Remotery

Senior Data Engineer

atAvayaUS flagNew YorkFull-timeUncategorizedSenior$128.2k – $157k/year

Posted 1 day ago

This is a fully remote position, open to applicants in New York.

📋 Description

• Design, construct, and manage low-latency streaming pipelines (Kafka, Spark Structured Streaming) alongside robust batch ETL/ELT processes on the Databricks Lakehouse platform.

• Implement reliable orchestration and dependency management (Airflow), ensuring strong SLAs and readiness for on-call support for critical business data flows.

• Model, optimize, and document curated datasets and interfaces that cater to analytics, product features, and AI workloads.

• Establish data quality checks, observability, and backfill processes; lead root-cause analyses and incident prevention efforts.

• Collaborate with application teams (Go/Java), analytics, and ML/AI to deploy data products into production environments.

• Create and sustain datasets and services that drive RAG pipelines and agentic AI workflows (tool-use/function calling).

• In scenarios where Spark/Databricks is not ideal, design and manage custom processors/services in Go to fulfill strict latency or specialized transformation needs.

• Instrument prompt/response and token usage telemetry to support LLMOps evaluation and cost optimization; provide datasets for labeling and golden sets.

• Enhance performance and cost (storage/compute), conduct code reviews, and elevate engineering standards.


⛳️ Requirements

• Over 6 years of experience in building production-grade data pipelines at scale (both streaming and batch).

• Extensive expertise in Python and SQL; significant experience with Spark on Databricks (or a similar platform).

• Advanced SQL skills: including window functions, CTEs, partitioning/z-ordering, as well as query planning and tuning in lakehouse environments.

• Practical experience with Kafka (or equivalent) and an orchestration tool (Airflow preferred).

• Strong skills in data modeling and performance tuning for low latency and high throughput scenarios.

• Production-oriented mindset: SLAs, monitoring, alerting, CI/CD, and participation in on-call rotations.

• Proficient in utilizing AI coding assistants (Cursor, Claude Code) as part of regular development tasks.

• Competence in building data services/processors in Go (or a willingness to quickly learn), with familiarity with alternative frameworks (e.g., Flink/Beam) being a plus.


🏝️ Benefits

• Performance-related bonus

• Benefits

People also viewed

Anchor Utility11 hours ago

Rate Analyst

US flagTexas OnlyFull-timeUncategorized
ApplyView job
Honeywell11 hours ago

HSE Manager

US flagNorth Carolina OnlyFull-timeUncategorized
ApplyView job
Cision France11 hours ago

People Partner

CA flagCanada OnlyFull-timeUncategorized$85k/year
ApplyView job
Navigate Power11 hours ago

B2B Outside Sales Consultant

US flagPennsylvania OnlyFreelanceUncategorized$50k – $250k/year
ApplyView job
TELUS11 hours ago

Business Development Executive, Early Career – European Language Required

GB flagUnited Kingdom OnlyFull-timeUncategorized
ApplyView job
Gilead Sciences11 hours ago

Statistical Programmer II

US flagUnited States OnlyFull-timeUncategorized$107.2k – $138.7k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers