
Senior AI / Data Engineer
Posted 1 day ago

Posted 1 day ago
This is a fully remote position, open to applicants in New York.
• Design, construct, and uphold scalable data pipelines, integrations, and AI workflows.
• Create reliable and maintainable ETL/ELT systems that facilitate analytics, operational reporting, and AI-driven products.
• Contribute to the architecture and ongoing development of the company’s data platform and AI infrastructure.
• Continuously enhance data architecture to align with changing business and product needs.
• Develop infrastructure automation and deployment workflows to boost engineering speed and operational consistency.
• Implement infrastructure as code (IaC) methodologies using tools such as Terraform or CloudFormation.
• Construct and maintain CI/CD pipelines along with automated testing workflows.
• Create monitoring, alerting, and observability solutions for data and AI systems.
• Enhance reliability, scalability, and operational efficiency through automation.
• Contribute to the development of production-ready AI systems and workflows that yield measurable business value.
• Mentor junior and mid-level engineers through code reviews, pairing, and technical guidance.
• Strong expertise in Python and SQL.
• Practical experience with data orchestration tools (preferably Airflow, Dagster, or AWS Step Functions).
• Proven track record of building and managing AWS cloud infrastructure, especially services like Lambda, ECS, and SQS.
• Experience in implementing infrastructure as code using Terraform or similar tools.
• Extensive experience designing event-driven, serverless architectures utilizing AWS Lambda, API Gateway, EventBridge, and SQS/SNS.
• Hands-on experience with large-scale data platforms in production settings (preferably Spark/PySpark, AWS Glue, or EMR).
• Strong understanding of AWS data lake technologies, including S3, Glue Catalog, and Lake Formation.
• Practical experience with cloud data warehouses (preferably Snowflake), covering schema design, performance tuning, cost optimization, and access control.
• Experience in designing and maintaining reliable ETL/ELT pipelines and distributed data workflows.
• Hands-on experience with SQL-based transformation frameworks such as dbt (Core or Cloud).
• Familiarity with CI/CD systems and tools like GitHub Actions or CircleCI.
• Understanding of observability, monitoring, and operational best practices for data systems.
• Strong grasp of data security, access controls, and safeguarding sensitive data.
• Experience building automation and operational tools using Python or similar programming languages.
• Familiarity with production AI/ML workflows and the operational considerations for AI-enabled systems.
• Experience responsibly utilizing AI-assisted engineering tools (e.g., Claude Code, Codex, GitHub Copilot) to enhance productivity and engineering quality.
• Competitive compensation package including base salary, bonus, and equity.
• Employer-sponsored 401(k) plan with matching contributions.
• Comprehensive medical, dental, and vision insurance coverage.
• Flexible time off and a hybrid work environment.
HubSpot
Prima
Get handpicked remote jobs straight to your inbox weekly.