
Data Engineer
Posted 2 hours ago

Posted 2 hours ago
This is a fully remote position, open to applicants in India.
• Create, develop, and sustain dependable data pipelines that facilitate the movement of information among EVS applications, databases, cloud services, and analytical platforms.
• Establish integrations between PostgreSQL databases, AWS services, internal software applications, and external systems.
• Construct and maintain data ingestion pipelines, incorporating embedding generation workflows for AI-driven applications.
• Apply testing, monitoring, version control, and observability practices to ensure the reliability and maintainability of data pipelines.
• Collaborate with Full Stack and AI Engineers to design scalable platform architecture and data integrations.
• Influence data architecture decisions, including schema design, normalization, indexing strategies, and data access patterns.
• Assist in supporting cloud-based data infrastructure, deployments, and initiatives to enhance platform reliability.
• Assess technical methodologies and propose scalable, maintainable data solutions.
• Execute data quality validation, governance, and monitoring within engineering systems.
• Engage in architecture reviews, code reviews, and technical discussions while helping to establish data engineering standards and best practices.
• Work collaboratively across the innovation team to bolster evolving data, analytics, and AI initiatives.
• Bachelor’s degree in Computer Science, Data Engineering, Software Engineering, or a related technical discipline.
• Over 3 years of professional experience in Data Engineering, Backend Software Development, or a similar technical position.
• Proven experience in designing, building, and maintaining production data pipelines and system integrations.
• Strong expertise with PostgreSQL or other relational database systems, including schema design, query optimization, and data modeling.
• Advanced programming skills in Python and SQL for data engineering and transformation workflows.
• Experience with data processing libraries like Pandas, Polars, or similar frameworks.
• Practical experience with AWS cloud services such as S3, EC2, IAM, Lambda, RDS, or equivalent cloud technologies.
• Familiarity with Git, GitHub, CI/CD pipelines, and contemporary software development best practices.
• Excellent analytical, problem-solving, and communication skills, with the ability to collaborate across technical teams.
• Experience with Infrastructure-as-Code tools like Terraform or AWS CDK is preferred.
• Knowledge of orchestration tools such as Airflow, AWS Step Functions, Glue, or EventBridge is advantageous.
• Familiarity with dbt, Snowflake, Redshift, BigQuery, or comparable modern data platforms is preferred.
• Understanding of vector databases, embedding pipelines, RAG architectures, or AI data infrastructure is beneficial.
• Experience with PostGIS, geospatial data, Docker, containerized applications, or event-driven architectures is a plus.
• Background in engineering, AEC, utilities, renewable energy, or technical consulting environments is advantageous.
• Health Insurance (Medical through United Healthcare/UMR)
• Dental and Vision Insurance
• STD (Short-Term Disability) and LTD (Long-Term Disability) Insurance
• Life Insurance
• EAP (Employee Assistance Program)
• Paid Time Off
• 401k Match
• Choice Benefit
• Flexible Work Time
• Performance-Based Bonus
• Point-Based Recognition
• Parental Benefits
• Referral Bonus
• Certification Bonus
• Hybrid / Remote Culture
Vertical Relevance
CenterWell Senior Primary Care
Medecision
Get handpicked remote jobs straight to your inbox weekly.