Remotery

Senior DevOps Engineer

Posted 22 hours ago

📋 Description

• Assist in transitioning the cloud platform from pre-production to a state of production readiness and scalability.

• Collaborate closely with engineering and data teams to implement infrastructure as code.

• Enhance deployment pipelines, establish monitoring and alerting systems, and facilitate the production deployment of data pipelines and risk oracle workloads.

• Evaluate and strengthen the existing platform setup, primarily on GCP.

• Transition infrastructure to Infrastructure-as-Code using Terraform or similar tools.

• Standardize development, staging, and production environments.

• Design and manage the platform's networking layer, including VPC architecture, private connectivity, and load balancing.

• Guide decisions regarding workload management between Cloud Run and GKE.

• Secure GitHub Actions pipelines.

• Implement monitoring, logging, tracing, alerting, and dashboards throughout the platform.

• Collaborate with data teams to productionize data pipelines and risk oracle workloads.

• Set up secrets management, audit logging, IAM, and access patterns.


⛳️ Requirements

• A minimum of 5 years of experience in DevOps, Platform, or SRE roles, with at least 2 years focused on GCP; familiarity with Vertex AI, AWS, or Azure is advantageous.

• Practical experience with Infrastructure-as-Code tools such as Terraform, Pulumi, CDK, or similar.

• Extensive CI/CD experience, particularly with GitHub Actions or comparable platforms.

• Experience in deploying and managing containerized services using Cloud Run, Kubernetes/GKE, ECS, or similar technologies.

• Sound judgment on when to opt for managed or serverless platforms versus Kubernetes or orchestrated methods.

• Experience in managing production data and caching infrastructure, including Cloud SQL/Postgres, Redis/Memorystore.

• Proficient in establishing production monitoring, logging, alerting, dashboards, and reliability targets.

• Strong understanding of cloud security principles, including IAM, secrets management, and audit logging.

• Familiarity with workflow orchestration or asynchronous task systems such as Temporal, Celery, or similar.

• Experience in supporting ML or AI inference workloads in a production environment, with hands-on experience across vector databases.


🏝️ Benefits

• Engage in global projects with clients from around the world.

• Be part of a remote-first culture, allowing flexibility to work from anywhere.

• Participate in team-building activities and regular outings.

• Collaborate and develop in a nurturing environment with opportunities to learn from experienced engineers.

• Enjoy a competitive salary and benefits package.

People also viewed

Arctiq18 hours ago

Site Reliability Engineer

US flagVirginia OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job
Arctiq18 hours ago

Senior Site Reliability Engineer

US flagVirginia OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job
Software Mind18 hours ago

Senior DevOps Manager, German speaking

PL flagPoland OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Mediastream18 hours ago

DevOps Engineer

RO flagRomania OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Kyndryl18 hours ago

Site Reliability Engineer

US flagOhio OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$161.5k – $290.8k/year
ApplyView job
Guidehouse18 hours ago

Senior Azure DevOps Engineer

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$118k – $196k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers