
Senior Software Engineer – Site Reliability
Posted 2 days ago

Posted 2 days ago
This is a fully remote position, open to applicants in United States.
• Ensures the platform maintains its stability, scalability, and performance.
• Improves product reliability by designing automated solutions for intricate infrastructure and operational issues.
• Advocates for application availability and efficiency through proactive monitoring, performance optimization, and strategic enhancements.
• Conducts post-mortem analyses, develops automation to minimize operational burdens, and collaborates with product owners and developers.
• Engages in tool selection, aids in capacity planning, and establishes monitoring and alerting systems to fulfill business-defined Service Level Objectives (SLOs).
• Guides junior engineers to promote a culture of operational excellence.
• Must be at least eighteen years old.
• Must have legal authorization to work in the United States.
• Experience with GCP - Cloud Infrastructure.
• Proficiency in Observability tools such as Grafana, Prometheus, Loki, and Tempo.
• Familiarity with Litmus Chaos for destructive testing.
• Knowledge of K6 for performance testing.
• Experience with Terraform Enterprise for Infrastructure as Code.
• Proficient in Github for source control management.
• Experience with CDK8S for Kubernetes manifest as code.
• Familiarity with GH Copilot for AI development acceleration.
• Knowledge of SRE practices, including Production Readiness Review, Capacity Planning, Change Validation, and Production Support.
• A minimum of 3 years of experience in software development.
• Health insurance
• 401(k) matching
• Flexible work hours
• Paid time off
• Remote work options
Cision France
Navigate Power
Get handpicked remote jobs straight to your inbox weekly.