
Staff Software Engineer – Databases SRE
Posted 1 hour ago

Posted 1 hour ago
This is a fully remote position, open to applicants in Germany.
• Collaborate closely with product engineering teams (embedded model)
• Take ownership of production reliability for complex customer environments with high SLAs
• Design and execute automation strategies to enhance our reliability practices
• Ensure our customers achieve their SLO targets
• Define and refine per-tenant SLOs and reliability frameworks
• Actively work to minimize SLO burn to avert recurring incidents
• Serve as the main escalation contact and be on-call for relevant incidents
• Lead incident responses that impact customers and conduct post-incident reviews
• Contribute to design documentation and participate in code reviews
• Influence feature designs to guarantee scalability and operability in production
• Develop automation to reduce unnecessary manual work
• Enhance alert quality and decrease unnecessary escalations
• 8+ years of engineering experience, with 4+ years in SRE/CRE/production engineering
• Strong expertise in Kubernetes within AWS, GCP, or Azure environments
• Familiarity with infrastructure-as-code tools (Helm, Terraform, Jsonnet, etc.)
• Experience managing multi-tenant systems in a production setting
• Robust experience in designing and implementing SLOs
• Proficiency in one or more programming languages (e.g., Go, Python, Java, etc.)
• Experience with Linux operating systems and their internals
• Understanding of networking, cloud storage, and scaling principles
• Excellent problem-solving and troubleshooting abilities
• Capability to analyze performance, scaling, and failure modes
• Comfortable collaborating within an engineering team
• Ability to engage deeply with product engineering teams
• Intellectually curious, value transparency, demonstrate a strong bias towards action, and exhibit kindness
• Equity
• Bonus (if applicable)
• 30 days of annual leave including 3 Grafana Shutdown Days
• Opportunities for professional development
• 100% remote work in a global culture
• Commitment to transparent communication
• Innovation-focused environment
• Open source foundations
• Empowered teams
• Clear pathways for career growth
• Approachable leadership
• In-person onboarding
PhoenixTeam
Pragmatike
Careflow
CI&T
Get handpicked remote jobs straight to your inbox weekly.