Remotery

Staff Software Engineer – Databases SRE

Posted 1 hour ago

This is a fully remote position, open to applicants in Germany.

📋 Description

• Collaborate closely with product engineering teams (embedded model)

• Take ownership of production reliability for complex customer environments with high SLAs

• Design and execute automation strategies to enhance our reliability practices

• Ensure our customers achieve their SLO targets

• Define and refine per-tenant SLOs and reliability frameworks

• Actively work to minimize SLO burn to avert recurring incidents

• Serve as the main escalation contact and be on-call for relevant incidents

• Lead incident responses that impact customers and conduct post-incident reviews

• Contribute to design documentation and participate in code reviews

• Influence feature designs to guarantee scalability and operability in production

• Develop automation to reduce unnecessary manual work

• Enhance alert quality and decrease unnecessary escalations


⛳️ Requirements

• 8+ years of engineering experience, with 4+ years in SRE/CRE/production engineering

• Strong expertise in Kubernetes within AWS, GCP, or Azure environments

• Familiarity with infrastructure-as-code tools (Helm, Terraform, Jsonnet, etc.)

• Experience managing multi-tenant systems in a production setting

• Robust experience in designing and implementing SLOs

• Proficiency in one or more programming languages (e.g., Go, Python, Java, etc.)

• Experience with Linux operating systems and their internals

• Understanding of networking, cloud storage, and scaling principles

• Excellent problem-solving and troubleshooting abilities

• Capability to analyze performance, scaling, and failure modes

• Comfortable collaborating within an engineering team

• Ability to engage deeply with product engineering teams

• Intellectually curious, value transparency, demonstrate a strong bias towards action, and exhibit kindness


🏝️ Benefits

• Equity

• Bonus (if applicable)

• 30 days of annual leave including 3 Grafana Shutdown Days

• Opportunities for professional development

• 100% remote work in a global culture

• Commitment to transparent communication

• Innovation-focused environment

• Open source foundations

• Empowered teams

• Clear pathways for career growth

• Approachable leadership

• In-person onboarding

People also viewed

PhoenixTeam1 hour ago

Release Engineer

US flagVirginia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$80k – $140k/year
ApplyView job
Pragmatike1 hour ago

SRE, Network Engineer

CA flagCanada OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Careflow1 hour ago

DevOps Engineer

US flagNew York OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
CI&T1 hour ago

Senior Azure DevOps

CO flagColombia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Coupa Software1 hour ago

Manager, DevOps and Engineering

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$125k – $198.7k/year
ApplyView job
Onebrief1 hour ago

Senior Release Engineer

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$180k – $200k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers