Remotery

Database Reliability Engineer – Core Team

Posted May 21

This is a fully remote position, open to applicants in Germany.

📋 Description

• Continuously enhance the reliability and performance of the ClickHouse core system.

• Develop and improve metrics and alerts for ClickHouse to proactively identify and address issues in production before they impact customers.

• Investigate the most frequently encountered issues by customers in ClickHouse Core to determine root causes, submit bug fixes, report issues, and propose enhancements.

• Refine incident response procedures and conduct post-mortem analyses for outages related to ClickHouse core, collaborating with support and Cloud teams to communicate with affected customers.

• Plan, facilitate, and drive Chaos initiatives across Engineering teams based on internal priorities.

• Oversee on-call processes to address performance and reliability challenges, establishing best practices for escalation coordination to resolve issues and minimize customer impact.


⛳️ Requirements

• Bachelor’s or Master’s degree in Computer Science or a related discipline.

• Minimum of 5 years of experience in Reliability Engineering, QA, or customer-facing engineering roles.

• Prior experience managing ClickHouse or other SQL databases in a production environment.

• Strong understanding of distributed database internals and SQL, with particular expertise in ClickHouse being a significant advantage.

• Proficiency in scripting with Shell or Python, and the ability to read and comprehend C++ code.

• Familiarity with cloud computing platforms such as AWS, Azure, or Google Cloud Platform.

• Strong problem-solving abilities and excellent production debugging skills.

• Ability to thrive in a fast-paced environment as part of a global team, viewing yourself as a collaborative partner with the business to drive it forward.

• High level of responsibility, ownership, and accountability.

• Exceptional communication skills.


🏝️ Benefits

• Flexible work environment - ClickHouse is a globally distributed company that supports remote work. We currently operate in 20 countries.

• Healthcare - Employer contributions towards your healthcare expenses.

• Equity in the company - Every new team member receives stock options upon joining.

• Time off - Flexible time off in the US, with generous entitlements in other countries.

• A $500 home office setup allowance for remote employees.

• Global Gatherings – We value in-person connections and offer opportunities to engage with colleagues at company-wide offsites.

People also viewed

Work Life Group24 min ago

Lead DevOps Engineer, Data & AI Platform

HU flagHungary OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
accesa.eu24 min ago

DevOps Engineer, German

RO flagRomania OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Cisco30 min ago

Site Reliability Engineer – Kubernetes Platform

IN flagIndia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Work Life Group37 min ago

Lead DevOps Engineer – Data & AI Platform

CZ flagCzechia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
JumpCloud37 min ago

Security Engineer, DevSecOps

MX flagMexico OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Unit437 min ago

Cloud Operations Engineer

PT flagPortugal OnlyFull-timeDevOps & Site Reliability Engineer (SRE)€30.5k – €35.1k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers