Remotery

Senior Database Site Reliability Engineer

atTherapyNotes, LLCUS flagUnited StatesFull-timeDevOps & Site Reliability Engineer (SRE)Senior$120k – $160k/year

Posted May 6

📋 Description

• Tasked with designing, implementing, and maintaining high-availability, high-throughput, data and compute-intensive database systems utilizing PostgreSQL to support a continuously growing 24x7 SaaS platform.

• Define and enhance database service reliability through effective monitoring/alerting, SLO-oriented metrics, and ensuring operational readiness.

• Engage in and facilitate incident response, root cause analysis, and post-incident corrective measures for database-related production occurrences.

• Collaborate with other technical leaders to confirm that all newly introduced systems are both supportable and maintainable by development and operations teams.

• Provide advanced technical guidance and support to various technology teams across the organization.

• Offer on-call support for production issues and other responsibilities as necessary.

• Responsible for adhering to HIPAA security policies within the database framework.

• Ensure that all solutions and operational tasks comply with the security and operational policies set forth by the organization.

• Lead the continuous enhancement of our Datadog database observability by creating actionable dashboards, alerts, and service-level views using an observability stack (e.g., Prometheus, Grafana, New Relic, or similar). Familiarity with PGAnalyze or Percona is advantageous.

• Automate system maintenance tasks using Bash, Powershell, Python, or Ansible. Manage infrastructure as code (IaC) by writing Ansible playbooks; some experience with Terraform is a plus.

• Experience in writing and designing ETL pipelines using Python is a plus.

• Understand and maintain various PostgreSQL ecosystem components such as PgBouncer, PgBackrest, HaProxy, and RepMgr.

• Possess excellent communication and interpersonal skills.


⛳️ Requirements

• Bachelor’s degree in Information Systems, Engineering, or equivalent experience.

• 7-10+ years of engineering experience in Database Engineering, Systems Engineering, DevOps, or SRE.

• Familiarity with cloud-based compute, storage, and containerization solutions (preferably Azure & Kubernetes).

• Proficient in operating PostgreSQL within a Linux environment is a plus.

• Expertise in observability/monitoring platforms (e.g., Prometheus/Grafana, New Relic, Datadog, or similar); experience with Datadog is a plus.

• Proven experience in Agile/DevOps settings and managing production services with ITSM practices as applicable.


🏝️ Benefits

• Employer-sponsored health, dental, vision, life, and disability insurance.

• Retirement plan with company contributions.

• Annual company profit sharing.

• Budget for personal development and training.

• Open and collaborative work environment.

• Comprehensive 2-week onboarding plan.

• In-depth mentorship program.

People also viewed

Arctiq18 hours ago

Site Reliability Engineer

US flagVirginia OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job
Arctiq18 hours ago

Senior Site Reliability Engineer

US flagVirginia OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job
Software Mind18 hours ago

Senior DevOps Manager, German speaking

PL flagPoland OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Mediastream18 hours ago

DevOps Engineer

RO flagRomania OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Kyndryl18 hours ago

Site Reliability Engineer

US flagOhio OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$161.5k – $290.8k/year
ApplyView job
Guidehouse18 hours ago

Senior Azure DevOps Engineer

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$118k – $196k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers