Remotery

Site Reliability Engineer

Posted 7 hours ago

This is a fully remote position, open to applicants in Pennsylvania.

📋 Description

• Schedule and execute large-scale batch workloads across Kubernetes clusters.

• Diagnose and troubleshoot job failures for clients.

• Work collaboratively with teams throughout the organization to comprehend workload requirements and enhance platform capabilities.

• Enhance the reliability and speed of our systems and processes by increasing automation.

• Document procedures to create a detailed library of runbooks, serving as a knowledge base and foundation for automation.

• Participate in an on-call rotation to maintain the SLOs and SLAs of production services.

• Contribute to platform tooling, automation, and CI/CD workflows.


⛳️ Requirements

• A solid understanding of Linux operating system internals, TCP/IP networking, and storage subsystems.

• Extensive experience with Kubernetes and container orchestration in production-grade environments.

• Knowledge of engineering design limitations and the ability to advise teams on scaling their services to meet performance goals within budget.

• Strong experience in implementing and troubleshooting cloud-native and open-source tools like Kubernetes, etcd, Prometheus, and OpenTelemetry.

• Excellent communication skills and the capability to work efficiently in a diverse and distributed team.


🏝️ Benefits

• We are proud to be an equal opportunity workplace.

• We believe that diverse teams produce the best ideas and outcomes.

• We are committed to fostering a culture of inclusion, entrepreneurship, and innovation across gender, race, age, sexual orientation, religion, disability, and identity.

People also viewed

Ping Identity7 hours ago

Staff Site Reliability Engineer

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$136.3k – $170k/year
ApplyView job
May Mobility7 hours ago

Autonomy Release Engineer II

US flagMichigan OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$128k – $165k/year
ApplyView job
Practical DevSecOps7 hours ago

Senior Security Engineer, Content Engineering

US flagCalifornia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
High 5 Games7 hours ago

DevOps Engineer – ML & Data Infrastructure

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Mercury Insurance7 hours ago

Manager – Site Reliability Operations

US flagCalifornia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$118.7k – $230.6k/year
ApplyView job
Ad Hoc LLC7 hours ago

Senior Site Reliability Engineer

North AmericaFull-timeDevOps & Site Reliability Engineer (SRE)$135k – $150k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers