Remotery

Senior Cloud Site Reliability Engineer

Posted 1 hour ago

This is a fully remote position, open to applicants in India.

📋 Description

• Ensure the health and reliability of Solace Cloud Services, meeting all SLAs.

• Design and implement infrastructure tooling, observability, and automation.

• Enhance the efficiency of production operations, minimizing errors and improving processes.

• Possess expert-level knowledge in managing production incidents in multi-cloud environments, following industry-standard incident management protocols.

• Handle service requests and provisioning from customers effectively.

• Demonstrate a proven ability to manage customer escalations and drive resolutions in critical, high-impact production settings.

• Collaborate directly with customers to identify, troubleshoot, and resolve operational challenges.

• Utilize expert debugging skills in Linux and Kubernetes to identify operational issues.

• Participate in on-call rotations, providing 24x7 off-hours support.


⛳️ Requirements

• Demonstrated expertise with public cloud providers (AWS, Azure, GCP) services and features.

• Proven experience with cloud Kubernetes infrastructure platforms such as AWS Elastic Kubernetes Service, Azure Kubernetes Service, and Google Kubernetes Service.

• Hands-on experience with monitoring tools including Datadog, Kibana, and Prometheus.

• Practical experience with infrastructure automation tools like Terraform and Cloud Formation.

• Expertise in debugging production alerts.

• In-depth understanding of Linux Operating Systems.

• Proficiency in programming languages such as Groovy, Python, and Go.

• Certified Kubernetes Administrator.

• Certified Cloud Administrator (AWS, Azure, or GCP).


🏝️ Benefits

• Balance matters – We believe work should fit into your life, not the other way around.

• Hybrid-first – Flexibility is embedded in our work culture, ensuring everyone feels included and empowered.

• Values-driven – We embody our core values: craftsmanship, trust, courage, freedom, momentum, humility, and human experience.

• Growth mindset – Our training programs are crafted to help you advance quickly.

• Customer Obsessed – We take pride in our world-class customer lineup.

• Keep it fun – We are social, maintaining simplicity, and know how to enjoy ourselves.

• Creative culture – Our sense of humor shines through as we create engaging videos on topics like MITT (check them out!).

People also viewed

HealthEdge40 min ago

Senior Release Engineer

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$108k – $115k/year
ApplyView job
Equinix1 hour ago

Senior Staff Engineer, SRE/DevOps, Produit Logiciel

US flagTexas OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$136k – $204k/year
ApplyView job
Calendly1 hour ago

Senior Site Reliability Engineer

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$198k – $288k/year
ApplyView job
GFT Technologies1 hour ago

DevOps Cloud Networking Engineer – English Advanced

BR flagBrazil OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Hotel Engine1 hour ago

Senior Software Engineer, DevOps/Infrastructure

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$121.4k – $168k/year
ApplyView job
CodiLime1 hour ago

Senior DevOps Engineer

EG flagEgypt OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers