Remotery

Site Reliability Engineer II

atAkamai TechnologiesPL flagPolandFull-timeDevOps & Site Reliability Engineer (SRE)Mid-levelSenior

Posted 1 hour ago

📋 Description

• Developing and maintaining dashboards, alerts, and monitoring systems for inference workloads utilizing Akamai's existing observability framework.

• Creating automation and tools in Python or Go to minimize operational burdens and enhance system reliability.

• Engaging in on-call rotations, addressing production incidents, and participating in post-incident analysis.

• Constructing and refining runbooks for inference-related operational tasks, integrating them into Akamai's current incident management workflows.

• Assisting in SLO tracking and reporting, identifying patterns and opportunities for improvement.

• Aiding in the maintenance of CI/CD pipelines, ensuring deployment safety checks, and managing rollback procedures.

• Collaborating with product engineering teams to resolve complex issues throughout the technology stack.


⛳️ Requirements

• Possess commercial experience in Site Reliability Engineering.

• Demonstrate proficiency in a programming language like Python or Go, with experience in developing automation solutions.

• Have experience with Linux systems administration and the capability to troubleshoot intricate infrastructure challenges.

• Be familiar with Kubernetes and containerization principles.

• Have experience using monitoring and observability tools such as Prometheus, Grafana, or equivalent.

• Have been exposed to CI/CD pipelines and infrastructure-as-code tools (Terraform, SaltStack, or comparable).

• Exhibit a desire to learn and grow, with a genuine interest in AI infrastructure and distributed systems.


🏝️ Benefits

• Your health

• Your finances

• Your family

• Your time at work

• Your time pursuing other endeavors

People also viewed

Auvaria1 hour ago

Senior DevOps Engineer – AWS Cloud Migrations

DE flagGermany OnlyFull-timeDevOps & Site Reliability Engineer (SRE)€65k – €85k/year
ApplyView job
Grupo Salta Educação1 hour ago

Especialista de Infraestrutura, Cloud, SRE

BR flagBrazil OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Parlay Games Inc.1 hour ago

Senior AWS DevOps Engineer – Web3, iGaming

PH flagPhilippines OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
BHSSYEN1 hour ago

Senior Backend Engineer / SRE

IN flagIndia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Deimos1 hour ago

Senior Site Reliability Engineer

KE flagKenya OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
NICE1 hour ago

Forward Deployment Engineer

DE flagGermany OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers