Remotery

Senior AIOps Engineer

atPORCH πŸ’šIN flagIndiaFull-timeOperationsSeniorβ‚Ή2500k – β‚Ή3500k/year

Posted May 20

This is a fully remote position, open to applicants in India.

πŸ“‹ Description

β€’ Take ownership of production reliability for AI/ML services.

β€’ Establish and uphold SLOs/SLIs for essential AI systems.

β€’ Enhance and develop AI observability.

β€’ Work alongside data scientists and ML engineers to optimize deployment workflows for models.

β€’ Provision, deploy, configure, and sustain new components of AI infrastructure.

β€’ Reduce costs associated with LLMs.

β€’ Engage in an on-call rotation for AI/ML systems.

β€’ Collaborate across various disciplines.

β€’ Create and maintain production-quality services and tools using Python.


⛳️ Requirements

β€’ Bachelor's degree in Computer Science, Engineering, or a related discipline (Master's degree is preferred).

β€’ Over 4 years of professional software engineering experience, including at least 3 years in commercial software engineering or production operations (SRE/DevOps/Platform/ML platform), managing services on Kubernetes and/or major cloud platforms.

β€’ Strong experience with GCP is preferred, including practical knowledge of: GKE (Kubernetes Engine), BigQuery (data warehousing, SQL, schema management), Pub/Sub (event messaging), Vertex AI (batch prediction, model deployment), Cloud SQL, GCS, and IAM.

β€’ Experience in operating production systems at scale, ideally with distributed, microservices- or Kubernetes-based architectures.

β€’ Practical experience with CI/CD tools and deployment automation (GitLab CI/CD, GitHub Actions, or similar).

β€’ Understanding of fundamental machine learning concepts and model lifecycles (training, evaluation, deployment, monitoring), as well as familiarity with common ML tools.

β€’ Familiarity with LLM ecosystem tools β€” AI gateways, vector databases, RAG pipelines, embedding models, or LLM evaluation frameworks β€” is a significant advantage.


🏝️ Benefits

β€’ Medical insurance.

β€’ Accident insurance.

β€’ Retiral benefits.

β€’ 12 company-paid holidays.

β€’ 2 flexible holidays.

β€’ Privilege/earned leave.

β€’ Casual/sick leave.

β€’ Paid maternity and paternity leaves.

β€’ Weekly wellness events.

People also viewed

Avaya11 hours ago

IT Operations Analyst II

IN flagIndia OnlyFull-timeOperations
ApplyView job
Sword Health11 hours ago

Deal Operations

PT flagPortugal OnlyFull-timeOperations
ApplyView job
Infios11 hours ago

Cloud Operations Manager

IN flagIndia OnlyFull-timeOperations
ApplyView job
Remote12 hours ago

Deal Lead – Commercial Strategy & Operations

EuropeFull-timeOperations$48k – $162k/year
ApplyView job
Gridware12 hours ago

Operations Analyst – Contractor Role

PH flagPhilippines OnlyFreelanceOperations$6 – $9/hour
ApplyView job
Delegate CX12 hours ago

Sales Analytics and Data Operations Analyst

PH flagPhilippines OnlyFull-timeOperations
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers