
Principal GenAI Platform Engineer
Posted 1 hour ago

Posted 1 hour ago
This is a fully remote position, open to applicants in California.
• Design, construct, and sustain the foundational infrastructure layer that supports GenAI products, which includes model gateways, prompt/versioning repositories, vector databases, and tools for evaluating LLM.
• Implement secure access controls and authentication mechanisms that are integrated by default into the components of the AI platform.
• Develop and oversee observability, monitoring, and logging solutions for GenAI workloads and infrastructure.
• Work closely with product and engineering teams to integrate the GenAI infrastructure with agent frameworks and downstream applications.
• Enhance infrastructure for scalability, high availability, and cost efficiency for production workloads.
• Significant experience in building and maintaining AI platform infrastructure, as well as expertise in Kubernetes and container security.
• Proven expertise in observability and monitoring frameworks, particularly with a focus on real-time performance (e.g., experience with OpenTelemetry, MLFlow).
• Familiarity with AI infrastructure components such as vector databases, prompt/versioning repositories, and AI IDEs.
• Knowledge of vLLM, SGLang, or similar frameworks for hosting LLM inference workloads.
• Experience with CI/CD pipelines and automation processes related to AI model deployment and platform operations.
• Strong understanding of authentication and authorization frameworks integrated into AI platforms.
• Opportunity to work on cutting-edge technology in the AI field.
• Collaborative and innovative work environment.
• Competitive salary and comprehensive benefits package.
SHOP APOTHEKE EUROPE
Zscaler
Hummingbird Healthcare
Empower
Get handpicked remote jobs straight to your inbox weekly.