Remotery

Forward Deployed Engineer, AI Inference, vLLM, Kubernetes

Posted 1 day ago

This is a fully remote position, open to applicants in California, +4 more states.

📋 Description

• Manage Distributed Inference: Implement and set up LLM-D and vLLM on Kubernetes clusters.

• Enhance Production Efficiency: Conduct performance testing and optimize vLLM settings.

• Collaborate on Code Development: Partner with customer engineers to produce high-quality production code.

• Tackle Complex Challenges: Resolve intricate interactions between model architectures and hardware accelerators.

• Establish Feedback Mechanisms: Relay insights from the field back to product development.


⛳️ Requirements

• Over 8 Years of Engineering Experience

• Strong Customer Engagement Skills

• Proactive Approach to Problem Solving

• Extensive Knowledge of Kubernetes

• Expertise in AI Inference

• Proficient in Systems Programming with Python and Go

• Familiarity with Infrastructure as Code, including Helm, Terraform, or similar tools

• Understanding of Cloud and GPU Hardware

• Experience with open-source AI infrastructure projects is advantageous

• Familiarity with Envoy Proxy or Inference Gateway (IGW) is a bonus


🏝️ Benefits

• Comprehensive medical, dental, and vision coverage

• Flexible Spending Account for healthcare and dependent care

• Health Savings Account for high deductible medical plans

• 401(k) retirement plan with employer matching

• Paid time off and holidays

• Paid parental leave for all new parents

• Leave benefits encompassing disability, paid family medical leave, and paid military leave

• Additional perks including employee stock purchase plan, family planning reimbursement, tuition reimbursement, transportation expense account, and employee assistance program

People also viewed

Anchor Utility10 hours ago

Rate Analyst

US flagTexas OnlyFull-timeUncategorized
ApplyView job
Honeywell10 hours ago

HSE Manager

US flagNorth Carolina OnlyFull-timeUncategorized
ApplyView job
Cision France10 hours ago

People Partner

CA flagCanada OnlyFull-timeUncategorized$85k/year
ApplyView job
Navigate Power10 hours ago

B2B Outside Sales Consultant

US flagPennsylvania OnlyFreelanceUncategorized$50k – $250k/year
ApplyView job
TELUS10 hours ago

Business Development Executive, Early Career – European Language Required

GB flagUnited Kingdom OnlyFull-timeUncategorized
ApplyView job
Gilead Sciences10 hours ago

Statistical Programmer II

US flagUnited States OnlyFull-timeUncategorized$107.2k – $138.7k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers