
Senior Lead AI Engineer
Posted May 7

Posted May 7
This is a fully remote position, open to applicants in Mexico.
• Create and develop the agent harness, encompassing skill loading, tool invocation, context management, execution sandbox, compaction patterns, and sub-agents.
• Establish the evaluation pipeline, focusing on how we assess skill effectiveness, identify regressions, and convert production failures into rectified behaviors.
• Collaborate with product and engineering teams to deliver the initial 3–5 supplier-facing use cases (invoicing, catalog, strategic views) using the platform you create.
• Define and oversee the cloud deployment and configuration for the platform, including its transition to production, scalability, and security measures.
• Make architectural decisions regarding runtime isolation, credential management, audit logging, and tenant separation, in partnership with DevOps experts.
• Elevate the team's standards for utilizing agentic tools, determining what to automate, what to verify, and what tasks should remain human-driven.
• Possess over 6 years of experience in delivering production software with strong proficiency in Python and/or TypeScript.
• Have developed platform or infrastructure systems that other engineers rely on, such as CLIs, runtimes, and internal platforms.
• Have successfully launched at least one LLM-powered product for actual users and possess insights into why many of these products fail.
• Be a **frequent, intensive user of agentic coding tools** — such as Claude Code, Cursor, Codex, or their equivalents — and have insights on maximizing their effectiveness.
• Engage in side projects that are genuine and significant, built out of passion and inspiration, ideally leveraging new AI tools to completion.
• Communicate fluently in English, both in writing and speaking, as skill authoring involves substantial prose and the team operates in English.
• Have experience in deploying and managing containerized services in production, understanding concepts like network isolation, infrastructure-as-code, secrets management, and least-privilege IAM beyond just the application layer.
• Think systemically, considering tokens, latency, failure modes, and user outcomes as interconnected elements rather than isolated issues.
• Pioneering Technology
• Collaborative Culture
• Global Impact
Omada Health
NineTwoThree Studio
Stride, Inc.
Get handpicked remote jobs straight to your inbox weekly.