
Senior Technical Program Manager, Machine Learning Infrastructure
Posted 1 hour ago

Posted 1 hour ago
This is a fully remote position, open to applicants in Canada.
• Oversee the program portfolio that encompasses inference, efficiency, serving, and endpoints to ensure that Cohere’s infrastructure adequately scales to support a rapidly growing internal and external user base.
• Direct the comprehensive coordination and execution of the program, with an emphasis on fostering cross-functional collaboration between Modeling and customer-facing teams.
• Recognize obstacles and implement processes that allow the team to concentrate on development while addressing the needs of both internal and external model users, and enhancing engineering best practices.
• Cultivate a robust culture centered around continuous improvement by collaborating with incident management leaders, ensuring accurate root cause analysis of issues, and delivering necessary fixes promptly.
• Handle multiple overlapping projects and programs, prioritizing requests to guarantee that the company's foremost objectives are achieved.
• Partner with stakeholders across the organization to establish, monitor, and manage timelines, deliverables, budgets, and scope to facilitate successful program execution.
• Provide clear, timely, and consistent updates to engineering, leadership, and non-technical teams regarding progress, plans, and incidents.
• Serve as a strong tactical and strategic ally to senior leaders within your program, whether offering detailed tactical support or addressing high-level strategic issues that create company-wide impact.
• A minimum of 5 years of experience in Technical Program Management with a focus on Machine Learning Infrastructure, particularly in areas like model inference, serving, efficiency, and endpoint design and implementation.
• Comprehensive technical expertise in ML infrastructure design and implementation, along with engineering best practices that support a balance of speed and structure in rapidly evolving organizations.
• Proven experience in a dynamic, fast-paced, and loosely structured environment.
• (optional, but strongly preferred) Hands-on technical experience.
• (optional, but preferred) Familiarity with ML teams and a fundamental understanding of the machine learning model development process.
• A weekly lunch stipend of $75/£75 or its equivalent in your local currency for dining expenses.
• Comprehensive health and dental coverage, including a separate budget for mental health services.
• RRSP matching, 401K, and Pension Scheme options available.
• 100% parental leave top-up for up to 6 months, available to either parent.
• Annual enrichment benefits covering arts & culture, fitness/wellness, quality time, and workspace improvement credits.
• Education and learning stipend for conferences, courses, and coaching opportunities.
• 6 weeks of paid vacation (30 working days!)
• Budget for travel to other offices if you are working remotely, plus an annual company offsite.
Quanata
brightwheel
Community Memorial Healthcare
Revelation Pharma
Get handpicked remote jobs straight to your inbox weekly.