
AI Engineer – Data Intelligence
Posted 5 days ago

Posted 5 days ago
This is a fully remote position, open to applicants in United States.
• Develop and sustain components of Clarium's master data enrichment pipeline, which classifies and enhances every product that passes through our platform.
• Design and manage classification and entity resolution workflows that integrate deterministic logic and LLMs for processing production data.
• Create and maintain evaluation harnesses, label sets, and regression suites (utilizing Braintrust) to confidently assess and enhance pipeline quality.
• Write production-level Python and SQL; the majority of your time will be dedicated to coding rather than using configuration tools.
• Analyze intricate datasets using statistical methods and machine learning to extract actionable insights and guide pipeline enhancements.
• Take the initiative to audit data for quality concerns; identify issues that may have gone unnoticed, diagnose root causes, and implement solutions.
• Proficient in Python with a proven history of writing production-level code, beyond mere scripts or notebooks.
• Strong expertise in SQL, including complex joins, window functions, performance optimization, and data modeling.
• Comfortable navigating ambiguous situations; able to define a problem, devise a plan, and execute independently.
• A sincere and unwavering dedication to data quality; you perceive silent bugs as significant failures.
• Capability to delve deeply into an unfamiliar domain and cultivate substantial expertise over time.
• Experience with LLM integrations, prompt evaluation, or large-scale classification (Nice to Have).
• Familiarity with evaluation frameworks such as Braintrust, Promptfoo, or similar (Nice to Have).
• Health insurance
• 401K
• Unlimited PTO
• Incentive Stock Options proportionate to your salary
• Fully remote, with access to a NYC co-working space available
Granicus
Omada Health
NineTwoThree Studio
Get handpicked remote jobs straight to your inbox weekly.