
Principal Software Engineer – AI Experiment Tracking
Posted May 25

Posted May 25
This is a fully remote position, open to applicants in Ireland.
• Design and spearhead the implementation of new features and solutions for MLFlow on Red Hat OpenShift AI.
• Provide technical direction and leadership on critical, high-impact projects, ensuring quality, scalability, and reliability across systems.
• Drive innovation in the MLOps field by engaging with upstream communities, especially Kubeflow and MLFlow.
• Establish and advocate for quality engineering standards across teams, ensuring robust testing practices, CI/CD pipelines, and a quality-first culture at scale.
• Ensure that non-functional requirements, such as security, resiliency, performance, and maintainability, are consistently achieved.
• Write and review complex test strategies, frameworks, and automation methods that elevate quality standards across the organization.
• Contribute to a culture of continuous improvement by sharing insights and technical knowledge with team members.
• Collaborate with product management, other engineering teams, and cross-functional teams to analyze and clarify business requirements.
• Engage effectively with stakeholders and leadership to provide visibility and influence decision-making processes.
• Conduct thoughtful and timely code reviews, exemplifying high standards of quality, maintainability, and design.
• Represent RHOAI in external engagements, including industry events, customer meetings, and open-source communities.
• Mentor, influence, and coach a distributed team of engineers, nurturing future technical leaders and instilling strong engineering discipline.
• Explore and experiment with emerging AI technologies relevant to software development, proactively identifying opportunities to integrate new AI capabilities into existing workflows and tools.
• Extensive experience in developing applications using Go, Python, or another programming language.
• Advanced proficiency with AI experiment tracking tools such as MLFlow, Weights and Biases, or ClearML.
• Significant experience with Kubernetes, OpenShift, or other cloud-native technologies.
• Expertise in defining, scaling, and implementing testing strategies, automation frameworks, and CI/CD pipelines across large, distributed systems.
• Ability to quickly learn and guide others in using new tools and technologies, including AI-assisted development tools.
• Experience with source code management tools like Git.
• Proven ability to innovate and a passion for remaining at the cutting edge of technology, including quality engineering best practices.
• Strong system understanding and troubleshooting abilities, focusing on scalability, reliability, and performance.
• Technical leadership skills within a global team environment, including mentoring and coaching engineers at various levels.
• Exceptional written and verbal communication skills.
• Red Hat recognizes that the best ideas originate from diverse perspectives and experiences.
• Flexible working hours.
• Opportunities for professional development.
• Paid time off.
• Health insurance.
• Retirement plans.
Webedia
TechBiz Global
The Flex
Nodeworthy
Get handpicked remote jobs straight to your inbox weekly.