
Principal / Staff Software Engineer – Backend, MLOps, Cloud Infrastructure
Posted 11 hours ago

Posted 11 hours ago
This is a fully remote position, open to applicants in Portugal.
• Spearhead the architectural design, evolution, and scaling of our distributed backend microservices and machine learning platforms.
• Establish, advocate for, and uphold engineering best practices across the organization, which include thorough code reviews, automated testing, design documentation, and security measures.
• Mentor and guide senior engineers, promoting a culture of technical curiosity, ongoing learning, and high execution velocity.
• Design and develop high-performance, resilient, and secure cloud-native solutions using Python and contemporary web application frameworks.
• Architect and enhance data layers, ensuring that database designs (SQL and NoSQL) are optimized for high throughput, low latency, and long-term scalability.
• Lead the strategy for containerization and orchestration utilizing Docker and Kubernetes to ensure effortless deployments and efficient resource use.
• Construct and manage a robust MLOps infrastructure to streamline the entire machine learning lifecycle—from data pipelining and model training to production deployment and monitoring.
• Oversee and secure our AWS cloud infrastructure, optimizing complex networking configurations that include VPCs, DNS, ingress controllers, and service meshes.
• Design end-to-end CI/CD pipelines and implement DevOps automation to support reliable, frictionless daily deployments.
• Foster comprehensive system visibility by implementing thorough observability stacks (Datadog, Prometheus, Grafana, OpenSearch/ELK) and enabling distributed tracing.
• Core Development: Expertise in Python with extensive experience in creating enterprise-grade backend systems and web applications.
• Distributed Systems: Proven experience in microservices architecture, API design, asynchronous event-driven patterns, and system scalability.
• Cloud & Networking: In-depth, hands-on knowledge of AWS cloud infrastructure, networking setups (VPCs, DNS, Ingress, Service Mesh), and infrastructure as code practices.
• Containers & Orchestration: Advanced proficiency in Docker and Kubernetes for scaling and managing containerized applications.
• Data Management: Strong skills in database design, query optimization, and managing complex data lifecycles.
• MLOps Foundations: Experience in building or integrating machine learning infrastructure, model deployment pipelines, and ML feature stores.
• Observability & DevOps: Advanced knowledge in deploying logging, metrics, and tracing stacks (Datadog, Prometheus, Grafana, ELK) alongside robust CI/CD pipelines.
• Experience: A minimum of 9 years of professional software engineering experience, with at least 3 years in a Staff, Principal, or similar technical leadership role.
• Track Record: Proven success in designing and operating large-scale distributed cloud systems from inception.
• Mindset: A strong sense of ownership, a drive for automation, and the ability to effectively communicate complex technical concepts into actionable tasks for the team.
• Education: Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical discipline (or equivalent practical experience).
• Competitive salary and equity packages.
• Opportunity to significantly influence the technical direction of an innovative and growing company.
• Flexible remote work options.
• Budget allocated for continuous learning, conferences, and professional development.
The Health Management Academy
Paragone Solutions, Inc.
F5
CACI International Inc
Get handpicked remote jobs straight to your inbox weekly.