
Infrastructure Engineer
Posted Jun 20

Posted Jun 20
This is a fully remote position, open to applicants in Spain.
• Lead the enhancement of our operational resilience as the organization expands.
• Manage the stability, observability, and debugging processes that ensure our systems function seamlessly.
• Serve as the primary contact for resolving intricate failures in real time.
• Develop tools that transform disorder into understanding, facilitating our transition from reactive to proactive operations.
• Influence the approach to reliability by minimizing incident frequency, creating internal tools, and directly enhancing developer concentration and system availability.
• Over 3 years of practical experience in debugging production environments (logs, traces, incidents, etc.).
• Excellent problem-solving capabilities and adeptness in navigating unfamiliar backend codebases.
• Proficient in Go and Kubernetes.
• Knowledge of observability and monitoring tools (e.g., Datadog, Prometheus, Sentry).
• Ability to communicate clearly and calmly under pressure, particularly during live incidents.
• Chance to be part of a rapidly growing AI startup, supported by leading investors.
• Accelerated Growth - Supported by a16z and YC, aiming for double-digit ARR.
• Competitive Compensation - Attractive salary plus equity in a fast-growing startup.
• Ownership & Autonomy - Full responsibility for projects and the ability to deliver quickly.
• Collaborate with Top Talent - Join an exceptional team of engineers and innovators.
F5
The Health Management Academy
Paragone Solutions, Inc.
Nacre Capital
Get handpicked remote jobs straight to your inbox weekly.