
Senior SRE, Blockchain Networks
Posted May 19

Posted May 19
This is a fully remote position, open to applicants in Europe.
• Take the lead in the comprehensive launch of new blockchain networks, transitioning from testnet to mainnet.
• Design and execute deployment architectures for validators, full nodes, RPCs, and ancillary services.
• Ensure all newly launched networks adhere to production readiness criteria, including monitoring, alerting, backups, failover, and security measures.
• Collaborate with protocol teams to grasp network-specific requirements, risks, and potential failure modes.
• Develop repeatable launch patterns and runbooks to expedite the time-to-market for new networks.
• Construct and manage infrastructure across both cloud and bare-metal environments.
• Enhance automation and standardization of deployments using tools like Terraform, Helm, and proprietary tools.
• Contribute to the internal platform by ensuring alignment of launches with existing Kubernetes, observability, and delivery standards.
• Implement high-availability and fault-tolerant configurations for validator infrastructure.
• Continuously refine SLOs, SLIs, and alerting mechanisms for newly launched networks.
• Guarantee that all services are fully observable, including metrics, logs, and traces.
• Define and execute alerts that are actionable and maintain a low-noise level.
• Engage in on-call rotations and incident response activities.
• Lead or contribute to post-incident reviews, focusing on systemic enhancements.
• Proactively identify and resolve reliability risks prior to their impact on production.
• Apply security best practices to all deployments, including secrets management, access control, and network isolation.
• Ensure compliance with internal standards and contribute to practices aligned with SOC 2.
• Support secure key management practices for validator infrastructure.
• Collaborate closely with Infrastructure, Core Networks, and Security teams.
• Take ownership of deliverables from design through to production.
• Contribute to documentation, runbooks, and knowledge sharing initiatives.
• Provide support and mentorship to junior engineers when necessary.
• Minimum of 5 years of experience in Site Reliability Engineering (SRE), DevOps, or infrastructure engineering.
• Extensive experience in operating production systems at scale.
• Hands-on experience with:
• - Kubernetes (deployment, troubleshooting, operations)
• - Terraform (infrastructure as code)
• - Linux systems and networking fundamentals.
• Experience with at least one cloud provider (GCP preferred, with AWS, Azure, OCI as alternatives).
• Familiarity with observability tools (Prometheus, Grafana, Loki, or similar).
• Knowledge of CI/CD systems and GitOps workflows (e.g., ArgoCD).
• Strong scripting or programming abilities (Go, Python, or similar).
• Experience in distributed systems or high-availability environments.
• Excellent debugging and problem-solving skills under pressure.
• Strong communication skills and the ability to collaborate across teams (minimum English B2 level).
• Paid vacation and sick leave.
• Well-being program.
• Mental health care program.
• Educational compensation, including for foreign language and professional development courses.
• Equipment and co-working reimbursement program.
• Opportunities for overseas conferences and community immersion.
1inch
RTB House
Ant-Tech
Binance
Get handpicked remote jobs straight to your inbox weekly.