
Principal Software Engineer
Posted 10 hours ago

Posted 10 hours ago
This is a fully remote position, open to applicants in California.
• Direct and collaborate with the team in coding initiatives.
• Drive a cultural and technical transformation to treat reliability as a key product feature.
• Transition the organization from reactive "operations" tasks to developing resilient platforms and self-healing systems.
• Exhibit advanced Incident Commander abilities while not being required to participate in the daily on-call schedule, intervening during critical outages to maintain composure and clarity, and leveraging those experiences to design systems that prevent similar incidents in the future.
• Outline the "Golden Paths" for our Cloud migration, ensuring that as Docusign expands globally, our architecture remains "Multi-Active" and resistant to regional cloud disruptions.
• Challenge existing norms, guiding Senior and Staff SREs to adopt a software architect mindset.
• Promote "Error Budgets" that carry significant weight, steering product roadmaps to emphasize long-term stability.
• Over 15 years of experience in large-scale distributed systems, software engineering, or infrastructure roles, with a proven history of influencing system architecture.
• Background as a software engineer, showcasing deep expertise in Go or Python, with a "code-first" mentality and a zeal for developing production-quality automation services in collaboration with the engineering team.
• Demonstrated technical leadership experience in constructing global, active-active distributed systems at hyperscale, effectively serving as both an architect and an engineering collaborator.
• Mastery of production-hardened Kubernetes and Terraform to oversee complex, multi-tenant cloud environments.
• Experience serving as the primary Lead Incident Commander during tier-0 global outages, adept at converting operational turmoil into actionable technical resolutions.
• Proven experience in defining "Developer Experience" strategies and contributing to Internal Developer Platforms (IDPs) that integrate resilience and infrastructure abstractions into developer workflows.
• Paid Time Off: accrued time off, along with paid company holidays based on your region.
• Paid Parental Leave: take up to six months off with your child following birth, adoption, or foster care placement.
• Full Health Benefits Plans: options for 100% employer-paid and minimal employee contribution health plans available from day one of employment.
• Retirement Plans: choose from retirement and pension programs with potential employer contributions.
• Learning and Development: access to coaching, online courses, and education reimbursements.
• Compassionate Care Leave: paid time off following the loss of a loved one and other significant life events.
VPS
Tango
Influur
Salesloft
Get handpicked remote jobs straight to your inbox weekly.