
Staff Operations Engineer
Posted 2 days ago

Posted 2 days ago
This is a fully remote position, open to applicants in Canada.
• Take ownership of and enhance architecture within a designated infrastructure domain.
• Design and deploy scalable, dependable systems that span multiple teams or environments.
• Establish and advocate for best practices, patterns, and standards within the domain.
• Lead the execution of ambiguous and high-impact infrastructure projects.
• Decompose complex system issues into actionable solutions.
• Drive migrations, re-architectures, and enhancements in performance and reliability.
• Stay hands-on with critical systems and implementations.
• Collaborate across teams (IT, SRE, Security, Service Owners) to unify solutions.
• Influence technical decisions through design reviews and collaborative efforts.
• Ensure seamless integration of systems across various infrastructures (office, data center, cloud).
• Enhance system reliability through effective monitoring, alerting, and operational design.
• Contribute to defining Service Level Indicators (SLIs), Service Level Objectives (SLOs), and capacity planning within the domain.
• Participate in and lead root cause analyses for intricate incidents.
• Minimize operational toil through automation and system enhancements.
• Design and support essential infrastructure components (compute, DNS, networking, identity, etc.).
• Drive advancements in performance, scalability, and reliability.
• Offer deep expertise in at least one area (e.g., DNS, network architecture, cloud infrastructure).
• Build and enhance automation using scripting and Infrastructure as Code methodologies.
• Contribute to the development of internal tools and platform improvements.
• Promote standardized and repeatable approaches to system management.
• Mentor engineers and guide system design and troubleshooting processes.
• Elevate the technical quality of the team through reviews and shared practices.
• Maintain clear documentation, diagrams, and runbooks for owned systems.
• Ensure systems are understandable and operable by others.
• Facilitate knowledge sharing across teams.
• A minimum of 6 years of experience in systems engineering or infrastructure roles.
• Extensive experience in designing and operating production infrastructure.
• Solid expertise in:
• VMware
• Cisco UCS
• Application/Network Load balancers
• Linux/Unix Operating Systems
• Networking fundamentals (DNS, TCP/IP, routing, firewalls)
• Data center environments
• Proven ability to lead complex technical work across teams.
• Preferred Skills:
• Familiarity with Infrastructure as Code.
• Puppet/Ansible/etc.
• Python
• Awareness of observability tooling and reliability practices.
• Experience with containerization and modern platform tooling.
• Exposure to security best practices in infrastructure design.
• Generous performance-based bonus plans for all eligible employees - we share in our success as one team.
• Comprehensive medical, dental, and vision insurance.
• Generous retirement contributions with 100% immediate vesting (regardless of whether you contribute).
• Quarterly all-company wellness days where everyone takes a break together.
• Country-specific holidays plus an additional day off for your birthday.
• One-time stipend for home office setup.
• Annual budget for professional development.
• Quarterly well-being stipend.
• Substantial paid parental leave.
• Employee referral bonus program.
• Additional benefits (life/AD&D, disability, EAP, etc. - varies by country).
EXL
Headspace
Allstate
Sargent & Lundy
Get handpicked remote jobs straight to your inbox weekly.