This is a fully remote position, open to applicants in Australia.

📋 Description

• Establish the infrastructure architecture for the APAC region. This role is focused on design rather than maintenance. You will be responsible for creating and managing the Azure footprint for our APAC operations from start to finish — including compute, networking, storage, identity, containerization, and application hosting — and you will set the foundational patterns for the region. New regional environments for our businesses in the Philippines and India will be operational from day one.

• Manage cross-region connectivity. Take charge of the network paths that connect our cloud environments with local partners and headquarters: including site-to-site VPN tunnels, partner IP allow-listing, edge and CDN custom domains, certificate management, and ensuring DNS and TLS hygiene to maintain healthy customer-facing endpoints.

• Lead migration and hardening initiatives. Oversee infrastructure migrations across multiple clouds while ensuring safety and scalability — which includes transferring sensitive customer data — and take responsibility for the APAC aspect of our PCI-DSS hardening initiative. These are active programs, not mere future plans.

• Direct incident response for the region. In the event of a disruption during APAC hours, you will manage the response — methodically and calmly, addressing all aspects. This includes conducting root cause analysis, performing blameless post-mortems, and implementing durable solutions. The on-call schedule follows the sun: during APAC business hours, not at 3 AM.

• Maintain and enhance the platform over time. Focus on improving reliability, deploying pipeline confidence, enhancing alert quality, and optimizing Azure expenditures. Security and compliance will be integral to our operations — you will collaborate with relevant teams on PCI-DSS scope, vulnerability scanning, least-privilege access, and secret management.

• Elevate the team's standards. Through your code, runbooks, architectural decisions, and collaborative work with our US-hours Platform engineers, your contributions will set the benchmark for infrastructure practices in the region.

• Embrace an AI-first approach. Leverage AI tools — such as coding assistants, observability tools, and infrastructure-as-code helpers — to expedite investigations, reduce manual tasks, and deliver more reliable results. Our Platform team is actively redefining engineering and operations with AI, and we anticipate that you will bring this mindset as well.

⛳️ Requirements

• A minimum of 5 years in Site Reliability Engineering (SRE), DevOps, or infrastructure roles supporting production systems — with a genuine sense of ownership over critical infrastructure, rather than just contributing to it.

• Strong expertise in infrastructure-as-code (Terraform: creating real modules and environments, not merely one-off scripts) and substantial production cloud experience. Azure is our primary cloud environment — if your background is in AWS or GCP, we expect you to adapt quickly. Transferable skills in networking, identity, compute, storage, and observability are more important than years of experience specifically on Azure.

• Practical networking experience: including site-to-site VPN, VNet design and peering, DNS, TLS and certificate management, firewall, and IP allow-listing.

• You make independent decisions and stand by them. We seek someone who recognizes what needs to be done and takes initiative — not someone who defers every decision to others. Own your reasoning and justify your trade-offs.

• Leadership in incident response — you handle incidents with composure, determine root causes, and ensure fixes are lasting and effective.

• Proficiency in scripting and automation using PowerShell, Bash, Python, and the Azure CLI. You should be comfortable enough with coding to create your own tools. Familiarity with C# and .NET is an added advantage.

• Experience with containers and CI/CD: including Docker, GitHub Actions or Azure DevOps, and a consistent practice of automation and tidying as you proceed.

• Awareness of security and compliance: you have worked in a regulated environment and understand PCI-DSS, least-privilege access, and effective secret management. Backgrounds in fintech or financial services are highly valued.

• You actively incorporate AI tools into your workflow — not just experimenting. You should be able to articulate your experiences.

• Located in APAC or capable of reliably working during APAC business hours — this is a fundamental requirement for the role.

🏝️ Benefits

• Virtual-first collaboration: The Tilt team operates across 14 countries, 12 time zones, and growing. You will begin with a work-from-home office reimbursement.

• Competitive salary: We value potential, and this is reflected in our competitive compensation packages and generous equity offerings.

• Comprehensive support: Access flexible health plans across various premium levels, along with substantial subsidies that meet global standards.

• Visibility in the organization: Expect direct exposure to our leadership team — we foster an environment where good ideas are shared quickly.

• Paid global onsite events: Real-life connections are important — we gather twice a year for meals and kayaking adventures. (Past locations include Vail, San Diego, and Mexico City, among others.)

• Recognition of impact: Opportunities for growth are tied to your contributions, rather than rigid promotion timelines.

Senior Site Reliability Engineer

📋 Description

⛳️ Requirements

🏝️ Benefits

People also viewed

DevOps Reliability Engineer

Senior Site Reliability Engineer – Network

Staff Site Reliability Engineer

DevOps Engineer, Mid Level

DevOps Engineer, Azure

DevOps Engineer, mk8s

Never miss a great job!