Remotery

Cluster & Systems Capacity Engineer

atBackblazeUS flagUnited StatesFull-timeSystems EngineerMid-levelSenior$123k – $175k/year

Posted 3 days ago

This is a fully remote position, open to applicants in United States.

📋 Description

• Create and sustain forecasts for short, medium, and long-term capacity demand and hardware deployment across storage, compute, and network areas within the platform.

• Develop predictive models that convert business demand signals into infrastructure needs by utilizing historical usage, growth patterns, product sales strategies, hardware lifecycle roadmaps, and other essential business inputs.

• Collaborate with Infrastructure, Production, and Network Engineering teams to synchronize capacity planning with system design and scaling efforts.

• Design and automate forecasting pipelines, simulation calculators, tools, and capacity dashboards to enhance data quality, minimize manual analysis, and offer stakeholders clear insights into platform usage and cluster health metrics.

• Track and assess cluster and system-level utilization and performance across CPU, memory, IOPS, and network resources.

• Modify deployment plans and configuration recommendations in real-time to ensure sufficient headroom and system stability, supporting the delivery of a world-class customer experience.

• Work alongside service and platform owners to create headroom and live buffer policies, optimize hardware BoMs, utilize virtualized orchestration, and reduce product costs.

• Collaborate closely with Operations and Finance colleagues to align capacity plans and hardware needs with capital budgets, cost objectives, and financial results.

• Assist in strategic optimization initiatives related to infrastructure investments, engineering development, and operational processes, contributing to long-term infrastructure strategy and capital planning.

• Lead initiatives to assess, procure, and provision requests for new or additional hardware, working with Systems and Network Engineering, SRE, NOC, and Data Center Operations teams to identify and deliver the best solutions.

• Maintain alignment with Product and Sales to facilitate customer onboarding, growth, and demand fluctuations.

• Clearly communicate complex capacity and infrastructure insights to both technical and non-technical stakeholders.


⛳️ Requirements

• Bachelor's degree in Computer Science, Engineering, Mathematics, Data Science, Information Systems, Statistics, or a related technical field (or equivalent experience).

• 3-6+ years of experience in Site Reliability Engineering, Infrastructure Capacity Planning, Systems/Infrastructure Engineering, Production Engineering, Data Center Operations, or a similar Cloud Operations role.

• Familiarity and experience with Cloud Storage infrastructure, especially highly-available, large-scale distributed systems that manage substantial data volumes with high throughput and intricate performance demands.

• Background in capacity modeling, performance analysis, scenario modeling, and/or infrastructure cost optimization, with the capability to quantify concepts within financial frameworks and forecasts.

• Proficiency in database and data analysis tools (preferably Snowflake, Metabase, Grafana, Python, SQL, Prometheus, Victoria Metrics, and Excel/Google Sheets).

• Proven ability in deep, creative, and logical thinking complemented by a robust data analysis skill set.

• Excellent communication and documentation abilities, with the capability to convey knowledge and explain concepts accurately and succinctly.

• A keen desire to work in a highly-autonomous team that is committed to quality, cost management, and customer experience.


🏝️ Benefits

• Healthcare coverage for the family, including dental and vision.

• Competitive salary and 401K plan.

• RSU grants for full-time employees.

• Employee Stock Purchase Plan (ESPP).

• Flexible vacation policy.

• Maternity and paternity leave.

• A MacBook Pro for work, along with a generous stipend to customize your workstation.

• Childcare bonus (for human children only).

• Support for fertility treatments.

• Learning and development program.

• Commuter benefits.

• A culture that promotes a healthy work-life balance.

People also viewed

L3Harris Technologies2 days ago

Manager, Systems Engineering

US flagVirginia OnlyFull-timeSystems Engineer$127.5k – $236.5k/year
ApplyView job
Grainger2 days ago

Senior Business Systems Engineer – SuccessFactors BTP CPI

US flagIllinois OnlyFull-timeSystems Engineer$88k – $146.6k/year
ApplyView job
Cresol Cooperativa3 days ago

Systems Developer

BR flagBrazil OnlyFull-timeSystems Engineer
ApplyView job
A1FED3 days ago

Instructional Systems Designer – Remote

US flagUnited States OnlyFull-timeSystems Engineer
ApplyView job
CACI International Inc3 days ago

Systems Administrator – Engineer

US flagUnited States OnlyFull-timeSystems Engineer$90.3k – $189.6k/year
ApplyView job
Nagarro4 days ago

Associate Principal Engineer, Delivery – System Analyst-Build

IN flagIndia OnlyFull-timeSystems Engineer
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers