
Senior Software Engineer β Grafana Databases, Managed Services
Posted May 19

Posted May 19
This is a fully remote position, open to applicants in Spain.
β’ Managing and advancing over 100 multi-cloud streaming clusters along with their associated database infrastructure.
β’ Identifying and resolving cross-layer failure issues (such as object storage latency, noisy neighbors, control-plane bottlenecks, and query performance regressions).
β’ Crafting secure upgrade and rollout strategies at scale.
β’ Enhancing observability, automation, and operational efficiency.
β’ Collaborating closely with database and platform teams to guarantee safe scaling, partitioning, consumer fan-out, and query performance.
β’ Engaging directly with the behavior of distributed systems, Kubernetes scheduling dynamics, storage engines, and compression trade-offs.
β’ Acting as a primary escalation point and being on-call for pertinent incidents.
β’ Managing relationships with all system vendors, including WarpStream Labs and others.
β’ A minimum of 6 years of engineering experience, including significant exposure to SRE, platform engineering, production engineering, infrastructure engineering, or roles involving distributed systems.
β’ Experience in operating distributed systems in a production environment (for instance, streaming systems, analytical databases, or large-scale storage backends). Examples include Kafka, Redpanda, WarpStream, Postgres, ClickHouse, Snowflake, or Cassandra.
β’ Strong expertise in Kubernetes within AWS, GCP, or Azure, along with familiarity with infrastructure-as-code tools (such as Helm, Terraform, Jsonnet, etc.).
β’ A solid grasp of distributed systems design and the trade-offs associated with large-scale systems.
β’ Proficiency in at least one programming language (Go is preferred but not mandatory).
β’ Working knowledge of Linux internals, networking, cloud storage, and performance/scaling behaviors.
β’ Experience in participating in blameless incident response and composing high-quality post-incident reviews.
β’ A clear communicator capable of collaborating across teams and working independently.
β’ Inquisitive, pragmatic, action-oriented, and kind (this is essential!).
β’ All positions come with Restricted Stock Units (RSUs), providing every team member with a stake in Grafana Labs' success.
β’ We have a global annual leave policy of 30 days per year.
β’ 3 days of your annual leave allocation are designated for Grafana Shutdown Days, allowing the team to fully disconnect.
β’ Opportunities for professional development.
Webedia
TechBiz Global
The Flex
Nodeworthy
Get handpicked remote jobs straight to your inbox weekly.