Senior Software Engineer - Grafana Databases (Distributed Systems)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Software Engineer (Distributed Systems): Operating and evolving production-critical streaming and database infrastructure for Grafana Cloud with an accent on reliability, scalability, and multi-cloud operational excellence. Focus on diagnosing cross-layer failure modes, designing safe rollout strategies, and optimizing high-throughput distributed systems.
Location: Remote (Must be based in German time zones)
Salary: EUR 97,034 - 116,440
Company
is the company behind the open observability cloud, focused on open source, open standards, and scalable managed platforms.
What you will do
- Operate and evolve 100+ multi-cloud streaming clusters (WarpStream) and related database infrastructure.
- Diagnose and eliminate cross-layer failure modes, including object storage latency and control-plane bottlenecks.
- Design and execute safe upgrade and rollout strategies for production systems at scale.
- Improve observability, automation, and operational ergonomics for the Managed Services squad.
- Collaborate with database and platform teams to optimize scaling, partitioning, and query performance.
- Act as a primary escalation point and manage on-call responsibilities for critical incidents.
Requirements
- 6+ years of engineering experience in SRE, platform engineering, or distributed systems roles.
- Proven experience operating distributed systems in production (e.g., Kafka, ClickHouse, Cassandra, or similar).
- Strong Kubernetes expertise across AWS, GCP, or Azure.
- Proficiency with infrastructure-as-code tooling such as Helm, Terraform, or Jsonnet.
- Deep understanding of Linux internals, networking, and cloud storage scaling behavior.
- Must be living in German time zones.
Nice to have
- Proficiency in Go programming language.
- Experience using modern AI coding assistants in a professional workflow.
Culture & Benefits
- 100% remote-first global culture with high autonomy and transparency.
- Equity ownership through Restricted Stock Units (RSUs).
- Global annual leave policy of 30 days, including 3 mandatory company shutdown days.
- Company-funded budget for frontier AI models and coding assistants.
- In-person onboarding process for new hires.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →