Staff Software Engineer (Databases, Managed Services)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Staff Software Engineer (Grafana Databases, Managed Services): Operating and evolving shared production-critical infrastructure powering Grafana Cloud’s database products (Mimir, Loki, Tempo) with an accent on high-throughput multi-cloud streaming clusters (WarpStream) and analytical/storage systems. Focus on diagnosing cross-layer failure modes, designing safe upgrade strategies, improving observability and automation, and leading reliability initiatives at massive scale.
Location: Remote (Spain time zones only)
Salary: EUR 94,025 - 112,830
Company
is a remote-first open-source company helping thousands of enterprises manage observability with Grafana Cloud and Enterprise Stack.
What you will do
- Operate and evolve 100+ multi-cloud WarpStream clusters and related database infrastructure for metrics, logs, and traces ingestion.
- Diagnose and eliminate failure modes like object storage latency, noisy neighbors, and query regressions.
- Design safe upgrade, rollout, and migration strategies at scale.
- Improve observability, automation, and operational ergonomics.
- Partner with database and platform teams on scaling, partitioning, and performance.
- Serve as escalation point, handle on-call, and own vendor relationships.
- Lead technical direction, mentor engineers, and drive SLOs and best practices.
Requirements
- Located in Spain time zones
- 8+ years engineering experience in SRE, platform, production, infrastructure, or distributed systems.
- Experience with high-throughput streaming (Kafka, Redpanda, WarpStream), analytical/storage backends (Postgres, ClickHouse), or large-scale databases.
- Strong Kubernetes in AWS/GCP/Azure; IaC (Helm, Terraform, Jsonnet).
- Proficiency in systems language (Go preferred); Linux internals, networking, cloud storage.
- Experience leading complex efforts, incident response, post-mortems.
- Clear communication for remote collaboration across regions.
Culture & Benefits
- 100% remote global culture with high trust, autonomy, and transparent communication.
- RSUs for all roles, equity ownership, and performance bonus.
- 30 days annual leave plus 3 Grafana Shutdown Days; comply with local laws.
- AI coding assistants with company budget; access to frontier models.
- Balanced on-call aligned to 12 daylight hours; in-person onboarding.
- Career growth, innovation-driven environment, open-source roots.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →