Staff Software Engineer (Grafana Databases, Managed Services)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Staff Software Engineer (Grafana Databases, Managed Services): Operating and evolving shared production-critical infrastructure powering Grafana Cloud’s database products (Mimir, Loki, Tempo) with an accent on high-throughput multi-cloud streaming clusters and analytical/storage systems. Focus on diagnosing cross-layer failure modes, designing safe upgrade strategies, improving observability and automation, and leading reliability initiatives at massive scale.
Location: Remote from Ireland time zones only
Salary: EUR 104,000 - 124,800
Company
Remote-first open-source company building observability solutions with Grafana Cloud and Enterprise Stack for metrics, logs, and traces.
What you will do
- Operate and evolve 100+ multi-cloud WarpStream clusters and related database infrastructure.
- Diagnose and eliminate cross-layer failure modes like object storage latency and query regressions.
- Design safe upgrade, rollout, and migration strategies at scale.
- Improve observability, automation, and operational ergonomics.
- Partner with database and platform teams on scaling, partitioning, and performance.
- Serve as escalation point, handle on-call, and own vendor relationships.
- Define technical direction, lead initiatives, establish SLOs, and mentor engineers.
Requirements
- Located in Ireland time zones.
- 8+ years engineering experience in SRE, platform, production, infrastructure, or distributed systems.
- Experience with high-throughput streaming systems (Kafka, Redpanda, WarpStream), analytical/storage backends, or large-scale databases.
- Strong Kubernetes in AWS/GCP/Azure; IaC (Helm, Terraform, Jsonnet).
- Proficiency in systems language (Go preferred); Linux internals, networking, cloud storage.
- Experience leading technical efforts, incident response, post-mortems.
- Clear communication for remote collaboration across regions.
Culture & Benefits
- 100% remote global culture with high trust, autonomy, and transparent communication.
- RSUs for all roles, equity ownership, and balanced on-call (12 daylight hours).
- 30 days annual leave plus 3 shutdown days; comply with local legislation.
- AI coding assistants with company budget; access to frontier models.
- In-person onboarding; career growth pathways and approachable leadership.
- Open-source roots, innovation-driven environment, empowered teams.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →