TL;DR
Senior Software Engineer (SRE, Grafana Databases): Ensuring exceptional reliability for Grafana Cloud's database products (Mimir, Loki, Tempo, Pyroscope) with an accent on owning production reliability for high-SLA customers and designing automation to scale reliability practices. Focus on proactively reducing SLO burn, leading incident response, and influencing feature design for scalability and operability.
Location: Remote from Spain, Germany, the UK, or Sweden
Salary: EUR 97,034 - EUR 116,441
Company
hirify.global is a remote-first, open-source company providing a visualization tool and observability solutions with the Grafana LGTM Stack.
What you will do
- Partner with product engineering squads to ensure production reliability for high-SLA customer environments.
- Design and implement automation to scale reliability practices and meet SLO targets.
- Proactively reduce SLO burn and lead customer-impacting incident response.
- Improve alert quality, eliminate toil, and contribute to design and code reviews.
- Influence feature design to ensure production scalability and operability.
- Participate in on-call rotation with global counterparts.
Requirements
- 6+ years of engineering experience, with 3+ years in SRE/CRE/production engineering.
- Strong Kubernetes experience in AWS, GCP, or Azure, and familiarity with infrastructure-as-code (Helm, Terraform, Jsonnet).
- Experience operating multi-tenant systems in production and designing/implementing SLOs.
- Proficiency in one or more programming languages (Go, Python, Java).
- Knowledge of Linux internals, networking, cloud storage, and scaling.
- Excellent problem-solving and troubleshooting skills, including incident response and PIRs.
- Ability to reason about performance, scaling, and failure modes.
Culture & Benefits
- 100% remote, global culture with a focus on collaboration and shared purpose.
- Opportunities to tackle meaningful work in a high-growth, innovation-driven environment.
- Transparent communication, empowered teams, and approachable leadership.
- 30 days of annual leave per annum, with 3 days reserved for Grafana Shutdown Days.
- Access to modern AI coding assistants with a company-funded usage budget.
- In-person onboarding for new team members.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →