Staff Software Engineer (Go)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Staff Software Engineer (Go/Python): Building and scaling the Grafana Cloud k6 performance testing product with an accent on operational excellence and SRE practices. Focus on establishing reliability frameworks (SLIs/SLOs), guiding distributed systems architecture, and scaling a culture of engineering excellence across teams.
Location: Must be based in Spain time zones
Salary: EUR 94,025 - 112,830
Company
is a remote-first, open-source powerhouse providing observability solutions through the LGTM Stack used by millions of users and thousands of companies globally.
What you will do
- Define standards for operational excellence and coach teams to own reliability and availability.
- Drive mature DevOps/SRE practices, including incident response, PIRs, on-call readiness, and observability.
- Establish reliability frameworks such as SLIs/SLOs and error budgets to guide engineering trade-offs.
- Guide the design, development, and operation of large-scale distributed cloud systems.
- Influence product and system direction through architectural discussions and design reviews.
- Grow into broader application and product development leadership as the reliability foundation matures.
Requirements
- Strong experience with DevOps/SRE practices and operating production systems at scale.
- Strong programming background in a modern language (Go and Python are primary).
- Experience designing, building, and operating large-scale distributed systems.
- Deep understanding of reliability engineering concepts, including incident management and failure modes.
- Experience with test automation, specifically performance and functional testing.
- Ability to operate autonomously in ambiguous environments with strong technical communication skills.
Nice to have
- Experience with containerized and cloud-native systems (Docker, Kubernetes, AWS).
- Familiarity with the Grafana observability stack.
- Experience with JavaScript or Jsonnet.
- Experience building event-driven or asynchronous systems.
Culture & Benefits
- 100% remote-first global culture with a focus on collaboration and transparency.
- Global annual leave policy of 30 days, including company-wide shutdown days.
- Provision of modern AI coding assistants and access to frontier LLMs (GPT, Claude, Gemini).
- Innovation-driven environment with high autonomy and trust.
- Defined career growth pathways and approachable, transparent leadership.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →