Назад
Company hidden
6 дней назад

Staff Software Engineer - Grafana Cloud K6 (DevOps/SRE)

174 986 - 209 983$
Формат работы
remote (только USA)
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Staff Software Engineer (DevOps/SRE): Establishing and scaling a cross-team culture of engineering excellence by setting standards and guiding the adoption of strong DevOps/SRE practices that improve reliability, availability, and operational ownership. Focus on mature DevOps/SRE practices, including incident response and PIRs, on-call readiness, runbooks, alerting, observability, and release/change management.

Location: Must be based in United States time zones

Salary: $174,986 - $209,983

Company

hirify.global is a remote-first, open-source powerhouse with more than 20M users of Grafana around the globe.

What you will do

  • Build and scale a strong culture of operational excellence by defining standards and coaching teams to own reliability and availability.
  • Drive mature DevOps/SRE practices, including incident response and PIRs, on-call readiness, runbooks, alerting, observability, and release/change management.
  • Establish reliability frameworks such as SLIs/SLOs and error budgets, and use them to guide prioritization and engineering trade-offs.
  • Provide visibility into system health through clear operational metrics and reliability reporting.
  • Guide teams in the design, development, evolution, and operation of large-scale, distributed cloud systems.
  • Influence product and system direction through design reviews, architectural discussions, and cross-team collaboration.

Requirements

  • Strong experience with DevOps/SRE practices, including operating and evolving production systems at scale
  • Strong programming background in a modern language (Python and Go are primary, but prior experience is not required)
  • Experience designing, building, and operating large-scale distributed systems
  • Strong understanding of reliability engineering concepts (e.g. incident management, observability, and failure modes)
  • Experience with test automation, including performance and functional testing
  • Ability to influence engineering practices through clear technical communication, reviews, and collaboration

Nice to have

  • Experience with containerized and cloud-native systems (Docker, Kubernetes, AWS)
  • Familiarity with observability tooling and platforms (e.g. the Grafana stack)
  • Experience working with Python, Go, JavaScript and/or Jsonnet
  • Experience building or operating event-driven or asynchronous systems
  • Experience defining or applying SLIs/SLOs, error budgets, or reliability metrics

Culture & Benefits

  • 100% Remote, Global Culture - As a remote-only company, we bring together talent from around the world, united by a culture of collaboration and shared purpose.
  • Transparent Communication – Expect open decision-making and regular company-wide updates.
  • Innovation-Driven – Autonomy and support to ship great work and try new things.
  • Open Source Roots – Built on community-driven values that shape how we work.
  • Career Growth Pathways – Defined opportunities to grow and develop your career.
  • Balance is Key - We operate a global annual leave policy of 30 days per annum.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Текст вакансии взят без изменений

Источник - загрузка...