Назад
Company hidden
обновлено 4 дня назад

Senior Site Reliability Engineer

Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior Site Reliability Engineer (Gamedev): Architecting and evolving the enterprise-wide observability platform to provide visibility into infrastructure and application performance with an accent on defining and implementing observability standards and best practices across the SDLC. Focus on proactively identifying and resolving performance issues and integrating industry-leading monitoring and telemetry tools.

Location: Onsite in Austin, Texas, United States

Company

hirify.global is a global video game company, publishing titles developed by some of the most influential game development studios in the world.

What you will do

  • Architect, develop, and evolve our enterprise-wide observability platform to provide deep visibility into infrastructure and application performance.
  • Design and implement monitoring solutions leveraging modern metrics and visualization technologies, with support for additional platform integrations.
  • Collaborate with application and infrastructure teams to define and implement observability standards and best practices across the software development lifecycle (SDLC).
  • Drive cost optimization initiatives around monitoring and logging, balancing data retention, performance, and value.
  • Create automation and alerting processes to proactively identify and resolve performance issues before they impact business operations.
  • Evaluate emerging technologies to evolve 2K’s observability strategy.

Requirements

  • 5+ years of professional experience in information technology, including 3+ years specializing in observability, monitoring, or SRE engineering.
  • Deep knowledge of monitoring toolsets such as Prometheus, Grafana, ELK, Splunk, Dynatrace, Datadog, or equivalent.
  • Proficiency in Python for automation and tool development.
  • Hands-on experience with Kubernetes, Docker, and cloud platforms (AWS, GCP, or Azure).
  • Strong understanding of networking, infrastructure, and performance optimization.

Nice to have

  • Experience building an Observability practice from the ground up.
  • Experience with developing software for highly scalable/distributed systems
  • Experience in gaming or similar industries, combining large-scale internet-facing systems with software development and entertainment services culture

Culture & Benefits

  • Inclusive work environment.
  • Dedicated to diversity and inclusion.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Текст вакансии взят без изменений

Источник - загрузка...