Назад
Company hidden
1 час назад

Customer Reliability Engineer (Kubernetes)

125 000 - 130 000$
Формат работы
remote (только USA)
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Customer Reliability Engineer (Kubernetes): Operating, monitoring, and maintaining managed Airflow services with an accent on cloud infrastructure reliability and Kubernetes cluster management. Focus on troubleshooting complex customer environments, building automation for operational efficiency, and delivering white-glove guidance to ensure successful product adoption.

Location: Must be based in the United States

Salary: $125,000 - $130,000

Company

hirify.global is the company behind Astro, the industry-leading unified DataOps platform powered by Apache Airflow, empowering data teams to bring mission-critical software, analytics, and AI to life.

What you will do

  • Provide solutions to customers to ensure successful product usage and meet SLAs.
  • Troubleshoot customer environments and engage in active triaging of issues.
  • Build and maintain monitoring, alerting, and automation systems for operational efficiency.
  • Participate in on-call rotation for weekend coverage.
  • Direct product architecture and provide feedback to development teams based on customer pain points.
  • Enhance customer documentation and provide white-glove guidance on the path to production.

Requirements

  • Must be based in the United States
  • 5 years of experience with large, complex cloud infrastructures operating at scale.
  • 3 years of experience with Kubernetes.
  • Experience managing production distributed systems with AWS, GCP, or Azure.
  • Strong Linux experience and Python scripting skills.
  • Previous experience handling internal or external customer issues.
  • Strong communication and troubleshooting skills.

Nice to have

  • Experience as a Site Reliability Engineer.
  • Worked with Kubernetes Custom Resources.
  • Depth of knowledge with Azure.
  • Airflow or Big Data Orchestration experience.
  • IaC experience.

Culture & Benefits

  • Comprehensive benefits package including equity.
  • Opportunity to work with the latest technology and multi-cloud implementations.
  • Fully distributed team environment.
  • Equal opportunity employer valuing diversity.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →