Customer Reliability Engineer (Kubernetes)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Customer Reliability Engineer (Kubernetes): Operating, monitoring, and maintaining managed Airflow services with an accent on cloud infrastructure reliability and Kubernetes cluster management. Focus on troubleshooting complex customer environments, building automation for operational efficiency, and delivering white-glove guidance to ensure successful product adoption.
Location: Must be based in the United States
Salary: $125,000 - $130,000
Company
is the company behind Astro, the industry-leading unified DataOps platform powered by Apache Airflow, empowering data teams to bring mission-critical software, analytics, and AI to life.
What you will do
- Provide solutions to customers to ensure successful product usage and meet SLAs.
- Troubleshoot customer environments and engage in active triaging of issues.
- Build and maintain monitoring, alerting, and automation systems for operational efficiency.
- Participate in on-call rotation for weekend coverage.
- Direct product architecture and provide feedback to development teams based on customer pain points.
- Enhance customer documentation and provide white-glove guidance on the path to production.
Requirements
- Must be based in the United States
- 5 years of experience with large, complex cloud infrastructures operating at scale.
- 3 years of experience with Kubernetes.
- Experience managing production distributed systems with AWS, GCP, or Azure.
- Strong Linux experience and Python scripting skills.
- Previous experience handling internal or external customer issues.
- Strong communication and troubleshooting skills.
Nice to have
- Experience as a Site Reliability Engineer.
- Worked with Kubernetes Custom Resources.
- Depth of knowledge with Azure.
- Airflow or Big Data Orchestration experience.
- IaC experience.
Culture & Benefits
- Comprehensive benefits package including equity.
- Opportunity to work with the latest technology and multi-cloud implementations.
- Fully distributed team environment.
- Equal opportunity employer valuing diversity.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →