TL;DR
Site Reliability Engineer (ELK): Maintaining and improving monitoring and observability infrastructure for RingCentral’s global platform with an accent on cloud operations and monitoring technologies. Focus on automating deployments, enhancing system resilience, and collaborating with engineering teams.
- Location: On-site presence required 4 days a week
Company
hirify.global is focused on providing innovative cloud solutions.
What you will do
- Maintain the availability, reliability, and scalability of the global monitoring and logging infrastructure.
- Integrate and evolve observability tools (ELK, Grafana, ClickHouse, VictoriaMetrics, Prometheus).
- Develop automation and deployment pipelines for Kubernetes-based monitoring components.
- Collaborate with engineering teams to embed observability and alerting into the development lifecycle.
- Participate in global incident response and on-call rotation.
Requirements
- 2+ years of experience as an SRE, Systems Engineer, or DevOps Engineer in production environments.
- Strong Kubernetes experience.
- Hands-on experience with ELK or similar log pipelines.
- Proficiency in Python or Go for automation.
- Intermediate English.
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →