Site Reliability Engineer (Kubernetes)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Site Reliability Engineer (Kubernetes): Building and optimizing scalable infrastructure for a global process automation platform with an accent on reliability, availability, and distributed systems. Focus on implementing Infrastructure as Code (IaC), automating delivery pipelines, and ensuring system performance through advanced observability practices.
Location: Must be based in or able to work from the office in Johannesburg, South Africa (Hybrid role).
Company
is a global standard for process intelligence and automation, trusted by over 10,000 organizations to accelerate digital transformation.
What you will do
- Design and build complex infrastructure components for distributed systems using Kubernetes.
- Manage infrastructure using IaC tools like Terraform and automate workflows with GitHub Actions.
- Monitor platform performance and build alerting systems using Prometheus, Grafana, and PagerDuty.
- Debug and resolve infrastructure issues in production environments to prevent recurrence.
- Lead post-mortems and root cause analysis for incidents to improve system reliability.
- Mentor other engineers and contribute to DevOps and SRE best practices within the team.
Requirements
- Must be based in Johannesburg, South Africa for hybrid office attendance.
- Extensive experience with Kubernetes and managing distributed systems.
- Strong proficiency in Infrastructure as Code (IaC) and configuration management.
- Proven ability to debug production issues across all levels of the stack.
- Experience with monitoring and observability tools like Prometheus and Grafana.
- Ability to lead projects of high complexity and ambiguous scope.
Culture & Benefits
- Hybrid working model emphasizing flexibility and collaboration.
- Paid parental leave and flexible paid time off policy.
- Employee wellness programs and counseling resources.
- Global community with opportunities for intercultural learning.
- Paid volunteer time and community impact initiatives.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →