Site Reliability Engineer (Kubernetes)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Site Reliability Engineer (Kubernetes/GCP): Building and improving observability, alerting, and incident management frameworks to ensure service reliability for a global travel app with an accent on automation and SLOs. Focus on developing tooling in Go and Python, optimizing Kubernetes infrastructure, and leading high-severity incident coordination.
Location: Hybrid in Paris, France (2-3 days at the office)
Company
is the world’s leading community-based travel app enabling millions of members to carpool or travel by bus across 21 countries.
What you will do
- Create and improve observability and alerting tools, leveraging AI to eliminate toil and streamline daily tasks.
- Own the SLO framework and assist in designing SLIs to ensure high service reliability.
- Manage the incident process, defining standards and serving as Incident Commander during high-severity incidents.
- Develop automation tools, such as Terraform modules or Go applications, to enhance reliability.
- Promote operational metrics and post-mortem analysis to drive continuous distributed improvement.
Requirements
- 1 to 5 years of experience in SRE, DevOps, or Software Engineering roles.
- Strong knowledge of observability tools (e.g., Datadog) and understanding of metrics, logging, and tracing.
- Experience diagnosing and resolving technical issues in production environments (Kubernetes experience is a plus).
- Full working proficiency in English.
- Must be based in or able to work hybridly from Paris, France.
Nice to have
- Familiarity with Grafana IRM and integrating OpenTelemetry.
- Experience working with SLOs/SLIs and programming in Go.
- Knowledge of object-oriented programming, scripting languages, and web/mobile testing tools.
Culture & Benefits
- Hybrid work model with 2-3 days in the office.
- Extended maternity/paternity leave (4 additional weeks).
- 50% healthcare coverage (Alan) and local meal plan (Swile).
- Financial support for home office equipment and 50% transportation reimbursement.
- Employee Stock Ownership Plan (ESOP) and free unlimited carpooling and bus rides.
Hiring process
- Screening video-call with Talent Acquisition.
- Technical discussion with the Hiring Manager.
- System design interview with team members.
- Final strategy and vision call with the Head of Foundations.
- Note: One of these interviews will be conducted onsite.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →