TL;DR
Senior Sre Engineer (Devops): Designing and maintaining a resilient, scalable, and highly available infrastructure with an accent on monitoring, alerting, logging, and observability practices. Focus on conducting root cause analysis (RCA) and implementing improvements based on the analysis.
Локация: Москва
Компания
Navio is a developer of autonomous driving technology compatible with various types of transport, from cars to trucks.
Что делать
- Design and maintain a resilient, scalable, and highly available infrastructure.
- Ensure high availability and fault tolerance of services.
- Implement and develop monitoring, alerting, logging, and observability practices using VictoriaMetrics, Grafana, and other tools.
- Ensure full system observability by organizing the collection of metrics, logs, and traces.
- Define, implement, and maintain SLI/SLO, conduct root cause analysis (RCA), and postmortem meetings.
- Actively use the "Infrastructure as Code" approach (Terraform, Ansible) in daily work.
Требования
- Deep understanding of SRE principles and reliability culture.
- Proven experience in designing and supporting highly available, fault-tolerant systems capable of withstanding high loads.
- Expert knowledge in Linux, monitoring, logging, alerting, and data visualization (experience with Prometheus, Grafana, ELK Stack, and similar tools).
- Confident command of Kubernetes, CI/CD tools, and Infrastructure as Code tools (Terraform, Ansible).
- Experience with SLO/SLI, conducting RCA, and writing high-quality postmortem reports.
- Developed mentoring and technical leadership skills, ability to share knowledge and inspire colleagues.
Хорошо, если есть
- Experience with cloud platforms (AWS, GCP, Azure).
Культура и преимущества
- Work in an accredited IT company.
- Work in a team of top developers, with the opportunity to develop unique and large-scale projects.
- Competitive working conditions (white indexed salary, salary + annual bonus).
- Standard working hours, but with a flexible approach to the start/end of the working day.
- DMS for employees from the first day (+dentistry after the trial period) and a preferential medical insurance program for relatives.
- Subsidies on mortgages.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →