TL;DR
Lead Engineer L3: Maintaining platform availability and resolving infrastructure incidents with an accent on monitoring, alerting, and automation. Focus on analyzing problems, implementing systematic solutions, and modernizing the infrastructure.
Локация: Москва, возможен гибридный формат работы.
Что делать
- Ensure platform availability and promptly resolve infrastructure incidents.
- Ensure platform updates with comprehensive testing and rollback capabilities.
- Organize effective infrastructure and application monitoring and alerting.
- Actively participate in analyzing problems and implementing systemic solutions.
- Plan and execute change requests and scheduled maintenance to prevent service degradation.
- Participate in deploying new platform instances.
Требования
- Deep knowledge of the OSI network model and TCP/IP stack.
- Expert-level administration of Unix-like operating systems.
- Experience in writing automation scripts (Bash, Python).
- Experience in automating deployment and infrastructure management, including cloud (Ansible, Terraform).
- Understanding of virtualization.
Культура и преимущества
- Employment in accordance with the labor laws of the Russian Federation.
- Competitive income: salary + annual bonus.
- Comprehensive health insurance with dental.
- Sports compensation.
- In-house therapist and psychologist.
- Office or hybrid work format, shortened workday on Fridays.
- Great office in Moscow and coworking spaces in various cities of Russia.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →