Senior Site Reliability Engineer (SRE)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Site Reliability Engineer (SRE): Designing, building, and maintaining scalable and reliable cloud infrastructure in AWS with an accent on observability solutions and incident management. Focus on improving detection capabilities, response processes, and reliability metrics.
Location: Must be based within commuting distance of 's hubs, specifically Barcelona. Relocation assistance is available from anywhere in the world.
Company
(formerly Travel) is the intelligent platform for travel and spend management, automating everything from travel bookings to expenses and invoice processing.
What you will do
- Design, build, and maintain scalable and reliable cloud infrastructure in AWS.
- Define and implement observability strategy across the organization, establishing consistent standards, tooling, and practices.
- Collaborate with engineering teams to establish shared infrastructure patterns and standards, ensuring systems are well-observed, reliable, and scalable.
- Improve incident management maturity by improving detection capabilities, response processes, and reliability metrics.
- Participate in the on-call rotation and resolve production issues.
Requirements
- Experience with AWS services such as ECS, S3, RDS, Lambda, CloudFront, etc.
- Experience designing and implementing observability solutions (metrics, logs, traces, APM) at scale.
- Experience with Docker, ECS, or similar containerization technologies.
- Experience using languages such as Python, NodeJS.
- Strong experience with Terraform, including module development, and managing multi-account deployments.
- Track record of improving reliability metrics (MTTD, MTTR, SLOs), and incident management processes.
Nice to have
- Experience with Datadog, CloudWatch.
- Experience with serverless architectures and modern CI/CD tooling (GitHub Actions, etc.).
- Experience working in large engineering organizations and influencing others to solve complex problems.
- Experience mentoring engineers or contributing to engineering culture and practices.
- Excellent problem-solving, written, and verbal communication skills.
Culture & Benefits
- IRL-first approach to work, with the team working together in-person 3 days a week.
- English is the official language at the office.
- is a global company with a diverse customer base, and we want to make sure the people behind our product reflect that.
- Equal opportunity employer.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →