Senior SRE (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior SRE (DevOps/AI): Building and optimizing the reliability and scalability of a global HR platform with an accent on Kubernetes, AWS infrastructure, and AI-native operational workflows. Focus on designing robust reliability frameworks, implementing agentic AI tools for infrastructure automation, and ensuring system stability in an async-first environment.
Location: Prioritising candidates based in Europe
Salary: $53,300 — $119,850 USD
Company
is a global HR platform that enables businesses to recruit, pay, and manage international teams compliantly.
What you will do
- Lead solution discovery and delivery for complex reliability and infrastructure problems with high ambiguity.
- Define and operate reliability practices, including SLOs, SLIs, error budgets, and observability frameworks.
- Develop and operationalize agentic AI workflows and reusable tooling to increase team shipping speed and safety.
- Collaborate with the Security team on platform hardening and threat mitigation.
- Participate in incident response and on-call rotations to maintain system reliability.
- Mentor less-senior engineers and contribute to platform architecture and RFC discussions.
Requirements
- Solid professional experience in SRE, DevOps, or Platform Engineering.
- Hands-on expertise in operating and scaling production Kubernetes clusters and Docker ecosystem.
- Strong proficiency with AWS and Terraform (Infrastructure-as-Code).
- Competency in Golang and Bash/scripting.
- Practical experience implementing embedded AI and agentic workflows in infra/ops work.
- Ability to communicate clearly and thoughtfully in an async-first, global environment.
Nice to have
- Experience with Elixir, Nodejs, or Python.
- Experience configuring Linux systems in non-cloud environments.
- Defensive and offensive security knowledge.
Culture & Benefits
- Fully , async-first work culture with flexible working hours.
- Flexible paid time off and 16 weeks of paid parental leave.
- Stock options and a dedicated learning budget.
- Home office budget and IT equipment provided.
- Mental health support services and budget for local in-person social events or co-working spaces.
Hiring process
- Interviews with Recruiter and Hiring Manager.
- Async infrastructure exercise (estimated 2-4 hours).
- Interviews with the Team, a Bar Raiser, and an Executive.
- Offer and background check.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →