Chaos Engineer (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Chaos Engineer (DevOps/SRE): Designing and executing chaos experiments to proactively identify reliability gaps in the platform with an accent on AI-native automation and fault injection. Focus on debugging complex multi-system failures across Kubernetes and cloud infrastructure to enhance system resilience.
Location: Hybrid (Ottawa, Ontario) — Must be able to work onsite two days per week (Tuesday & Wednesday)
Salary: $90,000 – $120,000
Company
is a leading platform for the enterprise AI era, providing real-time, event-driven data infrastructure for major global enterprises.
What you will do
- Design and implement chaos experiments using Chaos Mesh and AWS FIS to surface reliability gaps.
- Leverage AI tools to enhance experiment design, automate analysis, and accelerate root cause investigations.
- Debug complex failures across cloud infrastructure, Kubernetes, and application layers.
- Collaborate with R&D squads to share findings and embed resilience thinking into the development lifecycle.
- Define and track reliability metrics to translate results into concrete system improvements.
- Expand the chaos experiment library by staying current with cloud-native failure patterns and industry best practices.
Requirements
- 3+ years of experience in chaos engineering, site reliability engineering (SRE), or a related discipline.
- Proven experience with Chaos Mesh and cloud-provider fault injection (AWS FIS, Azure Chaos Studio, or GCP).
- Solid understanding of cloud environments (AWS, Azure, or GCP) and meaningful Kubernetes experience.
- Strong scripting and automation skills in Python, Bash, or similar languages.
- Familiarity with observability tooling for metrics, logs, and traces.
- Must be based in Ottawa, Ontario, and available to work onsite Tuesday & Wednesday.
Nice to have
- Experience with messaging systems or event streaming technologies.
Culture & Benefits
- Hybrid-first work model providing flexibility and inclusivity.
- Collaborative environment working alongside top-tier industry experts.
- Dedicated training programs designed for rapid professional growth.
- Values-driven culture centered on craftsmanship, trust, and human experience.
- Focus on work-life balance to ensure work fits around your life.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →