Site Reliability Engineer III (SRE III)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Site Reliability Engineer III (SRE III) (AWS/Kubernetes): Design, automate, and operate reliable cloud infrastructure and platform services for high-availability SaaS systems with an accent on scalability, observability, and resilience. Focus on developing IaC with Terraform, container orchestration, self-healing systems, and leading cross-functional incident resolution under 24/7 operational demands.
Hybrid in Toronto
Company
Leader in AI-powered travel and expense solutions helping organizations modernize financial operations and optimize spend across 12M+ users in 120 countries.
What you will do
- Proactively monitor, troubleshoot, and optimize services for 24/7 availability, low latency, and high uptime.
- Design and automate cloud infrastructure using IaC (Terraform), scripting (Python/Bash), and tools for operational efficiency.
- Collaborate with engineering teams on project planning, defining NFRs, and aligning reliability with product roadmaps.
- Lead cross-functional issue resolution, root cause analysis, postmortems, and continuous improvement initiatives.
- Mentor junior SREs, participate in architecture reviews, and promote reliability best practices across distributed teams.
Requirements
- Bachelor’s degree in Computer Science or STEM field.
- Minimum 6 years in engineering/operations focused on reliability, scalability, and automation.
- Strong proficiency in Linux distributed environments (up to 70% hands-on).
- Deep experience with AWS or Azure, Terraform IaC, Docker, Kubernetes (including Karpenter/KEDA).
- Scripting (Python, Bash, PowerShell), DevOps principles, CI/CD, observability (Prometheus, Grafana, OpenTelemetry).
- Excellent English communication, project management, and collaboration with global/offshore teams.
Nice to have
- Certified Kubernetes Administrator (CKA) and/or AWS Certification.
- Object-oriented programming experience.
- Background in SaaS or large-scale distributed applications.
Culture & Benefits
- Competitive pay and flexible hybrid work in an inclusive, collaborative environment.
- Work with bright minds in finance, tech, and AI solving real-world challenges.
- Drive efficiency, innovation, and smarter financial decisions for global businesses.
- Support for career growth, operational excellence, and continuous learning culture.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →