Назад
Company hidden
2 дня назад

Site Reliability Engineer III (SRE III)

Формат работы
hybrid
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
Canada
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Site Reliability Engineer III (SRE III) (AWS/Kubernetes): Design, automate, and operate reliable cloud infrastructure and platform services for high-availability SaaS systems with an accent on scalability, observability, and resilience. Focus on developing IaC with Terraform, container orchestration, self-healing systems, and leading cross-functional incident resolution under 24/7 operational demands.

Hybrid in Toronto

Company

Leader in AI-powered travel and expense solutions helping organizations modernize financial operations and optimize spend across 12M+ users in 120 countries.

What you will do

  • Proactively monitor, troubleshoot, and optimize services for 24/7 availability, low latency, and high uptime.
  • Design and automate cloud infrastructure using IaC (Terraform), scripting (Python/Bash), and tools for operational efficiency.
  • Collaborate with engineering teams on project planning, defining NFRs, and aligning reliability with product roadmaps.
  • Lead cross-functional issue resolution, root cause analysis, postmortems, and continuous improvement initiatives.
  • Mentor junior SREs, participate in architecture reviews, and promote reliability best practices across distributed teams.

Requirements

  • Bachelor’s degree in Computer Science or STEM field.
  • Minimum 6 years in engineering/operations focused on reliability, scalability, and automation.
  • Strong proficiency in Linux distributed environments (up to 70% hands-on).
  • Deep experience with AWS or Azure, Terraform IaC, Docker, Kubernetes (including Karpenter/KEDA).
  • Scripting (Python, Bash, PowerShell), DevOps principles, CI/CD, observability (Prometheus, Grafana, OpenTelemetry).
  • Excellent English communication, project management, and collaboration with global/offshore teams.

Nice to have

  • Certified Kubernetes Administrator (CKA) and/or AWS Certification.
  • Object-oriented programming experience.
  • Background in SaaS or large-scale distributed applications.

Culture & Benefits

  • Competitive pay and flexible hybrid work in an inclusive, collaborative environment.
  • Work with bright minds in finance, tech, and AI solving real-world challenges.
  • Drive efficiency, innovation, and smarter financial decisions for global businesses.
  • Support for career growth, operational excellence, and continuous learning culture.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →