Назад
Company hidden
1 день назад

Site Reliability Engineer (Cloud/Kubernetes)

100 000 - 133 000CAD
Формат работы
remote (только Canada)
Тип работы
fulltime
Английский
b2
Страна
Canada
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Site Reliability Engineer (Cloud/Kubernetes): Building and optimizing cloud services for reliability, security, and scalability with an accent on global cloud environment operations and distributed systems. Focus on designing, automating, and elevating modern cloud platforms, solving complex challenges, and shaping reliability engineering practices.

Location: Must be based in Canada (hybrid in Ottawa or remote across Ontario)

Salary: $100,000 CAD–$133,000 CAD

Company

hirify.global transforms complex data landscapes into actionable insights, leveraging pervasive data quality and advanced AI/ML capabilities for over 40,000 global customers.

What you will do

  • Solve real-world scale challenges across a global cloud platform, handling millions of transactions.
  • Build tooling, automation, alerts, and scalable infrastructure patterns to proactively prevent problems.
  • Collaborate with SRE, Architecture, Platform, and Domain Engineering teams to influence infrastructure design.
  • Increase reliability and availability by implementing resilient infrastructure patterns and performance optimizations.
  • Reduce incidents and recovery time through improved observability, automation, and proactive engineering.
  • Participate in on-call duties to maintain cloud infrastructure availability and performance.

Requirements

  • Cloud engineering skills across AWS and/or Azure, with hands-on experience supporting production Kubernetes systems at scale.
  • Infrastructure as Code (Terraform, Crossplane, Ansible) and microservices experience in distributed live environments.
  • Automation and engineering mindset, proficient in Python, Go, or Bash, with CI/CD and autoscaling experience.
  • Depth in observability and incident management, including Prometheus, Grafana, OpenTelemetry, distributed tracing, and SIEM tooling.
  • Knowledge of security and networking, including secret management (Vault, AWS SSM) and infrastructure security best practices.
  • Experience with cloud-native tooling like Helm and exposure to modern databases such as MongoDB.

Culture & Benefits

  • Recognized as one of National Capital Region's 2025 Top Employers in Canada.
  • Genuine career progression pathways and mentoring programs.
  • Culture of innovation, technology, collaboration, and openness.
  • Flexible, diverse, and international work environment.
  • Extra "change the world" day and a personal development day.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Текст вакансии взят без изменений

Источник - загрузка...