Назад
Company hidden
2 дня назад

Site Reliability Engineer (GCP)

Тип работы
fulltime
Грейд
middle
Английский
b2
Страна
Canada
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Site Reliability Engineer (GCP): Maintaining, optimizing, and scaling cloud implementations for large-scale enterprises with an accent on self-healing automation, monitoring, and high availability. Focus on building automated systems using Terraform and Python, managing GKE clusters, and resolving critical infrastructure outages.

Location: Vancouver, British Columbia, Canada. Pacific and Mountain Time Zones preferred.

Company

hirify.global is an end-to-end AI transformation partner helping enterprises turn chaotic data into strategic assets through cloud and artificial intelligence expertise.

What you will do

  • Ensure near-zero downtime through monitoring, alerting, and self-healing automation.
  • Design and deploy highly available, scalable systems using software and infrastructure principles.
  • Advise clients on DevOps and SRE practices, including deployment pipelines and service reliability.
  • Proactively anticipate failures and automate tasks to improve customer experience.
  • Collaborate with clients and Google engineers to resolve complex infrastructure issues.
  • Manage client requests via Jira and contribute to open-source initiatives and documentation.

Requirements

  • 4+ years of cloud and infrastructure experience (Linux, Windows, k8s, databases, networking).
  • Proficiency with Python and strong provisioning skills using Terraform.
  • Experience in troubleshooting across systems, networks, and code.
  • Bachelor’s degree in Computer Science, Engineering, or equivalent work experience.
  • Alignment with Pacific or Mountain Time Zones.
  • Availability for a weekend on-call rotation.

Nice to have

  • 2+ years of full-time Google Cloud (GCP) experience.
  • Microsoft Server and SQL Server experience.
  • Experience with 24x7x365 monitoring, incident response, and on-call support.
  • Experience negotiating Error budgets, SLIs, SLOs, and SLAs with product owners.

Culture & Benefits

  • Culture of innovation that supports professional and personal growth.
  • Opportunity to work with cutting-edge Google Cloud technologies like GKE and Anthos.
  • Collaborative environment focused on winning together and delivering high-impact outcomes.
  • Equal Opportunity employer commitment.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →