Назад
Company hidden
9 часов назад

Senior Site Reliability Engineer

125 200 - 132 500CAD
Формат работы
remote (только Canada)
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
Canada
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior Site Reliability Engineer (AI Platform): Own the reliability and AI operations foundation for an AI-first intelligence platform running demanding semiconductor intelligence workflows with an accent on SLOs, error budgets, and AI agent pipelines. Focus on designing reliability patterns for AI workloads, architecting blast radius containment, maturing multi-region failover, and enabling development teams via IDP and observability.

Remote position for candidates based in Canada. Occasional travel may be required.

Expected salary range: $125,200 - $132,500 CAD

Company

hirify.global is the information platform for the semiconductor industry, providing reverse engineering, teardowns, and market analysis accessed by over 650 companies and 150,000 users.

What you will do

  • Own SLOs, SLIs, error budgets, and drive discipline across engineering for production services.
  • Design reliability patterns for AI agent pipelines including LLM observability, failure detection, and graceful degradation.
  • Architect blast radius containment, mature active-active multi-region architecture, and lead incident response with durable fixes.
  • Enable Software and AI Engineering teams via CI/CD strategy, IDP adoption, SRE practices, and reliability standards.
  • Operate and extend observability with Datadog for services, infrastructure, and AI workloads; build service catalog and golden paths.
  • Manage IaC with Terraform, FinOps for AWS costs, mentor junior SREs, and build AI-assisted automation to reduce toil.

Requirements

  • Bachelor's degree in Computer Science, Engineering, or equivalent.
  • 6–8 years in SRE, platform engineering, or DevOps with senior individual contributor technical leadership.
  • Deep AWS expertise (EKS, Lambda, CloudWatch); Terraform, GitOps, policy-as-code.
  • Hands-on Datadog (dashboards, SLOs, alerting); Docker, Kubernetes (EKS); Python/Bash; CI/CD (Bitbucket Pipelines, GitHub Actions).
  • Submit citizenship/permanent residency information for U.S. Export Control compliance.

Nice to have

  • Experience with agentic AI systems reliability, AWS certifications, FinOps, IDP tooling.
  • Background in semiconductor, SaaS, data-intensive platforms, or regulated data environments.

Culture & Benefits

  • Company-sponsored training, development opportunities, and formal mentoring.
  • Comprehensive benefits: health, dental, vision, wellness, RRSP matching, annual fitness reimbursement.
  • Flexible vacation policy, wellness resources, inclusive environment prioritizing diversity, equity, accessibility.
  • Community involvement, high-growth high-performance culture.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →