Назад
Company hidden
3 дня назад

Staff Aiops Engineer (Generative AI Platform)

135 600 - 204 380$
Формат работы
hybrid
Тип работы
fulltime
Грейд
lead
Английский
b2
Страна
Canada
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Staff AIOps Engineer (Generative AI Platform): Design, build, and operate core GenAI platform capabilities, powering product copilots, agentic workflows, and customer-facing AI services with an accent on reliability, scalability, and policy enforcement. Focus on incident response, root cause analysis, and systemic reliability improvements to strengthen platform stability and operational excellence.

Location: Burnaby, BC, Canada

Salary: $135,600 - $204,380 plus bonus

Company

hirify.global provides cloud-first networking and security solutions.

What you will do

  • Design, build, and operate core GenAI platform services, ensuring reliability, scalability, and policy enforcement.
  • Establish and evolve observability, tracing, and telemetry frameworks for GenAI systems across development and production environments.
  • Define and operationalize SLAs, SLOs, and automated CI/CD and lifecycle management processes for GenAI services.
  • Build and maintain guardrails, safety mechanisms, and governance controls to mitigate risks across copilots and agentic workflows.
  • Lead incident response, root cause analysis, and systemic reliability improvements.
  • Drive architectural standards and cross-team alignment, mentoring engineers.

Requirements

  • 10+ years of software engineering experience, including experience building and operating large-scale distributed systems in production environments.
  • Strong experience designing and running highly available, scalable backend services in cloud-native environments (e.g., AWS, GCP, or Azure).
  • Hands-on experience implementing CI/CD pipelines, automation frameworks, and lifecycle management processes for distributed or AI-powered systems.
  • Proven experience defining and operating services against SLAs/SLOs, leading incident response, and driving structured post-incident improvements.
  • Experience building observability, tracing, and monitoring systems for complex distributed platforms.
  • Practical experience supporting AI/ML or Generative AI systems in production, including model access control, performance monitoring, governance, and safety enforcement.

Culture & Benefits

  • Comprehensive health coverage, generous PTO, and flexible work options.
  • Learning opportunities, career-mobility programs, and leadership workshops.
  • Sixteen paid volunteer hours each year, global employee resource groups.
  • Modern offices with EV charging, healthy snacks, hackathons, game nights, and culture celebrations.
  • Charitable Giving Program supported by Company Match.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →