Назад
Company hidden
2 дня назад

Senior Site Reliability Engineer, AI Research

Формат работы
remote (только Australia)
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
Australia
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior Site Reliability Engineer (AI): Supporting and evolving reliability of platforms for the AI Research team, including production inference services and AI data feature stores, with an accent on cloud-first, service-oriented architectures and iteration speed. Focus on building and maintaining Kubernetes-based services on GCP using infrastructure-as-code, improving CI/CD pipelines, and designing observability systems.

Location: Remote within Australia

Company

hirify.global is a pioneer and market leader in AI Search, empowering 17,000+ businesses with blazing-fast, predictive search and browse experiences.

What you will do

  • Support and evolve the reliability of platforms used by the AI Research team (e.g., production inference service, AI data feature store).
  • Ensure production services meet availability, latency, and operational readiness expectations.
  • Design infrastructure and operational patterns prioritizing iteration speed with safeguards.
  • Work closely with researchers and engineers as an advisor on infrastructure and operational concerns.
  • Build and maintain Kubernetes-based services on GCP using infrastructure-as-code (Terraform, ArgoCD).
  • Own and improve CI/CD pipelines for services primarily in Go and some Python.
  • Design and operate observability systems using tools like Datadog.
  • Participate in a light on-call rotation, responding to incidents and improving systems.

Requirements

  • Strong experience operating cloud-first infrastructure.
  • Hands-on experience running production services on Kubernetes.
  • Proficiency with infrastructure-as-code (Terraform) and CI/CD systems.
  • Experience supporting production services written in Go (Python is a plus).
  • Solid grounding in service reliability, incident response, and operational best practices.
  • Comfort working in environments with ambiguity and high ownership.
  • Must be based in Australia.

Nice to have

  • Experience supporting mission-critical internal platforms.
  • Exposure to research or experimentation-heavy environments.
  • Familiarity working alongside researchers or highly specialized domain experts.

Culture & Benefits

  • Flexible workplace model emphasizing individual impact and autonomy.
  • Opportunity to work remotely within Australia.
  • Collaborate with experienced SREs, engineers, and PhD researchers.
  • High impact work directly enabling new AI-powered capabilities for customers.
  • Company values: Grit, Trust, Candor, Care, Humility.

Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →