Назад
Company hidden
4 дня назад

Senior Reliability Operations Engineer (Robotics)

90 000 - 110 000MYR
Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
Malaysia/Sweden/Mexico
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior Reliability Operations Engineer (Robotics): Leading operational reliability and incident response for robotic and cloud systems with an accent on Tier 2 support and automated remediation. Focus on designing runbooks, leveraging monitoring tools like Grafana and Prometheus, and coordinating high-pressure incident resolution across global teams.

Location: Penang, Malaysia

Salary: MYR 90,000 – 110,000

Company

hirify.global is reimagining urban logistics by deploying personable sidewalk robots for efficient commercial deliveries in major US cities.

What you will do

  • Serve as the primary regional incident lead, coordinating technical investigations and stakeholder communication.
  • Manage Tier 2 escalations using system diagnostics, logs, and metrics to remediate issues.
  • Develop and maintain comprehensive runbooks, workflows, and operational documentation.
  • Build and enhance automation scripts to streamline remediation and reduce manual operational overhead.
  • Proactively monitor system health using Grafana, Prometheus, GCP Monitoring, and OpenTelemetry.
  • Collaborate with SRE and product engineering teams to improve system stability and operability.

Requirements

  • Bachelor’s degree in Computer Science, IT, Engineering or equivalent practical experience.
  • 5+ years of professional experience in Reliability Operations, SRE, DevOps, or IT Operations.
  • Strong proficiency with Linux and experience with Google Cloud Platform (GCP).
  • Proven track record in Tier 2/3 technical investigations and structured incident response.
  • Ability to participate in a shared weekend on-call rotation.
  • Must be based in Penang, Malaysia.

Nice to have

  • Experience operating robot fleets, IoT devices, or edge systems.
  • Previous experience as an Incident Commander for high-severity events.
  • Familiarity with incident management tools like PagerDuty, OpsGenie, or Grafana IRM.
  • Strong networking fundamentals and experience with Tailscale or zero-trust networking.

Culture & Benefits

  • Opportunity to work with tech industry veterans in a fast-paced, agile environment.
  • Collaborative and respectful team culture focused on solving complex real-world robotics problems.
  • Work with a diverse team leveraging machine learning and computer vision.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →