Назад
Company hidden
4 дня назад

Reliability Operations Engineer (Robotics)

80 000 - 100 000MYR
Формат работы
onsite
Тип работы
fulltime
Грейд
middle
Английский
b2
Страна
Malaysia/Sweden
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Reliability Operations Engineer (Robotics): Supporting the operational reliability of robotic and cloud systems with an accent on Tier 2 escalations and incident investigation. Focus on improving runbooks, enhancing observability using Grafana and Prometheus, and ensuring system stability across distributed physical device environments.

Location: Must be based in Penang, Malaysia

Salary: MYR 80K – MYR 100K

Company

hirify.global is reimagining urban delivery using personable sidewalk robots to reduce street congestion and support local businesses.

What you will do

  • Lead incident investigations during regional daytime hours, providing updates and coordinating escalations.
  • Respond to Tier 2 escalations from Tier 1 support using established runbooks, metrics, and diagnostics.
  • Update and maintain operational documentation and runbooks based on new discoveries and feedback.
  • Utilize observability tools such as Grafana, Prometheus, and GCP Monitoring to interpret metrics and identify anomalies.
  • Collaborate with SREs and product engineering to enhance tooling and scripts for troubleshooting.
  • Participate in a shared weekend on-call rotation to maintain continuous operational coverage.

Requirements

  • Location: Based in Penang, Malaysia
  • 2–4 years of experience in Reliability Operations, SRE, DevOps, or IT Operations.
  • Proficiency with Linux, including system navigation and performing basic diagnostics.
  • Experience with cloud platforms, preferably Google Cloud Platform (GCP).
  • Ability to interpret metrics, logs, and traces using tools like OpenTelemetry.
  • Bachelor’s degree in Computer Science, IT, Engineering, or equivalent hands-on experience.

Nice to have

  • Exposure to robot fleets, IoT systems, or distributed physical device environments.
  • Ability to write or modify lightweight scripts and automation to improve workflows.
  • Familiarity with incident management platforms such as PagerDuty or OpsGenie.
  • Strong networking fundamentals and familiarity with zero-trust tools like Tailscale.

Culture & Benefits

  • Collaborative and respectful team environment focused on solving complex dynamic problems.
  • Opportunity to work with cutting-edge robotics, machine learning, and computer vision.
  • Agile and diverse workplace driven by tech industry veterans.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →