Reliability Operations Engineer (Robotics)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Reliability Operations Engineer (Robotics): Supporting the operational reliability of robotic and cloud systems with an accent on Tier 2 escalations and incident investigation. Focus on improving runbooks, enhancing observability using Grafana and Prometheus, and ensuring system stability across distributed physical device environments.
Location: Must be based in Penang, Malaysia
Salary: MYR 80K – MYR 100K
Company
is reimagining urban delivery using personable sidewalk robots to reduce street congestion and support local businesses.
What you will do
- Lead incident investigations during regional daytime hours, providing updates and coordinating escalations.
- Respond to Tier 2 escalations from Tier 1 support using established runbooks, metrics, and diagnostics.
- Update and maintain operational documentation and runbooks based on new discoveries and feedback.
- Utilize observability tools such as Grafana, Prometheus, and GCP Monitoring to interpret metrics and identify anomalies.
- Collaborate with SREs and product engineering to enhance tooling and scripts for troubleshooting.
- Participate in a shared weekend on-call rotation to maintain continuous operational coverage.
Requirements
- Location: Based in Penang, Malaysia
- 2–4 years of experience in Reliability Operations, SRE, DevOps, or IT Operations.
- Proficiency with Linux, including system navigation and performing basic diagnostics.
- Experience with cloud platforms, preferably Google Cloud Platform (GCP).
- Ability to interpret metrics, logs, and traces using tools like OpenTelemetry.
- Bachelor’s degree in Computer Science, IT, Engineering, or equivalent hands-on experience.
Nice to have
- Exposure to robot fleets, IoT systems, or distributed physical device environments.
- Ability to write or modify lightweight scripts and automation to improve workflows.
- Familiarity with incident management platforms such as PagerDuty or OpsGenie.
- Strong networking fundamentals and familiarity with zero-trust tools like Tailscale.
Culture & Benefits
- Collaborative and respectful team environment focused on solving complex dynamic problems.
- Opportunity to work with cutting-edge robotics, machine learning, and computer vision.
- Agile and diverse workplace driven by tech industry veterans.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →