Senior Reliability Operations Engineer (Robotics)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Reliability Operations Engineer (Robotics): Leading operational reliability and incident response for robotic and cloud systems with an accent on Tier 2 support and automated remediation. Focus on designing runbooks, leveraging monitoring tools like Grafana and Prometheus, and coordinating high-pressure incident resolution across global teams.
Location: Penang, Malaysia
Salary: MYR 90,000 – 110,000
Company
is reimagining urban logistics by deploying personable sidewalk robots for efficient commercial deliveries in major US cities.
What you will do
- Serve as the primary regional incident lead, coordinating technical investigations and stakeholder communication.
- Manage Tier 2 escalations using system diagnostics, logs, and metrics to remediate issues.
- Develop and maintain comprehensive runbooks, workflows, and operational documentation.
- Build and enhance automation scripts to streamline remediation and reduce manual operational overhead.
- Proactively monitor system health using Grafana, Prometheus, GCP Monitoring, and OpenTelemetry.
- Collaborate with SRE and product engineering teams to improve system stability and operability.
Requirements
- Bachelor’s degree in Computer Science, IT, Engineering or equivalent practical experience.
- 5+ years of professional experience in Reliability Operations, SRE, DevOps, or IT Operations.
- Strong proficiency with Linux and experience with Google Cloud Platform (GCP).
- Proven track record in Tier 2/3 technical investigations and structured incident response.
- Ability to participate in a shared weekend on-call rotation.
- Must be based in Penang, Malaysia.
Nice to have
- Experience operating robot fleets, IoT devices, or edge systems.
- Previous experience as an Incident Commander for high-severity events.
- Familiarity with incident management tools like PagerDuty, OpsGenie, or Grafana IRM.
- Strong networking fundamentals and experience with Tailscale or zero-trust networking.
Culture & Benefits
- Opportunity to work with tech industry veterans in a fast-paced, agile environment.
- Collaborative and respectful team culture focused on solving complex real-world robotics problems.
- Work with a diverse team leveraging machine learning and computer vision.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →