Назад
Company hidden
5 дней назад

Software Engineering MTS (AIOps, AI/ML)

Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify RU Global, списка компаний с восточно-европейскими корнями
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Software Engineering MTS (AIOps, AI/ML): Building, operating, and scaling an intelligent AIOps product that supports Tier-0 and Tier-1 services across hirify.global with an accent on observability, automated detection, root cause analysis, and remediation for large scale distributed systems. Focus on implementing scalable services, continuously improving system reliability and operational efficiency, and contributing to detection and causation capabilities using AI assisted mechanisms.

Location: Onsite in San Francisco or Palo Alto, California, USA

Company

hirify.global is a leading cloud-based software company focused on customer relationship management and intelligent AIOps products.

What you will do

  • Develop and maintain core Warden AIOps product services, including data ingestion, signal processing, detection pipelines, and reliability workflows.
  • Analyze operational data and system behaviors to identify anomalies, recurring failure patterns, and performance regressions.
  • Contribute to detection and causation capabilities by implementing rule-based and AI-assisted mechanisms.
  • Identify and address reliability, scalability, and performance issues in product components and document findings.
  • Collaborate with cross-functional teams to integrate Warden AIOps with upstream and downstream systems.
  • Participate in design reviews, code reviews, and operational readiness activities, ensuring adherence to engineering standards.

Requirements

  • Bachelor’s degree in Computer Science, Engineering, or equivalent practical experience.
  • Strong proficiency in one or more backend programming languages (e.g., Java, Go, Python).
  • Experience building and operating distributed systems.
  • Understanding of cloud native architectures, microservices, and service-to-service communication patterns.
  • Strong analytical and problem-solving skills, with the ability to reason about complex system behaviors.
  • Effective written and verbal communication skills, with the ability to collaborate across teams.

Nice to have

  • Experience working with observability data such as metrics, logs, traces, or events, and familiarity with monitoring or reliability concepts.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Текст вакансии взят без изменений

Источник - загрузка...