Назад
Company hidden
5 дней назад

Site Reliability/Production Engineer (AI)

160 000 - 300 000$
Формат работы
onsite
Тип работы
fulltime
Грейд
middle/senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Site Reliability/Production Engineer (AI): Owning critical production systems end-to-end, designing, building, and improving them, focusing on maintaining platform reliability at scale. Focus on instrumenting services, eliminating performance bottlenecks, building deployment platforms, and translating incident post-mortems into lasting architectural improvements.

Location: New York City; San Francisco, CA

Salary: $160,000 to $300,000

Company

The AI platform for investors and bankers that generates alpha and drives upside.

What you will do

  • Own critical production services end-to-end, from design and code review through deployment, operation, and incident response.
  • Profile, benchmark, and rewrite hot paths to eliminate bottlenecks as hirify.global scales.
  • Lead incident response and drive post-mortem culture, translating findings into code changes and architectural improvements.
  • Design and build observability frameworks from scratch, writing custom instrumentation, alerting logic, and debugging tooling.
  • Own capacity planning and cost efficiency: model growth, right-size infrastructure, and write automation that prevents over-provisioning and resource exhaustion.

Requirements

  • 5+ years of software development experience writing, shipping, and maintaining production services.
  • Production-grade proficiency in at least one systems or backend language: Go, Python, C++, or Rust.
  • Deep understanding of distributed systems.
  • Container orchestration expertise and experience debugging complex distributed failures in production.
  • Cloud platform fluency (AWS preferred).
  • Strong CI/CD pipeline expertise and a track record of improving developer velocity without sacrificing safety.

Nice to have

  • Background at a company with a Production Engineering or software-focused SRE culture.
  • Experience building platforms for AI/ML workloads or high-throughput document processing pipelines.

Culture & Benefits

  • Unlimited PTO.
  • Medical, Dental, and Vision insurance + 401K.
  • Catered lunch daily + DoorDash dinner credit for late stays.
  • Parental leave policy: 3 months for non-birthing parent, 4 months for birthing parent.
  • $15k lifetime fertility benefit.
  • Competitive equity package.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Текст вакансии взят без изменений

Источник - загрузка...