Назад
Company hidden
2 дня назад

Senior/Staff+ Software Engineer, Voice Platform (AI)

320 000 - 485 000$
Формат работы
hybrid
Тип работы
fulltime
Грейд
senior/lead
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior/Staff+ Software Engineer, Voice Platform (AI): Building the real-time streaming infrastructure that powers voice conversations with Claude with an accent on low-latency serving systems and optimizing time-to-first-audio. Focus on designing the voice mode API surface and implementing graceful barge-in and interruption handling.

Location: San Francisco, CA | New York City, NY | Seattle, WA. Expect all staff to be in one of our offices at least 25% of the time.

Salary: $320,000 - $485,000 USD

Company

hirify.global’s mission is to create reliable, interpretable, and steerable AI systems.

What you will do

  • Design and build the real-time streaming infrastructure that powers voice conversations with Claude.
  • Build low-latency serving systems for speech models, optimizing time-to-first-audio and end-to-end conversational responsiveness.
  • Develop the public and internal APIs that expose voice capabilities to Claude.ai, mobile clients, and third-party developers.
  • Own the audio transport layer—codecs, jitter buffers, adaptive bitrate, packet loss recovery—so conversations stay smooth across unreliable networks.
  • Build observability and quality-measurement systems for voice.
  • Partner with Audio research to move new model architectures from experiment to production.

Requirements

  • Have 6+ years of experience building distributed systems, real-time infrastructure, or platform services at scale.
  • Have shipped production systems where latency is measured in tens of milliseconds.
  • Are comfortable working across the stack—from transport protocols and serving infrastructure up to the APIs product teams build on.
  • Are results-oriented, with a bias toward flexibility and impact.
  • Care about the societal impacts of voice AI and want to help shape how these systems are developed responsibly.
  • We require at least a Bachelor's degree in a related field or equivalent experience.

Nice to have

  • Real-time media protocols and stacks: WebRTC, RTP, gRPC bidirectional streaming, or WebSockets at scale.
  • Audio engineering fundamentals: codecs (Opus, AAC), voice activity detection, echo cancellation, jitter buffering, or audio DSP.
  • Low-latency ML inference serving, streaming model outputs, or GPU-based serving infrastructure.
  • Telephony, live streaming, video conferencing, or voice assistant platforms.
  • Mobile audio pipelines on iOS (AVAudioEngine, AudioUnits) or Android (Oboe, AAudio).

Culture & Benefits

  • Competitive compensation and benefits.
  • Optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours.
  • Lovely office space in which to collaborate with colleagues.

Hiring process

  • Applications will be reviewed on a rolling basis.
  • We encourage you to apply even if you do not believe you meet every single qualification.
  • If we make you an offer, we will make every reasonable effort to get you a visa.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →