Senior/Staff+ Software Engineer, Voice Platform (AI)

320 000 - 485 000$

Формат работы

hybrid

Тип работы

fulltime

Грейд

senior/lead

Английский

Страна

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Senior/Staff+ Software Engineer, Voice Platform (AI): Building the real-time streaming infrastructure that powers voice conversations with Claude with an accent on low-latency serving systems and optimizing time-to-first-audio. Focus on designing the voice mode API surface and implementing graceful barge-in and interruption handling.

Location: San Francisco, CA | New York City, NY | Seattle, WA. Expect all staff to be in one of our offices at least 25% of the time.

Salary: $320,000 - $485,000 USD

Company

hirify.global’s mission is to create reliable, interpretable, and steerable AI systems.

What you will do

Design and build the real-time streaming infrastructure that powers voice conversations with Claude.
Build low-latency serving systems for speech models, optimizing time-to-first-audio and end-to-end conversational responsiveness.
Develop the public and internal APIs that expose voice capabilities to Claude.ai, mobile clients, and third-party developers.
Own the audio transport layer—codecs, jitter buffers, adaptive bitrate, packet loss recovery—so conversations stay smooth across unreliable networks.
Build observability and quality-measurement systems for voice.
Partner with Audio research to move new model architectures from experiment to production.

Requirements

Have 6+ years of experience building distributed systems, real-time infrastructure, or platform services at scale.
Have shipped production systems where latency is measured in tens of milliseconds.
Are comfortable working across the stack—from transport protocols and serving infrastructure up to the APIs product teams build on.
Are results-oriented, with a bias toward flexibility and impact.
Care about the societal impacts of voice AI and want to help shape how these systems are developed responsibly.
We require at least a Bachelor's degree in a related field or equivalent experience.

Nice to have

Real-time media protocols and stacks: WebRTC, RTP, gRPC bidirectional streaming, or WebSockets at scale.
Audio engineering fundamentals: codecs (Opus, AAC), voice activity detection, echo cancellation, jitter buffering, or audio DSP.
Low-latency ML inference serving, streaming model outputs, or GPU-based serving infrastructure.
Telephony, live streaming, video conferencing, or voice assistant platforms.
Mobile audio pipelines on iOS (AVAudioEngine, AudioUnits) or Android (Oboe, AAudio).

Culture & Benefits

Competitive compensation and benefits.
Optional equity donation matching.
Generous vacation and parental leave.
Flexible working hours.
Lovely office space in which to collaborate with colleagues.

Hiring process

Applications will be reviewed on a rolling basis.
We encourage you to apply even if you do not believe you meet every single qualification.
If we make you an offer, we will make every reasonable effort to get you a visa.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →