Назад
Company hidden
10 часов назад

AI Engineer (Voice Design)

145 000 - 172 500CAD
Формат работы
hybrid
Тип работы
fulltime
Грейд
middle
Английский
b2
Страна
Canada
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

AI Engineer (Voice Design): Building and optimizing the Text-to-Speech (TTS) layer for next-generation AI voice agents with an accent on linguistic optimization and backend implementation. Focus on integrating TTS vendor APIs, designing conversational turn flows, and architecting voice attribute controls for product UI.

Location: Hybrid in Kitchener, Canada

Salary: $145,000 - $172,500 CAD

Company

hirify.global is an AI-native business communications platform that unifies calling, messaging, and meetings powered by real-time AI insights.

What you will do

  • Implement and optimize TTS backend using vendor APIs and research open-source or in-house architectures.
  • Apply phonetics and sociolinguistics to ensure naturalness via SSML orchestration and pronunciation handling.
  • Design context-specific utterances and manage LLM and TTS prompt templates to define agent personalities.
  • Architect the logic to expose voice attributes like speed, pitch, and tone to the product UI for customer customization.
  • Partner with ASR and Audio AI engineers to minimize latency in the end-to-end ASR → LLM → TTS pipeline.

Requirements

  • Location: Must be based in Kitchener, Canada (Hybrid)
  • 3+ years of experience in Speech Synthesis (TTS) or Voice Design.
  • Strong Python programming skills and experience with deep learning frameworks like PyTorch.
  • Degree in Computational Linguistics, Computer Science, or AI/ML with knowledge of phonetics and prosody.
  • Hands-on experience with TTS APIs (ElevenLabs, Rime, Cartesia) and frameworks (NVIDIA NeMo, ESPnet, Coqui).
  • Experience building production-grade APIs and integrating services in a cloud environment (GCP preferred).

Nice to have

  • Knowledge of speech quality metrics such as MOS, intelligibility, and latency.
  • Ability to design and execute rigorous A/B tests for voice personas.

Culture & Benefits

  • Opportunity to build and ship agentic AI products redefining business communications.
  • Competitive salary and comprehensive benefits package.
  • Inclusive and vibrant office environment designed for collaboration.
  • Access to cutting-edge AI tools and robust training programs for professional growth.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →