TL;DR
Senior AI Engineer (AI): Researching, building, optimizing, and deploying production ML systems to push the boundaries of speech modeling (STT & TTS) with an accent on data collection, efficient training infrastructure, RL alignment environments, and ultra-low latency inference optimizations. Focus on solving complex research and engineering problems to build the engine for the next generation of AI-driven software.
Location: Candidates must be based in the SF Bay Area or willing to relocate to the United States and work on-site a few days a week.
Salary: $250,000 - $300,000 USD + bonus + equity + benefits.
Company
Inworld is a product-oriented research lab of top AI researchers and engineers, developing best-in-class realtime multimodal models and the only realtime orchestration platform optimized for thousands of queries per second.
What you will do
- Research, build, optimize, and deploy production ML systems integrated by thousands of developers.
- Push the boundaries of speech modeling (STT & TTS).
- Research and utilize ML ideas to achieve state-of-the-art results.
- Focus on the difficult research and engineering problems of building the engine for the next generation of AI-driven software.
Requirements
- Power user of AI agents for work automation
- A BA/BS, MS, or PhD in a technical field (CS, Math, Physics) with a strong foundation in Machine Learning.
- 3+ years of combined experience in software development (e.g. with Python or C++) and applied ML engineering.
- Demonstrated experience applying or researching Machine Learning in one or more of the following domains: Speech or video processing, Natural Language Processing (NLP), Action planning
- Strong foundation in data structures, algorithms, and neural network architectures.
- Proficiency with ML frameworks such as PyTorch.
Nice to have
- A passion for learning and staying up-to-date with the latest advancements in ML/Voice AI research and its applications.
- Ability to work collaboratively in a fast-paced environment with shifting priorities.
- Familiarity with pre-training, fine-tuning, RLHF and evaluation of large language and speech models.
- Knowledge of working with embedded systems and/or running ML on edge devices.
- Strong background in mathematics and/or physics.
Culture & Benefits
- The US base salary range for this full-time position is $250,000 - $300,000 USD + bonus + equity + benefits.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →