TL;DR
Member of Technical Staff, Multimodal Inference (AI): Building large-scale inference infrastructure and supporting the efficient serving of multimodal generative models with an accent on optimal serving latency and throughput. Focus on model architecture, data curation, training and inference infrastructure, evaluation protocols, alignment, and reinforcement learning from human feedback (RLHF).
Location: Redmond, United States. MAI employees are expected to work from a designated Microsoft office at least four days a week if they live within 50 miles (U.S.) of that location.
Salary: USD $139,900 – $331,200 per year
Company
hirify.global is dedicated to advancing Copilot and other consumer AI products and research.
What you will do
- Develop and maintain the inference engine for multimodal generative models.
- Develop and maintain the model deployment pipeline for various product lines.
- Benchmark, profile and tune the model inference performance with model and hardware specific techniques.
- Gather data and insights to develop the multimodal inference roadmap.
- Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively.
Requirements
- Bachelor’s Degree in Computer Science or related technical field.
- 6+ years technical engineering experience with coding in languages including C, C++, C#, Java, JavaScript, or Python.
- Experience with generative AI is preferred.
- Experience with distributed computing is preferred.
- Must work from the Microsoft office in Redmond, United States, at least four days a week.
Culture & Benefits
- Embrace a growth mindset, innovate to empower others, and collaborate to achieve shared goals.
- Foster a culture of inclusion built on values of respect, integrity, and accountability.
- Work in a fast-paced, design-driven, product development cycle.
- Eligible for benefits and other compensation; additional pay information available on the careers site.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →