Эта вакансия в архиве
Посмотреть похожие вакансии ↓обновлено 1 месяц назад
Engineering Manager (LLM)
220 000 - 285 000$
Описание вакансии
Текст:
TL;DR
Engineering Manager (LLM): Lead and mentor a team of Forward Deployed Engineers building, scaling, and optimizing LLM inference workloads with an accent on AI/ML production deployment, performance, and cost efficiency. Focus on designing, deploying, and managing high-performance, low-latency AI applications and driving strategic product initiatives.
Location: San Francisco, United States (Onsite)
Salary: $220,000–$285,000
Company
powers inference for leading AI companies by uniting applied AI research, flexible infrastructure, and developer tooling, backed by $150M Series D funding.
What you will do
- Lead, mentor, and grow a team of Forward Deployed Engineers with technical and managerial guidance.
- Set goals and ensure high-quality delivery across multiple customer-facing LLM deployment projects.
- Collaborate with leadership to align team priorities with company and customer goals.
- Act as a player-coach driving strategic product initiatives and customer engagements.
- Develop and maintain software systems using Python, focusing on ML inference optimization.
- Own end-to-end product and customer projects, including design, deployment, and monitoring.
Requirements
- Bachelor’s, Master’s, or Ph.D. in Computer Science, Engineering, or related field.
- 4+ years professional software engineering experience, including 1+ year leadership or mentorship.
- Strong Python programming skills with production ML inference experience.
- Experience with LLMs, inference optimization, and serving frameworks (e.g., vLLM, TensorRT, Triton).
- Familiarity with observability, profiling, and cost/performance tradeoffs in production ML systems.
- Excellent communication and collaboration skills for cross-functional leadership.
Nice to have
- Experience leading customer-facing engineering teams or working with enterprise partners.
- Deep understanding of GPU infrastructure, distributed inference, or model compression techniques.
Culture & Benefits
- Competitive compensation with meaningful equity.
- 100% coverage of medical, dental, and vision insurance for employee and dependents.
- Generous PTO policy including company-wide Winter Break.
- Paid parental leave and company-facilitated 401(k).
- Exposure to a variety of ML startups for learning and networking opportunities.