Engineering Manager (LLM)

220 000 - 285 000$

Формат работы

onsite

Тип работы

fulltime

Грейд

senior

Английский

Страна

Описание вакансии

Текст:

TL;DR

Engineering Manager (LLM): Lead and mentor a team of Forward Deployed Engineers building, scaling, and optimizing LLM inference workloads with an accent on AI/ML production deployment, performance, and cost efficiency. Focus on designing, deploying, and managing high-performance, low-latency AI applications and driving strategic product initiatives.

Location: San Francisco, United States (Onsite)

Salary: $220,000–$285,000

Company

hirify.global powers inference for leading AI companies by uniting applied AI research, flexible infrastructure, and developer tooling, backed by $150M Series D funding.

What you will do

Lead, mentor, and grow a team of Forward Deployed Engineers with technical and managerial guidance.
Set goals and ensure high-quality delivery across multiple customer-facing LLM deployment projects.
Collaborate with leadership to align team priorities with company and customer goals.
Act as a player-coach driving strategic product initiatives and customer engagements.
Develop and maintain software systems using Python, focusing on ML inference optimization.
Own end-to-end product and customer projects, including design, deployment, and monitoring.

Requirements

Bachelor’s, Master’s, or Ph.D. in Computer Science, Engineering, or related field.
4+ years professional software engineering experience, including 1+ year leadership or mentorship.
Strong Python programming skills with production ML inference experience.
Experience with LLMs, inference optimization, and serving frameworks (e.g., vLLM, TensorRT, Triton).
Familiarity with observability, profiling, and cost/performance tradeoffs in production ML systems.
Excellent communication and collaboration skills for cross-functional leadership.

Nice to have

Experience leading customer-facing engineering teams or working with enterprise partners.
Deep understanding of GPU infrastructure, distributed inference, or model compression techniques.

Culture & Benefits

Competitive compensation with meaningful equity.
100% coverage of medical, dental, and vision insurance for employee and dependents.
Generous PTO policy including company-wide Winter Break.
Paid parental leave and company-facilitated 401(k).
Exposure to a variety of ML startups for learning and networking opportunities.