Senior Software Engineer, ML Infrastructure (AI)
ΠΡΡΡ & Π‘ΠΎΠΏΡΠΎΠ²ΠΎΠ΄
ΠΠ»Ρ ΠΌΡΡΡΠ° Ρ ΡΡΠΎΠΉ Π²Π°ΠΊΠ°Π½ΡΠΈΠ΅ΠΉ Π½ΡΠΆΠ΅Π½ Plus
ΠΠΏΠΈΡΠ°Π½ΠΈΠ΅ Π²Π°ΠΊΠ°Π½ΡΠΈΠΈ
TL;DR
Senior Software Engineer, ML Infrastructure (AI): Owning the platforms powering 's model training and inference with an accent on distributed training systems and inference architecture across multiple providers. Focus on improving latency and cost efficiency across the training and serving stack.
Location: Onsite in San Francisco
Salary: $250K β $330K
Company
is the leading conversational AI platform empowering every brand to deliver concierge customer experiences.
What you will do
- Design and build distributed training platforms for LLM and multimodal fine-tuning and post-training at scale
- Integrate state-of-the-art training algorithms into production pipelines
- Own inference architecture and multi-provider routing, including failover and optimization
- Lead initiatives to improve latency and cost efficiency across the training and serving stack
- Build evaluation and experimentation infrastructure that enables rapid, reliable iteration
- Drive technical direction, mentor engineers, and establish best practices for ML infrastructure
Requirements
- 6+ years building ML infrastructure or production systems at scale
- Deep experience with distributed training: multi-node GPU clusters, fault tolerance, and optimization
- Strong understanding of LLM inference: latency optimization, provider tradeoffs, and serving architecture
- Proven track record leading complex, multi-quarter technical projects
Culture & Benefits
- Medical, dental, and vision benefits
- Take what you need vacation policy
- Daily lunches, dinners and snacks in the office to keep you at your best
ΠΡΠ΄ΡΡΠ΅ ΠΎΡΡΠΎΡΠΎΠΆΠ½Ρ: Π΅ΡΠ»ΠΈ ΡΠ°Π±ΠΎΡΠΎΠ΄Π°ΡΠ΅Π»Ρ ΠΏΡΠΎΡΠΈΡ Π²ΠΎΠΉΡΠΈ Π² ΠΈΡ ΡΠΈΡΡΠ΅ΠΌΡ, ΠΈΡΠΏΠΎΠ»ΡΠ·ΡΡ iCloud/Google, ΠΏΡΠΈΡΠ»Π°ΡΡ ΠΊΠΎΠ΄/ΠΏΠ°ΡΠΎΠ»Ρ, Π·Π°ΠΏΡΡΡΠΈΡΡ ΠΊΠΎΠ΄/ΠΠ, Π½Π΅ Π΄Π΅Π»Π°ΠΉΡΠ΅ ΡΡΠΎΠ³ΠΎ - ΡΡΠΎ ΠΌΠΎΡΠ΅Π½Π½ΠΈΠΊΠΈ. ΠΠ±ΡΠ·Π°ΡΠ΅Π»ΡΠ½ΠΎ ΠΆΠΌΠΈΡΠ΅ "ΠΠΎΠΆΠ°Π»ΠΎΠ²Π°ΡΡΡΡ" ΠΈΠ»ΠΈ ΠΏΠΈΡΠΈΡΠ΅ Π² ΠΏΠΎΠ΄Π΄Π΅ΡΠΆΠΊΡ. ΠΠΎΠ΄ΡΠΎΠ±Π½Π΅Π΅ Π² Π³Π°ΠΉΠ΄Π΅ β