Engineering Manager (AI)
ΠΡΡΡ & Π‘ΠΎΠΏΡΠΎΠ²ΠΎΠ΄
ΠΠ»Ρ ΠΌΡΡΡΠ° Ρ ΡΡΠΎΠΉ Π²Π°ΠΊΠ°Π½ΡΠΈΠ΅ΠΉ Π½ΡΠΆΠ΅Π½ Plus
ΠΠΏΠΈΡΠ°Π½ΠΈΠ΅ Π²Π°ΠΊΠ°Π½ΡΠΈΠΈ
TL;DR
Engineering Manager (AI): Leading the API Core team to optimize the request lifecycle and "hot path" of the Claude API with an accent on service-level efficiency, throughput scaling, and rate-limiting systems. Focus on reducing per-request overhead, designing distributed systems at scale, and managing infrastructure capacity to maximize model-serving throughput.
Location: Hybrid (must be based in San Francisco, CA or New York City, NY)
Salary: $405,000 - $485,000 USD
Company
is a public benefit corporation creating reliable, interpretable, and steerable AI systems.
What you will do
- Lead the API Core team, managing hiring, performance management, and career development.
- Define the technical strategy and delivery roadmap for service efficiency, throughput scaling, and rate-limiting systems.
- Drive multi-quarter initiatives to improve token-path efficiency and implement protocol-level optimizations.
- Partner with Inference and Compute teams on capacity planning and regional load balancing.
- Own the end-to-end rate-limiting and acceleration-limit subsystems, including quota models and enforcement.
- Set and uphold reliability standards and latency SLOs for the /v1/messages API path.
Requirements
- 10+ years of experience managing engineering teams building high-throughput, latency-sensitive backend services.
- Proven track record of improving service efficiency on systems operating at scale (millions of QPS).
- Deep expertise in Rust, Go, systems-level performance engineering, or large-scale distributed systems.
- Ability to make architectural decisions under capacity pressure.
- Must be based in San Francisco or New York City (minimum 25% office presence required).
- Bachelor's degree or equivalent professional experience.
Culture & Benefits
- Competitive compensation and benefits with optional equity donation matching.
- Generous vacation and parental leave.
- Flexible working hours.
- Collaborative research-driven environment focused on "big science".
- Visa sponsorship available for eligible candidates.
ΠΡΠ΄ΡΡΠ΅ ΠΎΡΡΠΎΡΠΎΠΆΠ½Ρ: Π΅ΡΠ»ΠΈ ΡΠ°Π±ΠΎΡΠΎΠ΄Π°ΡΠ΅Π»Ρ ΠΏΡΠΎΡΠΈΡ Π²ΠΎΠΉΡΠΈ Π² ΠΈΡ ΡΠΈΡΡΠ΅ΠΌΡ, ΠΈΡΠΏΠΎΠ»ΡΠ·ΡΡ iCloud/Google, ΠΏΡΠΈΡΠ»Π°ΡΡ ΠΊΠΎΠ΄/ΠΏΠ°ΡΠΎΠ»Ρ, Π·Π°ΠΏΡΡΡΠΈΡΡ ΠΊΠΎΠ΄/ΠΠ, Π½Π΅ Π΄Π΅Π»Π°ΠΉΡΠ΅ ΡΡΠΎΠ³ΠΎ - ΡΡΠΎ ΠΌΠΎΡΠ΅Π½Π½ΠΈΠΊΠΈ. ΠΠ±ΡΠ·Π°ΡΠ΅Π»ΡΠ½ΠΎ ΠΆΠΌΠΈΡΠ΅ "ΠΠΎΠΆΠ°Π»ΠΎΠ²Π°ΡΡΡΡ" ΠΈΠ»ΠΈ ΠΏΠΈΡΠΈΡΠ΅ Π² ΠΏΠΎΠ΄Π΄Π΅ΡΠΆΠΊΡ. ΠΠΎΠ΄ΡΠΎΠ±Π½Π΅Π΅ Π² Π³Π°ΠΉΠ΄Π΅ β