Tech Lead Manager, Agentic Runtime (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Tech Lead Manager (Agentic Runtime): Build low-latency, reliable, secure runtime services powering AI agents and assistants with an accent on multi-turn orchestration, tool calling, model routing, memory, and streaming. Focus on designing for performance, correctness, cost optimization, fault isolation, and deep observability in distributed systems.
Location: Hybrid (4 days a week in either Mountain View or San Francisco offices)
Salary: $250,000 - $300,000 annually
Company
is the Work AI platform that powers intelligent search, AI assistants, and scalable AI agents across enterprises with over 100 SaaS connectors and robust APIs.
What you will do
- Own end-to-end runtime problems from architecture and design to production launch and reliability.
- Build core services for session lifecycle, streaming responses, structured tool execution, memory/state, and policy/guardrails.
- Design for performance, correctness, and cost: reduce latency, improve tail behavior, optimize token/tool budgets.
- Integrate with LLM providers like OpenAI, Anthropic, Google Gemini and internal evaluation frameworks.
- Harden platform with fault isolation, retries, circuit-breaking, backpressure, and graceful degradation.
- Instrument observability with tracing, metrics, logs and create SLOs for high availability.
- Collaborate with product, quality, and application teams on roadmap prioritization.
Requirements
- 8+ years software engineering experience building production distributed systems or cloud-native applications
- 1+ years engineering management experience
- BS/BA in Computer Science or equivalent.
- Strong coding in Python, Go, Java, or C++ with focus on reliability, performance, tests.
- Experience operating services on Kubernetes and major cloud (GCP, AWS, Azure).
- Familiarity with event/streaming systems (Pub/Sub, Kafka), caching (Redis), low-latency data stores.
- Practical understanding of LLM/agents: tool calling, structured outputs, streaming, model routing.
- Strong observability skills: tracing (OpenTelemetry), metrics, production debugging.
Nice to have
- Background in policy/guardrails, multi-tenant isolation, rate-limiting, concurrency control, cost optimization.
Culture & Benefits
- Comprehensive benefits: medical, vision, dental coverage, generous time-off, 401k contribution.
- Home office improvement stipend, annual education and wellness stipends.
- Vibrant culture with regular events, daily healthy lunches.
- Commitment to diversity, inclusion, and AI fluency for all employees.
Hiring process
- AI-focused exercise or discussion to assess AI thinking and usage.
- Standard interviews evaluating technical skills, management experience, and cultural fit.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →