Назад
Company hidden
12 часов назад

Tech Lead Manager, Agentic Runtime (AI)

250 000 - 300 000$
Формат работы
hybrid
Тип работы
fulltime
Грейд
lead
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Tech Lead Manager (Agentic Runtime): Build low-latency, reliable, secure runtime services powering AI agents and assistants with an accent on multi-turn orchestration, tool calling, model routing, memory, and streaming. Focus on designing for performance, correctness, cost optimization, fault isolation, and deep observability in distributed systems.

Location: Hybrid (4 days a week in either Mountain View or San Francisco offices)

Salary: $250,000 - $300,000 annually

Company

hirify.global is the Work AI platform that powers intelligent search, AI assistants, and scalable AI agents across enterprises with over 100 SaaS connectors and robust APIs.

What you will do

  • Own end-to-end runtime problems from architecture and design to production launch and reliability.
  • Build core services for session lifecycle, streaming responses, structured tool execution, memory/state, and policy/guardrails.
  • Design for performance, correctness, and cost: reduce latency, improve tail behavior, optimize token/tool budgets.
  • Integrate with LLM providers like OpenAI, Anthropic, Google Gemini and internal evaluation frameworks.
  • Harden platform with fault isolation, retries, circuit-breaking, backpressure, and graceful degradation.
  • Instrument observability with tracing, metrics, logs and create SLOs for high availability.
  • Collaborate with product, quality, and application teams on roadmap prioritization.

Requirements

  • 8+ years software engineering experience building production distributed systems or cloud-native applications
  • 1+ years engineering management experience
  • BS/BA in Computer Science or equivalent.
  • Strong coding in Python, Go, Java, or C++ with focus on reliability, performance, tests.
  • Experience operating services on Kubernetes and major cloud (GCP, AWS, Azure).
  • Familiarity with event/streaming systems (Pub/Sub, Kafka), caching (Redis), low-latency data stores.
  • Practical understanding of LLM/agents: tool calling, structured outputs, streaming, model routing.
  • Strong observability skills: tracing (OpenTelemetry), metrics, production debugging.

Nice to have

  • Background in policy/guardrails, multi-tenant isolation, rate-limiting, concurrency control, cost optimization.

Culture & Benefits

  • Comprehensive benefits: medical, vision, dental coverage, generous time-off, 401k contribution.
  • Home office improvement stipend, annual education and wellness stipends.
  • Vibrant culture with regular events, daily healthy lunches.
  • Commitment to diversity, inclusion, and AI fluency for all employees.

Hiring process

  • AI-focused exercise or discussion to assess AI thinking and usage.
  • Standard interviews evaluating technical skills, management experience, and cultural fit.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →