TL;DR
Engineering Manager (AI): Leading the Event Platform team responsible for high-throughput event processing infrastructure, powering the company’s LLM evaluation and observability platform with an accent on optimizing distributed systems for durability, high availability, and low latency. Focus on shaping mission-critical infrastructure and driving architectural decisions for distributed systems.
Location: United States of America
Company
The company is the leading AI observability and evaluation platform, empowering AI engineers to build and deploy high-performing, reliable models.
What you will do
- Lead and scale a team of engineers building next-generation event processing and storage infrastructure that handles millions of events per second.
- Drive architectural decisions for mission-critical distributed systems, focusing on reliability, performance, and scalability.
- Design and implement sophisticated storage and query engines optimized for AI observability workloads.
- Partner with product teams to evolve platform capabilities while maintaining strict performance and reliability SLAs.
- Mentor and grow engineering talent while fostering a culture of distributed systems excellence.
Requirements
- Strong background in performance optimization, particularly around low-latency storage and retrieval systems.
- Track record of leading engineering teams working on distributed infrastructure while maintaining technical depth.
- Experience with modern infrastructure (Kubernetes, cloud-native architectures) and distributed systems patterns.
- Focus on pragmatic solutions while meeting strict performance and reliability requirements.
Culture & Benefits
- We encourage underrepresented talent to apply to all our roles & support accessibility needs
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →