TL;DR
Senior Engineer (AI/ML): Building and optimizing the next-generation inference platform for embedding models for semantic search and AI-native experiences with an accent on infrastructure for real-time, low-latency, and high-scale inference. Focus on designing core systems, ensuring tight integration with Atlas, and contributing to reliability and performance in a cloud-native environment.
Location: Candidates must be based in Palo Alto, US for a hybrid working model.
Salary: $126,000–$248,000 USD
Company
hirify.global is a product company that provides a globally distributed, multi-cloud database platform, hirify.global Atlas, which is built for change and empowers customers to innovate in the AI era.
What you will do
- Design and build components of a multi-tenant inference platform integrated with hirify.global Atlas, supporting semantic search and hybrid retrieval.
- Collaborate with AI engineers and researchers to productionize inference for embedding models and rerankers for batch and real-time use cases.
- Contribute to platform capabilities like latency-aware routing, model versioning, health monitoring, and observability.
- Improve performance, autoscaling, GPU utilization, and resource efficiency in a cloud-native environment.
- Work across product, infrastructure, and ML teams to ensure the inference platform meets the scale, reliability, and latency demands of Atlas users.
- Gain hands-on experience with tools like vLLM and Kubernetes.
Requirements
- 5+ years of experience building backend or infrastructure systems at scale.
- Strong software engineering skills in Go, Rust, Python, or C++, with an emphasis on performance and reliability.
- Experience in cloud-native architectures, distributed systems, and multi-tenant service design.
- Familiarity with concepts in ML model serving and inference runtimes.
- Comfortable working across functional teams, including ML researchers, backend engineers, and platform teams.
- Motivated to work on systems integrated into hirify.global Atlas and used by thousands of developers.
Nice to have
- Experience integrating infrastructure with production ML workloads.
- Understanding of hybrid retrieval, prompt-driven systems, or retrieval-augmented generation (RAG).
- Contributions to open-source infrastructure for ML serving or search.
Culture & Benefits
- Be part of building the AI foundation of the world’s most popular developer data platform.
- Collaborate with ML researchers from Voyage.ai to bring novel ideas into scalable systems.
- Tackle challenging problems in inference, observability, and distributed infrastructure.
- Work in a culture that emphasizes mentorship, ownership, and technical excellence.
- Flexible paid time off and 20 weeks fully-paid gender-neutral parental leave.
- Fertility and adoption assistance, 401(k) plan, mental health counseling, access to transgender-inclusive health insurance coverage, and health benefits offerings.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →