TL;DR
Staff Software Engineer, Inference Infrastructure (AI): Developing, deploying, and operating AI platform for large language models through easy-to-use API endpoints with an accent on low latency, high throughput, and high availability environments. Focus on deploying optimized NLP models to production and creating customized deployments to meet specific customer needs.
Location: Hybrid (San Francisco, Toronto, London, New York, Montreal)
Company
hirify.global is training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences.
What you will do
- Develop, deploy, and operate the AI platform delivering hirify.global's large language models through easy to use API endpoints.
- Deploy optimized NLP models to production in low latency, high throughput, and high availability environments.
- Interface with customers and create customized deployments to meet their specific needs.
- Design large, highly available distributed systems.
- Collaborate and troubleshoot to build mission-critical systems and ensure smooth operations.
Requirements
- 5+ years of engineering experience running production infrastructure at a large scale.
- Experience designing large, highly available distributed systems with Kubernetes and GPU workloads.
- Experience with GCP, Azure, AWS, OCI, multi-cloud on-prem / hybrid serving.
- Experience in complex Linux-based computing environments.
- Strong understanding or working experience with distributed systems.
- Experience in Golang, C++ or other languages designed for high-performance scalable servers.
Culture & Benefits
- Open and inclusive culture and work environment.
- Work closely with a team on the cutting edge of AI research.
- Full health and dental benefits, including a separate budget to take care of your mental health.
- 100% Parental Leave top-up for up to 6 months.
- 6 weeks of vacation (30 working days!).
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →