TL;DR
Sr. Software Engineer, Inference (AI): Building and maintaining critical systems that serve Claude to millions of users worldwide with an accent on maximizing compute efficiency and enabling breakthrough research. Focus on tackling complex, distributed systems challenges across multiple accelerator families and emerging AI hardware running in multiple cloud platforms.
Location: London, UK. Currently, staff are expected to be in one of our offices at least 25% of the time.
Salary: £225,000 - £325,000 GBP
Company
hirify.global’s mission is to create reliable, interpretable, and steerable AI systems.
What you will do
- Designing intelligent routing algorithms that optimize request distribution across thousands of accelerators.
- Autoscaling compute fleet to dynamically match supply with demand across production, research, and experimental workloads.
- Building production-grade deployment pipelines for releasing new models to millions of users.
- Integrating new AI accelerator platforms to maintain hardware-agnostic competitive advantage.
- Supporting inference for new model architectures.
- Analyzing observability data to tune performance based on real-world production workloads.
Requirements
- Significant software engineering experience, particularly with distributed systems.
- Bachelor's degree in a related field or equivalent experience.
- Results-oriented, with a bias towards flexibility and impact.
- Pick up slack, even if it goes outside your job description.
- Want to learn more about machine learning systems and infrastructure.
- Thrive in environments where technical excellence directly drives both business results and research breakthroughs.
Nice to have
- High-performance, large-scale distributed systems experience.
- Experience implementing and deploying machine learning systems at scale.
- Experience with load balancing, request routing, or traffic management systems.
- Experience with LLM inference optimization, batching, and caching strategies.
- Experience with Kubernetes and cloud infrastructure (AWS, GCP).
- Experience with Python or Rust.
Culture & Benefits
- Competitive compensation and benefits.
- Optional equity donation matching.
- Generous vacation and parental leave.
- Flexible working hours.
- Lovely office space in which to collaborate with colleagues.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →