Назад
Company hidden
3 часа назад

Lead Member of Technical Staff (AI Infrastructure)

Формат работы
remote/hybrid
Тип работы
fulltime
Грейд
lead
Английский
b2
Страна
France/UK/US +1 еще
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Lead Member of Technical Staff (AI Infrastructure): Designing and scaling high-performance machine learning serving systems for frontier LLMs with an accent on low latency and high throughput. Focus on architecting distributed GPU clusters, optimizing multi-cloud deployments, and establishing engineering standards for AI model serving.

Location: Remote-flexible; offices in San Francisco, New York, Toronto, Montreal, London, and Paris

Company

hirify.global is an AI company training and deploying frontier models for developers and enterprises to power content generation, semantic search, RAG, and agents.

What you will do

  • Lead the architecture and strategy for deploying optimized NLP models into production in low latency, high throughput, and high availability environments.
  • Design high-performance, scalable, and reliable ML systems and provide technical leadership across multiple teams.
  • Serve as a primary technical contact for customers, designing customized deployments to meet specific business needs.
  • Mentor engineers and set team-wide standards for Kubernetes development and production coding.
  • Own compute, storage, and network resource management and cost optimization at an organizational level.

Requirements

  • 8+ years of experience running large-scale production infrastructure with a proven track record of technical leadership.
  • Demonstrated expertise in architecting large, highly available distributed systems with Kubernetes and GPU workloads.
  • Extensive experience with multi-cloud environments, including GCP, Azure, AWS, and OCI.
  • Proficiency in Golang, C++, or other languages designed for high-performance scalable servers.
  • Deep knowledge of accelerator characteristics (GPUs, TPUs) and how to leverage them for performance improvements.
  • Strong collaboration and communication skills for leading cross-functional initiatives.

Culture & Benefits

  • Opportunity to work on the cutting edge of AI research.
  • Generous vacation policy with 6 weeks (30 working days) of annual leave.
  • Comprehensive health and dental benefits, including a dedicated mental health budget.
  • 100% Parental Leave top-up for up to 6 months.
  • Flexible remote work options and a co-working stipend.
  • Weekly lunch stipends, in-office lunches, and snacks.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →