Назад
Company hidden
обновлено 3 дня назад

Software Engineer, Platform Systems (AI)

Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
UK
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Software Engineer, Platform Systems (AI): Design and build distributed systems for large-scale AI training workloads with an accent on failure detection, tracing, and observability. Focus on identifying performance bottlenecks, optimizing massive distributed training jobs, and ensuring system reliability at frontier scale.

Location: Onsite in London, UK

Company

hirify.global is an AI research and deployment company focused on ensuring that general-purpose artificial intelligence benefits all of humanity.

What you will do

  • Design and build distributed failure detection, tracing, and profiling systems for large-scale AI training jobs.
  • Develop tooling to identify slow, faulty, or misbehaving nodes and provide actionable visibility into system behavior.
  • Improve observability, reliability, and performance across hirify.global’s training platform.
  • Debug and resolve issues in complex, high-throughput distributed systems.
  • Collaborate with systems, infrastructure, and research teams to evolve platform capabilities.
  • Extend and adapt failure detection or tracing systems to support new training paradigms and workloads.

Requirements

  • Care deeply about performance, stability, and observability in distributed systems.
  • Experience finding and fixing issues in large-scale systems and automating operational workflows.
  • Experience writing low-level software where system details matter.
  • Understand hardware, operating systems, networking, concurrency, and distributed systems.
  • Background in high-performance computing or low-level systems engineering.
  • Excitement to work on critical infrastructure that powers frontier AI research.

Culture & Benefits

  • Committed to providing reasonable accommodations to applicants with disabilities.
  • Dedicated to ensuring general-purpose artificial intelligence benefits all of humanity.
  • Fosters a diverse and inclusive environment.
  • Offers equal employment opportunity, not discriminating on protected characteristics.
  • Focused on pushing AI system capabilities and safely deploying them through products.

Hiring process

  • Background checks for applicants will be administered in accordance with applicable law.
  • Qualified applicants with arrest or conviction records will be considered for employment consistent with laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates.
  • Requests for reasonable accommodations can be made via a provided link.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →