Назад
Company hidden
1 день назад

Staff + Sr. Software Engineer (Cloud Inference)

320 000 - 485 000$
Формат работы
hybrid
Тип работы
fulltime
Грейд
senior/lead
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Staff + Sr. Software Engineer (Cloud Inference): Designing and optimizing backend infrastructure to serve Claude across multiple cloud providers with an accent on high-performance distributed systems and multi-cloud abstraction. Focus on scaling inference execution, capacity management, and building reliable CI/CD pipelines for LLM deployment.

Location: Hybrid (San Francisco, CA). Must be in office at least 25% of the time.

Salary: $320,000 - $485,000 USD

Company

hirify.global is a public benefit corporation dedicated to creating reliable, interpretable, and steerable AI systems that are safe and beneficial for society.

What you will do

  • Design and own backend services and infrastructure for Claude across AWS, GCP, and Azure.
  • Build and evolve CI/CD automation systems to ship model versions to millions of users without regressions.
  • Create tooling abstractions to reduce per-platform complexity and enable cost-effective inference management.
  • Implement capacity planning, autoscaling, and workload routing strategies to optimize compute resources.
  • Analyze observability data to identify and remediate performance bottlenecks and cost anomalies.

Requirements

  • Significant experience with high-performance, large-scale distributed systems serving millions of users.
  • Experience building or operating services on at least one major cloud platform (AWS, GCP, or Azure).
  • Proficiency with Kubernetes, Infrastructure as Code (IaC), or container orchestration.
  • Must be based in or able to work from San Francisco, CA.
  • Ability to collaborate cross-functionally with internal teams and external cloud service provider partners.

Nice to have

  • Experience scaling infrastructure across multiple platforms, navigating differences in networking, security, and billing.
  • Hands-on experience with capacity management and resource planning at scale.
  • Understanding of multi-region deployments and global traffic management.
  • Proficiency in Python or Rust.

Culture & Benefits

  • Competitive compensation and optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours and collaborative office environment.
  • Visa sponsorship available for qualified candidates.
  • Collaborative "big science" research culture focused on long-term AI safety and steerability.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →