Staff + Sr. Software Engineer (Cloud Inference)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Staff + Sr. Software Engineer (Cloud Inference): Designing and optimizing backend infrastructure to serve Claude across multiple cloud providers with an accent on high-performance distributed systems and multi-cloud abstraction. Focus on scaling inference execution, capacity management, and building reliable CI/CD pipelines for LLM deployment.
Location: Hybrid (San Francisco, CA). Must be in office at least 25% of the time.
Salary: $320,000 - $485,000 USD
Company
is a public benefit corporation dedicated to creating reliable, interpretable, and steerable AI systems that are safe and beneficial for society.
What you will do
- Design and own backend services and infrastructure for Claude across AWS, GCP, and Azure.
- Build and evolve CI/CD automation systems to ship model versions to millions of users without regressions.
- Create tooling abstractions to reduce per-platform complexity and enable cost-effective inference management.
- Implement capacity planning, autoscaling, and workload routing strategies to optimize compute resources.
- Analyze observability data to identify and remediate performance bottlenecks and cost anomalies.
Requirements
- Significant experience with high-performance, large-scale distributed systems serving millions of users.
- Experience building or operating services on at least one major cloud platform (AWS, GCP, or Azure).
- Proficiency with Kubernetes, Infrastructure as Code (IaC), or container orchestration.
- Must be based in or able to work from San Francisco, CA.
- Ability to collaborate cross-functionally with internal teams and external cloud service provider partners.
Nice to have
- Experience scaling infrastructure across multiple platforms, navigating differences in networking, security, and billing.
- Hands-on experience with capacity management and resource planning at scale.
- Understanding of multi-region deployments and global traffic management.
- Proficiency in Python or Rust.
Culture & Benefits
- Competitive compensation and optional equity donation matching.
- Generous vacation and parental leave.
- Flexible working hours and collaborative office environment.
- Visa sponsorship available for qualified candidates.
- Collaborative "big science" research culture focused on long-term AI safety and steerability.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →