Назад
ΠΎΠ±Π½ΠΎΠ²Π»Π΅Π½ΠΎ 11 часов Π½Π°Π·Π°Π΄

Staff Software Engineer (AI Inference)

325Β 000 - 390Β 000GBP
Π€ΠΎΡ€ΠΌΠ°Ρ‚ Ρ€Π°Π±ΠΎΡ‚Ρ‹
hybrid
Π’ΠΈΠΏ Ρ€Π°Π±ΠΎΡ‚Ρ‹
fulltime
Π“Ρ€Π΅ΠΉΠ΄
principal
Английский
b2
Π‘Ρ‚Ρ€Π°Π½Π°
UK
Вакансия ΠΈΠ· списка Hirify.GlobalВакансия ΠΈΠ· Hirify Global, списка ΠΌΠ΅ΠΆΠ΄ΡƒΠ½Π°Ρ€ΠΎΠ΄Π½Ρ‹Ρ… tech-ΠΊΠΎΠΌΠΏΠ°Π½ΠΈΠΉ
Для мэтча ΠΈ ΠΎΡ‚ΠΊΠ»ΠΈΠΊΠ° Π½ΡƒΠΆΠ΅Π½ Plus

ΠœΡΡ‚Ρ‡ & Π‘ΠΎΠΏΡ€ΠΎΠ²ΠΎΠ΄

Для мэтча с этой вакансиСй Π½ΡƒΠΆΠ΅Π½ Plus

ОписаниС вакансии

ВСкст:
/

TL;DR

Staff Software Engineer (AI Inference): Building and optimizing high-performance inference systems for large-scale AI models with an accent on compute efficiency, intelligent request routing, and fleet-wide orchestration. Focus on solving complex distributed systems challenges across diverse AI accelerators and cloud platforms to serve millions of users and enable breakthrough research.

Location: London, UK. This role operates under a location-based hybrid policy, requiring staff to be in one of the offices at least 25% of the time. Visa sponsorship is available, with reasonable efforts made to secure a visa if an offer is extended.

Salary: Β£325,000 – Β£390,000 GBP

Company

Anthropic is a public benefit corporation with a mission to create reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society.

What you will do

  • Identify and address key infrastructure blockers for serving Claude to millions of users globally.
  • Design intelligent routing algorithms to optimize request distribution across thousands of accelerators.
  • Autoscale the compute fleet to dynamically match supply with demand for production, research, and experimental workloads.
  • Build production-grade deployment pipelines for releasing new AI models.
  • Integrate new AI accelerator platforms to maintain hardware-agnostic competitive advantage.
  • Analyze observability data to fine-tune performance based on real-world production workloads.

Requirements

  • Significant software engineering experience, particularly with distributed systems.
  • Familiarity with performance optimization, large-scale service orchestration, and intelligent request routing.
  • Experience implementing and deploying machine learning systems at scale.
  • Proficiency in Python or Rust.
  • At least a Bachelor's degree in a related field or equivalent experience.

Nice to have

  • Familiarity with LLM inference optimization, batching strategies, and multi-accelerator deployments.
  • Experience with load balancing or traffic management systems.
  • Knowledge of Kubernetes and cloud infrastructure (AWS, GCP).

Culture & Benefits

  • Competitive compensation and benefits with optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours.
  • Collaborative environment focused on high-impact AI research.
  • Emphasis on advancing long-term goals of steerable, trustworthy AI.
  • Regular research discussions to ensure pursuit of high-impact work.

Π‘ΡƒΠ΄ΡŒΡ‚Π΅ остороТны: Ссли Ρ€Π°Π±ΠΎΡ‚ΠΎΠ΄Π°Ρ‚Π΅Π»ΡŒ просит Π²ΠΎΠΉΡ‚ΠΈ Π² ΠΈΡ… систСму, ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΡ iCloud/Google, ΠΏΡ€ΠΈΡΠ»Π°Ρ‚ΡŒ ΠΊΠΎΠ΄/ΠΏΠ°Ρ€ΠΎΠ»ΡŒ, Π·Π°ΠΏΡƒΡΡ‚ΠΈΡ‚ΡŒ ΠΊΠΎΠ΄/ПО, Π½Π΅ Π΄Π΅Π»Π°ΠΉΡ‚Π΅ этого - это мошСнники. ΠžΠ±ΡΠ·Π°Ρ‚Π΅Π»ΡŒΠ½ΠΎ ΠΆΠΌΠΈΡ‚Π΅ "ΠŸΠΎΠΆΠ°Π»ΠΎΠ²Π°Ρ‚ΡŒΡΡ" ΠΈΠ»ΠΈ ΠΏΠΈΡˆΠΈΡ‚Π΅ Π² ΠΏΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΊΡƒ. ΠŸΠΎΠ΄Ρ€ΠΎΠ±Π½Π΅Π΅ Π² Π³Π°ΠΉΠ΄Π΅ β†’