Anthropic

AI safety and research company

301открытых вакансий

AI / Research

1 000-5 000

Средняя вакансия

Описание вакансии подробное, но отсутствует информация о зарплате, что является значительным недостатком. Роль находится в модной области ИИ с хорошей репутацией компании, но неясный диапазон зарплаты вызывает опасения по поводу ожиданий по компенсации.

Кликните для подробной информации

Оценка от Hirify AI

Anthropic

4 месяца назад

Research Engineer, Machine Learning (Reinforcement Learning)

Формат работы

hybrid

Тип работы

fulltime

Английский

Страна

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Research Engineer, Machine Learning (Reinforcement Learning): Collaborating with researchers and engineers to advance the capabilities and safety of large language models with an accent on implementing novel approaches and contributing to research direction. Focus on fundamental research in reinforcement learning, creating 'agentic' models via tool use, and improving reasoning abilities in areas like mathematics.

Location: London, UK. This is a hybrid role requiring staff to be in one of the offices at least 25% of the time. Visa sponsorship is available.

Company

Anthropic is a public benefit corporation focused on creating reliable, interpretable, and steerable AI systems for societal benefit.

What you will do

Lead reinforcement learning research and development for Anthropic's AI systems.
Develop systems enabling models to effectively use computers and advance code generation.
Pioneer fundamental RL research for large language models, improving model reasoning.
Architect and optimize core RL infrastructure and distributed experiment management across GPU clusters.
Design, implement, and test novel training environments, evaluations, and methodologies for RL agents.
Drive performance improvements through profiling, optimization, and debugging distributed systems.

Requirements

Proficiency in Python and async/concurrent programming (e.g., Trio).
Experience with machine learning frameworks (PyTorch, TensorFlow, JAX).
Industry experience in machine learning research.
Ability to balance research exploration with engineering implementation.
Strong systems design and communication skills.
Passion for the potential impact of AI and commitment to developing safe and beneficial systems.

Nice to have

Familiarity with LLM architectures and training methodologies.
Experience with reinforcement learning techniques and environments.
Experience with virtualization, sandboxed code execution, or Kubernetes.
Experience with distributed systems or high-performance computing.
Experience with Rust and/or C++.

Culture & Benefits

Work as a single cohesive team on a few large-scale research efforts.
Focus on impact: advancing long-term goals of steerable, trustworthy AI.
Extremely collaborative group with frequent research discussions.
Competitive compensation and benefits, optional equity donation matching.
Generous vacation and parental leave, flexible working hours.
Lovely office space in London for collaboration.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Research Engineer, Machine Learning (Reinforcement Learning)

Anthropic

Мэтч & Сопровод

Описание вакансии

TL;DR

Company

What you will do

Requirements

Nice to have

Culture & Benefits

Похожие вакансии

Specialist Machine Learning Researcher (AI)

Senior AI/ML Engineer (Medical Imaging)

Architect of Autonomous Research Platform (AI)

LLM Engineer (AI)

Solution Engineer (AI)

Director, Search & AI Evaluation (AI)

Разработка

Game Dev

Design и Creative

Аналитика

Менеджмент

People & Business