Company hidden

5 часов назад

Research Scientist (AI Safety)

150 000 - 250 000$

Формат работы

hybrid

Тип работы

fulltime

Английский

Страна

France/UK

Релокация

France

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Research Scientist (AI Safety): Studying how LLM agents fail in the wild and eliciting unsafe behaviors through concrete experiments with an accent on agent misalignment and deception. Focus on developing automated audit agents, pressure-testing frontier models, and building empirical benchmarks for AI safety.

Location: Hybrid in Paris or London. Relocation package available for Paris only.

Salary: $150K – $250K + Equity

Company

AI Safety company building a reliability and optimization layer for AI systems using natural-language policies.

What you will do

Own end-to-end research projects from hypothesis formulation to falsifiable experiments and results.
Develop automated audit agents to discover and characterize suspect model behavior at scale.
Study misalignment and bias in real user interactions to create shippable evals.
Pressure-test frontier agents in high-stakes scenarios to identify failure modes.
Perform white-box and black-box investigations of AI model failures.
Publish findings in public blog posts and conference papers.

Requirements

Proven track record of empirical research in agent behavior, model evaluation, or alignment.
Strong ML engineering skills, including independent ability to build MVPs with fine-tuning and agent inference.
Expertise in experimental design, including isolating failure modes and calibrating baselines.
Ability to define and iterate experiments for vague behavioral questions.
Fluent AI power-user experienced with frontier models and coding agents.

Nice to have

Publications at A* venues (NeurIPS, ICML, ICLR, ACL).
Depth in interpretability (NLAs, SAEs, persona vectors).
MSc or PhD in ML, Computer Science, Physics, or related quantitative fields.
AI safety fellowship (MATS, ASTRA, etc.).

Culture & Benefits

Paid time off according to local regulations.
Comprehensive medical insurance for France-based employees.
Provision of all necessary hardware, tools, and AI agent/IDE subscriptions.
Bi-annual team off-sites (e.g., Alps, Saint-Tropez).

Hiring process

Introductory HR call (25 min).
Take-home technical test.
Technical interview with the Head of Fundamental Research (60 min).
Final conversation with the CEO (45 min).

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Похожие вакансии

Research Scientist (AI Safety)

Мэтч & Сопровод

Описание вакансии

TL;DR

Company

What you will do

Requirements

Nice to have

Culture & Benefits

Hiring process

Похожие вакансии

Data Scientist (AI)

Lead Data Scientist (Cybersecurity)

Data Scientist (AI/ML)

Machine Learning Engineer (AI)

Principal Data Scientist (AI)

Data Scientist (Fintech)

Разработка

Game Dev

Design и Creative

Аналитика

Менеджмент

People & Business

Research Scientist (AI Safety)

Мэтч & Сопровод

Описание вакансии

TL;DR

Company

What you will do

Requirements

Nice to have

Culture & Benefits

Hiring process

Categories

Похожие вакансии

Data Scientist (AI)

Lead Data Scientist (Cybersecurity)

Data Scientist (AI/ML)

Machine Learning Engineer (AI)

Principal Data Scientist (AI)

Data Scientist (Fintech)