Product Manager, Safeguards Rare Harms (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Product Manager, Safeguards Rare Harms (AI): Designing and deploying safeguard systems to prevent misuse of frontier AI models with an accent on detections, evals, and interventions. Focus on building state-of-the-art safety systems, mitigating deployment risks, and balancing business and technical tradeoffs in a zero-to-one environment.
Location: San Francisco, CA. Hybrid: Expectation to be in the office at least 25% of the time.
Salary: $305,000 - $385,000 USD
Company
AI safety and research company dedicated to creating reliable, interpretable, and steerable AI systems that are helpful, harmless, and honest.
What you will do
- Lead the ideation, design, development, and deployment of Safeguards systems and product UX across various cloud platforms.
- Develop detections, evaluations, interventions, and tools to measure and mitigate deployment and user risks.
- Drive impact through ruthless prioritization, defining problems and clear requirements for MVP versus ideal states.
- Collaborate with policy, enforcement, research, and engineering stakeholders to build safety by design.
- Lead the development of metrics to identify blindspots and inform future project planning.
- Analyze the AI landscape to plan for mitigation of risks from increasingly powerful models and adversaries.
Requirements
- 5+ years in product management with a focus on roadmaps, data, and infrastructure.
- Deep technical expertise in the development, deployment, and measurement of Safeguards systems.
- Experience working across policy experts, AI/ML research engineers, and software engineering teams.
- Proven ability to launch and measure new products in a zero-to-one environment.
- Strong ability to articulate complex technical concepts to non-technical audiences.
- Bachelor’s degree or an equivalent combination of education and experience.
Culture & Benefits
- Competitive compensation with optional equity donation matching.
- Generous vacation and parental leave policies.
- Flexible working hours and a high-quality collaborative office space.
- Visa sponsorship availability for eligible candidates.
- Strong emphasis on a cohesive, research-driven team culture and open communication.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →