TL;DR
Safeguards Analyst (AI): Building and executing enforcement workflows to detect and mitigate human exploitation and abuse in AI products with an accent on technical detection signals and policy strategy. Focus on architecting automated review systems, conducting deep-dive threat investigations, and collaborating with external intelligence partners to ensure user well-being.
Location: Remote-friendly within the US, with an expected hybrid presence (at least 25% of the time) in San Francisco or Washington, DC.
Salary: $245,000 – $285,000 USD
Company
hirify.global is an AI safety and research company building reliable, interpretable, and steerable AI systems.
What you will do
- Design and architect automated enforcement and review workflows for human exploitation and abuse.
- Develop and tune detection signals for high-harm policy areas in collaboration with Engineering and Data Science teams.
- Conduct deep-dive investigations into threat patterns using data analysis tools.
- Curate golden evaluation datasets and track enforcement outcomes across consumer and API surfaces.
- Build and maintain relationships with NGOs, hotlines, and industry hash-sharing consortia.
- Review flagged content to drive enforcement decisions and refine policy improvements.
Requirements
- 3+ years of experience in trust and safety, content moderation, or counter-exploitation work.
- Subject matter expertise in human trafficking, sextortion, or commercial sexual exploitation.
- Experience building or operating detection and review workflows at scale.
- Proficiency in SQL, Python, or similar data analysis tools.
- Ability to analyze complex situations and make reasoned decisions under pressure.
- Comfort working with explicit content of a sexual, violent, or disturbing nature.
Nice to have
- Familiarity with NGO ecosystems like NCMEC, IWF, or Thorn.
- Experience with open-source investigations or threat actor profiling.
- Experience working with generative AI products and prompt engineering for content review.
- Deep interest in AI safety and responsible technology development.
Culture & Benefits
- Competitive compensation including equity.
- Flexible working hours and generous vacation policy.
- Collaborative research-focused environment with frequent team discussions.
- Supportive office spaces in San Francisco and Washington, DC.
- Visa sponsorship available for qualified candidates.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →