TL;DR
Researcher, Frontier Cybersecurity Risks (AI): Designing and implementing an end-to-end mitigation stack to reduce severe cyber misuse across hirify.global’s products with an accent on prevention, monitoring, detection, and enforcement. Focus on evaluating technical trade-offs, collaborating with threat modeling partners, and executing rigorous testing and red-teaming workflows against evolving AI threats.
Location: San Francisco, US
Salary: $295,000–$445,000
Company
hirify.global is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.
What you will do
- Design and implement mitigation components for model-enabled cybersecurity misuse.
- Integrate safeguards across product surfaces in partnership with product and engineering teams.
- Evaluate technical trade-offs within the cybersecurity risk domain and propose pragmatic solutions.
- Collaborate closely with risk and threat modeling partners on mitigation design.
- Execute rigorous testing and red-teaming workflows to stress-test the mitigation stack.
Requirements
- Passion for AI safety and motivation to make cutting-edge AI models safer for real-world use.
- Demonstrated experience in deep learning and transformer models.
- Proficiency with frameworks such as PyTorch or TensorFlow.
- Strong foundation in data structures, algorithms, and software engineering principles.
- Familiarity with methods for training and fine-tuning large language models.
- Significant experience designing and deploying technical safeguards for abuse prevention, detection, and enforcement at scale.
Nice to have
- Background knowledge in cybersecurity or adjacent fields.
Culture & Benefits
- Be part of a Safety Systems organization ensuring responsible development and deployment of capable AI models.
- Contribute to building protections that remain robust as products, model capabilities, and attacker behaviors evolve.
- Work in a fast-paced, exciting environment with far-reaching importance for the company and society.
- Collaborate with cross-functional teams across research, security, policy, product, and engineering.
- Committed to providing reasonable accommodations to applicants with disabilities.
- hirify.global is an equal opportunity employer valuing diverse perspectives, voices, and experiences.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →