TL;DR
Researcher, Alignment (AI): Designing and implementing scalable solutions to ensure AI systems consistently follow human intent, even in complex scenarios, with an accent on subjective, context-dependent alignment capabilities. Focus on reliably measuring risks and alignment with human intent and values, and integrating human oversight into AI decision-making.
Location: Based in San Francisco, CA, with a hybrid work model of 3 days in the office per week; relocation assistance is offered to new employees.
Salary: $295K – $440K + Offers Equity
Company
hirify.global is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.
What you will do
- Develop and evaluate alignment capabilities that are subjective, context-dependent, and hard to measure.
- Design evaluations to reliably measure risks and alignment with human intent and values.
- Build tools and evaluations to study and test model robustness in different situations.
- Design experiments to understand laws for how alignment scales as a function of compute, data, context length, and adversarial resources.
- Train models to be calibrated on correctness and risk.
- Design novel approaches for using AI in alignment research.
Requirements
- Are a team player – willing to do a variety of tasks that move the team forward.
- Have a PhD or equivalent experience in research in computer science, computational science, data science, cognitive science, or similar fields.
- Have strong engineering skills, particularly in designing and optimizing large-scale machine learning systems(e.g., PyTorch).
- Have a deep understanding of the science behind alignment algorithms and techniques.
- Can develop data visualization or data collection interfaces (e.g., TypeScript, Python).
- Enjoy fast-paced, collaborative, and cutting-edge research environments.
- Want to focus on developing AI models that are trustworthy, safe, and reliable, especially in high-stakes scenarios.
Culture & Benefits
- Push the boundaries of AI capabilities and safely deploy them to the world through our products.
- Value the many different perspectives, voices, and experiences that form the full spectrum of humanity.
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →