TL;DR
Research Scientist (AI): Driving end-to-end ambiguous research problems in RL or mid-training, forming hypotheses, and building training/eval/data to test them. Focus on improving understanding of RL for longer horizon tasks, training graders for coding, and enhancing data quality for model training.
Location: In-person in North Beach, San Francisco or Manhattan, New York
Company
hirify.global is building the best tool for professional programmers by automating coding through inventive research, design, and engineering.
What you will do
- Own ambiguous, hard research problems end-to-end, forming hypotheses and designing experiments.
- Build training, evaluation, and data infrastructure to test hypotheses and push results into models.
- Improve understanding of Reinforcement Learning (RL) for longer horizon tasks with less compute.
- Train graders to improve performance on coding tasks with non-verifiable reward.
- Improve the quality and difficulty of datapoints used for model training.
Requirements
- Deep background in RL and strong machine learning fundamentals.
- Excellent programmer and software engineer.
- Ability to handle ambiguous research tasks with little guidance.
- Strong focus on data quality.
- Must work in-person from offices in San Francisco or New York.
Culture & Benefits
- Small, talent-dense, flat organization.
- Culture of truth-seeking, passion, creativity, spirited debate, and shipping code.
- Cozy offices in North Beach, San Francisco and Manhattan, New York.
- Well-stocked libraries.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →