Назад
Company hidden
1 день назад

Machine Learning Engineer (AI Architecture Research)

Формат работы
remote (Global)
Тип работы
fulltime
Английский
b2
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Machine Learning Engineer (AI Architecture Research): Research and develop new neural network architectures (e.g. alternatives or extensions to Transformers, recurrent/hybrid models, long-context systems) with an accent on architecture-level experiments (scaling laws, memory mechanisms, compute trade-offs). Focus on prototyping models end-to-end from research code to training-ready implementations, analyzing model behavior and failure modes, and collaborating with inference engineers for deployable systems.

Location: Remote (world)

Company

Series-A AI company focused on next-generation model architectures.

What you will do

  • Research and develop new neural network architectures including alternatives or extensions to Transformers, recurrent/hybrid models, and long-context systems
  • Design and run architecture-level experiments on scaling laws, memory mechanisms, and compute trade-offs
  • Prototype models end-to-end from research code to training-ready implementations
  • Collaborate with inference and systems engineers to ensure architectures are deployable and efficient
  • Analyze model behavior, failure modes, and inductive biases
  • Read, reproduce, and extend cutting-edge research papers; contribute to internal notes, benchmarks, and open-source efforts

Requirements

  • Strong background in machine learning fundamentals and deep learning
  • Hands-on experience implementing model architectures from scratch
  • Solid understanding of attention mechanisms, RNNs, state-space models or hybrid architectures
  • Knowledge of training dynamics, scaling behavior, optimization, memory, latency, and compute constraints
  • Comfortable working in PyTorch or JAX
  • Ability to move fluidly between theory, experimentation, and engineering; clear communicator on architectural trade-offs

Nice to have

  • Experience with non-Transformer architectures (RNN variants, SSMs, long-context models)
  • Background in research-driven startups or open-source ML projects
  • Experience with large-scale training or custom training loops
  • Publications, preprints, or notable research contributions
  • Familiarity with inference optimization and deployment constraints

Culture & Benefits

  • Work on core model architecture, not just fine-tuning
  • Direct influence on technical direction of a Series-A company
  • Small, high-caliber team with fast feedback loops
  • Opportunity to ship research into production
  • Competitive compensation + meaningful equity

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →