TL;DR
Senior ML Engineer (AI): Building an inference and fine-tuning platform for foundation models (text, vision, audio, multimodal) at massive scale with an accent on model quality, training efficiency, and production speedups. Focus on enhancing fine-tuning methodologies for cutting-edge LLMs, identifying inference bottlenecks, and investigating low-precision training and inference for modern hardware.
Location: The job is available remote from Europe or the United States.
Company
hirify.global is a Nasdaq-listed company headquartered in Amsterdam, leading in cloud computing for the global AI economy with R&D hubs across Europe, North America, and Israel.
What you will do
- Enhance fine-tuning methodologies (LoRA-based and full-parameter) for cutting-edge LLMs, focusing on model quality and training efficiency.
- Identify LLM inference bottlenecks to drive production speedups.
- Build model training and evaluation pipelines in JAX for speculative decoding.
- Experiment with architectures (dense/MoE, auto-regressive/parallel) and derive scaling laws.
- Investigate low-precision (FP8, NVFP4/MXFP4) methodologies for supervised fine-tuning and reinforcement learning.
Requirements
- Profound understanding of theoretical foundations of machine learning and reinforcement learning.
- Deep expertise in modern deep learning for language processing and generation.
- Experience with training large models on multiple computational nodes.
- Reasonable understanding of performance aspects of large neural network training (sharding strategies, custom kernels, hardware features etc.).
- Strong software engineering skills, primarily in Python.
- Deep experience with modern deep learning frameworks (JAX).
- Proficiency in contemporary software engineering approaches, including CI/CD, version control and unit testing.
- Strong communication and leadership abilities.
Nice to have
- Previous experience working with language models or other similar NLP technologies.
- Familiarity with important ideas in LLM space (MHA, RoPE, ZeRO/FSDP, Flash Attention, quantization).
- Track record of building and delivering products in a dynamic startup-like environment.
- Strong engineering skills, including experience in developing large distributed systems or high-load web services.
- Open-source projects showcasing engineering prowess.
- Excellent command of the English language, alongside superior writing, articulation, and communication skills.
Culture & Benefits
- Competitive salary and comprehensive benefits package.
- Opportunities for professional growth.
- Flexible working arrangements.
- Dynamic and collaborative work environment that values initiative and innovation.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →