Senior AI Engineer (LLM)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior AI Engineer (LLM): Building and optimizing core systems for a vertically integrated GenAI cloud platform with an accent on distributed systems, inference efficiency, and large-scale training pipelines. Focus on solving hard systems problems at the hardware/software boundary to deliver high-performance, developer-facing AI services.
Company
is a vertically integrated GenAI cloud platform provider that builds and manages the data centers, software, and applications powering the next generation of AI.
What you will do
- Design, build, and optimize scalable AI platform systems for distributed training and inference.
- Improve performance and efficiency for inference tasks including quantization, batching, and model compression.
- Develop robust post-training and fine-tuning services (RLHF, DPO, LoRA).
- Create evaluation and benchmarking frameworks to measure model quality and system performance.
- Build developer-facing APIs, SDKs, and tooling to enable seamless platform consumption.
- Analyze performance bottlenecks across the full AI stack from hardware to model execution.
Requirements
- 5+ years of experience in machine learning, distributed systems, or high-performance infrastructure.
- 4+ years of hands-on experience in production AI environments.
- Strong expertise in Python and PyTorch.
- Deep understanding of transformer architectures and LLMs.
- Experience with distributed compute paradigms such as data/model parallelism and sharding.
- Knowledge of hardware-level optimization (CUDA, ROCm, or memory management).
Nice to have
- Experience with containerized, distributed environments like Kubernetes.
- Contributions to widely used open-source AI frameworks.
- Knowledge of advanced optimization techniques like MoE or speculative decoding.
- Experience developing APIs using OpenAPI 3.0+ specifications.
Culture & Benefits
- Focus on relentless innovation, ownership, and accountability.
- Transparent culture that prioritizes open collaboration.
- Opportunity to work on complex, large-scale systems at the cutting edge of AI.
- Emphasis on developer experience and high-quality, production-grade engineering.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →