Machine Learning Infra Engineer (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Machine Learning Infra Engineer (AI): Building training and inference frameworks for vision models processing enterprise documents with an accent on scaling across multi-node GPU clusters and high-performance serving. Focus on designing distributed systems, developing benchmarks, applying SOTA advances, and creating observability tooling.
Location: On-site in San Francisco office. In-person role required.
Salary: $150K – $300K
Company
enables AI teams to ingest unstructured enterprise data like PDFs and spreadsheets using vision models for accurate extraction at scale.
What you will do
- Build and maintain training and inference stacks emphasizing fast iteration, flexibility, and high performance.
- Develop benchmarks to identify bottlenecks in training and inference.
- Explore and apply state-of-the-art advances in training and inference.
- Design systems for reliable multi-node, multi-GPU training with observability.
- Scale distributed workloads across GPU clusters for better utilization, reliability, and cost efficiency.
- Create tooling and abstractions to accelerate ML engineers from experiment to production.
Requirements
- Strong Python skills and systems engineering background.
- Comfortable with Kubernetes and distributed training frameworks.
- High standards for quality, precision, and first-principles problem-solving.
- Experience with real-world implementation challenges.
- Ability to thrive in fast-changing, high-growth environments.
- Effective collaboration across technical and non-technical teams with full ownership.
Nice to have
- Experience at early-stage or high-growth startups.
- Contributions to open source training/inference stacks.
- Excitement for distributed inference on 100s-1000s of GPUs.
- Focus on technical excellence with business impact.
Culture & Benefits
- Unlimited PTO for recharging.
- Daily free lunch with teammates at the office.
- Reimbursed transportation costs.
- Generous health, dental, and vision insurance.
- $150/mo health and wellness budget for gym, classes, etc.
- Flexible parental leave scheduling.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →