Lead ML/AI Engineer (MLOps)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Lead ML/AI Engineer (MLOps): Designing and implementing automated MLOps pipelines and operational strategies for production AI systems with an accent on CI/CD automation and high-performance AI infrastructure. Focus on orchestrating agentic AI solutions, managing distributed systems, and leading technical delivery for enterprise customers.
Location: Remote (United States)
Salary: $175,000–$215,000
Company
An end-to-end technology company solving the industry’s most complex challenges in computing, memory, and LED solutions through high-performance enterprise infrastructure.
What you will do
- Lead, mentor, and provide technical guidance to a team of ML/AI Engineers and AI Infrastructure Specialists.
- Architect and implement robust, automated CI/CD pipelines for AI/ML models and agentic AI solutions.
- Oversee operational strategy, including monitoring, scaling, maintenance, and security of production AI systems.
- Manage the technical execution of multiple customer-facing project delivery activities and resolve critical issues.
- Lead the presentation of project delivery status, performance metrics, and technical resolution plans to stakeholders.
Requirements
- 7+ years of experience in software engineering, DevOps, or ML engineering.
- At least 2 years in a technical leadership, mentorship, or lead engineer capacity.
- Deep hands-on experience with CI/CD pipelines (Jenkins, GitLab CI, Actions) and infrastructure-as-code (Ansible, Terraform, Puppet).
- Production-level experience with Docker and Kubernetes.
- Proficiency with monitoring and observability tools (Prometheus, Grafana, ELK Stack).
- Must be based in the United States.
Nice to have
- Experience with MLOps platforms like Kubeflow, MLflow, or Seldon Core.
- Hands-on experience with NVIDIA AI Enterprise stack (Triton Inference Server, TensorRT-LLM, NeMo).
- Background in customer-facing professional services or consulting.
- Strong scripting skills in Python or Go.
- Experience deploying infrastructure in both public cloud (AWS, Azure, GCP) and on-premises data centers.
Culture & Benefits
- Medical, dental, and vision benefits.
- 401k saving plan.
- Paid Time Off, Life Insurance, and an Employee Assistance Plan.
- Commitment to an inclusive environment that embraces differences and fosters belonging.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →