Senior AI/ML Specialist Solutions Architect (AI Infra & Cloud)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior AI/ML Specialist Solutions Architect (AI Infra & Cloud): Architect and optimize distributed training and inference systems for large-scale AI models with an accent on multi-node multi-GPU environments and cloud infrastructure. Focus on designing customer-focused solutions, scaling ML pipelines from POC to production, and delivering technical leadership on AI deployment strategies.
Location: Remote U.S. Legal authorization to work in the United States on a full-time basis without sponsorship required.
Salary: $225,000 to $315,000 per year
Company
Publicly traded company at the forefront of AI revolution, offering AI-centric cloud platform with large-scale GPU clusters, tools, and services for Fortune 1000 companies, startups, and AI researchers.
What you will do
- Architect and optimize distributed training and inference systems for large-scale AI models
- Design and deliver customer-focused solutions maximizing performance and business value
- Lead transition of ML pipelines from POC to scalable production systems
- Build long-term customer relationships and ensure alignment with strategic goals
- Create whitepapers, deliver technical presentations, and host webinars
- Provide technical leadership, mentor teams on AI infrastructure, and collaborate with engineering/product teams
Requirements
- 5+ years experience with cloud technologies and infrastructure, ideally in senior MLOps or Solutions Architect roles
- Proven expertise scaling and optimizing AI workloads across multi-node and multi-GPU environments
- Demonstrated success delivering ML products from POC to production
- Deep knowledge of ML frameworks like PyTorch and JAX
- Strong background in NVIDIA HPC ecosystem (CUDA, NCCL, Infiniband)
- Exceptional communication skills for technical and business stakeholders
- Legal authorization to work in the United States without sponsorship
Nice to have
- Programming: Python, Go, Java, C++
- IaC: Terraform, Ansible
- Orchestration: Kubernetes, Slurm
- DevOps: Git, Docker, Helm
- Big Data: Spark, Kafka, Hadoop
- Databases: SQL, NoSQL, vector databases
- ML: TensorFlow, HuggingFace, Scikit-learn
Culture & Benefits
- Competitive compensation with stock options
- 100% company-paid medical, dental, vision for employees and families
- 401(k) with 4% match
- Flexible remote work environment
- 20 weeks paid parental leave for primary caregivers, 12 weeks for secondary
- Company-paid disability and life insurance, up to $85/month mobile/internet
- Work with state-of-the-art AI tech including NVIDIA GPUs and supercomputers
Hiring process
- Level 1: Interview with Talent Acquisition
- Level 2: Interview with Hiring Manager
- Level 3: Technical Assessment
- Reference and Background Checks
- Job Offer
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →