Bare Metal Support Engineer
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Bare Metal Support Engineer (System Engineering): Supporting, operating, and maintaining a large GPU fleet across multiple data centers with an accent on hardware troubleshooting, system reliability, and customer support. Focus on diagnosing complex issues, coordinating with data center teams, and automating support workflows to ensure high performance and scalability.
Location: Hybrid in Livingston, NJ; New York, NY; Sunnyvale, CA; Bellevue, WA
Salary: $83,000–$110,000
Company
is a publicly traded AI-focused cloud infrastructure company delivering high-performance GPU cloud services for AI labs, startups, and enterprises.
What you will do
- Provide high-level support for customers using bare-metal GPU fleets on Cloud.
- Diagnose, triage, and investigate customer issues and incidents, identifying root causes and escalating as needed.
- Coordinate remote troubleshooting and hardware interventions with data center technicians.
- Create and maintain internal documentation and knowledge base articles.
- Participate in on-call rotations to support production clusters and ensure operational reliability.
- Collaborate with engineering teams to improve hardware reliability, software stability, and system performance.
Requirements
- Must be eligible to access U.S. export controlled information (U.S. person or eligible for export authorization).
- Experience with data centers, GPU clusters, system administration, and hardware troubleshooting.
- Intermediate Linux knowledge and command-line proficiency.
- Experience with NVIDIA GPUs, SuperMicro/Dell systems, HPC, and large-scale data centers.
- Networking fundamentals and troubleshooting skills.
- Experience with firmware updates, BIOS configurations, driver management, and scripting (Python, Bash, Ansible).
Nice to have
- Curiosity about Kubernetes, Docker, and containerized infrastructure.
- Strong problem-solving skills with a proactive and analytical mindset.
- Excellent communication and collaboration skills in a fast-paced environment.
Culture & Benefits
- Medical, dental, and vision insurance fully paid by the company.
- Company-paid life insurance and disability coverage.
- Flexible spending and health savings accounts.
- Tuition reimbursement and employee stock purchase program.
- 401(k) with employer match and flexible PTO.
- Casual work environment with catered lunches and a culture focused on innovation.
Hiring process
- Onboarding at one of the company hubs within the first month.
- Participation in on-call rotations and collaboration with cross-functional teams.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →