Company hidden

5 часов назад

Datacenter Hardware Technician Lead (AI)

86 400 - 228 000$

Формат работы

onsite

Тип работы

fulltime

Грейд

lead

Английский

Страна

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Datacenter Hardware Technician Lead (AI): Serving as the senior on-site technical authority for hardware reliability and fleet health at a flagship AI campus with an accent on diagnosing complex hardware failures and driving root cause analysis. Focus on establishing hardware maintenance standards, optimizing fleet performance, and scaling operational frameworks for large-scale GPU and server infrastructure.

Location: Must be able to sit onsite at our datacenters 5 days per week in the US.

Compensation: $86,400 – $228,000

Company

hirify.global is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.

What you will do

Drive technical triage and resolution of complex hardware failures impacting production systems.
Lead root cause analysis (RCA) efforts for critical hardware incidents and develop corrective action plans.
Partner with Fleet Health Engineering to identify failure patterns and improve fleet reliability.
Collaborate with Cloud Service Provider operations teams and OEM vendors on repairs and upgrades.
Establish and improve hardware maintenance procedures, operational runbooks, and troubleshooting standards.
Mentor on-site technicians on advanced troubleshooting methodologies and operational excellence.

Requirements

8+ years of experience supporting large-scale datacenter hardware infrastructure.
Deep expertise with server platforms, GPU systems, storage infrastructure, and rack integration.
Proven experience conducting root cause analysis and driving long-term corrective actions.
Strong understanding of hardware reliability engineering principles and fleet-health management.
Must be able to sit onsite at our datacenters 5 days per week.
Excellent written and verbal communication skills with the ability to influence technical decisions.

Nice to have

Experience supporting large-scale GPU clusters or AI/ML infrastructure environments.
Familiarity with fleet health systems, telemetry platforms, and hardware monitoring tools.
Knowledge of Linux system administration and hardware validation workflows.
Experience with failure analysis methodologies such as FRACAS, RCCA, 5-Why, or FMEA.
Industry certifications such as CompTIA Server+ or OEM hardware certifications.

Culture & Benefits

Opportunity to work on world-class AI infrastructure at scale.
Collaborative environment working with engineering, manufacturing, and infrastructure teams.
Commitment to diversity, equity, and inclusion.
Equal opportunity employer with comprehensive policy statements.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Похожие вакансии

Datacenter Hardware Technician Lead (AI)

Мэтч & Сопровод

Описание вакансии

TL;DR

Company

What you will do

Requirements

Nice to have

Culture & Benefits

Похожие вакансии

Senior Hardware Support Engineer (AI)

Data Center Technician (AI)

HPC Systems Engineer (Semiconductor)

IT Manager (AI)

Data Center IT Technician (AI)

Data Center Technician

Разработка

Game Dev

Design и Creative

Аналитика

Менеджмент

People & Business

Datacenter Hardware Technician Lead (AI)

Мэтч & Сопровод

Описание вакансии

TL;DR

Company

What you will do

Requirements

Nice to have

Culture & Benefits

Categories

Похожие вакансии

Senior Hardware Support Engineer (AI)

Data Center Technician (AI)

HPC Systems Engineer (Semiconductor)

IT Manager (AI)

Data Center IT Technician (AI)

Data Center Technician