Head Of Infrastructure Support (AI)

Формат работы

remote (только USA)/hybrid

Тип работы

fulltime

Грейд

head

Английский

Страна

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Head of Infrastructure Support (AI): Leading the US support team to ensure high-performance infrastructure reliability for AI-focused customers with an accent on operational excellence, team mentorship, and incident management. Focus on bridging strategic direction with frontline execution, optimizing Kubernetes and Linux-based environments, and driving continuous improvement in service delivery.

Location: Must be based in the US

Company

hirify.global is a GPU cloud provider engineered for AI, delivering high-performance infrastructure to support rapid innovation and strategic business outcomes for AI startups and enterprises.

What you will do

Manage day-to-day operations and people leadership for the US Infrastructure Support team.
Oversee ticket queue management, ensuring SLA adherence and timely resolution of complex incidents.
Collaborate with Senior Engineers on technical improvements, operational tooling, and high-impact troubleshooting.
Drive continuous improvement by refining runbooks, dashboards, and automation workflows.
Set team objectives, manage shift planning, and conduct performance reviews to foster professional growth.
Ensure compliance with ITIL processes and maintain high standards for security and operational documentation.

Requirements

Must be based in the US
Proven experience leading or managing engineers in an operational support environment.
Strong Linux systems engineering experience and troubleshooting skills in production.
Experience operating and debugging Kubernetes environments and distributed systems.
Solid understanding of networking fundamentals (L2/L3, routing, load balancing) and datacenter technologies.
Proficiency in scripting (Bash, Python) and Infrastructure as Code tools (Ansible, Terraform).
Understanding of ITIL processes and SRE practices.

Nice to have

Experience with GPU platforms (NVIDIA/AMD) and performance diagnostics.
Exposure to HPC or distributed workloads like RDMA and InfiniBand.
Experience with CI/CD or GitOps tooling.
Experience working in multi-region environments.

Culture & Benefits

Competitive compensation package including base salary and equity with annual reviews.
Opportunity to work at a fast-growing tech startup in the cutting-edge AI infrastructure space.
Human-first flexibility with a remote-first culture that trusts employees to manage their own time.
Dynamic progression plan tailored to individual ambitions and career growth.
Collaborative and innovative environment focused on transparency and ownership.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →