Назад
Company hidden
1 день назад

Staff Systems Engineer (Cloud Operations & Support)

Формат работы
remote (только USA)/hybrid/onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Staff Systems Engineer (Cloud Operations & Support) (HPC/Cloud): Architecting and managing high-performance CPU and GPU clusters for cloud infrastructure with an accent on operational strategy, performance tuning, and multi-tenant service optimization. Focus on designing scalable HPC environments, automating system maintenance, and ensuring high availability across global sites.

Location: Must be based in the US (Remote, San Jose, CA, or Austin, TX)

Company

hirify.global is a global leader in electronic design automation (EDA) software and hardware.

What you will do

  • Architect, build, and optimize high-performance CPU and GPU clusters for the hirify.global cloud.
  • Deploy and manage multi-tenant cloud services across both private and public infrastructure.
  • Drive the overall operational strategy for internal HPC clusters to improve efficiency and reporting.
  • Collaborate with engineering teams to develop and implement solutions that optimize their working environment.
  • Develop automation scripts using Python, Bash, or Perl to streamline deployment and maintenance.
  • Implement monitoring solutions for system health, GPU utilization, and container performance.

Requirements

  • 8+ years of technical experience architecting and managing Linux-based HPC environments.
  • 3+ years of experience coordinating support and operations across multiple global geographies.
  • Deep expertise in Linux system administration (RHEL preferred), including networking, storage, and performance tuning.
  • Extensive hands-on experience with Docker, image management, and container orchestration.
  • Proven experience in GPU Cluster Management, including installation and optimization over OpenStack.
  • Proficiency in Python, Bash, or Perl for system automation and reporting.

Nice to have

  • Direct Electronic Design Automation (EDA) experience.

Culture & Benefits

  • Opportunity to work in a high-impact role developing leadership and innovation in technology.
  • Collaborative environment focusing on customer success and productivity.
  • Flexible location options within the USA (Remote or Onsite).

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →