Назад
Company hidden
2 мСсяца Π½Π°Π·Π°Π΄

Staff Engineer, HPC Systems Software (AI)

100Β 000 - 500Β 000$
Π€ΠΎΡ€ΠΌΠ°Ρ‚ Ρ€Π°Π±ΠΎΡ‚Ρ‹
hybrid
Π’ΠΈΠΏ Ρ€Π°Π±ΠΎΡ‚Ρ‹
fulltime
Π“Ρ€Π΅ΠΉΠ΄
senior
Английский
b2
Π‘Ρ‚Ρ€Π°Π½Π°
US/Canada
Вакансия ΠΈΠ· списка Hirify.GlobalВакансия ΠΈΠ· Hirify Global, списка ΠΌΠ΅ΠΆΠ΄ΡƒΠ½Π°Ρ€ΠΎΠ΄Π½Ρ‹Ρ… tech-ΠΊΠΎΠΌΠΏΠ°Π½ΠΈΠΉ
Для мэтча ΠΈ ΠΎΡ‚ΠΊΠ»ΠΈΠΊΠ° Π½ΡƒΠΆΠ΅Π½ Plus

ΠœΡΡ‚Ρ‡ & Π‘ΠΎΠΏΡ€ΠΎΠ²ΠΎΠ΄

Для мэтча с этой вакансиСй Π½ΡƒΠΆΠ΅Π½ Plus

ОписаниС вакансии

ВСкст:
/

TL;DR

Staff Engineer, HPC Systems Software (AI): Architecting and maintaining the operating system foundation for global hardware design infrastructure with an accent on bare-metal provisioning and configuration-as-code. Focus on scaling OS lifecycle management across hundreds of compute nodes and optimizing Linux kernel performance for AI hardware development.

Location: Hybrid: Must be based in Austin (TX), Santa Clara (CA), or Toronto (CA)

Salary: $100k - $500k

Company

hirify.global is a startup leading the industry in cutting-edge AI technology and high-performance RISC-V CPUs.

What you will do

  • Design and maintain automated OS deployment pipelines for global bare-metal HPC clusters.
  • Manage large-scale configuration using Ansible to ensure consistency across compute infrastructure.
  • Deploy and lifecycle manage RHEL and Ubuntu systems across diverse hardware platforms.
  • Implement infrastructure-as-code for repeatable, version-controlled system configurations.
  • Troubleshoot OS-level issues and optimize kernel parameters to resolve performance bottlenecks.
  • Collaborate with hardware design teams to standardize system configurations and development environments.

Requirements

  • Experience in RHEL and Ubuntu administration within HPC or large-scale compute environments.
  • High proficiency in Ansible for automation across hundreds of nodes.
  • Experience with bare-metal provisioning systems such as MAAS, Foreman, Cobbler, or Warewulf.
  • Deep understanding of Linux internals, networking, kernel tuning, and performance troubleshooting.
  • Familiarity with HPC cluster architecture and infrastructure-as-code practices.
  • Must be eligible to access U.S. export-controlled technology (EAR compliance).

Nice to have

  • Hands-on experience with IBM Spectrum LSF or similar HPC workload managers.
  • Integration with commercial HPC storage platforms like Pure Storage, Weka, or Vast Data.
  • Exposure to EDA tools and hardware design workflows in semiconductor development.
  • Experience with container technologies including Docker, Singularity, or Podman.
  • Cluster monitoring skills using Prometheus, Grafana, and custom tooling.
  • Python and bash scripting for production-level infrastructure automation.

Culture & Benefits

  • Highly competitive compensation package including base and variable targets.
  • Collaborative environment with a focus on curiosity and solving hard technical problems.
  • Opportunity to work on revolutionary AI platforms and RISC-V CPU architecture.
  • Equal opportunity employer.

Π‘ΡƒΠ΄ΡŒΡ‚Π΅ остороТны: Ссли Ρ€Π°Π±ΠΎΡ‚ΠΎΠ΄Π°Ρ‚Π΅Π»ΡŒ просит Π²ΠΎΠΉΡ‚ΠΈ Π² ΠΈΡ… систСму, ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΡ iCloud/Google, ΠΏΡ€ΠΈΡΠ»Π°Ρ‚ΡŒ ΠΊΠΎΠ΄/ΠΏΠ°Ρ€ΠΎΠ»ΡŒ, Π·Π°ΠΏΡƒΡΡ‚ΠΈΡ‚ΡŒ ΠΊΠΎΠ΄/ПО, Π½Π΅ Π΄Π΅Π»Π°ΠΉΡ‚Π΅ этого - это мошСнники. ΠžΠ±ΡΠ·Π°Ρ‚Π΅Π»ΡŒΠ½ΠΎ ΠΆΠΌΠΈΡ‚Π΅ "ΠŸΠΎΠΆΠ°Π»ΠΎΠ²Π°Ρ‚ΡŒΡΡ" ΠΈΠ»ΠΈ ΠΏΠΈΡˆΠΈΡ‚Π΅ Π² ΠΏΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΊΡƒ. ΠŸΠΎΠ΄Ρ€ΠΎΠ±Π½Π΅Π΅ Π² Π³Π°ΠΉΠ΄Π΅ β†’