TL;DR
Staff+ Software Engineer (Systems): Owning the technical strategy and roadmap for building and scaling AI clusters (thousands to hundreds of thousands of machines) with an accent on compute uptime, resilience, and infrastructure architecture. Focus on defining infrastructure strategy, solving the hardest distributed systems problems, and evolving operational excellence practices.
Location: Hybrid in San Francisco, New York City, or Seattle, US. Visa sponsorship is available.
Salary: $405,000–$485,000 USD annually.
Company
hirify.global is a public benefit corporation focused on creating reliable, interpretable, and steerable AI systems.
What you will do
- Own the technical strategy and roadmap for your area, translating team-level goals into concrete execution plans.
- Drive cross-team initiatives to build and scale AI clusters (thousands to hundreds of thousands of machines).
- Define infrastructure architecture, ensuring the hardest problems get solved.
- Partner with cloud providers and internal stakeholders to shape long-term compute, data, and infrastructure strategy.
- Establish and evolve operational excellence practices (incident response, postmortem culture, on-call).
Requirements
- 10+ years of software engineering experience.
- Led complex, multi-quarter technical initiatives spanning multiple teams or systems.
- Ability to set technical direction for a team.
- Deep expertise in distributed systems, reliability, and cloud platforms (Kubernetes, IaC, AWS/GCP).
- Strong in at least one systems language (Python, Rust, Go, Java).
- Ability to uplevel other engineers and build alignment across senior stakeholders.
- Bachelor's degree in a related field or equivalent experience required.
- English: B2 required.
Nice to have
- Security and privacy best practice expertise.
- Experience with machine learning infrastructure (GPUs, TPUs, Trainium) and networking (NCCL).
- Low-level systems experience (Linux kernel tuning, eBPF).
Culture & Benefits
- Work as a single cohesive team on large-scale AI research efforts.
- Focus on advancing long-term goals of steerable, trustworthy AI.
- Collaborative environment with frequent research discussions.
- Competitive compensation and benefits, optional equity donation matching.
- Generous vacation and parental leave, flexible working hours.
- Hybrid work policy, expected in office at least 25% of the time.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →