TL;DR
Senior Site Reliability Engineer (AI): Building and maintaining highly scalable, secure, and resilient infrastructure for a cloud platform with an accent on systems programming and DevOps. Focus on designing low-level systems, automating infrastructure, and ensuring platform reliability.
Location: Remote
Company
hirify.global is leading the charge in AI compute, building a versatile cloud platform that drives the next generation of AI innovation.
What you will do
- Design, build, and maintain infrastructure systems using Linux and NixOS.
- Manage infrastructure-as-code with Terraform to provision and scale resources.
- Architect and operate Kubernetes clusters with a focus on performance, security, and automation.
- Write high-performance tooling and internal utilities in Go, Javascript, Rust.
- Develop and maintain CI/CD pipelines for infrastructure and code deployments.
- Monitor system performance, resolve issues, and improve reliability through observability tooling.
Requirements
- 5+ years in DevOps, Site Reliability, or Infrastructure Engineering roles.
- Deep experience with Linux systems and configuration management (preferably NixOS).
- Hands-on experience with Terraform, Kubernetes, and containerized environments.
- Proficiency in one or more low-level languages: Rust, C, Zig, Javascript, and Go.
- Strong understanding of systems programming, performance tuning, and operating system internals.
- Familiarity with CI/CD practices and infrastructure monitoring/alerting tools.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →