Infrastructure Support Engineer (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Infrastructure Support Engineer (AI): Maintaining service availability and reliability for high-performance GPU cloud infrastructure with an accent on troubleshooting data center systems and supporting AI-focused enterprise customers. Focus on managing support tickets, automating operational processes, and collaborating with engineering teams to ensure system scalability.
Location: Must be based in the US (AMER region)
Salary: $100,000 - $140,000 USD
Company
is a GPU cloud provider engineered specifically for AI startups and large enterprises to reduce the complexity of AI development.
What you will do
- Join the support duty rotation to handle day-to-day tickets and alerts.
- Manage and resolve technical issues while keeping stakeholders informed.
- Follow and improve runbooks for common infrastructure issues.
- Participate in monitoring, troubleshooting, and incident triage.
- Identify opportunities for automation to optimize operational processes.
- Collaborate with cross-functional teams and act as an escalation point for onsite staff.
Requirements
- 2–4 years of experience in support, operations, or infrastructure engineering.
- Solid understanding of Linux fundamentals, including CLI and systemd.
- Networking basics including IP addressing, subnets, VLANs, and firewalls.
- Exposure to Kubernetes core concepts and basic troubleshooting.
- Familiarity with GPU diagnostics like nvidia-smi.
- Proficiency in Bash or Python scripting and Git version control.
Nice to have
- Hands-on experience with Kubernetes administration and operators.
- Knowledge of HPC concepts like InfiniBand, RDMA, or NCCL.
- Experience with Infrastructure as Code tools like Ansible or Terraform.
- Familiarity with GitOps, CI/CD pipelines, and security tooling like Vault.
Culture & Benefits
- Competitive base salary with equity and annual reviews.
- Dynamic progression plan tailored to individual ambitions.
- Human-first flexibility with autonomy to shape your workday.
- Collaborative and innovative environment in a fast-growing AI startup.
- Comprehensive benefits package including medical, dental, vision, and retirement plans.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →