Systems Engineer (HPC)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Systems Engineer (HPC): Installing, deploying, and administering HPC clusters and Linux environments with an accent on system stability, automation, and high-performance computing. Focus on troubleshooting root causes, optimizing cluster resource management, and maintaining SLA compliance in a customer-facing data center environment.
Location: Onsite in Orangeburg, NY
Salary: $91,000 - $113,000
Company
An end-to-end technology company solving complex challenges in computing, memory, and LED solutions.
What you will do
- Install, deploy, and administer HPC Clusters in a customer-facing data center environment.
- Maintain, patch, and administer Linux Operating systems and associated software.
- Develop automation scripts using Shell, Python, and Ansible to streamline operations.
- Analyze system logs, perform troubleshooting, and determine root causes for system errors.
- Respond to system alerts and support users with Move/Add/Change requests.
- Document processes and improve procedures to ensure strict adherence to SLAs.
Requirements
- Bachelor's degree in Computer Science, IT, or related field (or equivalent experience).
- 5+ years of hands-on experience with UNIX/Linux server environments.
- Must be able to work onsite in Orangeburg, NY.
- Knowledge of HPC Systems Management and Linux networking implementation/protocols.
- Experience with open-source technologies and working within ITIL operating models.
- Ability to pass a background check that includes a credit check.
Nice to have
- Expertise with HPC Schedulers such as SLURM, PBS, or LSF.
- Practical knowledge of InfiniBand, Ethernet networking, and cluster optimization techniques.
- Experience with virtualization, container orchestration, and AI hardware design.
- Knowledge of high-performance storage and parallel file systems used in AI and Cloud.
Culture & Benefits
- Comprehensive medical, dental, and vision insurance.
- 401k savings plan and life insurance.
- Paid Time Off (PTO) and Employee Assistance Plan.
- Commitment to an inclusive environment that fosters belonging for all.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →