AI Systems Administrator (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
AI Systems Administrator (AI): Implementing and maintaining a closed GPT environment and AI infrastructure with an accent on GPU server management, system observability, and Linux platform reliability. Focus on automating workflows, optimizing resource allocation for LLM workloads, and ensuring secure, high-performance operations in a regulated environment.
Location: Must be based in or able to commute to Cambridge, MA (Hybrid: 3 days/week)
Salary: $82,300–$220,000
Company
is an independent, nonprofit research and development company tackling national challenges in defense, space, and biomedical engineering.
What you will do
- Build, operate, and troubleshoot RHEL/Oracle systems supporting GPU-intensive AI workloads.
- Manage the GPU enablement layer, including driver toolkits, kernel compatibility, and health monitoring.
- Implement observability solutions using Prometheus and Grafana to track system and GPU performance.
- Maintain LLM servers to ensure high uptime and efficient resource allocation across the organization.
- Develop automation and scripting using Python and Ansible to streamline Linux team workflows.
- Collaborate with network and storage peers to identify bottlenecks and tune platform performance.
Requirements
- Active Secret Clearance required
- Bachelor's degree in Computer Science or a related field.
- 3+ years of experience in production Linux system administration.
- Strong proficiency in Bash, Python, and Ansible.
- Experience supporting enterprise platforms with a focus on security and audit logging.
- Ability to work on-site in Cambridge, MA 3 days per week.
Culture & Benefits
- Support for work-life balance through workplace flexibility.
- Access to employee clubs, health and finance workshops, and social events.
- Discounts to local museums and cultural activities.
- Collaborative environment focused on innovation and national impact.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →