HPC Data Center Production Engineer (Golang)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
HPC Data Center Production Engineer (Golang): Building and owning automation and tooling for HPC data center operations with an accent on hardware onboarding, capacity planning, and monitoring integration. Focus on developing production-ready systems for lifecycle management and leveraging AI tools to accelerate development velocity.
Location: On-site 5 days/week in Chicago, IL or New York, NY
Company
Group is a global quantitative trading firm that applies cutting-edge research to financial markets through a culture of innovation and collaboration.
What you will do
- Design and maintain automation to onboard servers, network switches, rack PDUs, CDUs, and environmental sensors.
- Build end-to-end provisioning workflows from racking and cabling through to a production-ready state.
- Develop tools for power and cooling capacity planning and outage simulation to model failure impacts.
- Integrate hardware telemetry via IPMI, BMC, Redfish, and SNMP into centralized observability platforms.
- Collaborate with HPC Planning and Engineering leads to translate operational pain points into automated solutions.
- Utilize AI tools daily for coding, debugging, and implementing predictive capacity planning.
Requirements
- 5+ years of professional experience in production engineering, infrastructure automation, or SRE.
- High proficiency in Golang and at least one other language such as Python.
- Deep Linux systems knowledge, including administration, networking, and OS-level troubleshooting.
- Experience with IaC tools (SaltStack, Ansible, Terraform) and observability stacks (Grafana, Prometheus, InfluxDB).
- Solid understanding of L2/L3 protocols, VLANs, BGP, and switch configuration (Arista, Cisco).
- Must be able to work on-site 5 days a week in Chicago or New York.
Nice to have
- Bachelor's degree.
Culture & Benefits
- Culture of fearlessness, creativity, and intellectual honesty.
- Collaborative environment where research outcomes drive superior returns.
- Opportunity to work with high-performance computing and world-class infrastructure.
- Commitment to maintaining extremely high personal standards for work quality.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →