Senior Data Center Operations Systems Engineer
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Data Center Operations Systems Engineer: Ensure new server, storage, and network infrastructure is properly racked, labeled, cabled, and configured with an accent on troubleshooting hardware and software issues in advanced GPU and networking systems. Focus on documenting data center layouts, managing parts inventory, and partnering with teams for deployments, incident resolution, and RMA processes.
Location: Québec City, QC, Canada - On-site presence required 5 days per week; shift work
Salary: CA$83.2K – CA$124.8K
Company
is a leader in AI cloud infrastructure serving tens of thousands of customers from AI researchers to enterprises and hyperscalers.
What you will do
- Rack, label, cable, and configure new server, storage, and network infrastructure.
- Troubleshoot hardware and software issues in advanced GPU and networking systems.
- Document and update data center layout and network topology in DCIM software.
- Manage parts depot inventory and track equipment through delivery, storage, staging, deployment, and handoff.
- Partner with HW support, supply chain, manufacturing, and RMA teams for deployments, incident resolution, and solutions dissemination.
- Follow installation standards for consistency in placement, labeling, and cabling across data centers.
Requirements
- On-site in Québec City, Canada Data Center 5 days per week; shift work
- Strong experience with critical infrastructure systems: power distribution, air flow management, environmental monitoring, capacity planning, DCIM software, structured cabling, cable management.
- Familiarity with carrier DIA circuit testing, fiber testing, cable optics, single/three-phase power, PDU balancing, multiple cable media types, cold/hot aisle containment.
- Solid understanding of server hardware and boot process.
- Ability to structure, collaborate, and improve complex maintenance MOPs; action-oriented with willingness to train junior staff.
- Willingness to travel for new data center bring-ups as needed.
Nice to have
- 3+ years with critical infrastructure systems supporting data centers.
- Experience with network topology, 400Gb Infiniband architectures, DDP or SCM cluster storage.
- 3+ years with ticketing systems like JIRA and Zendesk.
- Advanced Linux administration.
- Experience with high-performance compute GPU systems, especially Nvidia NVL72.
Culture & Benefits
- Generous cash and equity compensation.
- Health, dental, and vision coverage for you and dependents.
- Wellness and commuter stipends for select roles.
- 401k plan with 2% company match (USA employees).
- Flexible paid time off plan.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →