Назад
Company hidden
2 дня назад

Data Center Facility Telemetry & Controls Management Engineer

185 000 - 247 000$
Формат работы
remote (только USA)/hybrid
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Data Center Facility Telemetry & Controls Management Engineer (BMS/DCIM): Design, deploy, integrate, and operate building management systems, DCIM platforms, and facility telemetry pipelines for high-density GPU data centers with an accent on real-time monitoring, alarming, and controllable power/cooling/thermal systems. Focus on BMS/DCIM architecture, liquid-cooling controls (TCS/CDU loops), and incident response with root-cause analysis to improve MTTR.

Location

Location: Remote (USA) or Hybrid eligible; San Jose office (Zanker)

Salary: $185K–$247K annually (Remote, USA)

Company

hirify.global builds AI cloud infrastructure for high-density GPU deployments.

What you will do

  • Architect and manage BMS integration across facilities (chillers, CRAHs, CDUs, cooling towers, UPS, PDUs, automatic transfer switches) and define standards for point lists, naming, control sequences, and integration protocols (BACnet, Modbus, SNMP, OPC-UA, REST APIs).
  • Own DCIM strategy and roadmap, including asset management, capacity planning, environmental monitoring, and power chain visibility; build dashboards for PUE, thermal performance, stranded capacity, and cooling efficiency.
  • Build and maintain telemetry pipelines ingesting BMS/PDUs/in-rack sensor/CDU/network data into centralized monitoring and alerting (Prometheus, Grafana, InfluxDB or equivalent), including alarm thresholds and escalation workflows.
  • Develop liquid-cooling control strategies for TCS loops (220–380 kW per rack), qualify CDU vendors, and define procedures for commissioning, setpoint changes, loop pressure management, and fluid quality monitoring.
  • Run facility event management and on-call response for telemetry anomalies; lead root-cause analysis and corrective actions; maintain emergency response runbooks tied to BMS alerts and automated controls.
  • Manage BMS integrators and DCIM/control vendors from RFP through commissioning and ongoing support; coordinate controls readiness for new colocation and modular data center buildouts.

Requirements

  • Location/eligibility: must be based in the USA (Remote/Hybrid eligible).
  • 7+ years in data center infrastructure engineering, with 4+ years focused on BMS, DCIM, or controls systems in hyperscale, colocation, or AI/HPC environments.
  • Hands-on experience designing and integrating BMS for mission-critical facilities (UPS, PDU, CRAH/CRAC, chiller plant, cooling tower, and liquid cooling CDU/in-row systems).
  • Strong knowledge of industrial control protocols: BACnet IP/MS-TP, Modbus TCP/RTU, SNMP, DNP3, and API-based integrations.
  • Experience with DCIM platforms (Nlyte, Sunbird, Vertiv TRELLIS, or equivalent) including deployment, configuration, and administration.
  • Experience with real-time telemetry stacks (Prometheus, InfluxDB, Grafana or similar) for infrastructure monitoring; strong understanding of power/cooling systems and redundancy (2N, N+1).

Nice to have

  • Direct experience with direct liquid cooling (DLC), CDU controls integration, and TCS loop management for high-density AI GPU deployments (100+ kW per rack).
  • Familiarity with OCP hardware and telemetry standards.
  • Experience working with major colocation providers (Equinix, Digital Realty, CyrusOne) on BMS/EPMS integration and data sharing agreements.
  • Background in scripting/automation (Python, Ansible, Terraform) for infrastructure management workflows.
  • Experience operating data centers at international scale (including Asia-Pacific or Southeast Asian markets) and relevant BMS/controls certifications.

Culture & Benefits

  • Hybrid/remote flexibility depending on facility portfolio needs.
  • Competitive compensation with salary, equity, and comprehensive benefits.
  • Health, dental, and vision coverage for employees and dependents.
  • 401k plan with 2% company match for USA employees.
  • Flexible paid time off plan.
  • Wellness and commuter stipends for select roles.

Hiring process

  • Interviews to evaluate controls/BMS/DCIM architecture experience and telemetry/incident-response ownership.
  • Technical discussions focused on protocol integrations, liquid-cooling controls, and monitoring/alerting design.
  • Final evaluation of fit for building telemetry and controls readiness across a scaling data center portfolio.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →