Назад
Company hidden
7 дней назад

System Performance Modeling Engineer/Architect (NPU)

Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
Singapore
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

System Performance Modeling Engineer/Architect (NPU): Building and maintaining full-chip performance models for NPU tile arrays and NoC to drive architecture decisions with an accent on cycle-approximate modeling and workload analysis. Focus on optimizing memory bandwidth, solving NoC contention, and correlating model predictions with RTL and silicon.

Location: Singapore

Company

hirify.global is a world-leading technology company specializing in Bitcoin mining solutions and AI cloud computing.

What you will do

  • Build and maintain cycle-approximate and analytical models of the tile array, NoC, memory subsystem, and IO using C++, SystemC, and Python.
  • Characterize LLM and CNN inference workloads to produce roofline and bottleneck analyses.
  • Model NoC contention, memory bandwidth, and QoS under realistic traffic scenarios to parameterize architecture studies.
  • Run architecture trade-off sweeps on tile count, memory size, and NoC topology to provide recommendations for architecture freeze.
  • Partner with the AI compiler team on dataflow mapping, tiling, and scheduling to ensure models reflect achievable performance.
  • Correlate model predictions against RTL, emulation, FPGA, and silicon to quantify and close the modeling-to-reality gap.

Requirements

  • 5+ years of experience in performance modeling or architecture for SoCs, accelerators, GPUs, or CPUs.
  • Strong proficiency in C++ and Python, with experience using SystemC or similar cycle-approximate modeling frameworks.
  • Solid understanding of computer architecture fundamentals, including memory hierarchy, NoC behavior, and roofline reasoning.
  • Demonstrated ability to translate modeling results into concrete architecture decisions.
  • Ability to work effectively with incomplete specifications and collaborate across architecture, design, and software teams.

Nice to have

  • Experience modeling AI/ML inference workloads (LLM, CNN, transformers) and analyzing quantization/precision effects.
  • Familiarity with HBM, 3D-DRAM bandwidth analysis, or memory-controller behavior.
  • Exposure to compiler and dataflow co-design for accelerators.
  • Background with tile-based or dataflow architectures.

Culture & Benefits

  • Inclusive environment with an open workspace and an exciting start-up spirit.
  • Opportunity to collaborate with industrial pioneers in the digital asset and AI cloud industries.
  • Culture of personal accountability, autonomy, and rapid professional growth.
  • Attractive welfare benefits including training and mentoring opportunities.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →