Назад
Company hidden
4 часа назад

Software Engineer, Ml & Data Infra (AI)

180 000 - 440 000$
Формат работы
onsite
Тип работы
fulltime
Грейд
middle/senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Software Engineer, ML & Data Infra (AI): Building foundational infrastructure for frontier AI models, focusing on petabyte-to-exabyte scale distributed systems for data acquisition, web crawling, and multimodal pipelines. Focus on high-performance search/retrieval engines and low-level performance optimization using CUDA kernels and compiler/runtime innovations.

Location: Must be based in Palo Alto, CA

Salary: $180,000 - $440,000 USD

Company

hirify.global’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.

What you will do

  • Design, build, and operate petabyte-to-exabyte scale distributed systems for data acquisition, web crawling, preprocessing, filtering/classification, and multimodal pipelines.
  • Architect high-performance search/retrieval engines at trillion-document scale, integrating with LLMs/agents for truth-seeking and real-time knowledge access.
  • Develop reliable inference serving infrastructure, including load balancing, autoscaling, and monitoring for 100% uptime and optimal tail latency.
  • Optimize low-level performance using CUDA kernels, Triton/CUTLASS extensions, and model-hardware co-design.
  • Innovate on compilers/runtimes, distributed profiling/debugging tools, and interconnect fabrics.
  • Manage complex workloads across clouds/clusters, including orchestration, data bookkeeping, and failure analysis.

Requirements

  • Strong systems engineering skills with proven impact on large-scale distributed infrastructure.
  • Proficiency in Python and at least one compiled language (Rust, C++, Go, Java).
  • Hands-on experience with at least one key area: data pipelines/crawling, web-scale search/retrieval, inference optimization, compiler features, or high-speed interconnects.
  • Deep understanding of distributed systems challenges, including high-throughput ops/sec, latency/throughput tradeoffs, and fault-tolerance.
  • Passion for AI infrastructure and delivering rigorous, high-quality results.

Nice to have

  • Experience with multimodal data, epistemics/truth-seeking in retrieval, or agentic systems.
  • Low-level optimizations experience, including CUDA kernel development and GPU profiling.
  • Production expertise in inference reliability, CI/CD for ML, or cluster networking.
  • Track record owning end-to-end projects in hyperscale environments.

Culture & Benefits

  • Equity, comprehensive medical, vision, and dental coverage.
  • Access to a 401(k) retirement plan.
  • Short & long-term disability insurance.
  • Life insurance.
  • Various other discounts and perks.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Текст вакансии взят без изменений

Источник - загрузка...