Software Engineer (AI Infrastructure)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Software Engineer (AI Infrastructure): Building the orchestration layer for large-scale AI compute by partitioning, scheduling, optimizing, and executing AI workloads across heterogeneous hardware with an accent on compiler/runtime systems and production inference performance. Focus on MLIR/compiler optimizations, runtime execution planning, scheduling and memory movement efficiency, and latency-focused serving architectures.
Location: San Francisco, CA (Onsite)
Salary: $150,000-$400,000 Base + Equity
Company
is an AI infrastructure company building software to orchestrate AI workloads across diverse compute hardware.
What you will do
- Design and implement compiler optimizations and IR transformations for AI workloads using MLIR.
- Build runtime systems and execution planning to improve how AI workloads run in production.
- Develop scheduling and workload partitioning across heterogeneous hardware architectures.
- Optimize memory movement, kernel orchestration, and execution efficiency for inference workloads.
- Improve AI inference serving with latency optimization, including speculative decoding and next-generation serving architectures.
- Profile and debug performance bottlenecks across the AI software stack.
Requirements
- Strong systems programming and performance engineering fundamentals.
- Experience building compiler systems, runtime systems, or execution infrastructure.
- Experience implementing compiler passes, IR transformations, lowering, or code generation systems.
- Strong understanding of memory systems, scheduling, and hardware performance.
- Strong C++ and/or Python engineering skills.
- Experience working on performance-critical systems.
Nice to have
- Experience optimizing large-scale inference workloads.
- Experience with GPUs, AI accelerators, or heterogeneous compute systems.
- Familiarity with kernel dispatch, launch APIs, or memory allocators.
- Experience with distributed systems and serving infrastructure.
- Experience profiling and debugging production performance bottlenecks.
Culture & Benefits
- Onsite role in San Francisco, CA.
- Base salary plus equity.
- Work on non-traditional compiler problems spanning compilers, runtime systems, serving infrastructure, and execution optimization.
Hiring process
- Initial conversation to discuss compilers, runtime systems, and AI workload execution challenges.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →