Software Engineer, Productivity - Inference Runtime (AI)

230 000 - 385 000$

Формат работы

onsite

Тип работы

fulltime

Грейд

senior

Английский

Страна

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Software Engineer, Productivity - Inference Runtime (AI): Developing and scaling engineering systems, safeguards, and developer workflows for hirify.global's inference runtime with an accent on deploy gate validation and CI/CD infrastructure. Focus on reducing noise from flaky tests, improving release automation, and enhancing the reliability of large-scale model deployments.

Location: San Francisco, USA

Salary: $230K – $385K

Company

AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.

What you will do

Develop and evolve tooling for deploy gate validation to ensure inference engine releases are performant and regression-free.
Standardize and harden release, validation, branching, and deployment processes across the inference stack.
Optimize CI, testing, and validation infrastructure to reduce flakiness and improve signal quality.
Build automation for failure triage, ownership detection, debugging, and escalation.
Collaborate with inference and infrastructure teams to improve rollout safety and reduce developer friction.

Requirements

Strong experience with CI/CD systems, testing infrastructure, and large-scale build systems.
Proficiency in Python, as much of the current validation infrastructure is Python-based.
Ability to debug complex distributed systems and operate in ambiguous environments.
High ownership and developer empathy to proactively drive workflow improvements.
Must be based in San Francisco.

Nice to have

Experience with C++ for performance-sensitive systems or inference engine code.
Prior experience with large-scale inference systems.

Culture & Benefits

Opportunity to work on one of the largest and most performance-sensitive inference platforms in the world.
High-impact environment where work directly enables new model launches.
Commitment to safety, human needs, and diverse perspectives in AI development.
Equal opportunity employer with a focus on inclusivity.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →