Software Engineer, Productivity - Inference Runtime (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Software Engineer, Productivity - Inference Runtime (AI): Developing and scaling engineering systems, safeguards, and developer workflows for 's inference runtime with an accent on deploy gate validation and CI/CD infrastructure. Focus on reducing noise from flaky tests, improving release automation, and enhancing the reliability of large-scale model deployments.
Location: San Francisco, USA
Salary: $230K – $385K
Company
AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.
What you will do
- Develop and evolve tooling for deploy gate validation to ensure inference engine releases are performant and regression-free.
- Standardize and harden release, validation, branching, and deployment processes across the inference stack.
- Optimize CI, testing, and validation infrastructure to reduce flakiness and improve signal quality.
- Build automation for failure triage, ownership detection, debugging, and escalation.
- Collaborate with inference and infrastructure teams to improve rollout safety and reduce developer friction.
Requirements
- Strong experience with CI/CD systems, testing infrastructure, and large-scale build systems.
- Proficiency in Python, as much of the current validation infrastructure is Python-based.
- Ability to debug complex distributed systems and operate in ambiguous environments.
- High ownership and developer empathy to proactively drive workflow improvements.
- Must be based in San Francisco.
Nice to have
- Experience with C++ for performance-sensitive systems or inference engine code.
- Prior experience with large-scale inference systems.
Culture & Benefits
- Opportunity to work on one of the largest and most performance-sensitive inference platforms in the world.
- High-impact environment where work directly enables new model launches.
- Commitment to safety, human needs, and diverse perspectives in AI development.
- Equal opportunity employer with a focus on inclusivity.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →