TL;DR
MTS, Developer Experience (AI): Designing and implementing modern, fast, and ergonomic development environments for AI researchers, with an accent on eliminating pain points in build times, testing workflows, and iteration speed. Focus on building and managing CI/CD pipelines, developing tooling for seamless leverage of massive compute, and optimizing container build systems for GPU workloads.
Location: This position is open to candidates based in US geographic markets, including Los Angeles County and San Francisco.
Salary: $150,000–$325,000/year
Company
hirify.global is the AGI Autonomy organization at Amazon, a talent-dense team focused on advancing the accuracy and efficiency of Artificial General Intelligence (AGI) systems.
What you will do
- Design and implement development environments for AI researchers, addressing pain points in build times and iteration speed.
- Build and manage CI/CD pipelines for large-scale AI research workflows, orchestrating thousands of agentic experiments.
- Develop tooling to bridge local environments with remote supercomputing resources for seamless compute leverage.
- Manage and optimize code repository infrastructure for collaborative research at scale.
- Implement release management processes for reliable deployments of research code and models.
- Optimize container build systems for GPU workloads to ensure fast iteration and efficient resource utilization.
Requirements
- 5+ years of experience in DevOps, release engineering, or developer tools/infrastructure.
- Expertise with shell scripting and command-line tools (bash, zsh).
- Experience managing CI/CD systems (AWS CodePipeline, Jenkins, CircleCI) and code repositories (GitLab, GitHub, Phabricator).
- Proficiency in at least one programming language (Python, Go, Rust) for automation and tooling.
- Experience building and maintaining developer tooling or infrastructure at scale.
- Understanding of containerization (Docker, containerd) and container orchestration.
- English: B2 required.
Nice to have
- Experience with release management and maintaining large-scale software deployments.
- Knowledge of container build internals (Docker multi-stage builds, BuildKit, layer caching optimization).
- Experience working with GPU infrastructure and CUDA development workflows.
- Background in IDE development or customization (VSCode extensions, JetBrains plugins).
- Experience building development tools for machine learning or data science teams.
- Knowledge of ML frameworks (PyTorch, TensorFlow) and their build/dependency requirements.
- Experience with AWS developer tools and services (CodeBuild, CodeDeploy, CodeCommit).
Culture & Benefits
- Inclusive culture empowering Amazonians to deliver the best results for customers.
- Commitment to equal opportunity and non-discrimination based on protected veteran status, disability, or other legally protected status.
- Support for workplace accommodation or adjustment during the application and hiring process.
- Total compensation package includes equity, sign-on payments, and a full range of medical, financial, and other benefits.
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →