Software Engineer, SRE Tooling & Reliability Platforms (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Software Engineer, SRE Tooling & Reliability Platforms (AI): Building and maintaining high-availability services that integrate core Incident Management tools with ’s internal ecosystem with an accent on AI-augmented workflows to increase engineering velocity. Focus on developing new AI services and agents that ingest massive amounts of telemetry to provide on-call engineers with real-time summaries and automated root-cause hypotheses.
Location: Flexible hybrid work options. You may occasionally be asked to attend in-person events or team sessions at a office or facility.
Salary: $88,500.00 - $184,375.00/yr. The compensation package may also include incentive compensation opportunities in the form of discretionary annual bonus or commissions.
Company
connects brands and partners with an audience of hundreds of millions of people through powerful technology.
What you will do
- Build, deploy, and maintain high-availability services that integrate core Incident Management tools with ’s internal ecosystem.
- Develop new AI services and agents that ingest massive amounts of telemetry to provide on-call engineers with real-time summaries, historical context, and automated root-cause hypotheses.
- Use Infrastructure as Code (Terraform/CloudFormation) to manage serverless infrastructure, ensuring reliability tools are as resilient as the services they monitor.
- Identify and implement AI-driven efficiencies in your day-to-day development, replacing manual, repetitive tasks with automated or AI-assisted workflows.
- Leverage AI pair-programming tools to accelerate code reviews and ensure high unit-test coverage for all new reliability services.
- Verify and validate AI-generated code and infrastructure outputs to ensure they meet ’s security and resilience standards.
Requirements
- Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience).
- Proficiency in at least one programming language (Python or Go preferred).
- Working knowledge of AI-assisted development tools (e.g., GitHub Copilot, Amazon CodeWhisperer, or Cursor).
- Understanding of Linux/Unix environments and basic networking concepts.
- Familiarity with Git and version control.
Nice to have
- Experience with Cloud providers (AWS, GCP).
- Familiarity with Docker, Kubernetes, or serverless architectures.
- Demonstrated experience using Prompt Engineering to assist in debugging complex system failures or generating technical documentation.
- Interest in Machine Learning applications for DevOps and site reliability.
Culture & Benefits
- Flexible hybrid work options.
- Comprehensive benefits include healthcare, a great 401k, backup childcare, and education stipends.
- Diverse and inclusive workplace with 11 employee resource groups (ERGs).
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →