Senior AIOps Engineer, Incident Response (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior AIOps Engineer (AIOps/SRE): Owning production health and incident response by evolving the operational support model through AI-driven automation and intelligent agent workflows with an accent on system reliability and scalability. Focus on designing AI-native tooling to reduce operational toil and accelerating issue resolution in cloud-based production environments.
Location: Remote (Must be based in the U.S., excluding U.S. territories)
Salary: $215,000 – $280,000
Company
An insurance technology innovation company building an AI-native insurance platform backed by State Farm.
What you will do
- Own production health, reliability, and operational support processes across critical services.
- Lead incident response, stakeholder communication, root cause analysis, and post-incident reviews.
- Design and implement AI-driven agents and workflows to automate support and operational tasks.
- Collaborate with engineering and AI orchestration teams to improve system resilience and efficiency.
- Maintain operational runbooks and documentation for human and AI-assisted workflows.
- Manage observability, monitoring, and troubleshooting in cloud-based production environments.
Requirements
- 6–8 years of experience in production operations, SRE, or technical support engineering.
- Strong expertise in incident management and root cause analysis.
- Experience with modern SDLC, DevOps, and change management environments.
- Must be based in the United States.
- Bachelor’s degree in Computer Science, Engineering, or equivalent experience.
Nice to have
- Experience with AI/LLM-powered systems and intelligent agents.
- Proficiency with AWS and modern observability ecosystems.
- Knowledge of event-driven architectures and orchestration frameworks.
- Experience leading operational transformation initiatives.
Culture & Benefits
- Comprehensive health, dental, and vision insurance with supplemental plans.
- 401(k) plan with company match.
- $2,000 one-time home office equipment allowance and provided MacBook Pro.
- Four weeks of PTO in the first year and twelve weeks of paid parental leave.
- Annual $5,000 professional development budget, LinkedIn Learning, and BetterUp coaching.
- Remote-first culture with core meeting hours (9 AM - 2 PM PT).
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →