Staff Site Reliability Engineer (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Staff Site Reliability Engineer (AI): Building and scaling AI-assisted reliability tooling and infrastructure for a high-growth data science platform with an accent on observability, incident automation, and cloud operations. Focus on designing systems that reduce toil, improving signal quality for critical services, and mentoring engineering teams to foster a culture of operational excellence.
Location: Remote (US)
Compensation: $230,000 USD
Company
builds software that helps AI-driven organizations develop and operate advanced data science solutions at scale.
What you will do
- Lead the development of internal AI-assisted reliability tooling to automate incident diagnosis and reduce toil.
- Improve observability coverage and signal quality for critical customer-facing systems.
- Own incident response end-to-end, ensuring problems are documented and resolved with long-term fixes.
- Define and mature SLO/SLI frameworks to establish measurable reliability standards.
- Scale cloud operations for the single-tenant SaaS offering and improve deployment repeatability.
- Mentor engineers and shape the SRE culture, including incident workflows and operational readiness.
Requirements
- Must be based in the US
- Deep experience in SRE, platform engineering, or software engineering with operational ownership.
- Fluency with Kubernetes, Linux, cloud platforms, and observability tooling.
- Strong software engineering skills in Python or Go.
- Experience leading technically ambiguous work and influencing cross-team direction.
- Sound judgment regarding the application of AI/LLM tooling in operational workflows.
Nice to have
- Experience with LLM-based systems and retrieval workflows.
- Background in SaaS platform operations.
- Experience building tooling specifically for support or developer teams.
Culture & Benefits
- Comprehensive benefits including 401(k) plan, medical, dental, and vision insurance.
- Equity and company bonus opportunities.
- Wellness stipends to support employee well-being.
- Growth-oriented environment with a focus on teaching, learning, and continuous improvement.
- Startup spirit within a company backed by leading investors like Sequoia and Snowflake.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →