Senior Manager, Site Reliability Engineering, Follow Up Boss
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Manager, Site Reliability Engineering, Follow Up Boss (SRE/AWS): Lead a multidisciplinary team responsible for FUB infrastructure reliability, operability, and developer experience with an accent on proactive, roadmap-driven investments across scalability, security, and cost management. Focus on building low-toil SLO/error-budget on-call operations, modernizing AWS-based services and CI/CD, and driving cross-org strategy while operationalizing AI tooling for automation and diagnostics.
Company
builds real estate and career platforms and operates Follow Up Boss infrastructure through a dedicated SRE organization.
What you will do
- Own execution and delivery for the FUB infrastructure & security roadmap, turning strategic goals into sequenced plans with milestones and success measures.
- Account for reliability, performance, operability, and cost of core FUB services and infrastructure (EC2, RDS/Aurora, Redis/Valkey, networking, queues, SRE tooling).
- Lead low-toil on-call with clear SLOs/error budgets, actionable alerting, fast incident response, high-quality RCAs, and remediation follow-through.
- Drive database scaling and performance improvements, including capacity management, query/schema optimization, and modernization of data infrastructure.
- Modernize prioritized workloads and improve developer environments, onboarding, and safer faster deployments via CI/CD and progressive delivery.
- Build and grow the SRE/infrastructure/security team, partner across orgs, and operationalize AI agents for automation, diagnostics, and operations.
Requirements
- Proven track record as a Senior Engineering Manager or equivalent leading SRE, platform, or infrastructure teams for high-availability SaaS products.
- Experience scaling production systems and databases in a cloud environment (ideally AWS) with measurable improvements to reliability, performance, and cost.
- Ability to shift teams from reactive to proactive roadmap-driven execution across multiple quarters, defining strategy and metrics.
- Strong developer experience and CI/CD background with hands-on familiarity with Terraform/Ansible, GitLab, Kubernetes/ZGCP, and modern observability stacks.
- Demonstrated people leadership managing senior engineers and developing leaders who can operate autonomously.
- Comfortable experimenting with and operationalizing AI tools in engineering workflows.
Culture & Benefits
- Remote role: work from a physical location of choice within the U.S.
- Base pay range (annual) depends on state of residence: $206,700.00–$330,300.00 in CA, CT, MD, MA, NJ, NY, WA, and Washington DC; $196,400.00–$313,800.00 in CO, HI, IL, MN, NV, OH, RI, and VT.
- Eligible for equity awards based on experience, performance, and location.
- Pay will not be below the exempt salary threshold for the state where you reside.
Hiring process
- Interviews focused on leadership scope, reliability/incident management approach, and cross-org execution.
- Evaluation of technical depth across SRE practices, cloud infrastructure, CI/CD, and observability.
- Discussion of strategy, roadmap planning, and how AI tooling is operationalized in engineering workflows.
Location: Remote (USA)
Salary: $196,400–$330,300 annually (state-dependent base pay range)
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →