Senior Staff Site Reliability Engineer
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Staff Site Reliability Engineer (SRE): Defining and owning the global SRE strategy and infrastructure architecture with an accent on reliability, scalability, and operational excellence. Focus on driving organization-wide system design, FinOps strategy, and mentoring staff-level engineers to raise the engineering bar across a global platform.
Location: San Francisco (Onsite)
Salary: $181,000–$263,000
Company
is a leading data collaboration platform focused on consumer privacy, data ethics, and foundational identity solutions for global innovators.
What you will do
- Define and own the organization-wide SRE strategy, including SLOs, SLAs, and operational excellence frameworks.
- Drive system design, automation, and performance optimization standards across all engineering teams.
- Lead distributed systems architecture reviews and serve as the final escalation point for high-impact production incidents.
- Champion FinOps strategy across Kubernetes, cloud resources, and database infrastructure.
- Mentor Staff Engineers and provide technical guidance to influence product and platform architecture decisions.
- Contribute technical due diligence to M&A evaluations and represent the company in the broader technology ecosystem.
Requirements
- 10+ years in SRE, production, or platform engineering, with 3+ years at a senior or staff level.
- Expertise in Infrastructure as Code (Terraform) at scale.
- Deep Kubernetes knowledge, including internals, autoscaling, and multi-tenant workload management.
- Strong proficiency in Python and/or Go for building production-grade internal tooling.
- Advanced experience with real-time and NoSQL databases such as SingleStore, ScyllaDB, or Cassandra.
- Strong cloud security background (IAM, network segmentation, SOC 2/ISO 27001) in GCP or AWS.
Nice to have
- Experience building multi-region active-active architectures.
- Contributions to open source observability or infrastructure tooling.
- Familiarity with chaos engineering frameworks like Gremlin or Chaos Monkey.
- Experience with LLMs and AI-assisted development workflows for infrastructure automation.
Culture & Benefits
- Comprehensive benefits package including medical, dental, vision, and disability insurance.
- Flexible paid time off, paid holidays, and parental leave.
- 401K matching plan (1:1 match up to 6%) and Employee Stock Purchase Plan.
- Active social culture with game nights, happy hours, and team events.
- Commitment to diversity, inclusion, and belonging across a global team.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →