Senior Database Reliability Engineer (DBRE)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Database Reliability Engineer (DBRE) (PostgreSQL/MySQL): Design, operationalize, and optimize large-scale data persistence layers for mission-critical systems with an accent on high availability, performance, and automation. Focus on implementing resilient PostgreSQL clusters, incident response, root-cause analysis, and cross-functional collaboration with engineering teams.
Location: Hybrid in San Francisco Bay Area, CA (requires in-person onboarding and travel to SF or Chicago office first week; U.S. Person status required for federal access). Open to candidates in California, Colorado, Illinois, New York, Washington.
Salary: $160,000—$220,000 USD (San Francisco Bay Area); $143,000—$196,000 USD (other specified US locations).
Company
builds trusted infrastructure to secure identities, enabling organizations to safely embrace AI.
What you will do
- Design and operate highly available PostgreSQL clusters with physical/logical replication, sharding, failover automation, multi-AZ/region setups, and disaster recovery.
- Optimize query performance, indexing, schema design, storage engines, capacity planning, and workload modeling.
- Develop automation for provisioning, backups, failovers, vacuum tuning, schema management using Terraform, Ansible, Kubernetes Operators.
- Build monitoring, alerting, self-healing systems for PostgreSQL and MySQL.
- Lead incident response for performance issues, replication lag, deadlocks; conduct root-cause analysis and fixes.
- Collaborate with engineers on SQL reviews, schema optimization, design patterns, migrations, upgrades.
Requirements
- 4+ years hands-on PostgreSQL in high-volume, distributed production environments.
- Deep PostgreSQL internals (WAL, MVCC, vacuum tuning, query planner, indexing, logical replication).
- Production MySQL experience (InnoDB, replication, tuning).
- Advanced SQL, schema design, query optimization.
- Linux, networking, troubleshooting; automation with Go/Python.
- Monitoring tools (Prometheus, Grafana, Datadog); cloud (AWS/GCP).
Nice to have
- PgBouncer, HAProxy, connection pooling.
- Event streaming (Kafka, Debezium), CDC.
- 24/7 on-call support.
- Open-source PostgreSQL contributions.
Culture & Benefits
- Equity, bonus, comprehensive health/dental/vision insurance, 401(k), FSA, PTO, parental leave.
- In-person onboarding for connection and impact from day one.
- Global community across 20+ offices, focus on innovation, well-being, social impact, talent development.
- Flexible spending, paid leave per policies.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →