Senior Site Reliability Engineer
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Site Reliability Engineer (Fintech): Design and operate resilient distributed systems for real-time financial platform with an accent on observability, SLOs/SLIs, and incident management. Focus on building signal-heavy monitoring with Datadog/CloudWatch, improving incident lifecycles, and leading reliability initiatives across teams.
Location: Remote from Mexico; hybrid possible in Mexico City office. No visa sponsorship or immigration support
Company
Pioneer in earned wage access, delivering real-time financial flexibility without fees, interest, or credit checks, backed by top VCs.
What you will do
- Design systems for resilience, graceful degradation, and capacity planning.
- Define and measure SLOs/SLIs reflecting customer experience.
- Build observability using Datadog, CloudWatch, and incident.io for alerting and management.
- Improve incident lifecycle from detection to blameless post-mortems.
- Combine software engineering with reliability practices for highly available, debuggable systems.
Requirements
- Bachelor's or Master's in computer science or equivalent experience
- 4+ years in SRE or Software Engineering
- Hands-on coding in Python and/or Go
- Distributed systems expertise: design, operation, production at scale
- Deep knowledge of SLOs, SLIs, error budgets, MTTR
- Observability, incident response, cross-functional communication
- Operational tooling and AI fluency; leadership and mentorship
Culture & Benefits
- Healthcare, internet/cell phone reimbursement, lg stipend
- Potential travel to Mountain View HQ
- Diverse, inclusive team focused on belonging
- Salary based on role, level, location
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →