Staff Site Reliability Engineer (Storage)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Staff Site Reliability Engineer (Storage): Ensure reliability, resilience, and safe operations of critical storage systems (PostgreSQL, Kafka, Redis), evolving them toward a seamless Platform-as-a-Service for developers with an accent on disaster recovery, capacity planning, and developer experience. Focus on assessing resilience maturity, delivering improvements on DR readiness and alerting, leading incidents, and building automation for AI-operated systems.
Location: Remote from Paris, Barcelona, Berlin, Milan, or Belgrade (Europe)
Company
Europe's leading finance workspace for SMEs with banking at its core, serving 600,000+ customers across 8 European countries, profitable since 2023.
What you will do
- Assess resilience of Kafka and Redis stacks, identify risks, and propose business-aligned improvement roadmap in first 3 months.
- Deliver improvements on disaster recovery, safe upgrades, alerting, and capacity planning with production impact.
- Act as consultant for backend and product teams, leading design reviews and enabling efficient storage use.
- Lead high-severity incidents on stateful infrastructure, mitigate rapidly, and communicate clearly.
- Build automation, tooling, and APIs to enhance developer experience and prepare for AI-operated systems.
Requirements
- Strong hands-on experience operating distributed stateful systems at scale, especially Kafka (MSK) and Redis (ElastiCache)
- SRE mastery in disaster recovery planning, incident management, observability, and capacity planning
- Track record of platform engineering: building automation (IaC), tooling, or DBaaS-like solutions
- High rigor and autonomy in complex production environments and major incidents
- Strong communication to guide backend engineers on infrastructure constraints
Culture & Benefits
- Fully remote distributed team with high autonomy and focus (~50% deep projects, ~25% ops/incidents, ~25% consulting)
- Diversity-focused: 80+ nationalities, 45% women, 56% in leadership, discrimination-free hiring
- Unlimited access to best AI tools, encourage experimentation
- Customer-first culture with high NPS (75) and Trustpilot rating (4.8/55k reviews)
- Clear targets, technical growth support, ownership, and high standards from manager
Hiring process
- Average 20 working days
- Human final decisions, AI may assist in screening
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →