Principal Software Reliability Engineer (Python)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Principal Software Reliability Engineer (Python/Ruby/Node.js): Defining and driving product-level reliability for consumer identity systems with an accent on release safety, observability, and system architecture. Focus on building reliability tooling, reducing incident volume, and ensuring 99.99% availability for critical biometric and authentication services.
Location: Must be based in Portugal or the United Kingdom; roles are hybrid or remote eligible.
Company
is a global industry leader in identity-centric security solutions, protecting data and identities for organizations in over 150 countries.
What you will do
- Define the reliability roadmap and strategy for the Consumer Identity product engineering team.
- Partner with engineering leadership to implement resilience patterns and optimize service performance.
- Build tooling for automated rollbacks, progressive delivery, and advanced observability.
- Lead postmortem processes and drive systemic improvements to reduce change-induced incidents.
- Influence technical direction across the organization to enhance system reliability and reduce complexity.
Requirements
- 8+ years of software engineering experience, with 4+ years specifically in reliability or SRE roles.
- Proficiency in at least one backend language (Python, Ruby, or Node.js).
- Deep understanding of resilience patterns, observability, and incident management best practices.
- Proven ability to influence without authority and communicate complex reliability concepts to senior stakeholders.
- Must be located in Portugal or the United Kingdom.
Nice to have
- Experience with chaos engineering and fault injection.
- Reliability experience with ML systems or complex model serving.
- Familiarity with Datadog, Kubernetes, AWS, and GitLab CI/CD.
- Experience leveraging LLMs for automated runbook generation or code analysis.
Culture & Benefits
- Flexible work environment supporting remote, hybrid, or on-site arrangements.
- Opportunities for professional growth through learning-forward initiatives.
- Collaborative, global culture with a strong emphasis on diversity and inclusion.
- Direct impact on high-scale security solutions used by major financial institutions.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →