Назад
Company hidden
12 часов назад

Principal Software Reliability Engineer (Python)

Формат работы
remote (только Europe)/hybrid
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
UK/Portugal
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Principal Software Reliability Engineer (Python/Ruby/Node.js): Defining and driving product-level reliability for consumer identity systems with an accent on release safety, observability, and system architecture. Focus on building reliability tooling, reducing incident volume, and ensuring 99.99% availability for critical biometric and authentication services.

Location: Must be based in Portugal or the United Kingdom; roles are hybrid or remote eligible.

Company

hirify.global is a global industry leader in identity-centric security solutions, protecting data and identities for organizations in over 150 countries.

What you will do

  • Define the reliability roadmap and strategy for the Consumer Identity product engineering team.
  • Partner with engineering leadership to implement resilience patterns and optimize service performance.
  • Build tooling for automated rollbacks, progressive delivery, and advanced observability.
  • Lead postmortem processes and drive systemic improvements to reduce change-induced incidents.
  • Influence technical direction across the organization to enhance system reliability and reduce complexity.

Requirements

  • 8+ years of software engineering experience, with 4+ years specifically in reliability or SRE roles.
  • Proficiency in at least one backend language (Python, Ruby, or Node.js).
  • Deep understanding of resilience patterns, observability, and incident management best practices.
  • Proven ability to influence without authority and communicate complex reliability concepts to senior stakeholders.
  • Must be located in Portugal or the United Kingdom.

Nice to have

  • Experience with chaos engineering and fault injection.
  • Reliability experience with ML systems or complex model serving.
  • Familiarity with Datadog, Kubernetes, AWS, and GitLab CI/CD.
  • Experience leveraging LLMs for automated runbook generation or code analysis.

Culture & Benefits

  • Flexible work environment supporting remote, hybrid, or on-site arrangements.
  • Opportunities for professional growth through learning-forward initiatives.
  • Collaborative, global culture with a strong emphasis on diversity and inclusion.
  • Direct impact on high-scale security solutions used by major financial institutions.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →