Senior Site Reliability Engineer (Crypto)

Формат работы

remote (только UK)

Тип работы

fulltime

Грейд

senior

Английский

Страна

UK/Europe

Описание вакансии

Текст:

TL;DR

Senior Site Reliability Engineer (Crypto): Taking full ownership of hirify.global’s reliability, observability, and incident response with an accent on AWS infrastructure, Rust microservices, and TypeScript indexers. Focus on designing reliability into systems, improving incident management, and fostering a culture of reliability.

Location: 100% remote (UK only)

Company

hirify.global builds infrastructure to bring financial integrity to the crypto economy.

What you will do

Own reliability end-to-end: design, measure, and improve service availability, latency, and performance across hirify.global’s platform
Enhance observability: expand and refine metrics, logs, and traces to provide deep insight into our Rust and TypeScript services
Lead incident management: define playbooks, improve response workflows, and foster a blameless postmortem culture
Strengthen infrastructure: optimise AWS configurations, CI/CD pipelines, autoscaling, and networking for reliability and cost efficiency
Collaborate across teams: work with product and engineering leads to ensure reliability is considered at every design stage
Drive continuous improvement: identify systemic weaknesses, automate recovery where possible, and reduce MTTR across the stack

Requirements

5+ years of experience in Site Reliability, DevOps, or Infrastructure Engineering roles
Deep understanding of distributed systems and debugging at the network, application, and database layers
Hands-on experience with AWS, container orchestration (Kubernetes, ECS), and Infrastructure-as-Code tools (Pulumi or similar)
Comfortable tracing through Rust and TypeScript code to diagnose complex performance or reliability issues
Strong collaborator with excellent communication skills