TL;DR
Senior Site Reliability Engineer (Crypto): Taking full ownership of hirify.global’s reliability, observability, and incident response with an accent on AWS infrastructure, Rust microservices, and TypeScript indexers. Focus on designing reliability into systems, improving incident management, and fostering a culture of reliability.
Location: 100% remote (UK only)
Company
hirify.global builds infrastructure to bring financial integrity to the crypto economy.
What you will do
- Own reliability end-to-end: design, measure, and improve service availability, latency, and performance across hirify.global’s platform
- Enhance observability: expand and refine metrics, logs, and traces to provide deep insight into our Rust and TypeScript services
- Lead incident management: define playbooks, improve response workflows, and foster a blameless postmortem culture
- Strengthen infrastructure: optimise AWS configurations, CI/CD pipelines, autoscaling, and networking for reliability and cost efficiency
- Collaborate across teams: work with product and engineering leads to ensure reliability is considered at every design stage
- Drive continuous improvement: identify systemic weaknesses, automate recovery where possible, and reduce MTTR across the stack
Requirements
- 5+ years of experience in Site Reliability, DevOps, or Infrastructure Engineering roles
- Deep understanding of distributed systems and debugging at the network, application, and database layers
- Hands-on experience with AWS, container orchestration (Kubernetes, ECS), and Infrastructure-as-Code tools (Pulumi or similar)
- Comfortable tracing through Rust and TypeScript code to diagnose complex performance or reliability issues
- Strong collaborator with excellent communication skills
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →