ΠΡΠ° Π²Π°ΠΊΠ°Π½ΡΠΈΡ Π² Π°ΡΡ ΠΈΠ²Π΅
ΠΠΎΡΠΌΠΎΡΡΠ΅ΡΡ ΠΏΠΎΡ ΠΎΠΆΠΈΠ΅ Π²Π°ΠΊΠ°Π½ΡΠΈΠΈ βΠΎΠ±Π½ΠΎΠ²Π»Π΅Π½ΠΎ 1 ΠΌΠ΅ΡΡΡ Π½Π°Π·Π°Π΄
Senior Site Reliability Engineer (Crypto)
ΠΠΏΠΈΡΠ°Π½ΠΈΠ΅ Π²Π°ΠΊΠ°Π½ΡΠΈΠΈ
Π’Π΅ΠΊΡΡ:
TL;DR
Senior Site Reliability Engineer (Crypto): Taking full ownership of βs reliability, observability, and incident response with an accent on AWS infrastructure, Rust microservices, and TypeScript indexers. Focus on designing reliability into systems, improving incident management, and fostering a culture of reliability.
Location: 100% remote (UK only)
Company
builds infrastructure to bring financial integrity to the crypto economy.
What you will do
- Own reliability end-to-end: design, measure, and improve service availability, latency, and performance across βs platform
- Enhance observability: expand and refine metrics, logs, and traces to provide deep insight into our Rust and TypeScript services
- Lead incident management: define playbooks, improve response workflows, and foster a blameless postmortem culture
- Strengthen infrastructure: optimise AWS configurations, CI/CD pipelines, autoscaling, and networking for reliability and cost efficiency
- Collaborate across teams: work with product and engineering leads to ensure reliability is considered at every design stage
- Drive continuous improvement: identify systemic weaknesses, automate recovery where possible, and reduce MTTR across the stack
Requirements
- 5+ years of experience in Site Reliability, DevOps, or Infrastructure Engineering roles
- Deep understanding of distributed systems and debugging at the network, application, and database layers
- Hands-on experience with AWS, container orchestration (Kubernetes, ECS), and Infrastructure-as-Code tools (Pulumi or similar)
- Comfortable tracing through Rust and TypeScript code to diagnose complex performance or reliability issues
- Strong collaborator with excellent communication skills