Senior Site Reliability Engineer (Web3)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Site Reliability Engineer (Web3): Ensuring the reliability, scalability, and operational excellence of systems powering Chainlink's Cross-Chain Interoperability Protocol (CCIP) with an accent on production resilience and observability. Focus on driving adoption of SLOs/SLIs, automating operational toil, and scaling infrastructure for high-throughput distributed services.
Location: Global remote role, with a preference for overlapping working hours with Eastern Standard Time (EST).
Salary: $129,000 – $244,000 (U.S. locations) or €68,000 – €176,000 (Spain).
Company
is the industry-standard oracle platform powering decentralized finance and enabling advanced blockchain use cases for global financial institutions.
What you will do
- Advance production engineering practices to improve deployment safety and delivery velocity.
- Implement distributed tracing using OpenTelemetry to enhance observability and incident response.
- Automate operational tasks to eliminate toil and increase platform efficiency.
- Drive the adoption of meaningful SLOs, SLIs, and error budgets across engineering teams.
- Scale production infrastructure and ensure operational readiness for CCIP growth.
- Strengthen system reliability and reduce operational overhead for critical services.
Requirements
- Demonstrated experience in Site Reliability or Production Engineering for large-scale distributed systems.
- Deep expertise in defining and implementing SLOs, SLIs, and error budgets.
- Proven experience building and operating production Kubernetes environments.
- Applied knowledge of OpenTelemetry for observability in distributed systems.
- Ability to improve reliability, scalability, and operability of production infrastructure.
- Must be able to overlap working hours with Eastern Standard Time (EST).
Nice to have
- Technical leadership experience influencing reliability practices.
- Experience with capacity planning and performance tuning for high-throughput services.
- Background in Web3 infrastructure or crypto-native organizations.
- Experience with chaos engineering or fault-injection techniques.
- Experience leading on-call operations and defining escalation policies.
Culture & Benefits
- Global, remote-first work environment.
- Competitive cash compensation with long-term incentives.
- Comprehensive benefits package.
- Opportunity to work on industry-standard blockchain infrastructure.
- Collaborative culture focused on operational excellence and engineering efficiency.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →