TL;DR
Site Reliability Engineer (Fintech): Designing and implementing scalable systems, architecting observability solutions, and building automation for a financial platform with an accent on defining SLOs/SLIs, standardizing monitoring as code, and driving systemic improvements through incident response. Focus on enhancing platform reliability, reducing alert noise, and increasing signal quality across distributed systems.
Location: Fully remote
Company
hirify.global is a company focused on providing a platform for alternative investments.
What you will do
- Define and implement Service Level Objectives (SLOs) and Service Level Indicators (SLIs).
- Lead monitoring and alerting standardization using infrastructure as code, preferably Terraform.
- Develop observability standards for metrics, logs, and traces, including OpenTelemetry.
- Implement reliability and operability standards for Kubernetes-based services.
- Drive automation to eliminate toil, improve repeatability, and accelerate recovery.
- Serve as Incident Commander for high-severity incidents, lead postmortems, and drive systemic improvements.
Requirements
- 7+ years of experience in SRE or related roles with technical seniority.
- Strong experience with AWS and Kubernetes in production environments.
- Demonstrated experience defining and utilizing SLOs/SLIs and implementing actionable observability solutions.
- Strong Infrastructure as Code (IaC) skills (Terraform preferred) for automation and standardization.
- Strong incident response skills, including leading postmortems and driving systemic reliability improvements.
- Clear written and verbal English communication skills.
Culture & Benefits
- Competitive salary, annual performance bonus, and equity for all full-time employees.
- 100% employer-paid health and dental insurance.
- Generous paid time off (PTO).
- Opportunity to participate in on-call rotations focused on improving reliability.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →