Staff Site Reliability Engineer (Fintech)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Staff Site Reliability Engineer (GCP/Python): Designing and scaling core technology foundations for a global derivatives marketplace with an accent on reliability as a product and ultra-low latency. Focus on building self-healing infrastructure using Generative AI and architecting high-level abstractions for the global technology stack.
Location: Hybrid in Chicago, IL (requires 2 days per week on-site)
Salary: $132,100–$220,100
Company
is the world’s leading derivatives marketplace, providing critical infrastructure for the global financial system.
What you will do
- Define the 12–18 month technical strategy for Platform SRE teams, focusing on global footprint and self-service capabilities.
- Act as the final technical authority for major infrastructure changes involving GCP, GKE, and Kafka event buses.
- Lead responses for complex cross-functional outages and drive a blameless culture prioritizing systemic fixes.
- Architect and oversee an Internal Development Platform (IDP) using Python to simplify the global technology stack.
- Standardize SLIs, SLOs, and Error Budgets across platform teams to ensure market integrity.
- Mentor the SRE organization through design reviews and architectural office hours.
Requirements
- 10+ years of experience in SRE, Systems Engineering, or Software Engineering in high-pressure environments.
- 3+ years in a Staff, Principal, or Tech Lead capacity overseeing multiple teams or complex domains.
- Expert-level proficiency in Python (and ideally Go) for building production-grade distributed systems.
- Deep expertise in GCP (Networking, IAM, GKE) and scaling Kafka clusters for high-throughput environments.
- Mastery of Terraform module design and ArgoCD for enterprise-scale immutable infrastructure.
- Must be based in the Chicago area or able to work hybrid on-site in Chicago.
Nice to have
- Experience leveraging Generative AI and Agentic workflows (e.g., Gemini) for self-healing infrastructure.
- GCP Professional Cloud Architect or Kubernetes (CKA/CKAD) certifications.
- Proficiency in Node.js or modern frontend frameworks.
- Domain expertise in Financial Markets or highly regulated, high-concurrency environments.
Culture & Benefits
- Code-first engineering culture that prioritizes systematic improvement over manual intervention.
- Comprehensive health coverage, retirement package including 401(k), and an active pension plan.
- Competitive education reimbursement and mental health benefits.
- Paid time off and a flexible, holistic benefits package.
- Opportunity to work on technology maintaining the integrity of the global financial system.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →