AVP, Reliability Engineer (SRE)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
AVP, Reliability Engineer (SRE): Ensuring high availability, stability, and performance of OnePay-integrated applications with an accent on SRE principles, automation, and incident management. Focus on building resilient distributed systems, optimizing observability, and reducing operational toil through AIOps and infrastructure automation.
Location: Must be based in the U.S. and able to commute to a Hub for in-person engagement.
Salary: $100,000 – $170,000 USD Annual.
Company
A leading consumer financial services company providing diverse financing products and digital payment solutions.
What you will do
- Drive cross-functional investigations to identify root causes of production failures and implement long-term fixes.
- Design and maintain observability dashboards and monitoring capabilities to ensure adherence to SLAs/SLOs.
- Develop automation and leverage AIOps to reduce operational noise and expedite incident response.
- Support CI/CD pipeline operations and coordinate release processes with partner teams.
- Participate in on-call rotations to ensure the reliability of critical production systems.
- Communicate technical status, risks, and reliability initiatives to stakeholders and leadership.
Requirements
- Legal authorization to work in the U.S. is required (no visa sponsorship).
- Bachelor’s degree and 5+ years of experience in reliability, systems engineering, or application support (or 8+ years without a degree).
- Demonstrated experience troubleshooting and supporting distributed systems in cloud environments.
- Proficiency in scripting/automation (Python, Bash, JavaScript, PowerShell, or Go).
- Familiarity with configuration automation tools like Terraform or Ansible.
- Strong understanding of UNIX fundamentals, network protocols, and cloud concepts (containerization, load balancing).
Nice to have
- Experience with cloud providers (AWS, Azure, GCP).
- Knowledge of application languages such as Java, Golang, Rust, or C++.
- ITIL Foundation or SRE/DevOps certifications.
- Experience with tools like Splunk, New Relic, Grafana, or PagerDuty.
Culture & Benefits
- Inclusive culture with active Employee Resource Groups (ERGs).
- Flexible work environment with options for remote work near company hubs.
- Commitment to professional growth and continuous improvement.
- Comprehensive benefits package and performance-based annual bonus.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →