Senior Cloud Reliability Engineer (AWS)

162 200 - 187 200$

Формат работы

onsite

Тип работы

fulltime

Грейд

senior

Английский

Страна

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Senior Cloud Reliability Engineer (AWS): Architecting and implementing systems to ensure cloud environment reliability with an accent on automation tools, frameworks, and toil reduction. Focus on designing complex Terraform modules, managing SLIs/SLOs, and conducting chaos engineering to harden distributed systems.

Location: Dallas, TX

Salary: $162,200 – $187,200

Company

hirify.global is a staffing and technical recruiting firm providing specialized talent for diverse engineering projects.

What you will do

Design, develop, and maintain SRE utilities and automation solutions to minimize toil and drive self-service infrastructure.
Architect and maintain complex Terraform modules to manage AWS resources using cost-efficient design principles.
Develop custom APIs and tools in Python to integrate disparate cloud services using TDD and version control best practices.
Define and manage SLIs/SLOs, monitor system health, and lead root-cause analysis (RCA) for blameless postmortems.
Conduct resilience testing and chaos engineering experiments to harden system architecture.
Establish SRE standards, guidelines, and governance frameworks for adoption across cross-functional teams.

Requirements

Minimum 7 years of professional software development experience focused on platform engineering or reliability.
Minimum 5 years of experience building enterprise-grade tools and APIs with advanced Python.
Minimum 3 years of deep hands-on experience with core AWS services (EC2, VPC, S3, Lambda, IAM, EventBridge, Step Functions).
Expert-level proficiency with Terraform (module development/state management) and CI/CD pipeline implementation.
Minimum 3 years of experience defining SLIs/SLOs and managing error budgets.
Must be located in or able to work onsite in Dallas, TX

Nice to have

Proficiency in GoLang.
Hands-on experience with observability tools such as Grafana, CloudWatch, and AWS Canary.
Familiarity with ITSM workflows (Incident, Change, and Problem Management).

Culture & Benefits

Major medical, dental, and vision insurance for assignments lasting 13 weeks or longer.
401k retirement plan.
Statutory sick pay where required.
Commitment to equal opportunity and providing reasonable accommodations for individuals with disabilities.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →