Назад
Company hidden
1 день назад

Senior Cloud Reliability Engineer (AWS)

162 200 - 187 200$
Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior Cloud Reliability Engineer (AWS): Architecting and implementing systems to ensure cloud environment reliability with an accent on automation tools, frameworks, and toil reduction. Focus on designing complex Terraform modules, managing SLIs/SLOs, and conducting chaos engineering to harden distributed systems.

Location: Dallas, TX

Salary: $162,200 – $187,200

Company

hirify.global is a staffing and technical recruiting firm providing specialized talent for diverse engineering projects.

What you will do

  • Design, develop, and maintain SRE utilities and automation solutions to minimize toil and drive self-service infrastructure.
  • Architect and maintain complex Terraform modules to manage AWS resources using cost-efficient design principles.
  • Develop custom APIs and tools in Python to integrate disparate cloud services using TDD and version control best practices.
  • Define and manage SLIs/SLOs, monitor system health, and lead root-cause analysis (RCA) for blameless postmortems.
  • Conduct resilience testing and chaos engineering experiments to harden system architecture.
  • Establish SRE standards, guidelines, and governance frameworks for adoption across cross-functional teams.

Requirements

  • Minimum 7 years of professional software development experience focused on platform engineering or reliability.
  • Minimum 5 years of experience building enterprise-grade tools and APIs with advanced Python.
  • Minimum 3 years of deep hands-on experience with core AWS services (EC2, VPC, S3, Lambda, IAM, EventBridge, Step Functions).
  • Expert-level proficiency with Terraform (module development/state management) and CI/CD pipeline implementation.
  • Minimum 3 years of experience defining SLIs/SLOs and managing error budgets.
  • Must be located in or able to work onsite in Dallas, TX

Nice to have

  • Proficiency in GoLang.
  • Hands-on experience with observability tools such as Grafana, CloudWatch, and AWS Canary.
  • Familiarity with ITSM workflows (Incident, Change, and Problem Management).

Culture & Benefits

  • Major medical, dental, and vision insurance for assignments lasting 13 weeks or longer.
  • 401k retirement plan.
  • Statutory sick pay where required.
  • Commitment to equal opportunity and providing reasonable accommodations for individuals with disabilities.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →