Эта вакансия в архиве

Посмотреть похожие вакансии ↓
Company hidden
обновлено 1 месяц назад

Senior Cloud Site Reliability Engineer (AI)

Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
c1
Страна
US

Описание вакансии

Текст:
/

TL;DR

Senior Cloud Site Reliability Engineer (AI): Improving the reliability and availability of cloud solutions with an accent on providing on-call support for Major Incidents and reducing outage duration. Focus on automating manual activities, system design consulting, capacity planning, and blameless post-mortems.

Location: USA - Sandy, UT

Company

hirify.global Ltd. is a corporation whose software products are used by 25,000+ global businesses, including 85 of the Fortune 100, excelling in AI, cloud and digital solutions.

What you will do

  • Create dashboards for application observability, including SLI/SLO metrics.
  • Automate manual activities to reduce toil and assist development teams with SRE services.
  • Participate in design, definition, and scoping of new solutions, ensuring thorough documentation.
  • Provide on-call support for high-priority incidents and assist in identifying root causes and permanent fixes.
  • Support services through system design consulting, developing software platforms, and capacity planning.
  • Provide technical guidance and coaching to team members and ensure compliance with policies and standards.

Requirements

  • 4+ years programming/scripting experience.
  • 4+ years of experience working within public or private cloud environments.
  • 4+ years of SRE or related experience.
  • Experience with Agile, Jira, GitHub, monitoring, automation, and dashboarding.
  • English: 6+ years communicating in a technical field (C1 equivalent).
  • Ability to troubleshoot complex issues and proactively engage with peers and stakeholders.

hirify.global-to-have">hirify.global to have

  • Experience with Prometheus, Datadog, Grafana, Splunk, BMC, Dynatrace, AppDynamics, or New Relic.
  • Experience working with Kubernetes, Docker, microservices, or serverless compute.
  • Experience with Ansible or Terraform.
  • Proficiency in C#, C++, Java, Python, Perl, or Ruby.

Culture & Benefits

  • Ambitious and challenge-driven environment with high standards.
  • Commitment to equal opportunity employment.
  • Focus on innovation in AI, cloud, and digital.
  • Global presence with over 8,500 employees across 30+ countries.
  • Opportunities for technical guidance and mentoring.