Эта вакансия в архиве
Посмотреть похожие вакансии ↓обновлено 1 месяц назад
Senior Cloud Site Reliability Engineer (AI)
Описание вакансии
Текст:
TL;DR
Senior Cloud Site Reliability Engineer (AI): Improving the reliability and availability of cloud solutions with an accent on providing on-call support for Major Incidents and reducing outage duration. Focus on automating manual activities, system design consulting, capacity planning, and blameless post-mortems.
Location: USA - Sandy, UT
Company
Ltd. is a corporation whose software products are used by 25,000+ global businesses, including 85 of the Fortune 100, excelling in AI, cloud and digital solutions.
What you will do
- Create dashboards for application observability, including SLI/SLO metrics.
- Automate manual activities to reduce toil and assist development teams with SRE services.
- Participate in design, definition, and scoping of new solutions, ensuring thorough documentation.
- Provide on-call support for high-priority incidents and assist in identifying root causes and permanent fixes.
- Support services through system design consulting, developing software platforms, and capacity planning.
- Provide technical guidance and coaching to team members and ensure compliance with policies and standards.
Requirements
- 4+ years programming/scripting experience.
- 4+ years of experience working within public or private cloud environments.
- 4+ years of SRE or related experience.
- Experience with Agile, Jira, GitHub, monitoring, automation, and dashboarding.
- English: 6+ years communicating in a technical field (C1 equivalent).
- Ability to troubleshoot complex issues and proactively engage with peers and stakeholders.
hirify.global-to-have"> to have
- Experience with Prometheus, Datadog, Grafana, Splunk, BMC, Dynatrace, AppDynamics, or New Relic.
- Experience working with Kubernetes, Docker, microservices, or serverless compute.
- Experience with Ansible or Terraform.
- Proficiency in C#, C++, Java, Python, Perl, or Ruby.
Culture & Benefits
- Ambitious and challenge-driven environment with high standards.
- Commitment to equal opportunity employment.
- Focus on innovation in AI, cloud, and digital.
- Global presence with over 8,500 employees across 30+ countries.
- Opportunities for technical guidance and mentoring.
Похожие вакансии
2 дня назад
Senior Site Reliability Engineer (AI)
109 600 - 164 400$
4 дня назад
Senior DevOps Engineer
150 000 - 170 000$
7 дней назад
AI Infrastructure Engineer
157 487 - 174 713$
6 дней назад
Site Reliability Engineer (Platform Infrastructure)
2 дня назад
Senior Site Reliability Engineer (Kubernetes/Terraform)
150 000 - 200 000$
6 дней назад