6 дней назад
Site Reliability Engineer (Azure)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
Текст:
TL;DR
Site Reliability Engineer (Azure): Managing stability and reliability of B2B & B2C ecommerce platforms with an accent on automation, telemetry, and cloud infrastructure. Focus on optimizing Azure AKS environments, implementing proactive monitoring using Splunk and AppInsights, and automating deployments via CI/CD pipelines.
Location: Northbrook, IL
Company
is a professional services firm providing technical staffing and platform operations support for enterprise clients.
What you will do
- Monitor and maintain Development, QA, Staging, and Production environments to ensure high availability.
- Mitigate production performance issues and implement automated remediations to prevent recurrence.
- Configure monitors, alerts, and Service Level Indicators (SLIs) using various telemetry technologies.
- Automate the deployment of cloud resources using Infrastructure as Code (IaC) pipelines.
- Collaborate with application and cloud teams to guide container and pod deployments.
- Perform root-cause analysis (RCA) and incident management for software and telemetry defects.
Requirements
- Expert experience with ATG Commerce or building custom Java/Java EE solutions on Azure AKS.
- 3+ years of professional experience with Microsoft Azure.
- Hands-on experience with containerization, Kubernetes, and microservices architecture.
- Proficiency with APM tools such as Splunk APM, AppDynamics, or Azure AppInsights.
- Experience with DevOps platforms including Jenkins, Artifactory, ACR, or Azure DevOps.
- Ability to participate in after-hours on-call rotations and maintenance windows.
- Bachelor's degree in Computer Science or a relevant four-year degree.
Nice to have
- Experience using Terraform for Infrastructure as Code.
- Deep knowledge of Azure networking, Application Gateway, APIM, and IAM Policy.
- Production support experience specifically for E-commerce websites.
- Experience tracking and reporting KPIs such as MTBI, MTRS, and MTTD.
Culture & Benefits
- Work within an Agile environment partnering closely with a Scrum Master.
- Opportunity to mentor full-time employees and drive operational excellence.
- Fast-paced environment focusing on continuous improvement through operational metrics.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →
Похожие вакансии
16 часов назад
DevOps Engineer (AWS/Azure)
19 часов назад
Senior Cloud Platform Engineer (AWS/GCP)
2 дня назад
Senior Site Reliability Engineer (Kubernetes/Terraform)
150 000 - 200 000$
2 дня назад
Senior Site Reliability Engineer (AI)
109 600 - 164 400$
7 дней назад
Site Reliability Engineer (Platform Infrastructure)
4 дня назад
Site Reliability Engineer (AWS, GCP, Kubernetes)
100 000 - 145 000$