Site Reliability Consultant (SRE)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Site Reliability Consultant (SRE): Designing, deploying, and operating large-scale distributed systems across compute, storage, networking, and AI/ML environments with an accent on automation, scalability, and reliability. Focus on building resilient infrastructure, optimizing Kubernetes clusters, and collaborating with clients to solve complex performance challenges.
Location: Ottawa, Canada (Remote-friendly)
Company
is a multinational expert in strategic database and analytics services, helping mid and large-sized businesses leverage cloud and AI technologies to drive digital transformation.
What you will do
- Operate and optimize Kubernetes clusters, Istio service mesh, and Linux-based systems.
- Automate workflows using Go, Python, and Shell scripting.
- Build monitoring and observability solutions with Prometheus, Grafana, and Loki.
- Troubleshoot complex networking, storage, and system performance issues.
- Partner with AI/ML teams to ensure infrastructure readiness for model training and data pipelines.
- Participate in on-call rotations and postmortem reviews to improve system resilience.
Requirements
- Experience with Google Cloud and IaC tools like Terraform.
- Strong knowledge of microservices, containers (Kubernetes, Docker), and networking.
- Hands-on experience with PKI, service mesh, and Linux systems administration.
- SRE mindset with a focus on automation, scalability, and reliability.
- Ability to fulfill requirements for a background check.
Nice to have
- Experience with Golang.
Culture & Benefits
- Competitive total rewards package.
- Substantial training allowance and professional development days.
- Annual wellness budget for health and fitness.
- Generous paid vacation and sick days.
- Equipment provided for home office setup with an annual personalization budget.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →