Назад
4 часа Π½Π°Π·Π°Π΄

Senior Site Reliability Engineer (SRE)

Π€ΠΎΡ€ΠΌΠ°Ρ‚ Ρ€Π°Π±ΠΎΡ‚Ρ‹
hybrid
Π’ΠΈΠΏ Ρ€Π°Π±ΠΎΡ‚Ρ‹
fulltime
Π“Ρ€Π΅ΠΉΠ΄
senior
Английский
b2
Π‘Ρ‚Ρ€Π°Π½Π°
India
vacancy_detail.hirify_telegram_tooltip Π—Π°Π³Ρ€ΡƒΠΆΠ°Π΅ΠΌ источник...

ΠœΡΡ‚Ρ‡ & Π‘ΠΎΠΏΡ€ΠΎΠ²ΠΎΠ΄

ΠŸΠΎΠΊΠ°ΠΆΠ΅Ρ‚ Π²Π°ΡˆΡƒ ΡΠΎΠ²ΠΌΠ΅ΡΡ‚ΠΈΠΌΠΎΡΡ‚ΡŒ ΠΈ Π½Π°ΠΏΠΈΡˆΠ΅Ρ‚ письмо

ОписаниС вакансии

πŸš€ Hiring: Senior Site Reliability Engineer (SRE)
πŸ“ Location: Bengaluru (Hybrid)
πŸ’Ό Experience: 6–10 Years
⚠️ Important:
βœ” Only local Bengaluru candidates will be considered
βœ” Must be available for face-to-face interview on short notice
______________
πŸ”Ž Role Overview
We are looking for a hands-on Senior SRE with deep expertise in Observability, Kubernetes, and Cloud Platforms. This role focuses on building and operating highly reliable, scalable, and observable systems in GCP (preferred) and AWS environments.
______________
πŸ”Ή Key Responsibilities
Reliability & Operations
β€’ Design and operate highly available Kubernetes-based systems
β€’ Define & manage SLOs, SLIs, and Error Budgets
β€’ Lead incident response, RCA, and blameless postmortems
β€’ Improve platform reliability through automation
Observability (Core Focus)
β€’ Build centralized observability platforms (metrics, logs, traces)
β€’ Hands-on with Prometheus, Alertmanager, Grafana is Must
β€’ Logging/Tracing using ELK / OpenSearch, Loki, OpenTelemetry
β€’ Cloud-native monitoring (GCP Monitoring preferred)
β€’ Define actionable, low-noise alerting standards
Cloud & Platform Engineering
β€’ Infrastructure on GCP (GKE preferred) / AWS (EKS)
β€’ Kubernetes cluster operations
β€’ Helm deployments & Docker workloads
β€’ Infra automation using Terraform / Ansible / Packer
Automation & Tooling
β€’ Strong Python coding for reliability tooling
β€’ Build internal tools for SLO tracking & incident workflows
β€’ Integrate observability into CI/CD (Jenkins)
Leadership
β€’ Mentor engineers
β€’ Influence reliability architecture
β€’ Collaborate with platform & cloud teams
______________
βœ… Mandatory Skills
SRE | Python (Coding) | Kubernetes | ELK | Prometheus | Grafana | AWS/GCP | Docker | Helm | Terraform | Linux | Jenkins CI/CD
⭐ Nice to Have
Splunk | Datadog | Cribl | Vector | OpenTelemetry | Multi-cloud | Platform Security
______________
πŸ“… Project Highlights
✨ Build a centralized observability platform
πŸ“‰ Reduce MTTR using SLO-driven engineering
🚨 Lead production incident response
⚑ Optimize performance, scalability & cloud cost
______________
πŸ“© Interested?
Share the cv to

Π‘ΡƒΠ΄ΡŒΡ‚Π΅ остороТны: Ссли Ρ€Π°Π±ΠΎΡ‚ΠΎΠ΄Π°Ρ‚Π΅Π»ΡŒ просит Π²ΠΎΠΉΡ‚ΠΈ Π² ΠΈΡ… систСму, ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΡ iCloud/Google, ΠΏΡ€ΠΈΡΠ»Π°Ρ‚ΡŒ ΠΊΠΎΠ΄/ΠΏΠ°Ρ€ΠΎΠ»ΡŒ, Π·Π°ΠΏΡƒΡΡ‚ΠΈΡ‚ΡŒ ΠΊΠΎΠ΄/ПО, Π½Π΅ Π΄Π΅Π»Π°ΠΉΡ‚Π΅ этого - это мошСнники. ΠžΠ±ΡΠ·Π°Ρ‚Π΅Π»ΡŒΠ½ΠΎ ΠΆΠΌΠΈΡ‚Π΅ "ΠŸΠΎΠΆΠ°Π»ΠΎΠ²Π°Ρ‚ΡŒΡΡ" ΠΈΠ»ΠΈ ΠΏΠΈΡˆΠΈΡ‚Π΅ Π² ΠΏΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΊΡƒ. ΠŸΠΎΠ΄Ρ€ΠΎΠ±Π½Π΅Π΅ Π² Π³Π°ΠΉΠ΄Π΅ β†’

ВСкст вакансии взят Π±Π΅Π· ΠΈΠ·ΠΌΠ΅Π½Π΅Π½ΠΈΠΉ

Π˜ΡΡ‚ΠΎΡ‡Π½ΠΈΠΊ -