Эта вакансия в архиве
Посмотреть похожие вакансии ↓обновлено 22 дня назад
Senior Site Reliability Engineer
Описание вакансии
Текст:
TL;DR
Senior Site Reliability Engineer (SRE/DevOps): Architecting, building, and maintaining scalable and reliable production grade infrastructure using Terraform with an accent on AWS Cloud infrastructure and Kubernetes administration. Focus on monitoring, troubleshooting, and optimizing the infrastructure platform for performance and cost.
Location: Hybrid (Pune, India). Flexibility to work in a Hybrid model (2-3 days in-office)
Company
is the world’s leading open hotel commerce platform, supporting 50,000 hotels in 150+ countries.
What you will do
- Architect, build, and maintain scalable and reliable production grade infrastructure using Terraform.
- Monitor, troubleshoot, and optimize the infrastructure platform for performance and cost.
- Collaborate with data scientists and engineers to understand their data requirements and provide efficient solutions, adhering to security policies.
- Maintain and audit the security of the infrastructure, ensuring adherence to security standards and regulatory requirements such as PCI DSS, GDPR, and ISO 27001.
- Act as an escalation point and subject matter expert for BAU support issues related to the infrastructure platform.
- Participate in 24x7 on-call rotation to support critical production infrastructure.
Requirements
- Extensive professional experience in an infrastructure platform engineering role (e.g., SRE, DevOps).
- Strong proficiency in at least one programming language (e.g., Python, GoLang).
- Strong Linux Administration skills and security hardening experience.
- Demonstrated deep experience with AWS, Terraform, and associated data services.
- Demonstrated deep experience with Kubernetes or equivalent container orchestration service and underlying technologies.
- Excellent problem-solving, analytical, and communication skills.
Nice to have
- Familiarity with data warehousing concepts and technologies (e.g., Databricks, Snowflake, BigQuery).
- Understanding of MLOps and LLMOps principles.
- Experience with big data and streaming technologies (e.g., Spark, Hadoop, Kafka).
Culture & Benefits
- Mental health and well-being initiatives.
- Generous parental leave policy.
- Flexibility to work in a Hybrid model (2-3 days in-office).
- Paid birthday, study, and volunteering leave every year.
- Sponsored social clubs, team events, and celebrations.
- Investment in your personal growth offering training for your advancement.