Senior Site Reliability Engineer (AWS/GCP)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Site Reliability Engineer (AWS/GCP): Building and optimizing infrastructure automation and monitoring systems for an AI-powered education platform with an accent on reliability, scalability, and automation. Focus on reducing on-call toil, designing core cloud infrastructure on AWS and GCP, and enhancing release processes for seamless deployments.
Location: Remote (Must be based in Argentina, Brazil, Chile, Colombia, or Costa Rica)
Company
is a leading education technology company delivering AI-powered solutions designed to unlock student engagement and empower teachers in K–12 classrooms.
What you will do
- Maintain and extend cloud infrastructure using Terraform, GitHub Actions CI/CD, Prefect, and AWS/GCP services.
- Build symptom-based monitoring and alerting using Datadog, Sentry, and CloudWatch to improve system visibility.
- Automate repeatable manual tasks to reduce on-call toil and optimize operational processes like deployments and migrations.
- Design and maintain core cloud infrastructure to support thousands of concurrent users with high fault tolerance.
- Debug complex production issues across various levels of the stack and provide architectural planning as an embedded team member.
- Participate in on-call rotations to respond to incidents impacting platform availability.
Requirements
- Strong experience with Infrastructure as Code (Terraform) and GitHub CI/CD for automation.
- Proficiency in containerization (Docker, ECS) and leveraging cloud technologies (AWS, GCP).
- Experience managing and troubleshooting high-availability datastores (MySQL, Postgres, Neo4J) and Redis clusters.
- Expertise in operating system configuration, storage, and networking (VPCs, proxies, CDNs).
- Proficiency in Shell, Python, and SQL.
- Must be based in Argentina, Brazil, Chile, Colombia, or Costa Rica.
Culture & Benefits
- Fully remote work environment with a provided monthly tech stipend.
- Opportunity to contribute to a mission-driven product improving the lives of students and teachers.
- Collaborative engineering culture emphasizing knowledge sharing, mentoring, and constructive feedback.
- Focus on professional growth through annual learning and development allowances.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →