Site Reliability Engineer
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Site Reliability Engineer: Building and maintaining scalable cloud infrastructure for a Carrier-as-a-Service platform with an accent on automation, observability, and system reliability. Focus on designing robust deployment pipelines, managing mission-critical production environments, and enabling engineering teams through advanced tooling and infrastructure-as-code.
Company
is a NeoTelco building a global, accessible, and insightful telecom network platform that empowers users to launch their own carrier services.
What you will do
- Design and implement cloud-based platform infrastructure to support backend services.
- Automate technical operations including deployments, scaling, and disaster recovery.
- Monitor and maintain mission-critical production infrastructure to ensure maximum uptime.
- Participate in on-call rotations and contribute to a culture of continuous improvement through blameless postmortems.
- Provide tools and support to Engineering, Telecom, and Data Engineering teams to operate their services effectively.
Requirements
- Strong understanding of Linux/Unix system internals and networking.
- Proficiency in at least one programming language (Python, Go, or Ruby) and advanced scripting skills.
- Hands-on experience with infrastructure provisioning tools like Terraform or Ansible.
- Experience with containerization and orchestration using Docker and Kubernetes.
- Proven experience with cloud providers (AWS, Google Cloud, or Azure) and CI/CD pipeline management.
- Experience in on-call rotations and incident management practices.
Nice to have
- Experience with distributed systems like Kafka, Cassandra, or Elasticsearch.
- Knowledge of database management (SQL and NoSQL).
- Familiarity with distributed tracing and advanced log aggregation strategies.
- Experience with performance profiling and load testing.
Culture & Benefits
- Opportunity to work on a modern technology platform in the telecom space.
- Collaborative environment focused on innovation and solving complex connectivity challenges.
- Commitment to continuous improvement and blameless engineering culture.
- Remote-first work environment.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →