TL;DR
Site Reliability Engineer (Telco): Supporting and maintaining Linux-based servers and telephony services in production with an accent on high availability and incident resolution. Focus on automating tasks, improving system reliability, and collaborating with development teams.
Location: On-site presence required 4 days a week
Company
hirify.global is a company focused on maintaining business-critical telephony and communication services.
What you will do
- Support and maintain Linux-based servers and telephony services in production.
- Investigate and resolve incidents in a high-load, distributed environment.
- Participate in on-call shifts and ensure the stability of systems under strict SLAs.
- Analyze service performance, reliability, and architecture bottlenecks; propose improvements.
- Work with development teams to safely deliver and validate changes before production deployment.
- Contribute ideas and help evolve team processes, automation, and monitoring practices.
Requirements
- Strong experience with UNIX/Linux systems and using the CLI for troubleshooting.
- Good understanding of networking protocols and SIP.
- Strong hands-on experience with Kubernetes (k8s) and containerized environments.
- Proven track record of working in production environments, with a careful and methodical approach to changes.
- Understanding of high-availability systems, fault tolerance, and performance optimization.
- Good command of English (B2 or higher).
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →