Telecom Observability Engineer
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Telecom Observability Engineer (Mobile Networks): Building and operating observability for real-time core network and infrastructure issues with an accent on monitoring, alerting, and escalation workflows across distributed systems. Focus on designing observability strategies, implementing dashboards and distributed tracing/log aggregation, and using AI-powered anomaly detection to maintain optimal mobile network performance.
Location: Remote
Company
is a telecom company focused on monitoring and supporting mobile network services.
What you will do
- Maintain and improve telecom network observability, including fault investigation and troubleshooting for real-time core network/infrastructure issues.
- Design and implement observability strategies across distributed systems and microservices.
- Deploy and maintain monitoring solutions using SevOne, Grafana, and CloudPak4AiOPs; create real-time dashboards and on-demand dashboards for service validation after failures.
- Develop automated alerting with AI-powered anomaly detection and implement distributed tracing and log aggregation.
- Support NOC engineers with analysis, issue resolution, and proper escalation channel assessment.
- Define and track KPIs/metrics for mobile network performance using SLO/SLI/SLA concepts, including dashboards, alerts, and visualizations.
Requirements
- 10+ years of experience in a telecom environment performing L3 support or similar roles.
- Strong understanding of Mobile Networks 2G/3G/4G/5G and telecom network performance concepts.
- Expertise with observability/monitoring tools such as Prometheus, Grafana, ELK Stack, SevOne, and CloudPak.
- Strong knowledge of distributed systems and microservices architecture.
- Basic Python, Linux, and Bash knowledge.
- Proficient written and verbal English communication skills.
Nice to have
- Experience with cloud platforms (AWS, GCP, Azure).
- Experience creating training materials related to KPIs and dashboards.
Culture & Benefits
- Remote work with collaboration across telecom, core engineering, and Level 1/2 customer support escalation groups.
- Hands-on role focused on monitoring, escalation, and maintaining network performance against KPI/SLA targets.
- Opportunity to learn mobile telecom networks by working with highly experienced specialists.
Hiring process
- Interviews focused on telecom observability/monitoring experience, distributed systems knowledge, and troubleshooting/escalation approach.
- Discussion of practical experience with dashboards, alerting, tracing/log aggregation, and KPI/SLA/SLO concepts.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →