Senior Backend Software Engineer (Observability)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Backend Software Engineer (Observability): Building and scaling core observability services for a cloud AI platform, with an accent on high-volume telemetry ingestion, distributed storage, query engines, and alerting pipelines. Focus on solving complex production reliability and performance challenges while enabling AI-assisted troubleshooting across logs, metrics, traces, and operational data.
Location: Amsterdam, Netherlands; Germany; Prague, Czech Republic; Remote - Europe; United Kingdom
Company
builds a full-stack AI cloud platform for developers and enterprises, spanning compute, storage, networking, and applied AI.
What you will do
- Build and scale backend services for the Observability Platform, powering logs, metrics, traces, alerting, and troubleshooting.
- Design and implement high-volume telemetry ingestion and distributed storage systems.
- Develop query engines and alerting pipelines for operational data at scale.
- Improve system reliability, scalability, and performance across Kubernetes, managed services, databases, and networking.
- Troubleshoot complex production issues and drive fixes for operational stability.
- Collaborate with engineers across the stack to deliver new capabilities, including AI-assisted troubleshooting.
Requirements
- 5+ years of professional software engineering experience.
- Strong knowledge of Golang or willingness to quickly switch to it.
- Experience building distributed backend systems.
- Solid understanding of software reliability, scalability, and performance.
- Ability to troubleshoot complex production issues.
- Teamwork-oriented approach and strong communication skills.
Nice to have
- Experience with observability platforms/telemetry systems or open-source projects such as Prometheus, Grafana, Loki, Jaeger, OpenTelemetry, VictoriaMetrics, Mimir, or Tempo.
- Production experience using ClickHouse.
Culture & Benefits
- Competitive compensation and opportunities for career growth and learning.
- Flexibility, ownership, and a collaborative, innovative culture.
- Work on impactful AI projects in an international environment.
- Fast-moving team with meaningful impact and trust.
Hiring process
- Interviews to assess experience with distributed backend systems, reliability/performance, and observability-related work.
- Technical evaluation focused on building and operating backend services for telemetry and alerting.
- Final discussions around collaboration and communication fit.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →