Назад
5 часов назад

Site Reliability Engineer (Observability)

Формат работы
remote (только Germany)/hybrid
Тип работы
fulltime
Английский
b2
Страна
Germany

Мэтч & Сопровод

Покажет вашу совместимость и напишет письмо

Описание вакансии

Текст:
/

TL;DR

Site Reliability Engineer (Observability & Internal Tools): Managing and evolving the internal platform tooling and observability stack with an accent on automation, developer infrastructure, and open-source alternatives. Focus on designing observability as a platform capability, defining SLOs, and embedding security engineering into the delivery process.

Location: Remote with a required connection to Berlin or Cologne, Germany for occasional on-site collaboration during major features and brainstorming.

Company

smartclip is a technology-driven company specializing in internal platform tooling and TV Labs innovation.

What you will do

  • Operate and advance the observability stack utilizing Prometheus, Grafana, and Forgejo.
  • Implement "build and maintain" strategies for robust open-source alternatives over enterprise SaaS.
  • Design observability as a platform capability, defining SLOs and actionable alerting to prevent incidents.
  • Embed security engineering into the delivery process to identify vulnerabilities prior to penetration tests.
  • Maintain and navigate Linux systems and distributed tooling to balance exploration with stability.

Requirements

  • Proven ability to implement a comprehensive strategy for metrics, logs, and traces.
  • Strong "you build it, you run it" philosophy and a deep sense of ownership.
  • Systems thinking and a builder's mindset driven by technical curiosity.
  • Must be based in or have a connection to Berlin or Cologne for on-site requirements.

Nice to have

  • Experience designing production-grade setups on GCP or AWS.
  • Active contributions to open-source projects.
  • Passion for root-cause analysis and conducting blameless post-mortems.

Culture & Benefits

  • High-ownership environment with no micromanagement or unnecessary bureaucracy.
  • "Build > Talk" culture that prioritizes testing and rapid learning.
  • 30 days of vacation plus December 24th and 31st off.
  • Smart Fridays allowing for a potential 4-day work week.
  • Mobility support including Germany ticket and JobRad.
  • Investment in professional growth via hackathons and conferences.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →