Engineering Manager - Site Reliability Engineering/SRE
Мэтч & Сопровод
Покажет вашу совместимость и напишет письмо
Описание вакансии
TL;DR
Engineering Manager (SRE): Nurturing and supporting a collaborative team that ensures the reliability, scalability, and performance of our products while empowering engineers with thoughtful tools and workflows with an accent on cultivating and growing SRE practices. Focus on architectural transformation and ensuring our systems meet availability targets.
Location: Hybrid work environment in Berlin. However, we love to build real connections and want to welcome everyone in our beautiful Berlin office on certain days.
Company
’s voice-first GenAI platform for contact centers is built on the best AI technology to automate customer service with natural-sounding conversations for outstanding experiences on all communication channels.
What you will do
- Build and nurture a supportive team that harmonizes SRE excellence with developer experience.
- Collaborate to establish SRE practices: SLI/SLOs, error budgets, and trust-based postmortems using Datadog metrics.
- Create comprehensive observability strategies leveraging our monitoring stack.
- Support sustainable incident response, on-call processes, and automation using GitHub Actions and Terraform to improve MTTR.
- Partner with engineers and engineering teams to integrate reliability practices into CI/CD pipelines (ArgoCD, GitHub Actions) while supporting developer wellbeing.
- Guide teams in leveraging our Azure cloud platform effectively while preparing for multi-cloud architectures.
Requirements
- Experience supporting SRE, DevOps, or platform teams with focus on reliability and collaboration.
- Understanding of SRE principles: SLI/SLOs, error budgets, and toil reduction.
- Hands-on experience with our observability stack (Datadog for metrics/APM, ELK for sensitive logs) and production systems at scale.
- Deep empathy for developer workflows and creating sustainable on-call processes that support work-life balance.
- Familiarity with Infrastructure as Code using Terraform and container orchestration with Kubernetes.
- Experience with CI/CD platforms (GitHub Actions, ArgoCD) and integrating reliability into deployment pipelines.
Nice to have
- Background with databases (MySQL, Redis, MongoDB) and their reliability considerations is valued.
- Experience with multi-cloud architectures and distributed systems is warmly welcomed.
Culture & Benefits
- Join a diverse team of 40+ nationalities with flat hierarchies and a collaborative company culture.
- Opportunity to build and scale your career at the intersection of customer-facing roles and engineering in a dynamic startup on its journey to become an international leader in SaaS platforms for Conversational AI.
- Deutschland ticket, Urban Sports Club, Job Rad, Nilo Health, weekly sponsored office lunches
- Competitive compensation and equity package.
- Flexible working hours, 28 vacation days and workation opportunities.
- Access to a training and development budget for continuous professional growth.
Hiring process
- Recruiter video call → Meet your manager → Technical Interview + Technical Leadership Interview → Bar Raiser Interview
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →