TL;DR
Senior Platform Monitoring Engineer: Leading incident investigation and designing observability solutions for a data and AI infrastructure platform with an accent on rapid detection, mitigation, and resolution of incidents. Focus on conducting root cause analysis, building automation, and enhancing customer experience and platform stability.
Location: Hybrid role based in Amsterdam, Netherlands, requiring office attendance at least 3x a week.
Company
hirify.global is a data and AI company that empowers data teams to solve challenging problems by providing a leading data and AI infrastructure platform.
What you will do
- Lead platform incident investigation, coordinating cross-functional teams for rapid detection, mitigation, and resolution.
- Conduct thorough post-incident root cause analysis across infrastructure, services, and cloud providers.
- Design and implement customer-focused alerting pipelines and end-to-end observability workflows.
- Build automation tools, establish reusable monitoring patterns, and resolve reliability gaps impacting customer experience.
Requirements
- Minimum of 5 years of experience as an SRE, DevOps Engineer, or Production Engineer.
- Production-level experience with at least one major cloud provider (AWS, Azure, GCP).
- Proficiency in container and orchestration technologies (Docker, Kubernetes).
- Hands-on experience with monitoring, logging, and alerting tools such as ELK, Prometheus, Grafana, PagerDuty.
- Strong proficiency in Python or similar languages with the ability to build production-quality automation tools.
- Experience owning critical phases of the incident lifecycle from detection through resolution and post-mortem analysis.
- Must be based in Amsterdam, Netherlands, and attend the office at least 3x a week.
Culture & Benefits
- Comprehensive benefits and perks are provided to meet the needs of all employees.
- Opportunity to work on challenging problems in the data and AI infrastructure space.
- Committed to fostering a diverse and inclusive culture.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →