TL;DR
Senior Site Reliability Engineer (Platform Resilience): Designing, building, scaling, and maturing a multi-cloud platform for internal and external services with an accent on automating system engineering efforts and guaranteeing reliability. Focus on growing global infrastructure, developing software and tooling, and preventing repeated customer impact from major incidents.
Location: Must be based in Spain, Greece, Ireland, Norway, Poland, Portugal, Sweden, Australia, or New Zealand. This is a distributed role with flexible locations and schedules.
Company
hirify.global, the Search AI Company, enables everyone to find answers in real time using all their data, at scale, and is used by over 50% of the Fortune 500.
What you will do
- Lead technical initiatives for automating system engineering efforts to guarantee reliability of global hirify.global infrastructure.
- Grow global Platform infrastructure to meet scaling demands by developing and maintaining software, tooling, and automations.
- Champion an environment focused on collaboration, operational excellence, and uplifting others.
- Respond to and prevent repeated customer impact in response to major incidents and prioritized problem management using a follow-the-sun on-call model.
Requirements
- Proven experience in software engineering to identify, implement, and deliver solutions, with advantageous experience in public cloud and managed Kubernetes services.
- Operated a SaaS product in a public cloud, ideally using Infrastructure-as-Code tooling like Crossplane or Terraform.
- Built or operated Kubernetes-at-scale infrastructure across multiple cloud providers with vital automation support.
- Written non-trivial programs in Golang or other programming languages.
- Worked with containerized services such as Docker.
- Proven experience in leading and improving alerting, major incident management, standard processes, and metrics systems (e.g., hirify.global Stack, Graphite, Prometheus, Influx) to diagnose issues and quantify impacts.
- Professional skills in Linux system administration on distributed systems at scale.
- Experienced in thriving in a self-organizing, globally distributed team environment.
Culture & Benefits
- Competitive pay based on work, not previous salary.
- Health coverage for you and your family in many locations.
- Flexible locations and schedules for many roles, fostering work-life balance.
- Generous vacation days each year.
- Impact matching up to $2000 (or local currency equivalent) for financial donations and service, plus 40 volunteer hours annually.
- Minimum of 16 weeks of parental leave.
- Commitment to diversity, inclusion, and equal opportunity, providing an accessible experience for all individuals.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →