Назад
Company hidden
2 дня назад

Site Reliability Engineer (AWS/Kubernetes)

Формат работы
remote (только USA)
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
France
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Site Reliability Engineer (AWS/Kubernetes): Implementing and maintaining scalable infrastructure and systems to ensure reliability, performance, and security of production environments with an accent on applying software engineering principles to infrastructure and operational challenges. Focus on designing monitoring and alerting solutions, contributing to infrastructure as code initiatives, and improving SRE practices across the organization.

Location: Fully remote.

Company

hirify.global is a platform dedicated to helping people find jobs and improve their professional lives.

What you will do

  • Implement and maintain scalable infrastructure and systems for reliability, performance, and security.
  • Collaborate with Development and Security teams on infrastructure architecture, deployment, and operational requirements.
  • Design and implement monitoring, alerting, and observability solutions.
  • Contribute to incident management, capacity planning, and infrastructure as code practices.
  • Develop and maintain runbooks, documentation, and increase automation initiatives.
  • Provide technical mentorship to developers and lead knowledge sharing sessions.

Requirements

  • At least 4 years of infrastructure/systems engineering experience with a strong hands-on technical focus.
  • Experience building and maintaining large-scale distributed systems.
  • Proficiency in managing incident response according to SLA.
  • Experience implementing automation and self-healing systems, and developing utility scripts.
  • Ability to work in both French and English, in a remote context.
  • Strong problem-solving skills and ability to troubleshoot complex systems issues.
  • Passion for building resilient systems, measuring reliability, and establishing sustainable operational practices.

Nice to have

  • Experience with Ruby, Elixir, or React.js.

Culture & Benefits

  • Work within the Platform Team, reporting to the Platform Engineering Manager.
  • Opportunity to contribute to incident management and drive resolution of technical issues.
  • Focus on operational excellence and process implementation.
  • Emphasis on cross-team collaboration and knowledge sharing.
  • Remote work context.

Hiring process

  • Resume attachment required.
  • Cover letter mandatory.

Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →