Назад
Company hidden
7 месяцев назад

Site Reliability Engineer

230 000 - 310 000$
Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Site Reliability Engineer (DevOps): Own and improve the reliability, availability, and performance of a large-scale backend platform on AWS with an accent on automation, observability, and incident management. Focus on designing resilient systems, reducing toil, and scaling infrastructure to support millions of users.

Location: San Francisco, CA, onsite 4–5 days per week

Salary: $230,000–$310,000 plus equity

Company

hirify.global is a product company building AI-powered creative tools for modern communication, valued at $2.1B with over $100M ARR and profitable since 2023.

What you will do

  • Own reliability, availability, and performance of production systems primarily on AWS infrastructure
  • Build observability infrastructure with metrics, logging, tracing, and alerting
  • Design automation to reduce toil, improve deployment safety, and accelerate incident resolution
  • Lead incident response, conduct blameless post-mortems, and drive systemic improvements
  • Partner with engineering teams on architecture reviews, SLOs/SLIs, and reliability best practices
  • Manage and optimize infrastructure including compute, networking, databases, and managed services

Requirements

  • Must have 5+ years experience in Site Reliability Engineering, DevOps, or systems engineering with deep AWS expertise
  • Strong programming skills in Python, Go, or TypeScript/Node.js for automation
  • Experience with infrastructure-as-code tools like Terraform and CloudFormation
  • Solid understanding of networking, distributed systems, containerization (Docker, Kubernetes), and database performance
  • Strong incident management and debugging skills for complex production issues
  • Work onsite in San Francisco 4–5 days per week

Nice to have

  • Experience scaling SaaS applications to millions of users
  • Background with real-time collaborative systems, Kafka, chaos engineering, or service mesh technologies
  • AWS certifications or experience with security/compliance requirements (SOC 2, ISO 27001)

Culture & Benefits

  • Warm, quirky, and curious culture valuing creativity and craftsmanship
  • Small, passionate teams working on impactful projects
  • Flexibility to work from home when focus matters most
  • Fast-paced, creative, and occasionally chaotic environment

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →