Site Reliability Engineer (AWS)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Site Reliability Engineer (AWS): Building and scaling the reliability of production systems for a conversational voice intelligence platform with an accent on monitoring, incident response, and operational excellence. Focus on designing observability systems, leading root cause analysis, and ensuring enterprise-grade uptime for distributed APIs.
Location: Hybrid in Somerville, MA
Salary: $150,000 - $200,000
Company
is a leader in conversational voice intelligence, building a platform that enables enterprises to understand voice communication to detect harm and prevent fraud.
What you will do
- Own and operate production systems supporting APIs and enterprise products.
- Design and implement monitoring, alerting, and observability systems from the ground up.
- Lead incident response, root cause analysis, and postmortem processes.
- Establish and improve on-call rotations and operational workflows.
- Collaborate with engineers to deploy and scale distributed systems.
- Evaluate deployment models across cloud, on-prem, and hybrid environments.
Requirements
- Experience deploying and maintaining production software systems.
- Experience building monitoring and alerting systems for production environments.
- Proven track record with on-call rotations and incident response.
- Strong experience with AWS, Python, and Linux.
- Familiarity with CloudWatch, SNS, PagerDuty, or similar technologies.
- Ability to communicate effectively during high-pressure incidents.
Nice to have
- Deep experience with AWS EC2, load balancers, RDS, SQS, and SES.
- Proficiency with infrastructure-as-code tools like Terraform or CloudFormation.
- Experience supporting high-scale, distributed systems.
- Familiarity with hybrid or on-prem deployment models.
Culture & Benefits
- Competitive salary and equity packages.
- Full health, dental, and vision coverage, including HSA and FSA.
- Flexible PTO with a strong internal culture of taking time off.
- Hybrid work model with core in-office days and flexible remote options.
- Up to 8 weeks work-from-anywhere policy.
- Weekly team lunches and support for professional growth and continued learning.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →