Site Reliability Engineer
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Site Reliability Engineer (SRE): Ensuring SaaS application stability, scalability, and resilience across cloud environments with an accent on automation, monitoring/alerting, and incident response. Focus on building real-time dashboards, leading root cause analysis, and driving reliability improvements including disaster recovery and operational playbooks.
Location: Hybrid (Denver, Colorado or Raleigh, North Carolina); candidates should reside within reasonable commuting distance and provide on-site presence at least three days per week.
Salary: $120,000–$150,000 per year
Company
delivers legal technology solutions for law firms and corporate legal teams.
What you will do
- Own SaaS application reliability and architecture
- Improve operational efficiency using automation and workflow tooling
- Build and maintain dashboards for real-time monitoring
- Lead root cause analysis, remediation, and cross-team incident resolution
- Participate in a 24/7 on-call rotation and respond to escalated customer-facing issues
- Manage disaster recovery and operational playbooks while enforcing security and compliance standards
Requirements
- 5 years of experience as an SRE
- Experience with log analysis and software remediation
- Experience with configuration management tools (Terraform, Puppet, Ansible)
- Experience working with cloud platforms (Azure/AWS) and remote collocated systems
- Deep knowledge of monitoring and alerting tools (New Relic, Datadog, Dynatrace)
- Strong troubleshooting skills for operating systems (Windows, Linux) and clear cross-team communication
Nice to have
- Experience designing and troubleshooting large-scale distributed systems
- Knowledge of mainstream databases (querying and tuning)
- Experience in regulated environments (GDPR, SOX, HIPAA, PCI)
- AWS/Azure certifications
- Hands-on software development and networking background
Culture & Benefits
- Health insurance, dental, and vision insurance
- 401(k) with company contribution and retirement savings plans
- Generous paid time off and supportive work-life balance
- Company bonus plan eligibility and incentive/recognition programs
- Career growth with paths to technical and leadership roles
Hiring process
- Interviews to assess SRE experience, incident/reliability problem-solving, and cloud/automation skills
- Evaluation of communication and ability to collaborate across teams
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →