Senior Site Reliability Engineer

181 688 - 225 000$

Формат работы

onsite

Тип работы

fulltime

Грейд

senior

Английский

Страна

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Senior Site Reliability Engineer: Building and scaling internal platform offerings to ensure the reliability and performance of applications with an accent on monitoring, alerting, and incident response systems. Focus on collaborating with software engineers to guide their design and improve systems as the company expands globally.

Location: San Francisco, California; Santa Clara, California; Seattle, WA

Salary: $181,688 - $213,750 in Seattle, WA; $191,250 - $225,000 in Santa Clara, CA or San Francisco, CA

Company

hirify.global connects founders, investors, and limited partners through world-class software, purpose-built for everyone in venture capital, private equity and private credit.

What you will do

Build and scale internal platform offerings (compute, storage and networking services) to ensure the reliability, and performance of applications.
Design and implement monitoring, alerting, and incident response systems.
Collaborate with application software engineers to guide their design and ensure it scales for what hirify.global needs in the long run.
Act as an agent of change and push boundaries to incrementally improve systems as the company expands globally.

Requirements

Extensive experience with cloud services such as AWS, Google Cloud Platform, or Azure, including services like EC2, S3, RDS, and Lambda. Experience with Kubernetes or other container orchestration is preferred.
Proficient in using tools such as Terraform, Ansible, or CloudFormation for managing and provisioning cloud infrastructure.
Experience with networking concepts and tools, including Container Network Interface (CNI), Network policy implementations. Experience with proxies and service mesh is a big plus.
Strong knowledge of monitoring tools and practices, such as Prometheus, Grafana, ELK Stack, or Datadog, and the ability to set up and maintain comprehensive monitoring solutions.
Proficiency in Python, with the ability to write efficient, maintainable, and scalable code.
Experience in designing, deploying, and maintaining API services, with a strong understanding of RESTful and/or GraphQL API design principles.
You use AI tools in your own day-to-day work in addition to enabling others. You're comfortable building agents to reduce toil and expect this to be a normal part of how you operate.

Culture & Benefits

Market competitive salary.
Equity for all full time roles.
Exceptional benefits.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →