Senior Site Reliability Engineer
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Site Reliability Engineer: Building and scaling internal platform offerings to ensure the reliability and performance of applications with an accent on monitoring, alerting, and incident response systems. Focus on collaborating with software engineers to guide their design and improve systems as the company expands globally.
Location: San Francisco, California; Santa Clara, California; Seattle, WA
Salary: $181,688 - $213,750 in Seattle, WA; $191,250 - $225,000 in Santa Clara, CA or San Francisco, CA
Company
connects founders, investors, and limited partners through world-class software, purpose-built for everyone in venture capital, private equity and private credit.
What you will do
- Build and scale internal platform offerings (compute, storage and networking services) to ensure the reliability, and performance of applications.
- Design and implement monitoring, alerting, and incident response systems.
- Collaborate with application software engineers to guide their design and ensure it scales for what needs in the long run.
- Act as an agent of change and push boundaries to incrementally improve systems as the company expands globally.
Requirements
- Extensive experience with cloud services such as AWS, Google Cloud Platform, or Azure, including services like EC2, S3, RDS, and Lambda. Experience with Kubernetes or other container orchestration is preferred.
- Proficient in using tools such as Terraform, Ansible, or CloudFormation for managing and provisioning cloud infrastructure.
- Experience with networking concepts and tools, including Container Network Interface (CNI), Network policy implementations. Experience with proxies and service mesh is a big plus.
- Strong knowledge of monitoring tools and practices, such as Prometheus, Grafana, ELK Stack, or Datadog, and the ability to set up and maintain comprehensive monitoring solutions.
- Proficiency in Python, with the ability to write efficient, maintainable, and scalable code.
- Experience in designing, deploying, and maintaining API services, with a strong understanding of RESTful and/or GraphQL API design principles.
- You use AI tools in your own day-to-day work in addition to enabling others. You're comfortable building agents to reduce toil and expect this to be a normal part of how you operate.
Culture & Benefits
- Market competitive salary.
- Equity for all full time roles.
- Exceptional benefits.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →