Staff Site Reliability Engineer (Cloud)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Staff Site Reliability Engineer (Cloud): Building and maintaining real-time scalable infrastructure across own hardware, multiple locations, and major cloud providers with an accent on observability tooling, automated processes, and configuration as code. Focus on monitoring platform stability, deep-diving into networking and database strategies, and expanding cloud deployments.
Location: Remote – United States
Salary: $165,000 - $200,000
Company
is the network intelligence platform unifying cloud, device, flow, and synthetic data to demystify complex network operations for modern infrastructure teams.
What you will do
- Ensure real-time scalable infrastructure is set up for growth and efficiency across hardware and all major cloud vendors.
- Develop tools and processes to monitor and stabilize the platform during rapid growth.
- Deep-dive into topics like firewalls, IP routing, database replication, and build automation.
- Collaborate with engineering and infrastructure teams on operational solutions.
- Assist in expanding cloud deployments and contribute code, reviews, and design documents.
- Provide feedback on team goals, projects, and processes for continuous improvement.
Requirements
- 8+ years in cloud-based Systems Administration, IT, or SRE projects.
- Expertise in public clouds like AWS, GCP, Azure, or OCI.
- Strong skills in Docker, Kubernetes, Bash, Python, or Go for containerization, orchestration, and automation.
- Proficiency in IaC with Terraform, Ansible, Puppet, and Linux administration.
- Understanding of internet protocols (TCP/IP, DNS, HTTP, TLS) and networking (routing, firewalls).
- Experience with metrics monitoring like Grafana, Prometheus, Telegraf, OpenTelemetry.
- Passion for documentation and managing vendor relationships.
Nice to have
- Kubernetes automation with Helm and Helmfile for complex deployments.
- Scaling Kubernetes workloads and CI/CD optimization with GitHub Actions or Jenkins.
- Exposure to PagerDuty integrations and SRE/DevOps/GitOps practices.
Culture & Benefits
- 100% company-paid health, vision, dental premiums for you and dependents, plus HRA ($3,000 individual/$4,500 family).
- Paid family & medical leave, open PTO, quarterly Wellness Day, 10+ paid holidays.
- 401(k) retirement account, home office reimbursement, stock options.
- Fully remote global company with emphasis on collaboration, independence, and growth.
- Inclusive culture committed to diversity, belonging, and underrepresented groups.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →