Senior Platform Engineer (Cloud Workloads)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Platform Engineer (Cloud Workloads): Designing and maintaining observability pipelines and critical infrastructure for Azure and AWS workloads with an accent on Elastic Stack, SLO/SLI reporting, and incident response. Focus on building proactive support tooling, scaling observability gaps, and optimizing multi-tenant cloud workloads.
Location: Hybrid in San Jose, CA
Salary: $158,400 — $294,100 USD
Company
is the Data and AI Trust Company specializing in helping organizations ensure their data and AI are secure, resilient, and scalable.
What you will do
- Design and maintain observability pipelines using the Elastic Stack (Elasticsearch, Kibana, Fleet) across Azure and AWS workloads.
- Develop and own SLO/SLI dashboards and error budget reporting for BaaS platform services.
- Lead incident response for distributed, multi-tenant cloud workloads, including runbook creation and continuous improvement.
- Build proactive support tooling for pattern analysis and baseline deviation alerting to reduce reactive support burden.
- Manage Elastic Fleet agent policies, enrollment health, and log streaming pipelines across worker fleets.
- Collaborate with SRE, R&D, and Proactive Support teams to close observability gaps.
Requirements
- 5+ years of experience in cloud platform engineering, SRE, or infrastructure for commercial SaaS products.
- Deep hands-on experience with Elastic Stack: building dashboards, writing KQL/Query DSL, and managing Fleet.
- Proven experience operating and troubleshooting distributed, multi-tenant workloads on Azure and/or AWS.
- Strong understanding of Azure cloud services (AKS, Entra ID, Key Vault, Service Bus, Cosmos DB, etc.).
- Experience with IaC tools (Azure Bicep, Terraform) and CI/CD pipelines (Azure DevOps, GitHub Actions).
- Strong scripting skills in Bash, Python, or PowerShell.
Nice to have
- Familiarity with Data Platform products.
Culture & Benefits
- Unlimited paid time off, 12 paid holidays, and extra global e Days for self-care.
- Comprehensive medical, dental, and vision coverage starting on the first day.
- 401(k) retirement plan with company matching contributions.
- Paid parental leave (8 weeks for all parents, 16 weeks for birthing parents).
- Mental health support via Employee Assistance Program and virtual veterinary care.
- Access to on-demand learning libraries including LinkedIn Learning and O’Reilly.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →