Senior Platform Engineer (Cloud)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Platform Engineer (Cloud): Building and maintaining observability infrastructure and incident response systems for BaaS platform services on Azure and AWS with an accent on Elastic Stack and multi-tenant cloud workloads. Focus on designing SLO/SLI dashboards, refining proactive support tooling, and scaling observability pipelines.
Location: Remote (US)
Salary: $172,800 — $320,900 USD
Company
is the Data and AI Trust Company, specializing in data resilience and security posture management to enable safe AI at scale.
What you will do
- Design and maintain observability pipelines using the Elastic Stack (Elasticsearch, Kibana, Fleet) across Azure and AWS workloads.
- Develop SLO/SLI dashboards and error budget reporting for BaaS platform services.
- Lead incident response for distributed, multi-tenant cloud workloads, including runbook creation and continuous improvement.
- Build proactive support tooling, including pattern analysis, tenant correlation dashboards, and baseline deviation alerting.
- Manage Elastic Fleet agent policies, enrollment health, and log streaming pipelines across worker fleets.
- Partner with SRE, R&D, and Proactive Support teams to close observability gaps and integrate admin portal workflows.
Requirements
- 5+ years of experience in cloud platform engineering, SRE, or infrastructure roles supporting commercial SaaS products.
- Deep hands-on experience with Elastic Stack: building dashboards, writing KQL/Query DSL, and managing Fleet.
- Proven experience operating and troubleshooting distributed, multi-tenant workloads on Azure and/or AWS.
- Strong understanding of Azure cloud services, including AKS, Entra ID, Key Vault, Service Bus, and Cosmos DB.
- Experience with IaC tools (Azure Bicep, Terraform) and CI/CD pipelines (Azure DevOps, GitHub Actions).
- Strong scripting skills in Bash, Python, or PowerShell.
Nice to have
- Familiarity with Data Platform products.
Culture & Benefits
- Unlimited paid time off, 12 paid holidays, and 24 paid volunteer hours annually.
- Comprehensive medical, dental, and vision coverage starting on the first day.
- 401(k) retirement plan with company matching contributions.
- Paid parental leave (8 weeks for all parents, 16 weeks for birthing parents) and fertility support via Maven.
- Mental health support, therapy sessions, and digital wellness tools.
- Access to on-demand learning libraries (LinkedIn Learning, O’Reilly) and annual Global Day of Learning.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →