Engineering Manager (Infrastructure Platform/SRE)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Engineering Manager (Infrastructure Platform/SRE): Lead infrastructure platform engineering and establish a Site Reliability function to enable self-service environments and operational excellence for SaaS product delivery, with an accent on infrastructure-as-code, Kubernetes operations at scale, and incident/SLO-driven reliability. Focus on building design-before-build engineering rigor, professionalizing incident management, and balancing developer enablement with roadmap execution in a remote environment.
Company
is a remote cybersecurity company building the NodeZero platform for production-safe autonomous pentesting and scalable security assessment operations.
What you will do
- Lead infrastructure platform engineering teams delivering infrastructure-as-code (IaC) components and frameworks for self-service provisioning.
- Set governance, best practices, and controls so application feature teams can safely self-service infrastructure.
- Establish and professionalize an SRE function: hire SRE staff and a line manager, define incident processes, and drive adoption across engineering.
- Drive engineering excellence through design-before-build discipline, architecture decision records/RFCs, design/code reviews, and blameless retrospectives.
- Manage a growing team of infrastructure engineers and engineering managers: intake, prioritization, backlog visibility, hiring, coaching, and retention.
Requirements
- Demonstrated experience leading teams operating SaaS service infrastructure.
- Deep hands-on experience deploying and operating production infrastructure on public cloud platforms (AWS strongly preferred; Azure and GCP familiarity a plus).
- Strong Infrastructure as Code skills (Terraform); experience with Crossplane and GitOps patterns strongly preferred.
- Experience managing production Kubernetes environments at scale.
- Solid security knowledge (zero trust, secrets management, IAM, and software supply chain security).
- Experience building/leading SRE functions including incident management, on-call programs, SLO/SLA definition, and operational runbooks; strong observability experience (logs, traces, golden signals, service metrics).
Culture & Benefits
- Remote role with occasional business travel up to 10%.
- Competitive base salary plus equity (stock options) and benefits.
- Health, vision & dental insurance for you and your family, flexible vacation policy, and generous parental leave.
- Hybrid & remote work model depending on role and location, including a Chicago office for roles that require regular in-office presence.
Hiring process
- Interviews and evaluation of technical leadership, SRE/infrastructure experience, and ability to lead remote teams.
- Discussion of role expectations around roadmap execution, incident professionalism, and engineering rigor.
Location: Remote (US). Occasional business travel up to 10%.
Salary: $260,000 - $280,000 base (equity/stock options and benefits eligible for full-time roles).
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →