Senior Infra Engineer: Baremetal Orchestration
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Infra Engineer (Baremetal Orchestration): Build and maintain host provisioning stack including PXE boot, Ansible, and orchestration engine for clusters, containers, and VMs with an accent on efficiency, resilience, and scalability. Focus on optimizing bin packing algorithms, internal tooling, observability, alerting, and immutable infrastructure using Terraform and Ansible.
Location: Globally distributed team; diligent about boundaries with end-of-day overlapping someone else's start.
Company
builds an all-encompassing infrastructure platform solving deployment, zero-downtime, service communications, and networking for software engineers.
What you will do
- Build and maintain host provisioning stack with PXE boot, Ansible, and burn-in agents for new bare metal.
- Evolve homegrown orchestration engine to manage clusters, containers, and VMs.
- Optimize bin packing algorithm for utilization, performance, and cost minimization.
- Own internal tooling for engineers interacting with the fleet.
- Build observability, alerting, and CI pipelines for infrastructure code.
- Design immutable infrastructure with Terraform and Ansible for tear-down and failover.
- Build Golang/Rust gRPC services supporting millions of users.
- Write Engineering Requirement Documents from idea to monitoring.
Requirements
- Strong understanding of distributed systems, fault tolerance, resilience, and scalability; care about 3am failures.
- Hands-on experience with bare metal provisioning, configuration management, and hardware production readiness.
- Comfort building and operating internal tools with focus on developer experience.
- Intuition on solution longevity in startups (12-18 months).
- Tact to implement solutions, monitor error boundaries, and document for absences.
- Strong prioritization in ambiguity, grit to solve/scale/replace, and communication skills.
Culture & Benefits
- High ownership, autonomy, and leverage-focused environment with few meetings (Monday/Friday company board).
- Best-in-class compensation: great salary, full health benefits including dependents, strong equity, equipment stipend.
- Globally distributed, fast-scaling team emphasizing systems over coordination and judgment over process.
- Novel problems in well-funded startup allowing creative, high-leverage solutions without busywork.
- Growth support whether staying or moving on.
Hiring process
- 1: Open-ended talk about you, role, and goals.
- 2: Asynchronous small project on orchestration engine design + 60-min interview to build/discuss.
- 3: Review solution with team member.
- 4: Meet 4 team members from different areas.
- 5: 1:1 with CEO.
- 6: Offer call and onboarding.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →