Senior Infra Engineer (Observability)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Infra Engineer (Observability): Build ingestion pipelines for 1M+ RPS streams of logs, metrics, and telemetry, scalable fault-tolerant alerting engines, and rich backend observability APIs with an accent on distributed systems resilience and real-time user notifications. Focus on crafting Golang/Rust gRPC services, defining immutable infrastructure with Terraform and Ansible, and providing APIs for dashboard integration.
Location: Fully remote, distributed team across the globe
Company
is a startup platform empowering software engineers with comprehensive infrastructure tools for deployment, networking, and observability to enable higher leverage.
What you will do
- Build ingestion pipelines handling 1M+ RPS for logs, metrics, and telemetry data
- Develop scalable, fault-tolerant alerting engines for real-time threshold breach notifications
- Craft backend observability APIs collaborating with product teams for intuitive application insights
- Provide real-time log and metrics stream APIs for dashboard and product consumption
- Build Golang/Rust gRPC services supporting tens of thousands of users at scale
- Define immutable infrastructure using Terraform and Ansible for failover and reconstitution
- Write Engineering Requirement Documents from ideation to implementation and monitoring
- Interface with TypeScript/GraphQL edge for microservice API exposure
Requirements
- Strong understanding of distributed systems for building fault-tolerant, resilient, scalable services
- Interest in VictoriaMetrics, ClickHouse, and observability stacks
- Intuition for solution longevity in fast-scaling startups
- Ability to implement solutions, create monitors for error boundaries, and document handoffs
- Strong prioritization and direction in ambiguous early-stage environments
- Grit to dive deep, scale solutions, and replace them as needed
- Excellent communication for collaboration and implementation
Culture & Benefits
- High ownership and autonomy culture with minimal meetings (Monday/Friday company boards only)
- Distributed global team requiring diligent boundary management due to time zone overlaps
- Best-in-class compensation including great salary, full health benefits for dependents, strong equity, equipment stipend
- Focus on novel problems, creative high-leverage solutions, and personal/professional growth
- Small passionate team (21 people) serving hundreds of thousands of users with high impact
Hiring process
- Open-ended chat about the role, your background, and goals
- Asynchronous small project: design observability engine for compute workloads, followed by 60-min interview to build/expand solution and Q&A
- Team review of your solution focusing on problem-solving and presentation
- Meet the team (4 members from different areas) to assess collaboration and communication
- 30-min 1:1 with CEO for open conversation
- Offer call to discuss details and onboarding
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →