Infrastructure Engineer (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Infrastructure Engineer (AI): Leading the scaling of operational resilience for an AI workforce platform with an accent on stability, observability, and debugging workflows. Focus on untangling complex failures, designing internal tooling, and improving developer focus and system uptime.
Location: Onsite in San Francisco
Salary: $200K – $240K
Company
is the infrastructure for enterprises to build and orchestrate AI workforces, powering critical operations for global enterprises worldwide.
What you will do
- Take the lead on scaling operational resilience as the company grows.
- Own the stability, observability, and debugging workflows that keep systems running smoothly.
- Untangle complex failures in real time and design tools that turn chaos into clarity.
- Help shift from reactive to proactive operations, reducing incident load and improving system uptime.
- Build internal tooling to directly improve developer focus.
Requirements
- 3+ years of hands-on experience debugging production systems (logs, traces, incidents, etc.).
- Strong problem-solving skills and ability to dive into unfamiliar backend codebases.
- Strong Go and Kubernetes experience.
- Familiarity with observability and monitoring tools (e.g., Datadog, Prometheus, Sentry).
- Clear, calm communication under pressure — especially during live incidents.
Nice to have
- Experience working with distributed systems or services at scale.
- Built or maintained internal tooling for on-call teams or reliability workflows.
- Familiarity with deployment pipelines, CI/CD, or infra-as-code.
- Experience improving system observability (e.g., custom metrics, traces, log pipelines).
Culture & Benefits
- Opportunity to work at a high-growth AI startup, backed by top investors.
- Competitive salary + equity in a high-growth startup.
- Take full ownership of projects and ship fast.
- Join a world-class team of engineers and builders.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →