Infrastructure Engineer (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Infrastructure Engineer (AI): Operating and scaling the cloud infrastructure for an open-source LLM observability platform with an accent on AWS ECS Fargate and ClickHouse Cloud. Focus on automating CI/CD pipelines, optimizing cost efficiency, and ensuring high availability for high-throughput event processing.
Location: Must be based in EU timezones; Hybrid setup expecting one week per month in the Berlin office
Salary: €90K – €160K + Equity
Company
Open source LLM engineering platform (now part of ClickHouse) that helps teams build AI applications via tracing, evaluation, and prompt management.
What you will do
- Own Cloud operations on AWS ECS Fargate and ClickHouse Cloud, managing autoscaling, capacity planning, and cost optimization.
- Build and maintain a world-class observability stack using Datadog, including dashboards, alerts, and SLOs.
- Scale and evolve the public self-hosted infrastructure, owning Helm charts and Docker Compose configurations.
- Automate all infrastructure processes, including CI/CD pipelines, IaC, and zero-downtime deployments.
- Harden security and compliance to meet the requirements of large enterprise organizations.
- Collaborate with the ClickHouse team to optimize the core database dependency of the stack.
Requirements
- Strong experience operating production workloads on AWS (ECS/Fargate, networking, IAM, S3).
- Proficiency with container orchestration using Kubernetes, ECS, Docker, and Helm charts.
- Experience with infrastructure-as-code tools such as Terraform or Pulumi.
- Proven ability to build monitoring and observability systems that effectively catch production issues.
- Strong ownership mindset and ability to ship infrastructure changes safely.
- Must be located in an EU timezone.
Nice to have
- Experience with ClickHouse Cloud or other managed analytical databases.
- Background in operating high-throughput event processing or observability infrastructure.
- Contributions to open-source infrastructure tooling.
- Previous experience as a founder.
Culture & Benefits
- High ownership environment where you identify problems, propose RFCs, and ship solutions.
- Maker's schedule with minimal meetings: only a Monday check-in and a Friday demo.
- Strong mentorship culture with a focus on code reviews and collaborative whiteboard sessions.
- AI-first workflow, encouraging the use of AI tooling to maximize efficiency.
- Direct impact and visibility, with features attributed to you in public changelogs.
Hiring process
- Accelerated process designed to go from initial contact to offer letter in less than 7 days.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →