(Senior) Cloud Infrastructure Engineer (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
(Senior) Cloud Infrastructure Engineer (AI): Operate high-scale production environments on AWS ECS Fargate and ClickHouse Cloud for LLM observability platform with an accent on uptime, performance, cost efficiency, and self-hosting deployments. Focus on building world-class observability with Datadog, automating CI/CD and infrastructure-as-code, scaling for 10x growth, and hardening security/compliance for enterprises.
Location: Europe (Berlin, London, Munich, Paris, Zurich); EU timezones required; one week per month in Berlin office
Salary: €90K - €160K
Company
Open Source LLM Engineering Platform now part of ClickHouse, processing billions of trace events monthly, trusted by 19 Fortune 50 companies.
What you will do
- Own Cloud operations: manage deployments, autoscaling, capacity planning, and cost optimization on AWS ECS Fargate and ClickHouse Cloud.
- Build observability: own Datadog dashboards, alerts, SLOs to detect issues proactively.
- Maintain self-hosting: evolve Helm charts, Docker Compose, and documentation for seamless deployments from single-node to multi-region.
- Automate infrastructure: implement CI/CD pipelines, IaC with Terraform/Pulumi, zero-downtime deployments.
- Scale ahead: anticipate needs for new features like agent observability and real-time evaluation at 10x scale.
- Harden security: ensure cloud and self-hosted setups meet enterprise compliance standards.
Requirements
- EU timezones and one week per month in Berlin office
- Strong infrastructure/SRE experience operating production workloads at scale on AWS (ECS/Fargate, networking, IAM, S3) or equivalent.
- Container orchestration: Kubernetes/ECS, Helm, Docker.
- Infrastructure-as-code: Terraform, Pulumi, CloudFormation.
- Monitoring/observability: built effective dashboards and alerts (Datadog plus).
- Self-organized, with strong opinions on reliability, automation, safe changes; interest in open source and user support.
- Thrives in small, accountable teams; CS/quantitative degree preferred.
Nice to have
- ClickHouse Cloud or managed analytical databases experience.
- High-throughput event processing or observability infrastructure background.
- Open source contributions to infra tooling (Helm, Terraform).
- Former founder.
Culture & Benefits
- Engineering-heavy team in Berlin/SF; trust in ownership with RFCs, whiteboarding support, maker schedule.
- Minimal meetings: Monday priorities (15min), Friday demos (60min); code reviews as mentorship.
- Use AI in workflows; ship daily, customer-focused, open source DX emphasis.
- Continuous learning in fast AI space; close collaboration with ClickHouse team.
Hiring process
- Full process to offer in less than 7 days.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →