Senior ML Platform Engineer (AI Engineering)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior ML Platform Engineer (AI Engineering): Building and operating a production-grade ML platform for sovereign AI cybersecurity products with an accent on distributed training, model serving, and air-gapped deployment. Focus on designing scalable infrastructure for LLMs, optimizing inference latency, and ensuring reproducible experiment workflows.
Location: Tel Aviv
Company
is an AI cybersecurity firm that combines AI and human expertise to protect nations and critical infrastructure through a sovereign AI platform.
What you will do
- Build and operate ML training infrastructure, including distributed pipelines and compute scheduling.
- Own model serving and inference systems, focusing on packaging, autoscaling, and latency/cost optimization.
- Implement feature stores, model registries, and dataset versioning to enable reproducible experiments.
- Develop experiment tracking and evaluation infrastructure with automated evals and drift detection.
- Maintain production pipelines for training, fine-tuning, and serving domain models in mission-critical environments.
- Optimize the ML stack for training throughput and manage compute costs.
Requirements
- 5+ years in software engineering, with 2+ years focused on ML infrastructure, MLOps, or data-intensive systems.
- Strong proficiency in Python, distributed systems design, API design, and CI/CD.
- Experience with model serving frameworks such as Triton, TorchServe, vLLM, or Ray Serve.
- Knowledge of distributed training frameworks like PyTorch or JAX.
- Experience with ML lifecycle tools (MLflow, Weights & Biases) and data pipelines (Spark, Airflow/Dagster).
- Must be based in Tel Aviv.
Nice to have
- Experience operating in on-premise, private cloud, or air-gapped deployments.
- Hands-on experience with Kubernetes, AWS, and Terraform (IaC).
- Expertise in simulation environments, synthetic data generation, or reinforcement learning.
- Background in applied ML or data science.
Culture & Benefits
- Work on mission-critical Sovereign AI products impacting national security.
- High-impact role turning research into reliable, production-grade intelligence.
- Collaborative environment working across Data Platform, AI, and DevOps teams.
- Emphasis on engineering craft, secure coding, and end-to-end production ownership.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →