TL;DR
Machine Learning Engineer (AI Inference): Leading the design and development of real-time inference services, the core engine powering algorithmic decision-making at scale. Focus on integrating ML models with business logic under strict SLAs, ensuring high availability, scalability, and observability of production systems.
Company
A leading mobile marketing and audience platform, empowers the app ecosystem with cutting-edge solutions in mobile marketing, audience building, and monetization.
What you will do
- Own and lead the design and development of low-latency Algo inference services handling billions of requests per day.
- Build and scale robust real-time decision-making engines, integrating ML models with business logic under strict SLAs.
- Design systems for model versioning, shadowing, and A/B testing at runtime.
- Ensure high availability, scalability, and observability of production systems.
- Continuously optimize latency, throughput, and cost-efficiency using modern tooling and techniques.
- Work independently while interfacing with cross-functional stakeholders from Algo, Infra, Product, Engineering, BA & Business.
Requirements
- B.Sc. or M.Sc. in Computer Science, Software Engineering, or a related technical discipline.
- 5+ years of experience building high-performance backend or ML inference systems.
- Deep expertise in Python and experience with low-latency APIs and real-time serving frameworks (e.g., FastAPI, Triton Inference Server, TorchServe, BentoML).
- Experience with scalable service architecture, message queues (Kafka, Pub/Sub), and async processing.
- Strong understanding of model deployment practices, online/offline feature parity, and real-time monitoring.
- Experience in cloud environments (AWS, GCP, or OCI) and container orchestration (Kubernetes).
Nice to have
- Experience working with in-memory and NoSQL databases (e.g. Aerospike, Redis, Bigtable) to support ultra-fast data access in production-grade ML services.
- Familiarity with observability stacks (Prometheus, Grafana, OpenTelemetry) and best practices for alerting and diagnostics.
- A strong sense of ownership and the ability to drive solutions end-to-end.
- Passion for performance, clean architecture, and impactful systems.
Culture & Benefits
- Polish public holidays.
- 20 working days per year is Non-Operational Allowance and settled to be used for personal recreation matters and are compensated in full.
- Health Insurance.
- Gym Subscription (Multisport).
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →