Data Scientist (NLP, Generative AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Data Scientist (NLP, Generative AI): Design and build machine learning, NLP, and generative AI solutions for scientific discovery, knowledge extraction, decision support, and intelligent content understanding with an accent on large-scale heterogeneous scientific data and production-ready model integration. Focus on solving complex information retrieval, entity extraction, ranking/summarization, and evidence-grounded generation challenges while delivering reliable systems through evaluation, fine-tuning, and continuous monitoring.
Company
builds information and analytics products for researchers and healthcare professionals.
What you will do
- Design and build machine learning, NLP, and generative AI systems for scientific discovery, knowledge extraction, decision support, and intelligent content understanding.
- Work with large-scale heterogeneous data (scientific publications, datasets, knowledge graphs, ontologies, taxonomies, citations, metadata, and content across disciplines).
- Apply appropriate techniques (classification, regression, clustering, ranking, feature engineering, deep learning, embeddings, LLMs, retrieval, and generative AI) to solve complex problems.
- Develop capabilities for semantic search, information retrieval, entity extraction, content classification, recommendation/ranking, summarization, question answering, and evidence-grounded generation.
- Build, evaluate, fine-tune, prompt, and integrate models into robust production systems; improve quality, relevance, reliability, and user value.
- Write clean, tested, production-quality Python and contribute reusable data science components and scalable data pipelines for preprocessing, inference, experimentation, monitoring, and continuous improvement.
Requirements
- Experience in data science, machine learning/AI, NLP, statistics, applied mathematics, computer science, or a related quantitative field.
- Experience with frontier LLMs (e.g., OpenAI GPTs, Anthropic Claude, Google Gemini), including fine-tuning LLMs and/or SLMs.
- Strong Python skills with a habit of writing clean, maintainable, well-tested code.
- Solid grasp of ML fundamentals (supervised/unsupervised learning, feature engineering, evaluation, model selection, and performance measurement).
- Experience with structured, semi-structured, or unstructured large-scale text/content datasets.
- Familiarity with common tools such as Pandas, NumPy, SciPy, scikit-learn, PyTorch, TensorFlow, and Matplotlib.
Culture & Benefits
- Healthy work/life balance with flexible working hours.
- Wellbeing initiatives, shared parental leave, study assistance, and sabbaticals.
- Country-specific benefits based on location.
Hiring process
- Interviews and evaluation of technical fit for building production-ready ML/NLP/GenAI systems.
- Discussion of collaboration approach and ability to translate ambiguous requirements into measurable outcomes.
Location: NLD Amsterdam (Radarweg)
Salary: €44,500 - €74,300 (base pay range)
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →