Data Science Intern
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Data Science Intern (AI/LLM): Annotating datasets and preprocessing language-specific data in Bahasa Indonesia to enhance LLM performance with an accent on grammar, syntax, cultural nuances, and dialects. Focus on fine-tuning models, evaluating outputs for accuracy and appropriateness, and curating evaluation sets.
On-site in Jakarta, Indonesia
Company
is Indonesia's largest digital ecosystem offering mobility, delivery, payments, financial services, e-commerce, and merchant solutions.
What you will do
- Annotate datasets in Bahasa Indonesia to improve multilingual LLM processing.
- Clean, preprocess, and validate language-specific data for training.
- Analyze grammar, syntax, idioms, slang, and cultural nuances.
- Assist in LLM fine-tuning by creating and refining datasets.
- Develop preprocessing strategies like tokenization and stemming.
- Curate evaluation sets for Bahasa and dialects.
- Review LLM outputs for accuracy, grammar, and cultural fit as a human evaluator.
Requirements
- Final year student in mathematics, computer science, computer engineering, or related fields.
- Proficiency in Python and basic data processing.
- Attention to detail for high data quality.
- Strong communication for collaboration and documentation.
- Ability to work independently and in a team.
- Available for 6-month internship in Jakarta.
Culture & Benefits
- Join the Data Science team building ML solutions for GoTo's payment ecosystem.
- Collaborative environment with team discussions, forums, and knowledge-sharing.
- Fast-paced setting focused on real business challenges for GoPay.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →