Leveraging Estonian Olympiad Problems for Evaluating Large Language Models

Organisatsiooni nimi
TartuNLP
Kokkuvõte
Estonian Olympiad problems present a valuable data source for evaluating the performance of Large Language Models (LLMs). This thesis involves identifying relevant Olympiad tasks, collecting and processing data into benchmarks, and evaluating both open-source and commercial LLMs. The outcomes will provide insights into the models' capabilities in handling complex and possibly multimodal tasks in the Estonian language.
Lõputöö kaitsmise aasta
2024-2025
Juhendaja
Taido Purason, Hele-Andra Kuulmets
Suhtlemiskeel(ed)
eesti keel, inglise keel
Nõuded kandideerijale
Familiarity with machine learning and NLP. Experience with website/PDF scraping, LLM inference, and the HuggingFace ecosystem will be useful.
Tase
Bakalaureus, Magister
Märksõnad
#LLMs

Kandideerimise kontakt

 
Nimi
Taido Purason
Tel
E-mail
taido.purason@ut.ee