Institute of Computer Science - Graduation Theses Topics Registry

Graduation theses topics (Submit a thesis topic) Completed theses (Submit your thesis)

Leveraging Estonian Olympiad Problems for Evaluating Large Language Models

Organization

TartuNLP

Abstract

Estonian Olympiad problems present a valuable data source for evaluating the performance of Large Language Models (LLMs). This thesis involves identifying relevant Olympiad tasks, collecting and processing data into benchmarks, and evaluating both open-source and commercial LLMs. The outcomes will provide insights into the models' capabilities in handling complex and possibly multimodal tasks in the Estonian language.

Graduation Theses defence year

2024-2025

Supervisor

Taido Purason, Hele-Andra Kuulmets

Spoken language (s)

Estonian, English

Requirements for candidates

Familiarity with machine learning and NLP. Experience with website/PDF scraping, LLM inference, and the HuggingFace ecosystem will be useful.

Level

Bachelor, Masters

Keywords

#LLMs

Application of contact

Name

Taido Purason

Phone

E-mail

taido.purason@ut.ee

UT Institute of Computer Science Graduation Theses Topics Registry

Leveraging Estonian Olympiad Problems for Evaluating Large Language Models

Application of contact