Open LLM for baltic languages
ChatGPT is cool but closed behind an API. Llama2 is cool and open, but does not support Estonian. This thesis will try to train an LLM for Estonian and other Baltic and neighboring languages using the LUMI supercomputer.
Graduation Theses defence year
Hele-Andra Kuulmets, Mark Fishel
Spoken language (s)
Requirements for candidates
Python, HPC, text data processing, neural net / transformer training
Application of contact