Mechanistic Interpretability: Finding Universal Concepts in Large Language Models

Organisatsiooni nimi
Tartu NLP
Kokkuvõte
Think of a concept of a "cat". Can we extract it from language model representations? Does it appear across multiple models?
Lõputöö kaitsmise aasta
2024-2025
Juhendaja
Maksym Del
Suhtlemiskeel(ed)
inglise keel
Nõuded kandideerijale
You should understand and be able to tweak a Transformer architecture. The best fit is students who are good engineers aiming to possibly get a research publication.
Tase
Bakalaureus, Magister
Märksõnad

Kandideerimise kontakt

 
Nimi
Maksym Del
Tel
E-mail
maksym.del@ut.ee