Named Entity Recognition (NER) on Estonian Social Media: A Benchmark Dataset and Baselines

Organisatsiooni nimi
TartuNLP
Kokkuvõte
NER is one of the fundamental tasks in Natural Language Processing (NLP) and it aims at identifying different types of entities within a given text. NER involves recognizing named entities such as person, location, organization, etc. in a segment of text.

This proposal involves: i) creating a benchmark dataset by annotating Estonian tweets, ii) proposing a coarse-grained taxonomy and iii) neural baselines.

Outcome: a benchmark dataset and baselines for Estonian NER in the context of social media data in Estonian.
Lõputöö kaitsmise aasta
2022-2023
Juhendaja
Somnath Banerjee
Suhtlemiskeel(ed)
Nõuded kandideerijale
Tase
Bakalaureus, Magister
Märksõnad
#SocialNLP, #SocialMedia

Kandideerimise kontakt

 
Nimi
Somnath Banerjee
Tel
E-mail
somnath.banerjee@ut.ee
Vaata lähemalt
https://sites.google.com/view/somnath-banerjee