This dissertation provides a comprehensive overview of Named Entity Recognition (NER) in Natural Language Processing (NLP), encompassing its historical context, key principles, and cutting-edge techniques. It focuses on cross-lingual NER models, exploring how they leverage shared knowledge among languages to enhance performance. We investigate the advantages and limitations of cross-lingual NER, considering reduced data annotation and improved generalization and their challenges, including language variations and resource availability. A pivotal aspect is the analysis of ConNER, a state-of-the-art cross-lingual NER model, with a focus on its performance in the Italian language. Our empirical study employs a modified MultiNERD dataset covering English, German, French and Spanish, shedding light on ConNER's adaptability to other languages. Ultimately, this research aims to enrich NER methodology, offering insights into the potential of cross-lingual approaches for improving NER systems.
Exploring Cross-Lingual Named Entity Recognition: A Study of the ConNER Model for the Italian Language
Ferraresso, Francesca
2023/2024
Abstract
This dissertation provides a comprehensive overview of Named Entity Recognition (NER) in Natural Language Processing (NLP), encompassing its historical context, key principles, and cutting-edge techniques. It focuses on cross-lingual NER models, exploring how they leverage shared knowledge among languages to enhance performance. We investigate the advantages and limitations of cross-lingual NER, considering reduced data annotation and improved generalization and their challenges, including language variations and resource availability. A pivotal aspect is the analysis of ConNER, a state-of-the-art cross-lingual NER model, with a focus on its performance in the Italian language. Our empirical study employs a modified MultiNERD dataset covering English, German, French and Spanish, shedding light on ConNER's adaptability to other languages. Ultimately, this research aims to enrich NER methodology, offering insights into the potential of cross-lingual approaches for improving NER systems.File | Dimensione | Formato | |
---|---|---|---|
866698-1278844.pdf
accesso aperto
Tipologia:
Altro materiale allegato
Dimensione
1.89 MB
Formato
Adobe PDF
|
1.89 MB | Adobe PDF | Visualizza/Apri |
I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/20.500.14247/16441