This dissertation provides a comprehensive overview of Named Entity Recognition (NER) in Natural Language Processing (NLP), encompassing its historical context, key principles, and cutting-edge techniques. It focuses on cross-lingual NER models, exploring how they leverage shared knowledge among languages to enhance performance. We investigate the advantages and limitations of cross-lingual NER, considering reduced data annotation and improved generalization and their challenges, including language variations and resource availability. A pivotal aspect is the analysis of ConNER, a state-of-the-art cross-lingual NER model, with a focus on its performance in the Italian language. Our empirical study employs a modified MultiNERD dataset covering English, German, French and Spanish, shedding light on ConNER's adaptability to other languages. Ultimately, this research aims to enrich NER methodology, offering insights into the potential of cross-lingual approaches for improving NER systems.

Exploring Cross-Lingual Named Entity Recognition: A Study of the ConNER Model for the Italian Language

Ferraresso, Francesca
2023/2024

Abstract

This dissertation provides a comprehensive overview of Named Entity Recognition (NER) in Natural Language Processing (NLP), encompassing its historical context, key principles, and cutting-edge techniques. It focuses on cross-lingual NER models, exploring how they leverage shared knowledge among languages to enhance performance. We investigate the advantages and limitations of cross-lingual NER, considering reduced data annotation and improved generalization and their challenges, including language variations and resource availability. A pivotal aspect is the analysis of ConNER, a state-of-the-art cross-lingual NER model, with a focus on its performance in the Italian language. Our empirical study employs a modified MultiNERD dataset covering English, German, French and Spanish, shedding light on ConNER's adaptability to other languages. Ultimately, this research aims to enrich NER methodology, offering insights into the potential of cross-lingual approaches for improving NER systems.
2023-11-03
File in questo prodotto:
File Dimensione Formato  
866698-1278844.pdf

accesso aperto

Tipologia: Altro materiale allegato
Dimensione 1.89 MB
Formato Adobe PDF
1.89 MB Adobe PDF Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14247/16441