What happens when a named entity recognition (NER) system encounters entities it has never seen before? In practical applications, models must generalize to unseen entity types where labeled training data is either unavailable or severely limited—a challenge that demands zero-shot learning capabilities. While large language models (LLMs) offer extensive parametric knowledge, they fall short in cost-effectiveness compared to specialized small encoders. Existing zero-shot methods predominantly adopt a relaxed definition of the term with potential leakage issues and rely on entity type names for generalization, overlooking the value of richer descriptions for disambiguation. In this work, we introduce ZeroNER, a description-driven framework that enhances hard zero-shot NER in low-resource settings. By leveraging general-domain annotations and entity type descriptions with LLM supervision, ZeroNER enables a BERT-based student model to successfully identify unseen entity types. Evaluated on three real-world benchmarks, ZeroNER consistently outperforms LLMs by up to 16% in F1 score, and surpasses lightweight baselines that use type names alone. Our analysis further reveals that LLMs derive significant benefits from incorporating type descriptions in the prompts.

Cocchieri, A., Martínez Galindo, M., Frisoni, G., Moro, G., Sartori, C., Tagliavini, G. (2025). ZeroNER: Fueling Zero-Shot Named Entity Recognition via Entity Type Descriptions [10.18653/v1/2025.findings-acl.805].

ZeroNER: Fueling Zero-Shot Named Entity Recognition via Entity Type Descriptions

Alessio Cocchieri
Co-primo
;
Giacomo Frisoni
Co-primo
;
Gianluca Moro
Co-primo
;
Claudio Sartori;Giuseppe Tagliavini
2025

Abstract

What happens when a named entity recognition (NER) system encounters entities it has never seen before? In practical applications, models must generalize to unseen entity types where labeled training data is either unavailable or severely limited—a challenge that demands zero-shot learning capabilities. While large language models (LLMs) offer extensive parametric knowledge, they fall short in cost-effectiveness compared to specialized small encoders. Existing zero-shot methods predominantly adopt a relaxed definition of the term with potential leakage issues and rely on entity type names for generalization, overlooking the value of richer descriptions for disambiguation. In this work, we introduce ZeroNER, a description-driven framework that enhances hard zero-shot NER in low-resource settings. By leveraging general-domain annotations and entity type descriptions with LLM supervision, ZeroNER enables a BERT-based student model to successfully identify unseen entity types. Evaluated on three real-world benchmarks, ZeroNER consistently outperforms LLMs by up to 16% in F1 score, and surpasses lightweight baselines that use type names alone. Our analysis further reveals that LLMs derive significant benefits from incorporating type descriptions in the prompts.
2025
Findings of the Association for Computational Linguistics: ACL 2025
15594
15616
Cocchieri, A., Martínez Galindo, M., Frisoni, G., Moro, G., Sartori, C., Tagliavini, G. (2025). ZeroNER: Fueling Zero-Shot Named Entity Recognition via Entity Type Descriptions [10.18653/v1/2025.findings-acl.805].
Cocchieri, Alessio; Martínez Galindo, Marcos; Frisoni, Giacomo; Moro, Gianluca; Sartori, Claudio; Tagliavini, Giuseppe
File in questo prodotto:
File Dimensione Formato  
2025.findings-acl.805 (1).pdf

accesso aperto

Tipo: Versione (PDF) editoriale / Version Of Record
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione 2.94 MB
Formato Adobe PDF
2.94 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1027337
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact