Purpose: This paper aims to expand the scope and mitigate the biases of extant archival indexes. Design/methodology/approach: The authors use automatic entity recognition on the archives of the Dutch East India Company to extract mentions of underrepresented people. Findings: The authors release an annotated corpus and baselines for a shared task and show that the proposed goal is feasible. Originality/value: Colonial archives are increasingly a focus of attention for historians and the public, broadening access to them is a pressing need for archives.

Luthra Mrinalini, Todorov Konstantin, Jeurgens Charles, Colavizza Giovanni (2024). Unsilencing colonial archives via automated entity recognition. JOURNAL OF DOCUMENTATION, 80(5), 1080-1105 [10.1108/JD-02-2022-0038].

Unsilencing colonial archives via automated entity recognition

Colavizza Giovanni
2024

Abstract

Purpose: This paper aims to expand the scope and mitigate the biases of extant archival indexes. Design/methodology/approach: The authors use automatic entity recognition on the archives of the Dutch East India Company to extract mentions of underrepresented people. Findings: The authors release an annotated corpus and baselines for a shared task and show that the proposed goal is feasible. Originality/value: Colonial archives are increasingly a focus of attention for historians and the public, broadening access to them is a pressing need for archives.
2024
Luthra Mrinalini, Todorov Konstantin, Jeurgens Charles, Colavizza Giovanni (2024). Unsilencing colonial archives via automated entity recognition. JOURNAL OF DOCUMENTATION, 80(5), 1080-1105 [10.1108/JD-02-2022-0038].
Luthra Mrinalini; Todorov Konstantin; Jeurgens Charles; Colavizza Giovanni
File in questo prodotto:
File Dimensione Formato  
Luthra et al. - 2023 - Unsilencing colonial archives via automated entity.pdf

accesso aperto

Descrizione: Articolo
Tipo: Postprint
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione - Non commerciale (CCBYNC)
Dimensione 13.59 MB
Formato Adobe PDF
13.59 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/948745
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 3
social impact