Large-scale public datasets are vital for driving the progress of abstractive summarization, especially in law, where documents have highly specialized jargon. However, the available resources are English-centered, limiting research advancements in other languages. This paper introduces LAWSUIT, a collection of 14K Italian legal verdicts with expert-authored abstractive maxims drawn from the Constitutional Court of the Italian Republic. LAWSUIT presents an arduous task with lengthy source texts and evenly distributed salient content. We offer extensive experiments with sequence-to-sequence and segmentation-based approaches, revealing that the latter achieve better results in full and few-shot settings. We openly release LAWSUIT to foster the development and automation of real-world legal applications.

Ragazzi, L., Moro, G., Guidi, S., Frisoni, G. (2024). LAWSUIT: a LArge expert-Written SUmmarization dataset of ITalian constitutional court verdicts. ARTIFICIAL INTELLIGENCE AND LAW, 33, 1-37 [10.1007/s10506-024-09414-w].

LAWSUIT: a LArge expert-Written SUmmarization dataset of ITalian constitutional court verdicts

Luca Ragazzi
;
Gianluca Moro
;
Giacomo Frisoni
2024

Abstract

Large-scale public datasets are vital for driving the progress of abstractive summarization, especially in law, where documents have highly specialized jargon. However, the available resources are English-centered, limiting research advancements in other languages. This paper introduces LAWSUIT, a collection of 14K Italian legal verdicts with expert-authored abstractive maxims drawn from the Constitutional Court of the Italian Republic. LAWSUIT presents an arduous task with lengthy source texts and evenly distributed salient content. We offer extensive experiments with sequence-to-sequence and segmentation-based approaches, revealing that the latter achieve better results in full and few-shot settings. We openly release LAWSUIT to foster the development and automation of real-world legal applications.
2024
Ragazzi, L., Moro, G., Guidi, S., Frisoni, G. (2024). LAWSUIT: a LArge expert-Written SUmmarization dataset of ITalian constitutional court verdicts. ARTIFICIAL INTELLIGENCE AND LAW, 33, 1-37 [10.1007/s10506-024-09414-w].
Ragazzi, Luca; Moro, Gianluca; Guidi, Stefano; Frisoni, Giacomo
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1007074
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact