CRIS Current Research Information System

The structures of discourse used by legal and ordinary languages share differences that foster technical issues when applying or fine-tuning general-purpose language models for open-domain question answering on legal resources. For example, longer sentences may be preferred in European laws (i.e., Brussels I bis Regulation EU 1215/2012) to reduce potential ambiguities and improve comprehensibility, dis- tracting a language model trained on ordinary English. In this article, we investi- gate some mechanisms to isolate and capture the discursive patterns of legalese in order to perform zero-shot question answering, i.e., without training on legal docu- ments. Specifically, we use pre-trained open-domain answer retrieval systems and study what happens when changing the type of information to consider for retrieval. Indeed, by selecting only the important parts of discourse (e.g., elementary units of discourse, EDU for short, or abstract representations of meaning, AMR for short), we should be able to help the answer retriever identify the elements of interest. Hence, with this paper, we publish Q4EU, a new evaluation dataset that includes more than 70 questions and 200 answers on 6 different European norms, and study what happens to a baseline system when only EDUs or AMRs are used during infor- mation retrieval. Our results show that the versions using EDUs are overall the best, leading to state-of-the-art F1, precision, NDCG and MRR scores.

Sovrano, F., Palmirani, M., Sapienza, S., Pistone, V. (2024). DiscoLQA: zero-shot discourse-based legal question answering on European Legislation. ARTIFICIAL INTELLIGENCE AND LAW, First Online, 1-37 [10.1007/s10506-023-09387-2].

DiscoLQA: zero-shot discourse-based legal question answering on European Legislation

Sovrano, Francesco;Palmirani, Monica;Sapienza, Salvatore;Pistone, Vittoria

2024

Abstract

The structures of discourse used by legal and ordinary languages share differences that foster technical issues when applying or fine-tuning general-purpose language models for open-domain question answering on legal resources. For example, longer sentences may be preferred in European laws (i.e., Brussels I bis Regulation EU 1215/2012) to reduce potential ambiguities and improve comprehensibility, dis- tracting a language model trained on ordinary English. In this article, we investi- gate some mechanisms to isolate and capture the discursive patterns of legalese in order to perform zero-shot question answering, i.e., without training on legal docu- ments. Specifically, we use pre-trained open-domain answer retrieval systems and study what happens when changing the type of information to consider for retrieval. Indeed, by selecting only the important parts of discourse (e.g., elementary units of discourse, EDU for short, or abstract representations of meaning, AMR for short), we should be able to help the answer retriever identify the elements of interest. Hence, with this paper, we publish Q4EU, a new evaluation dataset that includes more than 70 questions and 200 answers on 6 different European norms, and study what happens to a baseline system when only EDUs or AMRs are used during infor- mation retrieval. Our results show that the versions using EDUs are overall the best, leading to state-of-the-art F1, precision, NDCG and MRR scores.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2024
			
	Rivista
	
				ARTIFICIAL INTELLIGENCE AND LAW
			
	Codice DOI
	
				https://dx.doi.org/10.1007/s10506-023-09387-2
			
	Citazione
	
				Sovrano, F., Palmirani, M., Sapienza, S., Pistone, V. (2024). DiscoLQA: zero-shot discourse-based legal question answering on European Legislation. ARTIFICIAL INTELLIGENCE AND LAW, First Online, 1-37 [10.1007/s10506-023-09387-2].
			
	Tutti gli autori
	
						Sovrano, Francesco; Palmirani, Monica; Sapienza, Salvatore; Pistone, Vittoria
					
	Appare nelle tipologie:
	
				1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
DiscoLQA: zero-shot discourse-based legal question answering on European Legislation.pdf accesso aperto Tipo: Versione (PDF) editoriale / Version Of Record Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY) Dimensione 2.04 MB Formato Adobe PDF Visualizza/Apri	2.04 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/952595

Citazioni

ND

8

8

10

social impact