CRIS Current Research Information System

Generics are statements that express generalizations and are used to communicate generalizable knowledge. While generics convey general truths (e.g., Birds can fly), they often allow for exceptions (e.g., penguins do not fly). Nonetheless, generics form the basis of how we communicate our commonsense about the world. We explored the interpretation of generics in Masked Language Models (MLMs), building on psycholinguistic experimental designs. As this interpretation requires a comparison with overtly quantified sentences, we investigated i) the probability of quantifiers, ii) the internal representation of nouns in generic vs. quantified sentences, and iii) whether the presence of a generic sentence as context influences quantifiers’ probabilities. The outcomes confirm that MLMs are insensitive to quantification; nevertheless, they appear to encode a meaning associated with the generic form, which leads them to reshape the probability associated with various quantifiers when the generic sentence is provided as context.

Collacciani C., Rambelli G. (2023). Interpretation of Generalization in Masked Language Models: An Investigation Straddling Quantifiers and Generics. Aachen : CEUR-WS.

Interpretation of Generalization in Masked Language Models: An Investigation Straddling Quantifiers and Generics

Collacciani C.;Rambelli G.

2023

Abstract

Generics are statements that express generalizations and are used to communicate generalizable knowledge. While generics convey general truths (e.g., Birds can fly), they often allow for exceptions (e.g., penguins do not fly). Nonetheless, generics form the basis of how we communicate our commonsense about the world. We explored the interpretation of generics in Masked Language Models (MLMs), building on psycholinguistic experimental designs. As this interpretation requires a comparison with overtly quantified sentences, we investigated i) the probability of quantifiers, ii) the internal representation of nouns in generic vs. quantified sentences, and iii) whether the presence of a generic sentence as context influences quantifiers’ probabilities. The outcomes confirm that MLMs are insensitive to quantification; nevertheless, they appear to encode a meaning associated with the generic form, which leads them to reshape the probability associated with various quantifiers when the generic sentence is provided as context.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	Titolo del volume
	
				Proceedings of the 9th Italian Conference on Computational Linguistics - CLiC-it 2023
			
	Pagina iniziale
	
				1
			
	Pagina finale
	
				11
			
	Collana/Serie
	
				CEUR WORKSHOP PROCEEDINGS
			
	Citazione
	
				Collacciani C.,  Rambelli G. (2023). Interpretation of Generalization in Masked Language Models: An Investigation Straddling Quantifiers and Generics. Aachen : CEUR-WS.
			
	Tutti gli autori
	
						Collacciani C.; Rambelli G.
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
paper17.pdf accesso aperto Tipo: Versione (PDF) editoriale / Version Of Record Licenza: Creative commons Dimensione 1.93 MB Formato Adobe PDF Visualizza/Apri	1.93 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/954561

Citazioni

ND

0

ND

social impact