CRIS Current Research Information System

Researchers and educators interested in creative writing need a reliable and efficient tool to score the creativity of narratives, such as short stories. Typically, human raters manually assess narrative creativity, but such subjective scoring is limited by labor costs and rater disagreement. Large language models (LLMs) have shown remarkable success on creativity tasks, yet they have not been applied to scoring narratives, including multilingual stories. In the present study, we aimed to test whether narrative originality-a component of creativity-could be automatically scored by LLMs, further evaluating whether a single LLM could predict human originality ratings across multiple languages. We trained three different LLMs to predict the originality of short stories written in 11 languages. Our first monolingual model, trained only on English stories, robustly predicted human originality ratings (r = .81). This same model-trained and tested on multilingual stories translated into English-strongly predicted originality ratings of multilingual narratives (r >= .73). Finally, a multilingual model trained on the same stories, in their original language, reliably predicted human originality scores across all languages (r >= .72). We thus demonstrate that LLMs can successfully score narrative creativity in 11 different languages, surpassing the performance of the best previous automated scoring techniques (e.g., semantic distance). This work represents the first effective, accessible, and reliable solution for the automated scoring of creativity in multilingual narratives.

Luchini, S.A., Moosa, I.M., Patterson, J.D., Johnson, D., Baas, M., Barbot, B., et al. (2025). Automated assessment of creativity in multilingual narratives. PSYCHOLOGY OF AESTHETICS, CREATIVITY, AND THE ARTS, online, 1-18 [10.1037/aca0000725].

Automated assessment of creativity in multilingual narratives

Luchini, Simone A.;Moosa, Ibraheem Muhammad;Patterson, John D.;Johnson, Dan;Baas, Matthijs;Barbot, Baptiste;Bashmakova, Iana;Benedek, Mathias;Chen, Qunlin;Corazza, Giovanni E.;Forthmann, Boris;Goecke, Benjamin;Said-Metwaly, Sameh;Karwowski, Maciej;Kenett, Yoed N.;Lebuda, Izabela;Lubart, Todd;Miroshnik, Kirill G.;Obialo, Felix-Kingsley;Ovando-Tellez, Marcela;Primi, Ricardo;Puente-Díaz, Rogelio;Stevenson, Claire;Volle, Emmanuelle;Zielińska, Aleksandra;van Hell, Janet G.;Yin, Wenpeng;Beaty, Roger E.

2025

Abstract

Researchers and educators interested in creative writing need a reliable and efficient tool to score the creativity of narratives, such as short stories. Typically, human raters manually assess narrative creativity, but such subjective scoring is limited by labor costs and rater disagreement. Large language models (LLMs) have shown remarkable success on creativity tasks, yet they have not been applied to scoring narratives, including multilingual stories. In the present study, we aimed to test whether narrative originality-a component of creativity-could be automatically scored by LLMs, further evaluating whether a single LLM could predict human originality ratings across multiple languages. We trained three different LLMs to predict the originality of short stories written in 11 languages. Our first monolingual model, trained only on English stories, robustly predicted human originality ratings (r = .81). This same model-trained and tested on multilingual stories translated into English-strongly predicted originality ratings of multilingual narratives (r >= .73). Finally, a multilingual model trained on the same stories, in their original language, reliably predicted human originality scores across all languages (r >= .72). We thus demonstrate that LLMs can successfully score narrative creativity in 11 different languages, surpassing the performance of the best previous automated scoring techniques (e.g., semantic distance). This work represents the first effective, accessible, and reliable solution for the automated scoring of creativity in multilingual narratives.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Rivista
	
				PSYCHOLOGY OF AESTHETICS, CREATIVITY, AND THE ARTS
			
	Codice DOI
	
				https://dx.doi.org/10.1037/aca0000725
			
	Citazione
	
				Luchini, S.A., Moosa, I.M., Patterson, J.D., Johnson, D., Baas, M., Barbot, B., et al. (2025). Automated assessment of creativity in multilingual narratives. PSYCHOLOGY OF AESTHETICS, CREATIVITY, AND THE ARTS, online, 1-18 [10.1037/aca0000725].
			
	Tutti gli autori
	
						Luchini, Simone A.; Moosa, Ibraheem Muhammad; Patterson, John D.; Johnson, Dan; Baas, Matthijs; Barbot, Baptiste; Bashmakova, Iana; Benedek, Mathias; ...espandi
						
	Appare nelle tipologie:
	
				1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
Luchini et al_2025_MultilingFTmanuscript.pdf accesso aperto Tipo: Postprint / Author's Accepted Manuscript (AAM) - versione accettata per la pubblicazione dopo la peer-review Licenza: Licenza per accesso libero gratuito Dimensione 2.07 MB Formato Adobe PDF Visualizza/Apri	2.07 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1013606

Citazioni

ND

7

7

social impact