CRIS Current Research Information System

Multimodal systems and Large Language Models have shown remarkable capabilities in text-based reasoning, yet their capacity to perceive and interpret visual art remains uncertain. This study examines how CLIP “sees” and understands artworks by comparing their responses to human- and AI-generated paintings in the European tradition from the Renaissance onward. The analysis focuses on its ability to identify style, period and cultural context, as well as potential biases in its perception, evaluated against human judgments.

Asperti, A., Dessi, L., Tonetti, M.C., Wu, N. (2025). Does CLIP Perceive Art the Same Way We Do?. New York : IEEE [10.1109/cbmi66578.2025.11339321].

Does CLIP Perceive Art the Same Way We Do?

Asperti, Andrea;Dessi, Leonardo;Tonetti, Maria Chiara;Wu, Nico

2025

Abstract

Multimodal systems and Large Language Models have shown remarkable capabilities in text-based reasoning, yet their capacity to perceive and interpret visual art remains uncertain. This study examines how CLIP “sees” and understands artworks by comparing their responses to human- and AI-generated paintings in the European tradition from the Renaissance onward. The analysis focuses on its ability to identify style, period and cultural context, as well as potential biases in its perception, evaluated against human judgments.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Titolo del volume
	
				International Conference on Content-Based Multimedia Indexing, CBMI, 2025, Dublin, Ireland, October 22-24, 2025
			
	Pagina iniziale
	
				1
			
	Pagina finale
	
				8
			
	Codice DOI
	
				https://dx.doi.org/10.1109/cbmi66578.2025.11339321
			
	Citazione
	
				Asperti, A., Dessi, L., Tonetti, M.C., Wu, N. (2025). Does CLIP Perceive Art the Same Way We Do?. New York : IEEE [10.1109/cbmi66578.2025.11339321].
			
	Tutti gli autori
	
						Asperti, Andrea; Dessi, Leonardo; Tonetti, Maria Chiara; Wu, Nico
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
CLIP_perception__IEEE_trans_arxiv__compressed.pdf embargo fino al 19/01/2028 Tipo: Postprint / Author's Accepted Manuscript (AAM) - versione accettata per la pubblicazione dopo la peer-review Licenza: Licenza per accesso libero gratuito Dimensione 221.4 kB Formato Adobe PDF Visualizza/Apri Contatta l'autore	221.4 kB	Adobe PDF	Visualizza/Apri Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1051463

Citazioni

ND

0

ND

1

social impact