Detecting Smart Contract Vulnerabilities using Transformers and LLMs

Ferretti, S; D'Angelo, G; Ghini, V; Tomasone, Mb

doi:10.1109/PerComWorkshops65533.2025.00033

This study investigates the detection of vulnerabilities in smart contracts using various transformer models and Large Language Model (LLM) systems. We evaluated BERT, CodeBERT, DistilBERT, and the Gemini model, employing techniques such as aggregation of chunks to enhance performance. The results indicate that simple transformers applied to source code generally perform worse than when applied to byte-code. However, the use of aggregation techniques on the source code significantly improved the model performance. We also evaluate the use of meta-classifiers for multimodal data, by stacking multiple transformers working on source code and byte-code. The Random Forest meta-classifier achieved the highest performance but exhibited significant overfitting. The Gemini model demonstrates limited performance, highlighting the necessity of proper training for LLM systems.

Ferretti, S., D'Angelo, G., Ghini, V., Tomasone, M.b. (2025). Detecting Smart Contract Vulnerabilities using Transformers and LLMs. 10662 LOS VAQUEROS CIRCLE, PO BOX 3014, LOS ALAMITOS, CA 90720-1264 USA : IEEE COMPUTER SOC [10.1109/PerComWorkshops65533.2025.00033].

Detecting Smart Contract Vulnerabilities using Transformers and LLMs

Ferretti, S;D'Angelo, G;Ghini, V;Tomasone, MB

2025

Abstract

This study investigates the detection of vulnerabilities in smart contracts using various transformer models and Large Language Model (LLM) systems. We evaluated BERT, CodeBERT, DistilBERT, and the Gemini model, employing techniques such as aggregation of chunks to enhance performance. The results indicate that simple transformers applied to source code generally perform worse than when applied to byte-code. However, the use of aggregation techniques on the source code significantly improved the model performance. We also evaluate the use of meta-classifiers for multimodal data, by stacking multiple transformers working on source code and byte-code. The Random Forest meta-classifier achieved the highest performance but exhibited significant overfitting. The Gemini model demonstrates limited performance, highlighting the necessity of proper training for LLM systems.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Titolo del volume
	
				IEEE Annual Conference on Pervasive Computing and Communications Workshops (PerCom)
			
	Pagina iniziale
	
				7
			
	Pagina finale
	
				12
			
	Collana/Serie
	
				IEEE International Conference on Pervasive Computing and Communications workshops
			
	Codice DOI
	
				https://dx.doi.org/10.1109/PerComWorkshops65533.2025.00033
			
	Citazione
	
				Ferretti, S., D'Angelo, G., Ghini, V., Tomasone, M.b. (2025). Detecting Smart Contract Vulnerabilities using Transformers and LLMs. 10662 LOS VAQUEROS CIRCLE, PO BOX 3014, LOS ALAMITOS, CA 90720-1264 USA : IEEE COMPUTER SOC [10.1109/PerComWorkshops65533.2025.00033].
			
	Tutti gli autori
	
						Ferretti, S; D'Angelo, G; Ghini, V; Tomasone, Mb

File in questo prodotto:

Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1027597

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

0

0

CRIS Current Research Information System