CRIS Current Research Information System

Large language models (LLMs) have significantly advanced artificial intelligence (AI) and natural language processing (NLP) by excelling in tasks like text generation, machine translation, question answering and sentiment analysis, often rivaling human performance. This paper reviews LLMs’ foundations, advancements and applications, beginning with the transformative transformer architecture, which improved on earlier models like recurrent neural networks and convolutional neural networks through self-attention mechanisms that capture long-range dependencies and contextual relationships. Key innovations such as masked language modeling and causal language modeling underpin leading models like Bidirectional encoder representations from transformers (BERT) and the Generative Pre-trained Transformer (GPT) series. The paper highlights scaling laws, model size increases and advanced training techniques that have driven LLMs’ growth. It also explores methodologies to enhance their precision and adaptability, including parameter-efficient fine-tuning and prompt engineering. Challenges like high computational demands, biases and hallucinations are addressed, with solutions such as retrieval-augmented generation to improve factual accuracy. By discussing LLMs’ strengths, limitations and transformative potential, this paper provides researchers, practitioners and students with a comprehensive understanding. It underscores the importance of ongoing research to improve efficiency, manage ethical concerns and shape the future of AI and language technologies.

Ferraris, A.F., Audrito, D., Luigi, D.C., Poncibò, C. (2025). The architecture of language: Understanding the mechanics behind LLMs. CAMBRIDGE FORUM ON AI. LAW AND GOVERNANCE., 1, 1-19 [10.1017/cfl.2024.16].

The architecture of language: Understanding the mechanics behind LLMs

Ferraris, Andrea Filippo;Audrito, Davide;Luigi, Di Caro;Poncibò, Cristina

2025

Abstract

Large language models (LLMs) have significantly advanced artificial intelligence (AI) and natural language processing (NLP) by excelling in tasks like text generation, machine translation, question answering and sentiment analysis, often rivaling human performance. This paper reviews LLMs’ foundations, advancements and applications, beginning with the transformative transformer architecture, which improved on earlier models like recurrent neural networks and convolutional neural networks through self-attention mechanisms that capture long-range dependencies and contextual relationships. Key innovations such as masked language modeling and causal language modeling underpin leading models like Bidirectional encoder representations from transformers (BERT) and the Generative Pre-trained Transformer (GPT) series. The paper highlights scaling laws, model size increases and advanced training techniques that have driven LLMs’ growth. It also explores methodologies to enhance their precision and adaptability, including parameter-efficient fine-tuning and prompt engineering. Challenges like high computational demands, biases and hallucinations are addressed, with solutions such as retrieval-augmented generation to improve factual accuracy. By discussing LLMs’ strengths, limitations and transformative potential, this paper provides researchers, practitioners and students with a comprehensive understanding. It underscores the importance of ongoing research to improve efficiency, manage ethical concerns and shape the future of AI and language technologies.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Rivista
	
				CAMBRIDGE FORUM ON AI. LAW AND GOVERNANCE.
			
	Codice DOI
	
				https://dx.doi.org/10.1017/cfl.2024.16
			
	Citazione
	
				Ferraris, A.F., Audrito, D., Luigi, D.C., Poncibò, C. (2025). The architecture of language: Understanding the mechanics behind LLMs. CAMBRIDGE FORUM ON AI. LAW AND GOVERNANCE., 1, 1-19 [10.1017/cfl.2024.16].
			
	Tutti gli autori
	
						Ferraris, Andrea Filippo; Audrito, Davide; Luigi, Di Caro; Poncibò, Cristina
					
	Appare nelle tipologie:
	
				1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
div-class-title-the-architecture-of-language-understanding-the-mechanics-behind-llms-div (1).pdf accesso aperto Tipo: Versione (PDF) editoriale / Version Of Record Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY) Dimensione 800.05 kB Formato Adobe PDF Visualizza/Apri	800.05 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1042358

Citazioni

ND

ND

ND

ND

social impact