MDX (MultiDimensional Expressions) is the standard language for querying multidimensional data in OLAP systems, but its complex syntax poses challenges for non-expert users. While a lot of research has focused on natural language interfaces for SQL, little attention has been given to MDX. This paper explores the potential of Large Language Models (LLMs), specifically GPT-4o, in translating natural language questions into MDX statements. We investigate whether LLMs can act as full MDX query generators or assistants, and study how the writing style of questions affects output correctness. Through four research questions, we evaluate ChatGPT's basic capabilities and the effectiveness of prompt engineering in improving text-to-MDX performance. Our evaluation confirms that, with ad-hoc prompt engineering, GPT-4o is indeed able to generate complex MDX queries ---particularly when the natural language question is given a structured formulation.

Bimonte, S., Rizzi, S. (2025). Text-to-MDX: LLM-Assisted Generation of MDX Queries from User Questions. Springer Nature [10.1007/978-3-032-08623-5_9].

Text-to-MDX: LLM-Assisted Generation of MDX Queries from User Questions

Stefano Rizzi
2025

Abstract

MDX (MultiDimensional Expressions) is the standard language for querying multidimensional data in OLAP systems, but its complex syntax poses challenges for non-expert users. While a lot of research has focused on natural language interfaces for SQL, little attention has been given to MDX. This paper explores the potential of Large Language Models (LLMs), specifically GPT-4o, in translating natural language questions into MDX statements. We investigate whether LLMs can act as full MDX query generators or assistants, and study how the writing style of questions affects output correctness. Through four research questions, we evaluate ChatGPT's basic capabilities and the effectiveness of prompt engineering in improving text-to-MDX performance. Our evaluation confirms that, with ad-hoc prompt engineering, GPT-4o is indeed able to generate complex MDX queries ---particularly when the natural language question is given a structured formulation.
2025
Conceptual Modeling. 44th International Conference, ER 2025, Poitiers, France, October 20–23, 2025, Proceedings
165
181
Bimonte, S., Rizzi, S. (2025). Text-to-MDX: LLM-Assisted Generation of MDX Queries from User Questions. Springer Nature [10.1007/978-3-032-08623-5_9].
Bimonte, Sandro; Rizzi, Stefano
File in questo prodotto:
File Dimensione Formato  
main.pdf

embargo fino al 18/10/2026

Tipo: Postprint / Author's Accepted Manuscript (AAM) - versione accettata per la pubblicazione dopo la peer-review
Licenza: Licenza per accesso libero gratuito
Dimensione 799.1 kB
Formato Adobe PDF
799.1 kB Adobe PDF   Visualizza/Apri   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1029102
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact