Limited hospital resources may prolong patient stays in the Emergency Department (ED), potentially affecting clinical outcomes. This paper investigates the link between overnight Emergency Department (ED) stays and in-hospital mortality, focusing on co-morbidity extraction from clinical records. In Italian healthcare records, comorbidities are typically documented using abbreviations and non-standard clinical slang in unstructured free-text fields. We evaluated two approaches for comorbidity extraction: a rule-based method and a Large Language Model approach. Both were assessed against a dataset of 200 clinical records manually annotated by emergency medical staff. This first result showed that the rule-based strategy outperformed Large Language Models in terms of recall, F1-score, consistency, and reliability. Then, to assess the impact of overnight stays on in-hospital mortality and to identify the most significant predictors, 126,696 ED admissions at the Romagna Local Health Agency in Forl\`{\i}, Italy, between 2017 and 2022 were analysed using several models, with particular emphasis on interpretability. Comorbidity burden, diagnosis severity, age, and infectious, respiratory, and circulatory diseases emerged as the most influential factors
Tascioglu, A.B., Bertini, F., Pistore, L., Fabbri, A., Montesi, D. (2025). Comorbidity Extraction for In-Hospital Mortality Analysis: a Comparison of Regular Expressions and Large Language Models [10.1145/3765612.3767202].
Comorbidity Extraction for In-Hospital Mortality Analysis: a Comparison of Regular Expressions and Large Language Models
Tascioglu, Ayca Begum;Bertini, Flavio;Pistore, Laura;Montesi, Danilo
2025
Abstract
Limited hospital resources may prolong patient stays in the Emergency Department (ED), potentially affecting clinical outcomes. This paper investigates the link between overnight Emergency Department (ED) stays and in-hospital mortality, focusing on co-morbidity extraction from clinical records. In Italian healthcare records, comorbidities are typically documented using abbreviations and non-standard clinical slang in unstructured free-text fields. We evaluated two approaches for comorbidity extraction: a rule-based method and a Large Language Model approach. Both were assessed against a dataset of 200 clinical records manually annotated by emergency medical staff. This first result showed that the rule-based strategy outperformed Large Language Models in terms of recall, F1-score, consistency, and reliability. Then, to assess the impact of overnight stays on in-hospital mortality and to identify the most significant predictors, 126,696 ED admissions at the Romagna Local Health Agency in Forl\`{\i}, Italy, between 2017 and 2022 were analysed using several models, with particular emphasis on interpretability. Comorbidity burden, diagnosis severity, age, and infectious, respiratory, and circulatory diseases emerged as the most influential factors| File | Dimensione | Formato | |
|---|---|---|---|
|
3765612.3767202.pdf
accesso aperto
Tipo:
Versione (PDF) editoriale / Version Of Record
Licenza:
Licenza per Accesso Aperto. Creative Commons Attribuzione - Non commerciale - Non opere derivate (CCBYNCND)
Dimensione
1.55 MB
Formato
Adobe PDF
|
1.55 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.



