Train characteristics identification serves a vital role in monitoring the traffic on railway infrastructures and correlating it with their structural response. It traditionally relies on camera-based systems, which raise privacy concerns, have high implementation costs, and perform variably depending on visibility conditions. To address these limitations, we present RATTLE, an IoT framework that uses audio fingerprinting to identify trains, count carriages, and estimate transit speed. Our system combines edge devices for data collection, feature extraction, and processing with cloud infrastructure for aggregation and model training. To validate our approach, we collected a comprehensive dataset of video and audio samples, annotated it with ground truth labels, and used it for evaluation in our experiments. We are also making this dataset openly available to the research community to support further advancements in the field. The framework employs a custom Convolutional Neural Network for processing mel-spectrograms, achieving 96% accuracy in train identification, with a mean absolute error of 0.85 for carriage counting and 1.8 m/s for speed estimation. These results were obtained using only 540 training samples per location, with the model requiring less than 1.63 MB of storage, making it suitable for deployment on resource-limited devices.
Ciabattini, L., Esposito, A., Sciullo, L., Zyrianoff, I., Di Felice, M. (2025). RATTLE: A Framework for Train Characterization and Identification Through Audio Fingerprinting. IEEE ACCESS, 13, 211901-211920 [10.1109/ACCESS.2025.3641380].
RATTLE: A Framework for Train Characterization and Identification Through Audio Fingerprinting
Leonardo Ciabattini;Alfonso Esposito;Luca Sciullo;Ivan Zyrianoff;Marco Di Felice
2025
Abstract
Train characteristics identification serves a vital role in monitoring the traffic on railway infrastructures and correlating it with their structural response. It traditionally relies on camera-based systems, which raise privacy concerns, have high implementation costs, and perform variably depending on visibility conditions. To address these limitations, we present RATTLE, an IoT framework that uses audio fingerprinting to identify trains, count carriages, and estimate transit speed. Our system combines edge devices for data collection, feature extraction, and processing with cloud infrastructure for aggregation and model training. To validate our approach, we collected a comprehensive dataset of video and audio samples, annotated it with ground truth labels, and used it for evaluation in our experiments. We are also making this dataset openly available to the research community to support further advancements in the field. The framework employs a custom Convolutional Neural Network for processing mel-spectrograms, achieving 96% accuracy in train identification, with a mean absolute error of 0.85 for carriage counting and 1.8 m/s for speed estimation. These results were obtained using only 540 training samples per location, with the model requiring less than 1.63 MB of storage, making it suitable for deployment on resource-limited devices.| File | Dimensione | Formato | |
|---|---|---|---|
|
RATTLE_A_Framework_for_Train_Characterization_and_Identification_Through_Audio_Fingerprinting.pdf
accesso aperto
Tipo:
Versione (PDF) editoriale / Version Of Record
Licenza:
Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione
4.85 MB
Formato
Adobe PDF
|
4.85 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


