Anatomical Therapeutic Chemical (ATC) classification of unknown compounds is essential for drug development and research. In this paper, we propose a multi-label classifier system for ATC prediction based on convolutional neural networks (CNNs) and long short-term memory (LSTM) networks. The CNN approach extracts a 1D feature vector from the compounds utilizing information about their chemical–chemical interaction and structural and fingerprint similarities to other compounds belonging to the ATC classes. The 1D vector is then reshaped into a 2D matrix. A CNN is trained on the matrix and used to extract new features. LSTM is trained on the 1D vector and likewise used to extract features. These features are then trained on two general-purpose classifiers designed for multi-label classification, and results are fused. Rigorous experimental evaluation demonstrates the superiority of our method compared to other state-of-the-art approaches.
Nanni L., Brahnam S., Lumini A. (2020). Ensemble of Deep Learning Approaches for ATC Classification. Singapore : Springer [10.1007/978-981-13-9282-5_12].
Ensemble of Deep Learning Approaches for ATC Classification
Lumini A.
2020
Abstract
Anatomical Therapeutic Chemical (ATC) classification of unknown compounds is essential for drug development and research. In this paper, we propose a multi-label classifier system for ATC prediction based on convolutional neural networks (CNNs) and long short-term memory (LSTM) networks. The CNN approach extracts a 1D feature vector from the compounds utilizing information about their chemical–chemical interaction and structural and fingerprint similarities to other compounds belonging to the ATC classes. The 1D vector is then reshaped into a 2D matrix. A CNN is trained on the matrix and used to extract new features. LSTM is trained on the 1D vector and likewise used to extract features. These features are then trained on two general-purpose classifiers designed for multi-label classification, and results are fused. Rigorous experimental evaluation demonstrates the superiority of our method compared to other state-of-the-art approaches.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.