TY - JOUR
T1 - A computational predictor for autophagy-related proteins using interpretable machine learning and genetic algorithm
AU - Nguyen, Ngan Thi Kim
AU - Vo, Thanh Hoa
AU - Lin, Shyh Hsiang
AU - Le, Nguyen Quoc Khanh
N1 - Publisher Copyright:
© 2024 Elsevier Inc.
PY - 2024/3/1
Y1 - 2024/3/1
N2 - Autophagy is a quintessential process for eliminating molecules, subcellular elements, and damaged organelles to enhance homeostasis, differentiation, development, and survival. Therefore, a thorough understanding of the sophisticated mechanism of autophagy can solely contribute to the knowledge of side effects, drug repurposing, and the development of novel poly-pharmacological strategies regarding autophagy-related diseases. Artificial intelligence approaches' broad applicability in system biology has been a promising method in identifying the autophagy-related (Art) protein, which is vitally important to regulate and control various stages of autophagy formation. Underlying explainable XGBoost and SHapley Additive exPlanations (SHAP) models, the important features and predictive model of Art protein were established. Consequently, our model performance achieved a sensitivity of 66.52 %, specificity of 82.77 %, accuracy of 77.32 %, and MCC of 0.430 via a 5-fold cross-validation evaluation. Moreover, we evaluated our model on an independent dataset, and the final results reached a sensitivity of 65.1 %, specificity of 79.1 %, and accuracy of 77.1 %. It is then observed that our model was efficient and strongly recommended for further analysis of physiological mechanisms and the development of drugs regarding autophagy. The model and dataset are freely accessible via https://github.com/khanhlee/art-predictor.
AB - Autophagy is a quintessential process for eliminating molecules, subcellular elements, and damaged organelles to enhance homeostasis, differentiation, development, and survival. Therefore, a thorough understanding of the sophisticated mechanism of autophagy can solely contribute to the knowledge of side effects, drug repurposing, and the development of novel poly-pharmacological strategies regarding autophagy-related diseases. Artificial intelligence approaches' broad applicability in system biology has been a promising method in identifying the autophagy-related (Art) protein, which is vitally important to regulate and control various stages of autophagy formation. Underlying explainable XGBoost and SHapley Additive exPlanations (SHAP) models, the important features and predictive model of Art protein were established. Consequently, our model performance achieved a sensitivity of 66.52 %, specificity of 82.77 %, accuracy of 77.32 %, and MCC of 0.430 via a 5-fold cross-validation evaluation. Moreover, we evaluated our model on an independent dataset, and the final results reached a sensitivity of 65.1 %, specificity of 79.1 %, and accuracy of 77.1 %. It is then observed that our model was efficient and strongly recommended for further analysis of physiological mechanisms and the development of drugs regarding autophagy. The model and dataset are freely accessible via https://github.com/khanhlee/art-predictor.
KW - Autophagy pathway
KW - Explainable AI
KW - Physiological mechanisms
KW - Protein function prediction
KW - Sequence analysis
UR - http://www.scopus.com/inward/record.url?scp=85182706034&partnerID=8YFLogxK
U2 - 10.1016/j.genrep.2023.101876
DO - 10.1016/j.genrep.2023.101876
M3 - Article
AN - SCOPUS:85182706034
SN - 2452-0144
VL - 34
JO - Gene Reports
JF - Gene Reports
M1 - 101876
ER -