Computational approaches for predicting drug-disease associations: a comprehensive review

Zhaoyang HUANG, Zhichao XIAO, Chunyan AO, Lixin GUAN, Liang YU

PDF(1479 KB)
PDF(1479 KB)
Front. Comput. Sci. ›› 2025, Vol. 19 ›› Issue (5) : 195909. DOI: 10.1007/s11704-024-40072-y
Interdisciplinary
REVIEW ARTICLE

Computational approaches for predicting drug-disease associations: a comprehensive review

Author information +
History +

Abstract

In recent decades, traditional drug research and development have been facing challenges such as high cost, long timelines, and high risks. To address these issues, many computational approaches have been proposed for predicting the relationship between drugs and diseases through drug repositioning, aiming to reduce the cost, development cycle and risks associated with developing new drugs. Researchers have explored different computational methods to predict drug-disease associations, including drug side effects-disease associations, drug-target associations, and miRNA-disease associations. In this comprehensive review, we focus on recent advances in predicting drug-disease association methods for drug repositioning. We first categorize these methods into several groups, including neural network-based algorithms, matrix-based algorithms, recommendation algorithms, link-based reasoning algorithms, and text mining and semantic reasoning. Then, we compare the prediction performance of existing drug-disease association prediction algorithms. Lastly, we discuss the current challenges and future perspectives in the field of drug-disease associations.

Graphical abstract

Keywords

drug-disease association / association prediction / drug repositioning / machine learning

Cite this article

Download citation ▾
Zhaoyang HUANG, Zhichao XIAO, Chunyan AO, Lixin GUAN, Liang YU. Computational approaches for predicting drug-disease associations: a comprehensive review. Front. Comput. Sci., 2025, 19(5): 195909 https://doi.org/10.1007/s11704-024-40072-y

Zhaoyang Huang is currently pursuing a doctoral degree at the School of Computer Science and Technology, Xidian University, China. His research endeavors focus on the analysis of single-cell RNA sequencing (scRNA-seq) data, the development of algorithms for predicting cellular differentiation, and the exploration of novel applications for existing drugs through drug repositioning strategies

Zhichao Xiao is currently pursuing a master’s degree in the School of Computer Science and Technology at Xidian University, China. His main research interests are in bioinformatics and fuzzy theory

Chunyan Ao received the PhD degree from the School of Computer Science and Technology, Xidian University, China in 2024. Her research interests are bioinformatics and machine learning

Lixin Guan graduated from the School of Computer Science and Technology of Xidian University, China in 2022 with a master’s degree. Her research interests are computational bioinformatics and association prediction

Liang Yu is a professor and doctoral supervisor at the School of Computer Science and Technology, Xidian University, China. She obtained her PhD degree in engineering from Xidian University, China in June 2011. She visited Harvard Medical School, USA from September 2017 to September 2018. She has led two general projects funded by the National Natural Science Foundation of China, one project from the Youth Fund, and one project from the Natural Science Foundation of Shaanxi Province. As a key participant, she has contributed to two major projects supported by the National Natural Science Foundation of China, one major research project, and one general project. She has published over 60 academic papers in this field as the first or corresponding author

References

[1]
Ashburn T T, Thor K B . Drug repositioning: identifying and developing new uses for existing drugs. Nature Reviews Drug Discovery, 2004, 3( 8): 673–683
[2]
Park K . A review of computational drug repurposing. Translational and Clinical Pharmacology, 2019, 27( 2): 59–63
[3]
Su R, Liu X, Wei L, Zou Q . Deep-resp-forest: a deep forest model to predict anti-cancer drug response. Methods, 2019, 166: 91–102
[4]
Shahreza M L, Ghadiri N, Mousavi S R, Varshosaz J, Green J R . A review of network-based approaches to drug repositioning. Briefings in Bioinformatics, 2018, 19( 5): 878–892
[5]
Jarada T N, Rokne J G, Alhajj R . SNF-NN: computational method to predict drug-disease interactions using similarity network fusion and neural networks. BMC Bioinformatics, 2021, 22( 1): 28
[6]
Xue H Q, Li J, Xie H Z, Wang Y D . Review of drug repositioning approaches and resources. International Journal of Biological Sciences, 2018, 14( 10): 1232–1244
[7]
Liu X W, Shi T Y, Gao D, Ma C Y, Lin H, Yan D, Deng K J . iPADD: a computational tool for predicting potential antidiabetic drugs using machine learning algorithms. Journal of Chemical Information and Modeling, 2023, 63( 15): 4960–4969
[8]
Jourdan J P, Bureau R, Rochais C, Dallemagne P . Drug repositioning: a brief overview. Journal of Pharmacy and Pharmacology, 2020, 72( 9): 1145–1151
[9]
Zeng X, Song X, Ma T, Pan X, Zhou Y, Hou Y, Zhang Z, Li K, Karypis G, Cheng F . Repurpose open data to discover therapeutics for COVID-19 using deep learning. Journal of Proteome Research, 2020, 19( 11): 4624–4636
[10]
Li J, Zheng S, Chen B, Butte A J, Swamidass S J, Lu Z. A survey of current trends in computational drug repositioning. Briefings in Bioinformatics, 2016, 17(1): 2−12
[11]
Pan X, Lin X, Cao D, Zeng X, Yu P S, He L, Nussinov R, Cheng F . Deep learning for drug repurposing: methods, databases, and applications. WIREs Computational Molecular Science, 2022, 12( 4): e1597
[12]
Wang R, Jiang Y, Jin J, Yin C, Yu H, Wang F, Feng J, Su R, Nakai K, Zou Q, Wei L . DeepBIO: an automated and interpretable deep-learning platform for high-throughput biological sequence prediction, functional annotation and visualization analysis. Nucleic Acids Research, 2023, 51( 7): 3017–3029
[13]
Li H, Liu B . BioSeq-diabolo: biological sequence similarity analysis using diabolo. PLoS Computational Biology, 2023, 19( 6): e1011214
[14]
Yang H, Lv H, Ding H, Chen W, Lin H . iRNA-2OM: a sequence-based predictor for identifying 2’-O-methylation sites in Homo sapiens. Journal of Computational Biology, 2018, 25( 11): 1266–1277
[15]
Li H L, Pang Y H, Liu B . BioSeq-BLM: a platform for analyzing DNA, RNA and protein sequences based on biological language models. Nucleic Acids Research, 2021, 49( 22): e129
[16]
Luo H, Wang J, Li M, Luo J, Ni P, Zhao K, Wu F X, Pan Y . Computational drug repositioning with random walk on a heterogeneous network. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2019, 16( 6): 1890–1900
[17]
Wang B, Lyu X, Qu J, Sun H, Pan Z, Tang Z,. GNDD: a graph neural network-based method for drug-disease association prediction. In: Proceedings of 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). 2019, 1253−1255
[18]
Yang K, Zhao X, Waxman D, Zhao X M . Predicting drug-disease associations with heterogeneous network embedding. Chaos, 2019, 29( 12): 123109
[19]
Wang Y, Zhai Y, Ding Y, Zou Q. SBSM-pro: support bio-sequence machine for proteins. 2023, arXiv preprint arXiv: 2308.10275
[20]
Ai C, Yang H, Ding Y, Tang J, Guo F . Low rank matrix factorization algorithm based on multi-graph regularization for detecting drug-disease association. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2023, 20( 5): 3033–3043
[21]
Li P, Tiwari P, Xu J, Qian Y, Ai C, Ding Y, Guo F . Sparse regularized joint projection model for identifying associations of non-coding RNAs and human diseases. Knowledge-Based Systems, 2022, 258: 110044
[22]
Zeng X, Xiang H, Yu L, Wang J, Li K, Nussinov R, Cheng F . Accurate prediction of molecular properties and drug targets using a self-supervised image representation learning framework. Nature Machine Intelligence, 2022, 4( 11): 1004–1016
[23]
Su R, Liu X, Xiao G, Wei L . Meta-GDBP: a high-level stacked regression model to improve anticancer drug response prediction. Briefings in Bioinformatics, 2020, 21( 3): 996–1005
[24]
Feng J, Wu S, Yang H, Ai C, Qiao J, Xu J, Guo F . Microbe-bridged disease-metabolite associations identification by heterogeneous graph fusion. Briefings in Bioinformatics, 2022, 23( 6): bbac423
[25]
Agamah F E, Bayjanov J R, Niehues A, Njoku K F, Skelton M, Mazandu G K, Ederveen T H A, Mulder N, Chimusa E R, ’t Hoen P A C . Computational approaches for network-based integrative multi-omics analysis. Frontiers in Molecular Biosciences, 2022, 9: 967205
[26]
Xu J, Xu J, Meng Y, Lu C, Cai L, Zeng X, Nussinov R, Cheng F . Graph embedding and Gaussian mixture variational autoencoder network for end-to-end analysis of single-cell RNA sequencing data. Cell Reports Methods, 2023, 3( 1): 100382
[27]
Moon C, Jin C, Dong X, Abrar S, Zheng W, Chirkova R Y, Tropsha A . Learning drug-disease-target embedding (DDTE) from knowledge graphs to inform drug repurposing hypotheses. Journal of Biomedical Informatics, 2021, 119: 103838
[28]
Han P, Li X, Wang X, Wang S, Gao C, Chen W . Exploring the effects of drug, disease, and protein dependencies on biomedical named entity recognition: a comparative analysis. Frontiers in Pharmacology, 2022, 13: 1020759
[29]
Jin J, Yu Y, Wang R, Zeng X, Pang C, Jiang Y, Li Z, Dai Y, Su R, Zou Q, Nakai K, Wei L . iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations. Genome Biology, 2022, 23( 1): 219
[30]
Tang Y J, Pang Y H, Liu B . IDP-Seq2Seq: identification of intrinsically disordered regions based on sequence to sequence learning. Bioinformatics, 2021, 36( 21): 5177–5186
[31]
Yang Y, Gao D, Xie X, Qin J, Li J, Lin H, Yan D, Deng K . DeepIDC: a prediction framework of injectable drug combination based on heterogeneous information and deep learning. Clinical Pharmacokinetics, 2022, 61( 12): 1749–1759
[32]
Zhang Z Y, Ning L, Ye X, Yang Y H, Futamura Y, Sakurai T, Lin H . iLoc-miRNA: extracellular/intracellular miRNA prediction using deep BiLSTM with attention mechanism. Briefings in Bioinformatics, 2022, 23( 5): bbac395
[33]
Liu B, Gao X, Zhang H . BioSeq-Analysis2. 0: an updated platform for analyzing DNA, RNA and protein sequences at sequence level and residue level based on machine learning approaches. Nucleic Acids Research, 2019, 47( 20): e127
[34]
Jiang H J, Huang Y A, You Z H . Predicting drug-disease associations via using gaussian interaction profile and kernel-based autoencoder. BioMed Research International, 2019, 2019: 2426958
[35]
Jiang H-J, You Z-H, Huang Y-A . Predicting drug−disease associations via sigmoid kernel-based convolutional neural networks. Journal of Translational Medicine, 2019, 17( 1): 382
[36]
Jiang H J, Huang Y A, You Z H . SAEROF: an ensemble approach for large-scale drug-disease association prediction by incorporating rotation forest and sparse autoencoder deep neural network. Scientific Reports, 2020, 10( 1): 4972
[37]
Xuan P, Gao L, Sheng N, Zhang T, Nakaguchi T . Graph convolutional autoencoder and fully-connected autoencoder with attention mechanism based method for predicting drug-disease associations. IEEE Journal of Biomedical and Health Informatics, 2021, 25( 5): 1793–1804
[38]
Bang D, Lim S, Lee S, Kim S . Biomedical knowledge graph learning for drug repurposing by extending guilt-by-association to multiple layers. Nature Communications, 2023, 14( 1): 3570
[39]
Ren Z-H, Yu C-Q, Li L-P, You Z-H, Li Z-W, Zhang S-W, Zeng X, Shang Y F . SiSGC: a drug repositioning prediction model based on heterogeneous simplifying graph convolution. Journal of Chemical Information and Modeling, 2024, 64( 1): 238–249
[40]
Wen J, Zhang X, Rush E, Panickan V A, Li X, Cai T, Zhou D, Ho Y L, Costa L, Begoli E, Hong C, Gaziano J M, Cho K, Lu J, Liao K P, Zitnik M, Cai T . Multimodal representation learning for predicting molecule−disease relations. Bioinformatics, 2023, 39( 2): btad085
[41]
Lei S, Lei X, Chen M, Pan Y . Drug repositioning based on deep sparse autoencoder and drug−disease similarity. Interdisciplinary Sciences: Computational Life Sciences, 2024, 16( 1): 160–175
[42]
Nepusz T, Yu H, Paccanaro A . Detecting overlapping protein complexes in protein-protein interaction networks. Nature Methods, 2012, 9( 5): 471–472
[43]
van Driel M A, Bruggeman J, Vriend G, Brunner H G, Leunissen J A M . A text-mining analysis of the human phenome. European Journal of Human Genetics, 2006, 14( 5): 535–542
[44]
Bro R, Smilde A K . Principal component analysis. Analytical Methods, 2014, 6( 9): 2812–2831
[45]
Luo H, Li M, Wang S, Liu Q, Li Y, Wang J . Computational drug repositioning using low-rank matrix approximation and randomized algorithms. Bioinformatics, 2018, 34( 11): 1904–1912
[46]
Jiang Y, Wang R, Feng J, Jin J, Liang S, Li Z, Yu Y, Ma A, Su R, Zou Q, Ma Q, Wei L . Explainable deep hypergraph learning modeling the peptide secondary structure prediction. Advanced Science, 2023, 10( 11): 2206151
[47]
Cui Z, Gao Y L, Liu J X, Wang J, Shang J L, Dai L Y . The computational prediction of drug-disease interactions using the dual-network L2,1-CMF method. BMC Bioinformatics, 2019, 20( 1): 5
[48]
Xie G, Li J, Gu G, Sun Y, Lin Z, Zhu Y, Zhu Y, Wang W . BGMSDDA: a bipartite graph diffusion algorithm with multiple similarity integration for drug-disease association prediction. Molecular Omics, 2021, 17( 6): 997–1011
[49]
Huang F, Qiu Y, Li Q, Liu S, Ni F . Predicting drug-disease associations via multi-task learning based on collective matrix factorization. Frontiers in Bioengineering and Biotechnology, 2020, 8: 218
[50]
Yang X, Zamit L, Liu Y, He J . Additional neural matrix factorization model for computational drug repositioning. BMC Bioinformatics, 2019, 20( 1): 423
[51]
Chang H-T, Lin C-Y, Jheng W-B, Chen S-H, Wu H-H, Tseng F-C, Wang L C . AI, please help me choose a course: building a personalized hybrid course recommendation system to assist students in choosing courses adaptively. Educational Technology & Society, 2023, 26( 1): 203–217
[52]
Wang H, Yu G. Persomalized recommendation system K-neighbor algorithm optimization. In: Proceedings of the 1st International Conference on Information Technologies in Education and Learning. 2016, 105−108
[53]
Lin Y, Zhang J, Lin M, Li J . Drug epositioning algorithm based on collaborative filtering. Journal of Nanjing University: Natural Sciences, 2015, 51( 4): 834–841
[54]
Xu E, Yu Z, Sun Z, Guo B, Yao L . Modeling within-basket auxiliary item recommendation with matchability and ubiquity. ACM Transactions on Intelligent Systems and Technology, 2023, 14( 3): 49
[55]
Colace F, Conte D, De Santo M, Lombardi M, Santaniello D, Valentino C . A content-based recommendation approach based on singular value decomposition. Connection Science, 2022, 34( 1): 2158–2176
[56]
Roy A, Banerjee S, Sarkar M, Darwish A, Elhoseny M, Hassanien A E . Exploring New Vista of intelligent collaborative filtering: a restaurant recommendation paradigm. Journal of Computational Science, 2018, 27: 168–182
[57]
Liu Y Q, Fang S Z, Wang L Y, Huan C, Wang R X . Neural graph collaborative filtering for privacy preservation based on federated transfer learning. The Electronic Library, 2022, 40( 6): 729–742
[58]
Cheng F, Liu C, Jiang J, Lu W, Li W, Liu G, Zhou W, Huang J, Tang Y . Prediction of drug-target interactions and drug repositioning via network-based inference. PLoS Computational Biology, 2012, 8( 5): e1002503
[59]
Zhang J, Li C D, Lin Y J, Shao Y W, Li S Z . Computational drug repositioning using collaborative filtering via multi-source fusion. Expert Systems with Applications, 2017, 84: 281–289
[60]
Jin S, Niu Z, Jiang C, Huang W, Xia F, Jin X, Liu X, Zeng X . HeTDR: drug repositioning based on heterogeneous networks and text mining. Patterns, 2021, 2( 8): 100307
[61]
Jarada T N, Rokne J G, Alhajj R . A review of computational drug repositioning: strategies, approaches, opportunities, challenges, and directions. Journal of Cheminformatics, 2020, 12( 1): 46
[62]
Li J, Zhu X, Chen J Y . Building disease-specific drug-protein connectivity maps from molecular interaction networks and pubmed abstracts. PLoS Computational Biology, 2009, 5( 7): e1000450
[63]
Tari L B, Patel J H. Systematic drug repurposing through text mining. In: Kumar V D, Tipney H J, eds. Biomedical Literature Mining. New York: Humana, 2014, 253−267
[64]
Rastegar-Mojarad M, Elayavilli R K, Li D, Prasad R, Liu H. A new method for prioritizing drug repositioning candidates extracted by literature-based discovery. In: Proceedings of 2015 IEEE International Conference on Bioinformatics and Biomedicine, 2015, 669−674
[65]
Brown A S, Patel C J . MeSHDD: literature-based drug-drug similarity for drug repositioning. Journal of the American Medical Informatics Association, 2017, 24( 3): 614–618
[66]
Pavlopoulos G A, Malliarakis D, Papanikolaou N, Theodosiou T, Enright A J, Iliopoulos I . Visualizing genome and systems biology: technologies, tools, implementation techniques and trends, past, present and future. Gigascience, 2015, 4: 38
[67]
Zeng X X, Zhu S Y, Liu X R, Zhou Y D, Nussinov R, Cheng F X . deepDR: a network-based deep learning approach to in silico drug repositioning. Bioinformatics, 2019, 35( 24): 5191–5198
[68]
Han P, Yang P, Zhao P, Shang S, Liu Y, Zhou J, Gao X, Kalnis P. GCN-MF: disease-gene association identification by graph convolutional networks and matrix factorization. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2019, 705−713
[69]
Zhu Q, Tao C, Shen F, Chute C G. Exploring the pharmacogenomics knowledge base (PharmGKB) for repositioning breast cancer drugs by leveraging Web ontology language (OWL) and cheminformatics approaches. In: Proceedings of Pacific Symposium on Biocomputing 2014. 2014, 172−182
[70]
Chen B, Ding Y, Wild D J . Assessing drug target association using semantic linked data. PLoS Computational Biology, 2012, 8( 7): e1002574
[71]
Luo H, Wang J, Li M, Luo J, Peng X, Wu F-X, Pan Y . Drug repositioning based on comprehensive similarity measures and Bi-Random walk algorithm. Bioinformatics, 2016, 32( 17): 2664–2671
[72]
Wishart D S, Knox C, Guo A C, Cheng D, Shrivastava S, Tzur D, Gautam B, Hassanali M . DrugBank: a knowledgebase for drugs, drug actions and drug targets. Nucleic Acids Research, 2008, 36( S1): D901–D906
[73]
Hamosh A, Scott A F, Amberger J, Bocchini C, Valle D, McKusick V A . Online mendelian inheritance in man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Research, 2002, 30( 1): 52–55
[74]
Bolton E E, Wang Y, Thiessen P A, Bryant S H . PubChem: integrated platform of small molecules and biological activities. Annual Reports in Computational Chemistry, 2008, 4: 217–241
[75]
Sun Z, Huang Q, Yang Y, Li S, Lv H, Zhang Y, Lin H, Ning L . PSnoD: identifying potential snoRNA-disease associations based on bounded nuclear norm regularization. Briefings in Bioinformatics, 2022, 23( 4): bbac240
[76]
Yu J, Xuan Z, Feng X, Zou Q, Wang L . A novel collaborative filtering model for LncRNA-disease association prediction based on the Naïve Bayesian classifier. BMC Bioinformatics, 2019, 20( 1): 396
[77]
Nie R, Li Z, You Z-H, Bao W, Li J . Efficient framework for predicting MiRNA-disease associations based on improved hybrid collaborative filtering. BMC Medical Informatics and Decision Making, 2021, 21( S1): 254
[78]
Wang Y, Chen S, Deng N, Wang Y . Drug repositioning by kernel-based integration of molecular structure, molecular activity, and phenotype data. PLoS One, 2013, 8( 11): e78518
[79]
Gottlieb A, Stein G Y, Ruppin E, Sharan R . PREDICT: a method for inferring novel drug indications with application to personalized medicine. Molecular Systems Biology, 2011, 7: 496

Acknowledgments

This work was supported by the National Natural Science Foundation of China (Grant Nos. 62072353, 62272065, and 62202081).

Competing interests

The authors declare that they have no competing intersts or financial conflicts to disclose.

RIGHTS & PERMISSIONS

2025 Higher Education Press
AI Summary AI Mindmap
PDF(1479 KB)

Accesses

Citations

Detail

Sections
Recommended

/