DMFVAE: miRNA-disease associations prediction based on deep matrix factorization method with variational autoencoder

Pijing WEI, Qianqian WANG, Zhen GAO, Ruifen CAO, Chunhou ZHENG

PDF(1671 KB)
PDF(1671 KB)
Front. Comput. Sci. ›› 2024, Vol. 18 ›› Issue (6) : 186912. DOI: 10.1007/s11704-023-3610-y
Interdisciplinary
RESEARCH ARTICLE

DMFVAE: miRNA-disease associations prediction based on deep matrix factorization method with variational autoencoder

Author information +
History +

Abstract

MicroRNAs (miRNAs) are closely related to numerous complex human diseases, therefore, exploring miRNA-disease associations (MDAs) can help people gain a better understanding of complex disease mechanism. An increasing number of computational methods have been developed to predict MDAs. However, the sparsity of the MDAs may hinder the performance of many methods. In addition, many methods fail to capture the nonlinear relationships of miRNA-disease network and inadequately leverage the features of network and neighbor nodes. In this study, we propose a deep matrix factorization model with variational autoencoder (DMFVAE) to predict potential MDAs. DMFVAE first decomposes the original association matrix and the enhanced association matrix, in which the enhanced association matrix is enhanced by self-adjusting the nearest neighbor method, to obtain sparse vectors and dense vectors, respectively. Then, the variational encoder is employed to obtain the nonlinear latent vectors of miRNA and disease for the sparse vectors, and meanwhile, node2vec is used to obtain the network structure embedding vectors of miRNA and disease for the dense vectors. Finally, sample features are acquired by combining the latent vectors and network structure embedding vectors, and the final prediction is implemented by convolutional neural network with channel attention. To evaluate the performance of DMFVAE, we conduct five-fold cross validation on the HMDD v2.0 and HMDD v3.2 datasets and the results show that DMFVAE performs well. Furthermore, case studies on lung neoplasms, colon neoplasms, and esophageal neoplasms confirm the ability of DMFVAE in identifying potential miRNAs for human diseases.

Graphical abstract

Keywords

miRNA-disease association / deep matrix factorization / self-adjusted nearest neighbor / variational encoder / network structure

Cite this article

Download citation ▾
Pijing WEI, Qianqian WANG, Zhen GAO, Ruifen CAO, Chunhou ZHENG. DMFVAE: miRNA-disease associations prediction based on deep matrix factorization method with variational autoencoder. Front. Comput. Sci., 2024, 18(6): 186912 https://doi.org/10.1007/s11704-023-3610-y

Pijing Wei received the PhD degree in computer science and technology from Anhui University, China in 2020. She is currently a lecturer in the Institute of Physical Science and Information Technology, Anhui University, China. Her main research interests include bioinformatics, synthetic biology, cancer data mining, and machine learning

Qianqian Wang received the BS degree in science from Anhui University of Science and Technology, China in 2021. She is currently pursuing the MS degree in the School of Computer Science and Technology, Anhui University, China. Her research interests include research of bioinformatics and deep learning

Zhen Gao received the MS degree in computer science from Qufu Normal University, China in 2021. She is currently working toward the PhD degree in the School of Computer Science and Technology, Anhui University, China. Her research interests include research of bioinformatics, deep learning and gene regulatory networks

Ruifen Cao received the PhD degree from Hefei Institute of Physical Sciences, Chinese Academy of Sciences, China in 2009. She is currently an associate professor at the School of Computer Science and Technology, Anhui University, China. Her research interests include artificial intelligence, medical image processing, and multimodal data fusion

Chunhou Zheng received the the PhD degree in pattern recognition and intelligent system in 2006, from University of Science and Technology of China. From February 2007 to June 2009, he worked as a Postdoctoral Fellow in the Hefei Institutes of Physical Sceience, Chinese Academy of Sciences, China. From July 2009 to July 2010, he worked as a Postdoctoral Fellow in the Department of Computing, the Hong Kong Polytechnic University, China. He is currently a Professor in the School of Artificial Intelligence, Anhui University, China. His research interests include pattern recognition, synthetic biology and bioinformatics

References

[1]
Gebert L F R, MacRae I J . Regulation of microRNA function in animals. Nature Reviews Molecular Cell Biology, 2019, 20( 1): 21–37
[2]
Van Meter E N, Onyango J A, Teske K A . A review of currently identified small molecule modulators of microRNA function. European Journal of Medicinal Chemistry, 2020, 188: 112008
[3]
Hammond S M . An overview of microRNAs. Advanced Drug Delivery Reviews, 2015, 87: 3–14
[4]
Patanè S . The complex miRNAs-p53 signaling network in cardiovascular disease. Journal of the American College of Cardiology, 2017, 69( 16): 2099–2100
[5]
Wang X, He Y, Mackowiak B, Gao B . MicroRNAs as regulators, biomarkers and therapeutic targets in liver diseases. Gut, 2021, 70( 4): 784–795
[6]
Niu Y W, Wang G H, Yan G Y, Chen X . Integrating random walk and binary regression to identify novel miRNA-disease association. BMC Bioinformatics, 2019, 20( 1): 59
[7]
Chen X, Xie D, Zhao Q, You Z H . MicroRNAs and complex diseases: from experimental results to computational models. Briefings in Bioinformatics, 2019, 20( 2): 515–539
[8]
Huang L, Zhang L, Chen X . Updated review of advances in microRNAs and complex diseases: taxonomy, trends and challenges of computational models. Briefings in Bioinformatics, 2022, 23( 5): bbac358
[9]
Huang L, Zhang L, Chen X . Updated review of advances in microRNAs and complex diseases: experimental results, databases, webservers and data fusion. Briefings in Bioinformatics, 2022, 23( 6): bbac397
[10]
Huang L, Zhang L, Chen X . Updated review of advances in microRNAs and complex diseases: towards systematic evaluation of computational models. Briefings in Bioinformatics, 2022, 23( 6): bbac407
[11]
Dai L Y, Liu J X, Zhu R, Wang J, Yuan S S . Logistic weighted profile-based bi-random walk for exploring miRNA-disease associations. Journal of Computer Science and Technology, 2021, 36( 2): 276–287
[12]
Xuan P, Wang D, Cui H, Zhang T, Nakaguchi T . Integration of pairwise neighbor topologies and miRNA family and cluster attributes for miRNA−disease association prediction. Briefings in Bioinformatics, 2022, 23( 1): bbab428
[13]
Xu J, Zhu W, Cai L, Liao B, Meng Y, Xiang J, Yuan D, Tian G, Yang J . LRMCMDA: predicting miRNA-disease association by integrating low-rank matrix completion with miRNA and disease similarity information. IEEE Access, 2020, 8: 80728–80738
[14]
Chen X, Yin J, Qu J, Huang L . MDHGI: matrix decomposition and heterogeneous graph inference for miRNA-disease association prediction. PLoS Computational Biology, 2018, 14( 8): e1006418
[15]
Chen X, Sun L G, Zhao Y . NCMCMDA: miRNA–disease association prediction through neighborhood constraint matrix completion. Briefings in Bioinformatics, 2021, 22( 1): 485–496
[16]
Lu X, Li J, Zhu Z, Yuan Y, Chen G, He K . Predicting miRNA-disease associations via combining probability matrix feature decomposition with neighbor learning. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2022, 19( 6): 3160–3170
[17]
Zhang Y, Lei X, Fang Z, Pan Y . CircRNA-disease associations prediction based on metapath2vec++ and matrix factorization. Big Data Mining and Analytics, 2020, 3( 4): 280–291
[18]
Xuan P, Sun H, Wang X, Zhang T, Pan S . Inferring the disease-associated miRNAs based on network representation learning and convolutional neural networks. International Journal of Molecular Sciences, 2019, 20( 15): 3648
[19]
Li Z, Li J, Nie R, You Z H, Bao W . A graph auto-encoder model for miRNA-disease associations prediction. Briefings in Bioinformatics, 2021, 22( 4): bbaa240
[20]
Ding Y, Tian L P, Lei X, Liao B, Wu F X . Variational graph auto-encoders for miRNA-disease association prediction. Methods, 2021, 192: 25–34
[21]
Liu W, Lin H, Huang L, Peng L, Tang T, Zhao Q, Yang L . Identification of miRNA–disease associations via deep forest ensemble learning based on autoencoder. Briefings in Bioinformatics, 2022, 23( 3): bbac104
[22]
Zeng M, Lu C, Fei Z, Wu F X, Li Y, Wang J, Li M . DMFLDA: a deep learning framework for predicting lncRNA−disease associations. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2021, 18( 6): 2353–2363
[23]
Lu C, Zeng M, Zhang F, Wu F X, Li M, Wang J . Deep matrix factorization improves prediction of human circRNA-disease associations. IEEE Journal of Biomedical and Health Informatics, 2021, 25( 3): 891–899
[24]
Liu D, Huang Y, Nie W, Zhang J, Deng L . SMALF: miRNA-disease associations prediction based on stacked autoencoder and XGBoost. BMC Bioinformatics, 2021, 22( 1): 219
[25]
Li J, Chen X, Huang Q, Wang Y, Xie Y, Dai Z, Zou X, Li Z . Seq-SymRF: a random forest model predicts potential miRNA-disease associations based on information of sequences and clinical symptoms. Scientific Reports, 2020, 10( 1): 17901
[26]
Li J, Li Z, Nie R, You Z, Bao W . FCGCNMDA: predicting miRNA-disease associations by applying fully connected graph convolutional networks. Molecular Genetics and Genomics, 2020, 295( 5): 1197–1209
[27]
Chu Y, Wang X, Dai Q, Wang Y, Wang Q, Peng S, Wei X, Qiu J, Salahub D R, Xiong Y, Wei D Q . MDA-GCNFTG: identifying miRNA-disease associations based on graph convolutional networks via graph sampling through the feature and topology graph. Briefings in Bioinformatics, 2021, 22( 6): bbab165
[28]
Zhang L, Chen X, Yin J . Prediction of potential miRNA−disease associations through a novel unsupervised deep learning framework with variational autoencoder. Cells, 2019, 8( 9): 1040
[29]
Dai Q, Wang Z, Liu Z, Duan X, Song J, Guo M . Predicting miRNA-disease associations using an ensemble learning framework with resampling method. Briefings in Bioinformatics, 2022, 23( 1): bbab543
[30]
Zhong T, Li Z, You Z H, Nie R, Zhao H . Predicting miRNA−disease associations based on graph random propagation network and attention network. Briefings in Bioinformatics, 2022, 23( 2): bbab589
[31]
Zhou S, Wang S, Wu Q, Azim R, Li W . Predicting potential miRNA-disease associations by combining gradient boosting decision tree with logistic regression. Computational Biology and Chemistry, 2020, 85: 107200
[32]
Jin C, Shi Z, Lin K, Zhang H . Predicting miRNA-disease association based on neural inductive matrix completion with graph autoencoders and self-attention mechanism. Biomolecules, 2022, 12( 1): 64
[33]
Zhao Y, Chen X, Yin J . Adaptive boosting-based computational model for predicting potential miRNA-disease associations. Bioinformatics, 2019, 35( 22): 4730–4738
[34]
Ding Y, Lei X, Liao B, Wu F X . Predicting miRNA-disease associations based on multi-view variational graph auto-encoder with matrix factorization. IEEE Journal of Biomedical and Health Informatics, 2022, 26( 1): 446–457
[35]
Ding Y, Lei X, Liao B, Wu F X . MLRDFM: a multi-view Laplacian regularized DeepFM model for predicting miRNA-disease associations. Briefings in Bioinformatics, 2022, 23( 3): bbac079
[36]
Yang Z, Ren F, Liu C, He S, Sun G, Gao Q, Yao L, Zhang Y, Miao R, Cao Y, Zhao Y, Zhong Y, Zhao H . dbDEMC: a database of differentially expressed miRNAs in human cancers. BMC Genomics, 2010, 11( S4): S5
[37]
Jiang Q, Wang Y, Hao Y, Juan L, Teng M, Zhang X, Li M, Wang G, Liu Y . miR2Disease: a manually curated database for microRNA deregulation in human disease. Nucleic Acids Research, 2009, 37: D98–D104
[38]
Seijo L M, Zulueta J J . Understanding the links between lung cancer, COPD, and emphysema: a key to more effective treatment and screening. Oncology, 2017, 31( 2): 93–102
[39]
Hamamoto J, Soejima K, Yoda S, Naoki K, Nakayama S, Satomi R, Terai H, Ikemura S, Sato T, Yasuda H, Hayashi Y, Sakamoto M, Takebayashi T, Betsuyaku T . Identification of microRNAs differentially expressed between lung squamous cell carcinoma and lung adenocarcinoma. Molecular Medicine Reports, 2013, 8( 2): 456–462
[40]
Ciuleanu T, Brodowicz T, Zielinski C, Kim J H, Krzakowski M, Laack E, Wu Y L, Bover I, Begbie S, Tzekova V, Cucevic B, Pereira J R, Yang S H, Madhavan J, Sugarman K P, Peterson P, John W J, Krejcy K, Belani C P . Maintenance pemetrexed plus best supportive care versus placebo plus best supportive care for non-small-cell lung cancer: a randomised, double-blind, phase 3 study. The Lancet, 2009, 374( 9699): 1432–1440
[41]
Schabath M B, Cote M L . Cancer progress and priorities: lung cancer. Cancer Epidemiology, Biomarkers & Prevention, 2019, 28( 10): 1563–1579
[42]
Cappell M S . Pathophysiology, clinical presentation, and management of colon cancer. Gastroenterology Clinics of North America, 2008, 37( 1): 1–24
[43]
Aslam M I, Taylor K, Pringle J H, Jameson J S . MicroRNAs are novel biomarkers of colorectal cancer. British Journal of Surgery, 2009, 96( 7): 702–710
[44]
Yamada A, Horimatsu T, Okugawa Y, Nishida N, Honjo H, Ida H, Kou T, Kusaka T, Sasaki Y, Yagi M, Higurashi T, Yukawa N, Amanuma Y, Kikuchi O, Muto M, Ueno Y, Nakajima A, Chiba T, Boland C R, Goel A . Serum miR-21, miR-29a, and miR-125b are promising biomarkers for the early detection of colorectal neoplasia. Clinical Cancer Research, 2015, 21( 18): 4234–4242
[45]
El-Serag H B, Sweet S, Winchester C C, Dent J . Update on the epidemiology of gastro-oesophageal reflux disease: a systematic review. Gut, 2014, 63( 6): 871–880
[46]
Sohda M, Kuwano H . Current status and future prospects for esophageal cancer treatment. Annals of Thoracic and Cardiovascular Surgery, 2017, 23( 1): 1–11
[47]
Gramantieri L, Ferracin M, Fornari F, Veronese A, Sabbioni S, Liu C G, Calin G A, Giovannini C, Ferrazzi E, Grazi G L, Croce C M, Bolondi L, Negrini M . Cyclin G1 is a target of miR-122a, a microRNA frequently down-regulated in human hepatocellular carcinoma. Cancer Research, 2007, 67( 13): 6092–6099
[48]
Li Y, Qiu C, Tu J, Geng B, Yang J, Jiang T, Cui Q . HMDD v2.0: a database for experimentally supported human microRNA and disease associations. Nucleic Acids Research, 2014, 42( D1): D1070–D1074
[49]
Huang Z, Shi J, Gao Y, Cui C, Zhang S, Li J, Zhou Y, Cui Q . HMDD v3.0: a database for experimentally supported human microRNA−disease associations. Nucleic Acids Research, 2019, 47( D1): D1013–D1017
[50]
Xuan P, Han K, Guo M, Guo Y, Li J, Ding J, Liu Y, Dai Q, Li J, Teng Z, Huang Y . Prediction of microRNAs associated with human diseases based on weighted k most similar neighbors. PLoS One, 2013, 8( 8): e70204
[51]
Wang D, Wang J, Lu M, Song F, Cui Q . Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases. Bioinformatics, 2010, 26( 13): 1644–1650
[52]
Pasquier C, Gardès J . Prediction of miRNA-disease associations with a vector space model. Scientific Reports, 2016, 6: 27036
[53]
Zhang Z W, Gao Z, Zheng C H, Wang Y T, Qi S M. MELPMDA: a new method based on matrix enhancement and label propagation for predicting miRNA-disease association. In: Proceedings of the 17th International Conference on Intelligent Computing Theories and Application. 2021, 536−548
[54]
Xie F, Yang Z, Song J, Dai Q, Duan X . DHNLDA: a novel deep hierarchical network based method for predicting lncRNA-disease associations. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2022, 19( 6): 3395–3403
[55]
Dhillon A, Verma G K . Convolutional neural network: a review of models, methodologies and applications to object detection. Progress in Artificial Intelligence, 2020, 9( 2): 85–112
[56]
Tang X, Luo J, Shen C, Lai Z . Multi-view multichannel attention graph convolutional network for miRNA−disease association prediction. Briefings in Bioinformatics, 2021, 22( 6): bbab174

Acknowledgments

This work was supported by the National Natural Science Foundation of China (Grant Nos. 62202004, and 62322301), the Natural Science Foundation of Anhui Province (No. 2108085QF267), the University Synergy Innovation Program of Anhui Province (No. GXXT-2021-039), and the Anhui University Outstanding Youth Research Project (No. 2022AH020010).

Supplementary document

Our supplementary document consists of three parts. The first part is the Supplementary Figures related to the experiment, the second part is the Supplementary Tables related to the experiment, and the third part is the Supplementary Formulas related to the experimental results and some methods. The supporting information is available online at journal.hep.com.cn and link.springer.com.

Competing interests

The authors declare that they have no competing interests or financial conflicts to disclose.

Open Access

This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.
The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
To view a copy of this licence, visit creativecommons.org/licenses/by/4.0/

RIGHTS & PERMISSIONS

2024 The Author(s) 2024. This article is published with open access at link.springer.com and journal.hep.com.cn
AI Summary AI Mindmap
PDF(1671 KB)

Accesses

Citations

Detail

Sections
Recommended

/