DMFVAE: miRNA-disease associations prediction based on deep matrix factorization method with variational autoencoder
Pijing WEI, Qianqian WANG, Zhen GAO, Ruifen CAO, Chunhou ZHENG
DMFVAE: miRNA-disease associations prediction based on deep matrix factorization method with variational autoencoder
MicroRNAs (miRNAs) are closely related to numerous complex human diseases, therefore, exploring miRNA-disease associations (MDAs) can help people gain a better understanding of complex disease mechanism. An increasing number of computational methods have been developed to predict MDAs. However, the sparsity of the MDAs may hinder the performance of many methods. In addition, many methods fail to capture the nonlinear relationships of miRNA-disease network and inadequately leverage the features of network and neighbor nodes. In this study, we propose a deep matrix factorization model with variational autoencoder (DMFVAE) to predict potential MDAs. DMFVAE first decomposes the original association matrix and the enhanced association matrix, in which the enhanced association matrix is enhanced by self-adjusting the nearest neighbor method, to obtain sparse vectors and dense vectors, respectively. Then, the variational encoder is employed to obtain the nonlinear latent vectors of miRNA and disease for the sparse vectors, and meanwhile, node2vec is used to obtain the network structure embedding vectors of miRNA and disease for the dense vectors. Finally, sample features are acquired by combining the latent vectors and network structure embedding vectors, and the final prediction is implemented by convolutional neural network with channel attention. To evaluate the performance of DMFVAE, we conduct five-fold cross validation on the HMDD v2.0 and HMDD v3.2 datasets and the results show that DMFVAE performs well. Furthermore, case studies on lung neoplasms, colon neoplasms, and esophageal neoplasms confirm the ability of DMFVAE in identifying potential miRNAs for human diseases.
miRNA-disease association / deep matrix factorization / self-adjusted nearest neighbor / variational encoder / network structure
Pijing Wei received the PhD degree in computer science and technology from Anhui University, China in 2020. She is currently a lecturer in the Institute of Physical Science and Information Technology, Anhui University, China. Her main research interests include bioinformatics, synthetic biology, cancer data mining, and machine learning
Qianqian Wang received the BS degree in science from Anhui University of Science and Technology, China in 2021. She is currently pursuing the MS degree in the School of Computer Science and Technology, Anhui University, China. Her research interests include research of bioinformatics and deep learning
Zhen Gao received the MS degree in computer science from Qufu Normal University, China in 2021. She is currently working toward the PhD degree in the School of Computer Science and Technology, Anhui University, China. Her research interests include research of bioinformatics, deep learning and gene regulatory networks
Ruifen Cao received the PhD degree from Hefei Institute of Physical Sciences, Chinese Academy of Sciences, China in 2009. She is currently an associate professor at the School of Computer Science and Technology, Anhui University, China. Her research interests include artificial intelligence, medical image processing, and multimodal data fusion
Chunhou Zheng received the the PhD degree in pattern recognition and intelligent system in 2006, from University of Science and Technology of China. From February 2007 to June 2009, he worked as a Postdoctoral Fellow in the Hefei Institutes of Physical Sceience, Chinese Academy of Sciences, China. From July 2009 to July 2010, he worked as a Postdoctoral Fellow in the Department of Computing, the Hong Kong Polytechnic University, China. He is currently a Professor in the School of Artificial Intelligence, Anhui University, China. His research interests include pattern recognition, synthetic biology and bioinformatics
[1] |
Gebert L F R, MacRae I J . Regulation of microRNA function in animals. Nature Reviews Molecular Cell Biology, 2019, 20( 1): 21–37
|
[2] |
Van Meter E N, Onyango J A, Teske K A . A review of currently identified small molecule modulators of microRNA function. European Journal of Medicinal Chemistry, 2020, 188: 112008
|
[3] |
Hammond S M . An overview of microRNAs. Advanced Drug Delivery Reviews, 2015, 87: 3–14
|
[4] |
Patanè S . The complex miRNAs-p53 signaling network in cardiovascular disease. Journal of the American College of Cardiology, 2017, 69( 16): 2099–2100
|
[5] |
Wang X, He Y, Mackowiak B, Gao B . MicroRNAs as regulators, biomarkers and therapeutic targets in liver diseases. Gut, 2021, 70( 4): 784–795
|
[6] |
Niu Y W, Wang G H, Yan G Y, Chen X . Integrating random walk and binary regression to identify novel miRNA-disease association. BMC Bioinformatics, 2019, 20( 1): 59
|
[7] |
Chen X, Xie D, Zhao Q, You Z H . MicroRNAs and complex diseases: from experimental results to computational models. Briefings in Bioinformatics, 2019, 20( 2): 515–539
|
[8] |
Huang L, Zhang L, Chen X . Updated review of advances in microRNAs and complex diseases: taxonomy, trends and challenges of computational models. Briefings in Bioinformatics, 2022, 23( 5): bbac358
|
[9] |
Huang L, Zhang L, Chen X . Updated review of advances in microRNAs and complex diseases: experimental results, databases, webservers and data fusion. Briefings in Bioinformatics, 2022, 23( 6): bbac397
|
[10] |
Huang L, Zhang L, Chen X . Updated review of advances in microRNAs and complex diseases: towards systematic evaluation of computational models. Briefings in Bioinformatics, 2022, 23( 6): bbac407
|
[11] |
Dai L Y, Liu J X, Zhu R, Wang J, Yuan S S . Logistic weighted profile-based bi-random walk for exploring miRNA-disease associations. Journal of Computer Science and Technology, 2021, 36( 2): 276–287
|
[12] |
Xuan P, Wang D, Cui H, Zhang T, Nakaguchi T . Integration of pairwise neighbor topologies and miRNA family and cluster attributes for miRNA−disease association prediction. Briefings in Bioinformatics, 2022, 23( 1): bbab428
|
[13] |
Xu J, Zhu W, Cai L, Liao B, Meng Y, Xiang J, Yuan D, Tian G, Yang J . LRMCMDA: predicting miRNA-disease association by integrating low-rank matrix completion with miRNA and disease similarity information. IEEE Access, 2020, 8: 80728–80738
|
[14] |
Chen X, Yin J, Qu J, Huang L . MDHGI: matrix decomposition and heterogeneous graph inference for miRNA-disease association prediction. PLoS Computational Biology, 2018, 14( 8): e1006418
|
[15] |
Chen X, Sun L G, Zhao Y . NCMCMDA: miRNA–disease association prediction through neighborhood constraint matrix completion. Briefings in Bioinformatics, 2021, 22( 1): 485–496
|
[16] |
Lu X, Li J, Zhu Z, Yuan Y, Chen G, He K . Predicting miRNA-disease associations via combining probability matrix feature decomposition with neighbor learning. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2022, 19( 6): 3160–3170
|
[17] |
Zhang Y, Lei X, Fang Z, Pan Y . CircRNA-disease associations prediction based on metapath2vec++ and matrix factorization. Big Data Mining and Analytics, 2020, 3( 4): 280–291
|
[18] |
Xuan P, Sun H, Wang X, Zhang T, Pan S . Inferring the disease-associated miRNAs based on network representation learning and convolutional neural networks. International Journal of Molecular Sciences, 2019, 20( 15): 3648
|
[19] |
Li Z, Li J, Nie R, You Z H, Bao W . A graph auto-encoder model for miRNA-disease associations prediction. Briefings in Bioinformatics, 2021, 22( 4): bbaa240
|
[20] |
Ding Y, Tian L P, Lei X, Liao B, Wu F X . Variational graph auto-encoders for miRNA-disease association prediction. Methods, 2021, 192: 25–34
|
[21] |
Liu W, Lin H, Huang L, Peng L, Tang T, Zhao Q, Yang L . Identification of miRNA–disease associations via deep forest ensemble learning based on autoencoder. Briefings in Bioinformatics, 2022, 23( 3): bbac104
|
[22] |
Zeng M, Lu C, Fei Z, Wu F X, Li Y, Wang J, Li M . DMFLDA: a deep learning framework for predicting lncRNA−disease associations. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2021, 18( 6): 2353–2363
|
[23] |
Lu C, Zeng M, Zhang F, Wu F X, Li M, Wang J . Deep matrix factorization improves prediction of human circRNA-disease associations. IEEE Journal of Biomedical and Health Informatics, 2021, 25( 3): 891–899
|
[24] |
Liu D, Huang Y, Nie W, Zhang J, Deng L . SMALF: miRNA-disease associations prediction based on stacked autoencoder and XGBoost. BMC Bioinformatics, 2021, 22( 1): 219
|
[25] |
Li J, Chen X, Huang Q, Wang Y, Xie Y, Dai Z, Zou X, Li Z . Seq-SymRF: a random forest model predicts potential miRNA-disease associations based on information of sequences and clinical symptoms. Scientific Reports, 2020, 10( 1): 17901
|
[26] |
Li J, Li Z, Nie R, You Z, Bao W . FCGCNMDA: predicting miRNA-disease associations by applying fully connected graph convolutional networks. Molecular Genetics and Genomics, 2020, 295( 5): 1197–1209
|
[27] |
Chu Y, Wang X, Dai Q, Wang Y, Wang Q, Peng S, Wei X, Qiu J, Salahub D R, Xiong Y, Wei D Q . MDA-GCNFTG: identifying miRNA-disease associations based on graph convolutional networks via graph sampling through the feature and topology graph. Briefings in Bioinformatics, 2021, 22( 6): bbab165
|
[28] |
Zhang L, Chen X, Yin J . Prediction of potential miRNA−disease associations through a novel unsupervised deep learning framework with variational autoencoder. Cells, 2019, 8( 9): 1040
|
[29] |
Dai Q, Wang Z, Liu Z, Duan X, Song J, Guo M . Predicting miRNA-disease associations using an ensemble learning framework with resampling method. Briefings in Bioinformatics, 2022, 23( 1): bbab543
|
[30] |
Zhong T, Li Z, You Z H, Nie R, Zhao H . Predicting miRNA−disease associations based on graph random propagation network and attention network. Briefings in Bioinformatics, 2022, 23( 2): bbab589
|
[31] |
Zhou S, Wang S, Wu Q, Azim R, Li W . Predicting potential miRNA-disease associations by combining gradient boosting decision tree with logistic regression. Computational Biology and Chemistry, 2020, 85: 107200
|
[32] |
Jin C, Shi Z, Lin K, Zhang H . Predicting miRNA-disease association based on neural inductive matrix completion with graph autoencoders and self-attention mechanism. Biomolecules, 2022, 12( 1): 64
|
[33] |
Zhao Y, Chen X, Yin J . Adaptive boosting-based computational model for predicting potential miRNA-disease associations. Bioinformatics, 2019, 35( 22): 4730–4738
|
[34] |
Ding Y, Lei X, Liao B, Wu F X . Predicting miRNA-disease associations based on multi-view variational graph auto-encoder with matrix factorization. IEEE Journal of Biomedical and Health Informatics, 2022, 26( 1): 446–457
|
[35] |
Ding Y, Lei X, Liao B, Wu F X . MLRDFM: a multi-view Laplacian regularized DeepFM model for predicting miRNA-disease associations. Briefings in Bioinformatics, 2022, 23( 3): bbac079
|
[36] |
Yang Z, Ren F, Liu C, He S, Sun G, Gao Q, Yao L, Zhang Y, Miao R, Cao Y, Zhao Y, Zhong Y, Zhao H . dbDEMC: a database of differentially expressed miRNAs in human cancers. BMC Genomics, 2010, 11( S4): S5
|
[37] |
Jiang Q, Wang Y, Hao Y, Juan L, Teng M, Zhang X, Li M, Wang G, Liu Y . miR2Disease: a manually curated database for microRNA deregulation in human disease. Nucleic Acids Research, 2009, 37: D98–D104
|
[38] |
Seijo L M, Zulueta J J . Understanding the links between lung cancer, COPD, and emphysema: a key to more effective treatment and screening. Oncology, 2017, 31( 2): 93–102
|
[39] |
Hamamoto J, Soejima K, Yoda S, Naoki K, Nakayama S, Satomi R, Terai H, Ikemura S, Sato T, Yasuda H, Hayashi Y, Sakamoto M, Takebayashi T, Betsuyaku T . Identification of microRNAs differentially expressed between lung squamous cell carcinoma and lung adenocarcinoma. Molecular Medicine Reports, 2013, 8( 2): 456–462
|
[40] |
Ciuleanu T, Brodowicz T, Zielinski C, Kim J H, Krzakowski M, Laack E, Wu Y L, Bover I, Begbie S, Tzekova V, Cucevic B, Pereira J R, Yang S H, Madhavan J, Sugarman K P, Peterson P, John W J, Krejcy K, Belani C P . Maintenance pemetrexed plus best supportive care versus placebo plus best supportive care for non-small-cell lung cancer: a randomised, double-blind, phase 3 study. The Lancet, 2009, 374( 9699): 1432–1440
|
[41] |
Schabath M B, Cote M L . Cancer progress and priorities: lung cancer. Cancer Epidemiology, Biomarkers & Prevention, 2019, 28( 10): 1563–1579
|
[42] |
Cappell M S . Pathophysiology, clinical presentation, and management of colon cancer. Gastroenterology Clinics of North America, 2008, 37( 1): 1–24
|
[43] |
Aslam M I, Taylor K, Pringle J H, Jameson J S . MicroRNAs are novel biomarkers of colorectal cancer. British Journal of Surgery, 2009, 96( 7): 702–710
|
[44] |
Yamada A, Horimatsu T, Okugawa Y, Nishida N, Honjo H, Ida H, Kou T, Kusaka T, Sasaki Y, Yagi M, Higurashi T, Yukawa N, Amanuma Y, Kikuchi O, Muto M, Ueno Y, Nakajima A, Chiba T, Boland C R, Goel A . Serum miR-21, miR-29a, and miR-125b are promising biomarkers for the early detection of colorectal neoplasia. Clinical Cancer Research, 2015, 21( 18): 4234–4242
|
[45] |
El-Serag H B, Sweet S, Winchester C C, Dent J . Update on the epidemiology of gastro-oesophageal reflux disease: a systematic review. Gut, 2014, 63( 6): 871–880
|
[46] |
Sohda M, Kuwano H . Current status and future prospects for esophageal cancer treatment. Annals of Thoracic and Cardiovascular Surgery, 2017, 23( 1): 1–11
|
[47] |
Gramantieri L, Ferracin M, Fornari F, Veronese A, Sabbioni S, Liu C G, Calin G A, Giovannini C, Ferrazzi E, Grazi G L, Croce C M, Bolondi L, Negrini M . Cyclin G1 is a target of miR-122a, a microRNA frequently down-regulated in human hepatocellular carcinoma. Cancer Research, 2007, 67( 13): 6092–6099
|
[48] |
Li Y, Qiu C, Tu J, Geng B, Yang J, Jiang T, Cui Q . HMDD v2.0: a database for experimentally supported human microRNA and disease associations. Nucleic Acids Research, 2014, 42( D1): D1070–D1074
|
[49] |
Huang Z, Shi J, Gao Y, Cui C, Zhang S, Li J, Zhou Y, Cui Q . HMDD v3.0: a database for experimentally supported human microRNA−disease associations. Nucleic Acids Research, 2019, 47( D1): D1013–D1017
|
[50] |
Xuan P, Han K, Guo M, Guo Y, Li J, Ding J, Liu Y, Dai Q, Li J, Teng Z, Huang Y . Prediction of microRNAs associated with human diseases based on weighted k most similar neighbors. PLoS One, 2013, 8( 8): e70204
|
[51] |
Wang D, Wang J, Lu M, Song F, Cui Q . Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases. Bioinformatics, 2010, 26( 13): 1644–1650
|
[52] |
Pasquier C, Gardès J . Prediction of miRNA-disease associations with a vector space model. Scientific Reports, 2016, 6: 27036
|
[53] |
Zhang Z W, Gao Z, Zheng C H, Wang Y T, Qi S M. MELPMDA: a new method based on matrix enhancement and label propagation for predicting miRNA-disease association. In: Proceedings of the 17th International Conference on Intelligent Computing Theories and Application. 2021, 536−548
|
[54] |
Xie F, Yang Z, Song J, Dai Q, Duan X . DHNLDA: a novel deep hierarchical network based method for predicting lncRNA-disease associations. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2022, 19( 6): 3395–3403
|
[55] |
Dhillon A, Verma G K . Convolutional neural network: a review of models, methodologies and applications to object detection. Progress in Artificial Intelligence, 2020, 9( 2): 85–112
|
[56] |
Tang X, Luo J, Shen C, Lai Z . Multi-view multichannel attention graph convolutional network for miRNA−disease association prediction. Briefings in Bioinformatics, 2021, 22( 6): bbab174
|
/
〈 | 〉 |