Identifying miRNA-disease association based on integrating miRNA topological similarity and functional similarity

Qingfeng Chen , Zhao Zhe , Wei Lan , Ruchang Zhang , Zhiqiang Wang , Cheng Luo , Yi-Ping Pheobe Chen

Quant. Biol. ›› 2019, Vol. 7 ›› Issue (3) : 202 -209.

PDF (395KB)
Quant. Biol. ›› 2019, Vol. 7 ›› Issue (3) : 202 -209. DOI: 10.1007/s40484-019-0176-7
RESEARCH ARTICLE
RESEARCH ARTICLE

Identifying miRNA-disease association based on integrating miRNA topological similarity and functional similarity

Author information +
History +
PDF (395KB)

Abstract

Background: MicroRNAs (miRNAs) are a significant type of non-coding RNAs, which usually were encoded by endogenous genes with about ~22 nt nucleotides. Accumulating biological experiments have shown that miRNAs have close associations with various human diseases. Although traditional experimental methods achieve great successes in miRNA-disease interaction identification, these methods also have some limitations. Therefore, it is necessary to develop computational method to predict miRNA-disease interactions.

Methods: Here, we propose a computational framework (MDVSI) to predict interactions between miRNAs and diseases by integrating miRNA topological similarity and functional similarity. Firstly, the CosRA index is utilized to measure miRNA similarity based on network topological feature. Then, in order to enhance the reliability of miRNA similarity, the functional similarity and CosRA similarity are integrated based on linear weight method. Further, the potential miRNA-disease associations are predicted by using recommendation method. In addition, in order to overcome limitation of recommendation method, for new disease, a new strategy is proposed to predict potential interactions between miRNAs and new disease based on disease functional similarity.

Results: To evaluate the performance of different methods, we conduct ten-fold cross validation and de novo test in experiment and compare MDVSI with two the-state-of-art methods. The experimental result shows that MDVSI achieves an AUC of 0.91, which is at least 0.012 higher than other compared methods.

Conclusions: In summary, we propose a computational framework (MDSVI) for miRNA-disease interaction prediction. The experiment results demonstrate that it outperforms other the-state-of-the-art methods. Case study shows that it can effectively identify potential miRNA-disease interactions.

Graphical abstract

Keywords

miRNA-disease association / CosRA index / miRNA functional similarity / recommendation method

Cite this article

Download citation ▾
Qingfeng Chen, Zhao Zhe, Wei Lan, Ruchang Zhang, Zhiqiang Wang, Cheng Luo, Yi-Ping Pheobe Chen. Identifying miRNA-disease association based on integrating miRNA topological similarity and functional similarity. Quant. Biol., 2019, 7(3): 202-209 DOI:10.1007/s40484-019-0176-7

登录浏览全文

4963

注册一个新账户 忘记密码

INTRODUCTION

MiRNAs are a class of non-coding small RNAs with 20 to 25 nucleotides in length, which play important roles in various biological processes such as cell proliferation, cell differentiation, cell aging, cell development [1]. The biogenesis of miRNA is like that: firstly, the primary miRNA (pri-miRNA) is generally transcribed from introns (or host gene) by RNA polymerase II. Then, the pri-miRNA will be processed into the precursor miRNA (pre-miRNA) with about 60 nucleotides in length by Drosha nuclease [2]. Further, the pre-miRNA is transported to the cytoplasm by Exportin-5. Finally, the mature miRNA, which can target the mRNA to regulate the expression of gene, is formed from pre-miRNA with RNase III enzyme. During the past decades, increasing studies have shown that miRNAs have close relationships with various human diseases such as breast cancer [3], liver cancer [4], and gastric carcinoma [5]. Therefore, identifying associations between miRNAs and diseases may contribute to uncover the pathogenesis of human diseases.

Since the first discovery of miRNA (lin-4) about 30 years ago, many miRNAs have been identified based on experimental methods [6]. However, these methods are time consuming and expensive. In order to overcome these limitations, some computational methods have been proposed to predict miRNA-disease associations [711]. Jiang et al. [12] proposed a computational model based on hypergeometric distribution to predict miRNA-disease association by integrating the miRNA functional similarity, disease phenotype similarity network and known miRNA-disease associations. Xuan et al. [13] presented a computational method named HDMP, to discover the relationships of miRNAs and diseases based on weighted k most similar neighbors. This method calculated the functional similarity by integrating the similarity of information content of disease terms and phenotype similarity between diseases. Mørk et al. [14] developed an approach called miRPD to predict disease-related miRNAs. Chen et al. [15] introduced regularized least squares to discover the potential relationships between miRNAs and diseases. Peng et al. [16] presented a new method to predict miRNA-disease association by integrating microRNA, disease, gene and environment factor networks. Yan et al. [17] proposed a computational approach based on logistic matrix factorization to identify interactions between miRNAs and diseases. Luo et al. [18] developed a computational approach based on unbalanced bi-random walk for predicting microRNA-disease associations. Lan et al. [19] utilized kernelized Bayesian matrix factorization method to infer potential miRNA-disease associations by computing sequence and functional similarity of miRNA, semantic similarity of disease. Liu et al. [20] presented a computational method to predict miRNA-disease interactions based on random walk. Zou et al. [21] proposed two methods, KATZ and CATAPULT, to identify potential miRNA-disease associations based on known similarity network analysis. Although all these existing methods have achieved great successes, the predicting accuracy still needs to be improved.

In this paper, we propose a computational framework to predict associations between miRNAs and diseases based on recommendation method. Firstly, the miRNA-miRNA similarity is calculated based on miRNA-disease topological feature and miRNA functional information. The disease-disease similarity is calculated based on disease-gene association. Then, the miRNA similarities are integrated to improve the reliability of miRNA similarity. Further, the potential miRNA-disease associations are predicted by using recommendation method. In addition, to over the limitation of existing recommendation method, a method is proposed for identifying interactions between miRNAs and new disease. In order to evaluate the performance of our method, the ten-fold cross validation and de novo test are implemented in experiment. We compare our method with other two the-state-of-the-art methods in term of AUC. The experimental results demonstrate that our method outperform other methods in prediction performance. The case study shows that our method can effectively identify potential miRNA-disease associations.

RESULTS

Evaluation

In this paper, ten-fold cross-validation and de novo test are utilized to evaluate the performance of our method. In the ten-fold cross-validation, all known miRNA-disease interactions are randomly divided into ten folds. For each cross validation round, one subset is treated as test samples and the rest nine subsets are regarded as training samples. After completing the test, predicted scores are generated. Then, we rank test samples and unknown miRNA-disease interactions. The corresponding predicted result of test samples is considered as true positive (TP) when the predicted relevance score is greater than the threshold. Otherwise, considered as false negative (FN). Similar, for the unknown miRNA-disease interactions, the corresponding predicted result consider as false positive (FP) when the predicted relevance score is greater than the threshold. Otherwise, considered as true negative (TN). Then, FPR (false positive rates) and TPR (true positive rates) are calculated as follow:
T PR= TPTP+FN,
F PR= TNTN+FP.

Finally, the receiver operating characteristic (ROC) curve is drew and the area under the ROC curve (AUC) is calculated to evaluate the performance of method. The higher AUC value is, the better performance is.

Comparison with different predicted methods using ten-fold cross validation

We compare MDVSI with two the-state-of-the-art algorithms (LRLSLDA [15] and BRWH [18]). The result is shown in Figure 1. It can be observed from Figure 1 that MDVSI, LRLSLDA and BRWH achieve AUC of 0.9346, 0.9069 and 0.8393, respectively. It demonstrates that MDVSI outperforms other two methods in the prediction performance.

Comparison with different predicted methods using de novo test

To evaluate the ability of MDVSI in predicting interactions between miRNAs and new diseases, we conduct the de novo test here. In the de novo test, all the known miRNA-disease associations of one disease are removed. Then, the interactions are predicted by prediction method. It can be found in Figure 2 that MDVSI achieves AUC of 0.7468 which is 0.003 and 0.01 higher than LRLSLDA and BRWH, respectively. It demonstrates that MDVSI is an effective method for predicting interactions between new disease and miRNAs.

The effect of parameter β

We also test the effect of parameter β , which is used to balance the miRNA topological similarity and miRNA functional similarity in miRNA similarity integration process. The result is shown in the Table 1. It is easy to find that the MDVSI achieves the best performance in experimental result whenβ =0.5 .

The effect of parameter k

In order to test the effect of parameter k, which is used to selecte the predicted interactions between miRNAs and new disease. The result is shown in the Table 2. It can be concluded that our method achieves the best performance when k ranges from 17 to 19.

Case studies

In order to validate the reliability of miRNAs predicted by our MDVSI, we select two important human cancers (lung cancer and breast cancer) as two case studies. We choose the top 50 predicted miRNAs of breast cancer and lung cancer, respectively. Then, we confirm these miRNAs by consulting databases and recent literatures.

Lung cancer is a kind of high death rate cancer for human. It has been proved that miRNAs have important roles in lung cancer. For example, the expression of miRNAs (has-miR-205, has-miR-19a, has-miR-19b, has-miR-30b and has-miR-20a) decreased strikingly after lung SCC surgery [22]. The top-50 predicted miRNAs related with lung cancer are shown in Table 3. It can be approved that 46 out of top 50 on the list are verified in recent literatures. We also discover some interesting miRNAs, such as hsa-mir-127, hsa-mir-9 and hsa-mir-146b. It demonstrates that our algorithm can not only effectively predict known relationships, but also can predict the potential relationships between diseases and miRNAs.

Breast cancer accounts for three percent of human malignancies and the mortality rate continues to increase over the past years. It is the most common malignancies in women all over the world. Accumulating researches have shown that miRNAs play key roles in breast cancer. For example, some studies have found that the higher expression of mature miRNAs (hsa-miR-103, hsa-miR-21-5p, hsa-miR-141, hsa-miR-25, hsa-miR-30b, hsa-miR-30c, and hsa-let-7i) contribute to breast cancer survival [23]. As shown in Table 4, 43 out of top 50 are verified by consulting published literatures. The results demonstrate that MDVSI is a useful tool for identifying the potential disease-associated miRNAs.

CONCLUSIONS

Increasing experimental studies have indicated that miRNAs play an important role in various biological processes [24,25]. Therefore, it is important to effectively identify the relationship between miRNA and disease [26,27]. While existing experimental methods for miRNA-disease interaction identification are generally time-consuming and unable to apply to large scale datasets. Therefore, it is demanded to develop computational method for predicting miRNA-disease associations. In the paper, we propose a novel method (MDVSI) to infer relationships between miRNAs and diseases. Firstly, we utilize disease-gene interaction downloaded from HumanNet to calculate disease-disease functional similarity. Then, the miRNA topological similarity is calculated based on miRNA-disease network topological feature. Further, the miRNA functional similarity and miRNA topological similarity are integrated to improve the accuracy of prediction by using recommendation method. In addition, in order to overcome the limitation of recommendation method in predicting interaction of new disease, we propose a new method to predict interactions between miRNAs and new disease based on disease similarity. To evaluate the prediction performance, we compare our method with other two algorithms via using ten-fold cross-validation and de novo test. Experimental results show that our model has superior performance than other compared methods for predicting miRNA-disease associations.

MATERIALS AND METHODS

Datasets

The miRNA-disease association is downloaded from HMDD [28]. In the final, we select 271 miRNAs, 137 diseases and 1,395 miRNA-disease associations as gold dataset. The adjacency matrix A is constructed to represent the relationships between miRNAs and diseases, where A(i, j)=1 if miRNA i is associated with disease j, otherwise A(i ,j)= 0.

We extract disease functional similarity from HumanNet dataset [29], which contains 16,243 genes and 476,399 interactions. The assigned score for paired genes in HumanNet represent the gene functional similarity.

The miRNA functional similarity is downloaded from misim [30]. Then, the miRNA-miRNA functional similarity network is described as FM matrix based on the calculated score. The element of matrix FM(i,j) denotes the similarity score between miRNAs i and j.

Methods

We propose a computational framework, MDVSI, for miRNA-disease association identification. In the first step, we calculate disease functional similarity based on disease-gene interaction. Then, the miRNA functional similarity and miRNA topological similarity are calculated based on HumanNet and miRNA-disease network topological feature, respectively. Further, the miRNA functional similarity and miRNA topological similarity are integrated to enhance the reliability of miRNA similarity. Finally, the potential miRNA-disease associations are predicted by using recommendation method. In addition, to overcome the limitation of recommendation method for predicting interaction of new disease, we predict associations between new disease and miRNAs based on disease functional similarity. The flowchart of MDVSI is shown in Figure 3.

The disease-disease similarity

It has been proved that human disease has close relationships with gene. Therefore, based on the disease-gene association, we calculate the functional similarity of diseases. For the two genes gi andgj, the similarity between g i andg j is defined as follow:
ew( gi,gj)= { 1 cc(gi,g j)0 if gi=gj if gi gj, gi an d gj connected if gigj, gi and gj not connected ,
where ew(gi ,gj) denotes the similarity between gi andgj. c c(g i,gj)denotes log likelihood score between gi andgj which is calculated in HumanNet.

Given two gene sets G1={g11, g12,..., g1m} and G 2= {g21,g 22, ...,g 2n}associated with disease d1 and disease d2, respectively. Then, the score between gene g 2j and disease d1 is defined as follow:
GFd1(g 2j )=max Σi =1m( ew( g1i, g2j)),g1i G1,
GFd2(g1i )=max Σj =1n( ew( g1i,g 2j)) ,g2jG2 ,
In the same way, the G Fd2(g1i) can also be calculated.

Finally, the similarity of diseases d1 and d2 is defined as follow:
SD (d 1,d2)= Σj =1nG Fd 1(g2j)+ Σi=1mG Fd 2(g1i)m+n.

The miRNA-miRNA topological similarity

Considering that miRNA-disease interaction may reflect topological feature, miRNA topological similarity is calculated by using CosRA index [31]. The miRNA topological similarity between miRNA i and j is defined as follows:
T MCosRA(i,j )=1kikjΣx=1p aixajx kx,
where ki and kj denote the degree of miRNA i and j, respectively. kx denotes the degree of disease x. aixrepresents the interaction between miRNA i and disease x (a ix=1 if miRNA i is related with diseasex, otherwise aix=0). ajx denotes the interaction between miRNA j and disease x (a jx=1 if miRNA j is related with diseasex, otherwise ajx=0). p is the sum number of diseases.

Integrate miRNA similarities

In order to improve the reliability of miRNA similarity, we integrate miRNA functional similarity and miRNA topological similarity as follows:
SM=β* TMCosRA + (1-β)* FM,
where FM denotes the functional similarity of miRNA. βis the parameter to balance two similarities in fusion process.

Predict potential miRNA-disease association

We predict potential miRNA-disease association based on recommendation method [32], which is used to calculate the miRNA-target prediction. According to the known relationships between miRNA and disease, the predicted score of potential relationships between miRNA i and disease j is calculated as follows:
Score(i,j )=SM (i,* )×A(*,j),
where SM denotes the miRNA similarity matrix. A denotes the miRNA-disease association matrix. The higher score is, the closer relationship between miRNA i and disease j is.

Predict interaction between miRNA and new disease

It is well known that the recommendation method is unable to predict new entity (i.e., cold start problem). In order to overcome this limitation, we propose a new strategy for predicting potential interactions between miRNAs and new disease i. For new disease i, the top k most similarity diseases are selected based on disease functional similarity. And then it is considered to be associated with miRNA j if over two similar diseases have relationship with the miRNA j. We use these associations as known knowledge and predict the potential relationships between new disease i and miRNAs by using recommendation method.

References

[1]

Ambros, V. (2004) The functions of animal microRNAs. Nature, 431, 350–355

[2]

Akhtar, M. M., Micolucci, L., Islam, M. S., Olivieri, F. and Procopio, A. D. (2016) Bioinformatic tools for microRNA dissection. Nucleic Acids Res., 44, 24–44

[3]

Iorio, M. V., Ferracin, M., Liu, C. G., Veronese, A., Spizzo, R., Sabbioni, S., Magri, E., Pedriali, M., Fabbri, M., Campiglio, M., (2005) MicroRNA gene expression deregulation in human breast cancer. Cancer Res., 65, 7065–7070

[4]

Ashmawy, A. M., Elgeshy, K. M., Abdel Salam, E. T., Ghareeb, M., Kobaisi, M. H., Amin, H. A. A., Sharawy, S. K. and Abdel Wahab, A. H. A. (2017) Crosstalk between liver-related microRNAs and Wnt/b-catenin pathway in hepatocellular carcinoma patients. Arab J. Gastroenterol., 18, 144–150

[5]

Kim, Y. K., Yu, J., Han, T. S., Park, S. Y., Namkoong, B., Kim, D. H., Hur, K., Yoo, M. W., Lee, H. J., Yang, H. K., (2009) Functional links between clustered microRNAs: suppression of cell-cycle inhibitors by microRNA clusters in gastric cancer. Nucleic Acids Res., 37, 1672–1681

[6]

Moss, E. G., Lee, R. C. and Ambros, V. (1997) The cold shock domain protein LIN-28 controls developmental timing in C. elegans and is regulated by the lin-4 RNA. Cell, 88, 637–646

[7]

Lan, W., Chen, Q., Li, T., Yuan, C., Mann, S. and Chen, B. (2014) Identification of important positions within miRNAs by integrating sequential and structural features. Curr. Protein Pept. Sci., 15, 591–597

[8]

Chen, Q., Lan, W., Wang, J. X. (2013) Mining featured patterns of miRNA interaction based on sequence and structure similarity. IEEE/ACM Trans. Comput. Biol. Bioinform., 10, 415–422

[9]

Luo, H., Lan, W., Chen, Q., Wang, Z., Liu, Z., Yue, X. and Zhu, L. (2018) Inferring microRNA-environmental factor interactions based on multiple biological information fusion. Molecules, 23, 2439

[10]

Lan, W., Wang, J., Li, M., Liu, J. and Pan, Y. (2015) Predicting microRNA-disease associations by integrating multiple biological information. In: EEE International Conference on Bioinformatics and Biomedicine (BIBM), pp.183–188

[11]

Peng, W., Lan, W., Zhong, J., Wang, J. and Pan, Y. (2017) A novel method of predicting microRNA-disease associations based on microRNA, disease, gene and environment factor networks. Methods, 124, 69–77

[12]

Jiang, Q., Hao, Y., Wang, G., Juan, L., Zhang, T., Teng, M., Liu, Y. and Wang, Y. (2010) Prioritization of disease microRNAs through a human phenome-microRNAome network. BMC Syst. Biol., 4, S2

[13]

Xuan, P., Han, K., Guo, M., Guo, Y., Li, J., Ding, J., Liu, Y., Dai, Q., Li, J., Teng, Z., (2013) Prediction of microRNAs associated with human diseases based on weighted k most similar neighbors. PLoS One, 8, e70204

[14]

Mørk, S., Pletscher-Frankild, S., Palleja Caro, A., Gorodkin, J. and Jensen, L. J. (2014) Protein-driven inference of miRNA-disease associations. Bioinformatics, 30, 392–397

[15]

Chen, X. and Yan, G. Y. (2014) Semi-supervised learning for potential human microRNA-disease associations inference. Sci. Rep., 4, 5501

[16]

Peng, W., Lan, W., Yu, Z., Wang, J., Pan, Y. (2016) A Framework for integrating multiple biological networks to predict microRNA-disease associations. IEEE Trans. Nanobio., 16, 100–107

[17]

Yan, C., Wang, J., Ni, P., Lan, W., Wu, F. X., Pan, Y. (2019) DNRLMF-MDA: predicting microRNA-disease associations based on similarities of microRNAs and diseases. IEEE/ACM Trans. Comput. Biol. Bioinform., 16, 233–243

[18]

Luo, J. and Xiao, Q. (2017) A novel approach for predicting microRNA-disease associations by unbalanced bi-random walk on heterogeneous network. J. Biomed. Inform., 66, 194–203

[19]

Lan, W., Wang, J., Li, M., Liu, J., Wu, F.X., Pan, Y. (2018) Predicting microRNA-disease associations based on improved microRNA and disease similarities. IEEE/ACM Trans. Comput. Biol. Bioinform., 15, 1774–1782

[20]

Liu, Y., Zeng, X., He, Z., Zou, Q. (2016) Inferring microRNA-disease associations by random walk on a heterogeneous network with multiple data sources. IEEE/ACM Trans. Comput. Biol. Bioinform., 14, 905–915

[21]

Zou, Q., Li, J., Hong, Q., Lin, Z., Wu, Y., Shi, H. and Ju, Y. (2015) Prediction of microRNA-disease associations based on social network analysis methods. BioMed Res. Int., 2015, 810514

[22]

Aushev, V. N., Zborovskaya, I. B., Laktionov, K. K., Girard, N., Cros, M. P., Herceg, Z. and Krutovskikh, V. (2013) Comparisons of microRNA patterns in plasma before and after tumor removal reveal new biomarkers of lung squamous cell carcinoma. PLoS One, 8, e78649

[23]

Chang, J. T., Wang, F., Chapin, W. and Huang, R. S. (2016) Identification of microRNAs as breast cancer prognosis markers through the Cancer Genome Atlas. PLoS One, 11, e0168284

[24]

Köhler, S., Bauer, S., Horn, D. and Robinson, P. N. (2008) Walking the interactome for prioritization of candidate disease genes. Am. J. Hum. Genet., 82, 949–958

[25]

Lan, W., Li, M., Zhao, K., Liu, J., Wu, F. X., Pan, Y. and Wang, J. (2017) LDAP: a web server for lncRNA-disease association prediction. Bioinformatics, 33, 458–460

[26]

Lan, W., Huang, L., Lai, D., and Chen, Q. (2018) Identifying Interactions Between Long Noncoding RNAs and Diseases Based on Computational Methods. In: Computational Systems Biology, pp.205–221, Springer Nature

[27]

Lan, W., Wang, J. X., Li, M., Peng, W. and Wu, F. (2015) Computational approaches for prioritizing candidate disease genes based on PPI networks. Tsinghua Sci. Technol., 20, 500–512

[28]

Li, Y., Qiu, C., Tu, J., Geng, B., Yang, J., Jiang, T. and Cui, Q. (2014) HMDD v2.0: a database for experimentally supported human microRNA and disease associations. Nucleic Acids Res., 42, D1070–D1074

[29]

Hwang, S., Kim, C. Y., Yang, S., Kim, E., Hart, T., Marcotte, E. M. and Lee, I. (2019) HumanNet v2: human gene networks for disease research. Nucleic Acids Res., 47, D573–D580

[30]

Wang, D., Wang, J., Lu, M., Song, F. and Cui, Q. (2010) Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases. Bioinformatics, 26, 1644–1650

[31]

Chen, L. J., Zhang, Z. K., Liu, J. H., Gao, J. and Zhou, T. (2017) A vertex similarity index for better personalized recommendation. Physica A, 466, 607–615

[32]

Jiang, H., Wang, J., Li, M., Lan, W., Wu, F., Pan, Y. (2018) miRTRS: a recommendation algorithm for predicting miRNA targets. IEEE/ACM Trans. Comput. Biol. Bioinform., 1

RIGHTS & PERMISSIONS

Higher Education Press and Springer-Verlag GmbH Germany, part of Springer Nature

AI Summary AI Mindmap
PDF (395KB)

1411

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/