Overlap maximum matching ratio (OMMR): a new measure to evaluate overlaps of essential modules
Xiao-xia ZHANG, Qiang-hua XIAO, Bin LI, Sai HU, Hui-jun XIONG, Bi-hai ZHAO
Overlap maximum matching ratio (OMMR): a new measure to evaluate overlaps of essential modules
Protein complexes are the basic units of macro-molecular organizations and help us to understand the cell’s mechanism. The development of the yeast two-hybrid, tandem affinity purification, and mass spectrometry high-throughput proteomic techniques supplies a large amount of protein-protein interaction data, which make it possible to predict overlapping complexes through computational methods. Research shows that overlapping complexes can contribute to identifying essential proteins, which are necessary for the organism to survive and reproduce, and for life’s activities. Scholars pay more attention to the evaluation of protein complexes. However, few of them focus on predicted overlaps. In this paper, an evaluation criterion called overlap maximum matching ratio (OMMR) is proposed to analyze the similarity between the identified overlaps and the benchmark overlap modules. Comparison of essential proteins and gene ontology (GO) analysis are also used to assess the quality of overlaps. We perform a comprehensive comparison of serveral overlapping complexes prediction approaches, using three yeast protein-protein interaction (PPI) networks. We focus on the analysis of overlaps identified by these algorithms. Experimental results indicate the important of overlaps and reveal the relationship between overlaps and identification of essential proteins.
Protein-protein interaction network / Essential protein modules / Overlap / Overlap maximum matching ratio
[1] |
Adamcsek, B., Palla, G., Farkas, I.J.,
CrossRef
Google scholar
|
[2] |
Bader, G.D., Hogue, C.W.V., 2003. An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinform., 4: 2.1-2.27. [
CrossRef
Google scholar
|
[3] |
Boyle, E.I., Weng, S., Gollub, J.,
CrossRef
Google scholar
|
[4] |
Chen, B., Shi, J., Zhang, S.,
CrossRef
Google scholar
|
[5] |
Cherry, J.M., Adler, C., Ball, C.,
CrossRef
Google scholar
|
[6] |
Dezső, Z., Oltvai, Z.N., Barabási, A.L., 2003. Bioinformatics analysis of experimentally determined protein complexes in the yeast Saccharomyces cerevisiae. Genome Res., 13: 2450-2454. [
CrossRef
Google scholar
|
[7] |
Enright, A.J., van Dongen, S., Ouzounis, C.A., 2002. An efficient algorithm for large-scale detection of protein families. Nucl. Acids Res., 30(7): 1575-1584. [
CrossRef
Google scholar
|
[8] |
Gavin, A.C., Aloy, P., Grandi, P.,
CrossRef
Google scholar
|
[9] |
Han, J.D., Bertin, N., Hao, T.,
CrossRef
Google scholar
|
[10] |
Hart, G.T., Lee, I., Marcotte, E.M., 2007. A high-accuracy consensus map of yeast protein complexes reveals modular nature of gene essentiality. BMC Bioinform., 8: 236.1-236.11. [
CrossRef
Google scholar
|
[11] |
Hu, H., Yan, X., Huang, Y.,
CrossRef
Google scholar
|
[12] |
Jiang, P., Singh, M., 2010. SPICi: a fast clustering algorithm for large biological networks. Bioinformatics, 26(8): 1105-1111. [
CrossRef
Google scholar
|
[13] |
Krogan, N., Cagney, G., Yu, H.,
CrossRef
Google scholar
|
[14] |
Lei, X., Wu, S., Ge, L.,
CrossRef
Google scholar
|
[15] |
Leung, H.C.M., Xiang, Q., Yiu, S.M.,
CrossRef
Google scholar
|
[16] |
Liu, G., Wong, L., Chua, H.N., 2009. Complex discovery from weighted PPI networks. Bioinformatics, 25(15): 1891-1897. [
CrossRef
Google scholar
|
[17] |
Macropol, K., Can, T., Singh, A.K., 2009. RRW: repeated random walks on genome-scale protein networks for local cluster discovery. BMC Bioinform., 10: 283.1-283.10.[
CrossRef
Google scholar
|
[18] |
Maraziotis, I.A., Dimitrakopoulou, K., Bezerianos, A., 2007. Growing functional modules from a seed protein via integration of protein interaction and gene expression data. BMC Bioinform., 8: 408.1-408.15. [
CrossRef
Google scholar
|
[19] |
Mewes, H.W., Frishman, D., Mayer, K.F.X.,
CrossRef
Google scholar
|
[20] |
Nepusz, T., Yu, H., Paccanaro, A., 2012. Detecting overlapping protein complexes in protein-protein interaction networks. Nat. Methods, 9(5): 471-472. [
CrossRef
Google scholar
|
[21] |
Ni, W.Y., Xiong, H.J., Zhao, B.H.,
CrossRef
Google scholar
|
[22] |
Palla, G., Derényi, I., Farkas, I.,
CrossRef
Google scholar
|
[23] |
Pu, S., Wong, J., Turner, B.,
CrossRef
Google scholar
|
[24] |
Shih, Y.K., Parthasarathy, S., 2012. Identifying functional modules in interaction networks through overlapping Markov clustering. Bioinformatics, 28(18): i473-i479. [
CrossRef
Google scholar
|
[25] |
Stark, C., Breitkreutz, B.J., Reguly, T.,
CrossRef
Google scholar
|
[26] |
Wu, M., Li, X., Kwoh, C.K.,
CrossRef
Google scholar
|
[27] |
Zhang, R., Lin, Y., 2009. DEG 5.0, a database of essential genes in both prokaryotes and eukaryotes. Nucl. Acids Res., 37(suppl 1): D455-D458. [
CrossRef
Google scholar
|
[28] |
Zhao, B., Wang, J., Li, M.,
CrossRef
Google scholar
|
/
〈 | 〉 |