Functional annotation from the genome sequence of the giant panda

Tong Huo1,3, Yinjie Zhang1,3, Jianping Lin2,3()

PDF(479 KB)
PDF(479 KB)
Protein Cell ›› 2012, Vol. 3 ›› Issue (8) : 602-608. DOI: 10.1007/s13238-012-2914-8
COMMUNICATION
COMMUNICATION

Functional annotation from the genome sequence of the giant panda

  • Tong Huo1,3, Yinjie Zhang1,3, Jianping Lin2,3()
Author information +
History +

Abstract

The giant panda is one of the most critically endangered species due to the fragmentation and loss of its habitat. Studying the functions of proteins in this animal, especially specific trait-related proteins, is therefore necessary to protect the species. In this work, the functions of these proteins were investigated using the genome sequence of the giant panda. Data on 21,001 proteins and their functions were stored in the Giant Panda Protein Database, in which the proteins were divided into two groups: 20,179 proteins whose functions can be predicted by GeneScan formed the known-function group, whereas 822 proteins whose functions cannot be predicted by GeneScan comprised the unknownfunction group. For the known-function group, we further classified the proteins by molecular function, biological process, cellular component, and tissue specificity. For the unknown-function group, we developed a strategy in which the proteins were filtered by cross- Blast to identify panda-specific proteins under the assumption that proteins related to the panda-specific traits in the unknown-function group exist. After this filtering procedure, we identified 32 proteins (2 of which are membrane proteins) specific to the giant panda genome as compared against the dog and horse genomes. Based on their amino acid sequences, these 32 proteins were further analyzed by functional classification using SVM-Prot, motif prediction using MyHits, and interacting protein prediction using the Database of Interacting Proteins. Nineteen proteins were predicted to be zinc-binding proteins, thus affecting the activities of nucleic acids. The 32 panda-specific proteins will be further investigated by structural and functional analysis.

Keywords

Giant panda / GPPD / cross-Blast

Cite this article

Download citation ▾
Tong Huo, Yinjie Zhang, Jianping Lin. Functional annotation from the genome sequence of the giant panda. Prot Cell, 2012, 3(8): 602‒608 https://doi.org/10.1007/s13238-012-2914-8

References

[1] Birney, E., Clamp, M., and Durbin, R. (2004). GeneWise and Genomewise. Genome Res 14, 988-995 .10.1101/gr.1865504
[2] Bocca, S. N., Magioli, C., Mangeon, A., Junqueira, R. M., Cardeal, V., Margis, R., Sachetto-Martins, G. (2005). Survey of glycine-rich proteins (GRPs) in the Eucalyptus expressed sequence tag database (ForEST). Genet Mole Biol 28, 608-624 .10.1590/S1415-47572005000400016
[3] Burge, C., and Karlin, S. (1997). Prediction of complete gene structures in human genomic DNA. J Mol Biol 268, 78-94 .10.1006/jmbi.1997.0951
[4] Cai, C.Z., Han, L.Y., Ji, Z.L., Chen, X., and Chen, Y.Z. (2003). SVM-Prot: Web-based support vector machine software for functional classification of a protein from its primary sequence. Nucleic Acids Res 31, 3692-3697 .10.1093/nar/gkg600
[5] Hao, Y.Z., Hou, W.R., Hou, Y.L., Du, Y.J., Zhang, T., and Peng, Z.S. (2009). cDNA, genomic sequence cloning and overexpression of ribosomal protein S25 gene (RPS25) from the Giant Panda. Mol Biol Rep 36, 2139-2145 .10.1007/s11033-008-9427-9
[6] Hama, N., Kanemitsu, H., Tanikawa, M., Shibaya, M., Sakamoto, K., Oyama, Y., Acosta, T.J., Ishikawa, O., Pengyan, W., and Okuda, K. (2009). Development of an enzyme immunoassay for urinary pregnanediol-3-glucuronide in a female giant panda (Ailuropoda melanoleuca). J Vet Med Sci 71, 879-884 .10.1292/jvms.71.879
[7] Krause, J., Unger, T., Nocon, A., Malaspinas, A.S., Kolokotronis, S.O., Stiller, M., Soibelzon, L., Spriggs, H., Dear, P.H., Briggs, A.W., . (2008). Mitochondrial genomes reveal an explosive radiation of extinct and extant bears near the Miocene-Pliocene boundary. BMC Evol Biol 8, 220.10.1186/1471-2148-8-220
[8] Krogh, A., Larsson, B., von Heijne, G., and Sonnhammer, E.L. (2001). Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol 305, 567-580 .10.1006/jmbi.2000.4315
[9] Li, R., Fan, W., Tian, G., Zhu, H., He, L., Cai, J., Huang, Q., Cai, Q., Li, B., Bai, Y., . (2010). The sequence and de novo assembly of the giant panda genome. Nature 463, 311-317 .10.1038/nature08696
[10] Magrane, M., and Consortium, U. (2011). UniProt Knowledgebase: a hub of integrated protein data. Database (Oxford) 2011 , bar009.10.1093/database/bar009
[11] Nash, W.G., Wienberg, J., Ferguson-Smith, M.A., Menninger, J.C., and O'Brien, S.J. (1998). Comparative genomics: tracking chromosome evolution in the family ursidae using reciprocal chromosome painting. Cytogenet Cell Genet 83, 182-192 .10.1159/000015176
[12] Pages, M., Calvignac, S., Klein, C., Paris, M., Hughes, S., and Hanni, C. (2008). Combined analysis of fourteen nuclear genes refines the Ursidae phylogeny. Mol Phylogenet Evol 47, 73-83 .10.1016/j.ympev.2007.10.019
[13] Pagni, M., Ioannidis, V., Cerutti, L., Zahn-Zabal, M., Jongeneel, C.V., Hau, J., Martin, O., Kuznetsov, D., and Falquet, L. (2007). MyHits: improvements to an interactive resource for analyzing protein sequences. Nucleic Acids Res 35, W433-437 .10.1093/nar/gkm352
[14] She, R., Chu, J.S., Wang, K., Pei, J., and Chen, N. (2009). Gen-BlastA: enabling BLAST to identify homologous gene sequences. Genome Res 19, 143-149 .10.1101/gr.082081.108
[15] Stanke, M., and Waack, S. (2003). Gene prediction with a hidden Markov model and a new intron submodel. Bioinformatics 19 Suppl2, ii215-225 .10.1093/bioinformatics/btg1080
[16] Williamson, M.P. (1994). The structure and function of proline-rich regions in proteins. Biochem J 297 (Pt 2), 249-260 .
[17] Xenarios, I., Salwinski, L., Duan, X.J., Higney, P., Kim, S.M., and Eisenberg, D. (2002). DIP, the database of interacting proteins: a research tool for studying cellular networks of protein interactions. Nucleic Acids Res 30, 303-305 .10.1093/nar/30.1.303
[18] Zhan, X., Li, M., Zhang, Z., Goossens, B., Chen, Y., Wang, H., Bruford, M.W., and Wei, F. (2006). Molecular censusing doubles giant panda population estimate in a key nature reserve. Curr Biol 16, R451-452 .10.1016/j.cub.2006.05.042
AI Summary AI Mindmap
PDF(479 KB)

Accesses

Citations

Detail

Sections
Recommended

/