Strategic planning for national biomedical big data infrastructure in China
Zhen Wang, Zefeng Wang, Yixue Li
Strategic planning for national biomedical big data infrastructure in China
The promise that big data will revolutionize scientific discovery and technology innovation is now being widely recognized. With the explosive growth of biomedical data, life science is being transformed into a digital science in which novel insights are gained from in-depth data analysis and modeling. Extensive and innovative utilization of biomedical big data is a key to the success of precision medicine. Therefore, constructing a centralized national-level biomedical big data infrastructure becomes crucial and urgent for China. Such infrastructure should achieve superb capacity of safe data storage, standardized data processing and quality control, systematic data integration across multiple types, and in-depth data mining and effective data sharing. Full data chain service including information retrieval, knowledge discovery and technology support can be provided to data centers, research institutes and healthcare industries. Relying on Shanghai Institutes for Biological Sciences, agreements have been signed that a main node of the infrastructure will be located in Shanghai, and a backup node will be set up in Guizhou Province. After a construction period of five years, the infrastructure should greatly enhance China’s core competence in collection, interpretation and application of biomedical big data.
biomedical big data / national infrastructure / precision medicine
[1] |
Mayer-Schönberger, V. and Cukier, K. (2013) Big Data: A Revolution That Will Transform How We Live, Work, and Think. Boston: Houghton Mifflin Harcourt
|
[2] |
Chouard, T. (2016) The Go Files: AI computer wraps up 4-1 victory against human champion. Nature doi: 10.1038/nature. 2016.19575.
|
[3] |
Gray, J. (2009) Jim Gray on eScience: A Transformed Scientific Method. Hey, T., Tansley, S., and Tolle, K. M. eds. In The Fourth Paradigm: Data-intensive Scientific Discovery. Redmond, WA: Microsoft Research, xix
|
[4] |
Hey, T., Tansley, S. and Tolle, K. M. (2009) The Fourth Paradigm: Data-intensive Scientific Discovery. Redmond, WA: Microsoft Research
|
[5] |
Hood, L. and Rowen, L. (2013) The Human Genome Project: big science transforms biology and medicine. Genome Med., 5, 79
CrossRef
Pubmed
Google scholar
|
[6] |
Stephens, Z. D. , Lee, S. Y. , Faghri, F. , Campbell, R. H. , Zhai, C. , Efron, M. J. , Iyer, R. , Schatz, M. C. , Sinha, S. and Robinson, G. E. (2015) Big data: astronomical or genomical? PLoS Biol., 13, e1002195.
CrossRef
Pubmed
Google scholar
|
[7] |
Gomez-Cabrero, D., Abugessaisa, I., Maier, D. , Teschendorff, A. , Merkenschlager, M. , Gisel, A. , Ballestar, E. , Bongcam-Rudloff, E. , Conesa, A. and Tegnér, J. (2014) Data integration in the era of omics: current and future challenges. BMC Syst. Biol., 8, I1.
CrossRef
Pubmed
Google scholar
|
[8] |
Ashley, E. A. (2016) Towards precision medicine. Nat. Rev. Genet., 17, 507–522.
CrossRef
Pubmed
Google scholar
|
[9] |
Gligorijević, V. , Malod-Dognin, N. and Pržulj, N. (2016) Integrative methods for analyzing big data in precision medicine. Proteomics, 16, 741–758.
CrossRef
Pubmed
Google scholar
|
[10] |
Cochrane, G., Karsch-Mizrachi, I., Takagi, T. and the International Nucleotide Sequence Database Collaboration. (2016) The International Nucleotide Sequence Database Collaboration. Nucleic Acids Res., 44, D48–D50.
CrossRef
Pubmed
Google scholar
|
[11] |
Wheeler, D. L. , Barrett, T. , Benson, D. A. , Bryant, S. H. , Canese, K. , Chetvernin, V. , Church, D. M. , DiCuccio, M. , Edgar, R. , Federhen, S . (2016) Database resources of the national center for biotechnology information. Nucleic Acids Res., 44, D7–D19.
CrossRef
Pubmed
Google scholar
|
[12] |
Zhan, Q. and Qian, H. (2016) Opportunities and Advantages for The Development of Precision Medicine in China. In: Precision Medicine in China. Sanders, S. and Oberst, J. eds, pp. 6–9. Washington, DC: Science/AAAS
|
/
〈 | 〉 |