a catalog of biological databases
|Description:||Virtual Chinese Genome Database is a dynamic genome database of Chinese population. VCGDB is a big data solution based on public data released by 1000 Genomes Project.|
|University/Institution:||Beijing Institute of Genomics, Chinese Academy of Sciences|
|Address:||No.1 Beichen West Road, Chaoyang District|
|Contact name (PI/Team):||Jingfa Xiao|
|Contact email (PI/Helpdesk):||email@example.com|
[Database resources of the reference genome and genetic variation maps for the Chinese population]. [PMID: 30465539]
With the implementation of the international human genome project and 1000 genome project, hundreds of Chinese individual genome sequences have been published. Establishing a high-precision Chinese population reference genome and identifying the unique genome variations are fundamental for future precision medicine research in China. To further meet the needs of scientific management and deep mining on the rapidly growing Chinese genomic data, Beijing Institute of Genomics, Chinese Academy of Sciences, has developed a Virtual Chinese Genome Database (VCGDB, http://bigd.big.ac.cn/vcg/) and Genome Variation Map (GVM, http://bigd.big.ac.cn/gvm/) based on the public whole genome sequencing data, which provides the worldwide services of data retrieval, sharing, downloading and online analysis. This paper presents the brief introduction of characteristics and functions of the two databases, as well as their future development and application prospects, aiming to provide useful information for the promotion and development of the reference genome and genome variation map database in China.
VCGDB: a dynamic genome database of the Chinese population. [PMID: 24708222]
The data released by the 1000 Genomes Project contain an increasing number of genome sequences from different nations and populations with a large number of genetic variations. As a result, the focus of human genome studies is changing from single and static to complex and dynamic. The currently available human reference genome (GRCh37) is based on sequencing data from 13 anonymous Caucasian volunteers, which might limit the scope of genomics, transcriptomics, epigenetics, and genome wide association studies. We used the massive amount of sequencing data published by the 1000 Genomes Project Consortium to construct the Virtual Chinese Genome Database (VCGDB), a dynamic genome database of the Chinese population based on the whole genome sequencing data of 194 individuals. VCGDB provides dynamic genomic information, which contains 35 million single nucleotide variations (SNVs), 0.5 million insertions/deletions (indels), and 29 million rare variations, together with genomic annotation information. VCGDB also provides a highly interactive user-friendly virtual Chinese genome browser (VCGBrowser) with functions like seamless zooming and real-time searching. In addition, we have established three population-specific consensus Chinese reference genomes that are compatible with mainstream alignment software. VCGDB offers a feasible strategy for processing big data to keep pace with the biological data explosion by providing a robust resource for genomics studies; in particular, studies aimed at finding regions of the genome associated with diseases.