The Genome Variation Map: A Data Repository of Genome Variations in BIG Data Center

ADMIN  Mar 17, 2018


The genome sequence variation is genetic variation of genomic DNA molecules. It is the basis of species diversity, and is the most valuable genetic resources for studying the evolution of species, molecular breeding, human diseases etc. Recently, a world leading and the largest domestic genome sequence variation data resource (GVM) was released by the BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences. This work has been published online in Nucleic Acids Research on October 24, 2017.

 

GVM is a public data repository of genome variations, which dedicates to collect, integrate and visualize genome variations for a wide range of species, accepts submissions of different types of genome variations from all over the world and provides free open access to all publicly available data in support of worldwide research activities.

 

In the current implementation of GVM, it houses a total of ∼4.9 billion variants for 19 species including chicken, dog, goat, human, poplar, rice and tomato, incorporates 8,669 individual genotypes and 13,262 manually curated high-quality genotype-to-phenotype associations for non-human species.

 

In addition, GVM provides friendly intuitive web interfaces for data submission, browse, search and visualization.

 

Collectively, GVM serves as an important resource for archiving genomic variation data, and is helpful for better understanding population genetic diversity and deciphering complex mechanisms associated with different phenotypes.

 

This research was supported by Strategic Priority Research Program of the Chinese Academy of Sciences; National Key Research & Development Program of China; National Key Research Program of China; National Programs for High Technology Research and Development; The Youth Innovation Promotion Association of Chinese Academy of Science etc.