Genome Sequence Archive

The Genome Sequence Archive (GSA) is a data repository for archiving raw sequence reads. It accepts data submissions from all over the world and provides free access to all publicly available data for global scientific communities.

China Genomic Data Sharing Initiative
Data Statistics
How to Cite?

When you have successfully submitted data to GSA, please consider to use the following words to describe data deposition in your manuscript.

The raw sequence data reported in this paper have been deposited in the Genome Sequence Archive (Genomics, Proteomics & Bioinformatics 2017) in National Genomics Data Center (Nucleic Acids Res 2020), Beijing Institute of Genomics (China National Center for Bioinformation), Chinese Academy of Sciences, under accession number (s) CRAxxxxxx (, CRAyyyyyy) that are publicly accessible at https://bigd.big.ac.cn/gsa.

Please cite the following required publications.

GSA: Genome Sequence Archive. Genomics, Proteomics & Bioinformatics 2017. [PMID=28387199]
Database Resources of the National Genomics Data Center in 2020. Nucleic Acids Res 2020, 48(D1):D24–D33. [PMID=31702008]

New  2019-nCov Raw Sequences

New  2019-nCoV Data Resources

  Help & Support

If you have any question or would like to give us any suggestion/comment or report a bug, please feel free to contact us.

Email: gsa@big.ac.cn

QQ group: 548170081

We highly appreciate your comments and suggestions for further improvements.

  GSA-supported Deposition
  • Data submissions to GSA have been reported by multiple high-profile journals. GSA has been designated as supported data repository by Elsevier.
  •    gsa    gsa    gsa
  •    gsa    gsa    gsa
  •    gsa    gsa    gsa
  •   gsa    gsa    gsa
  •    gsa    gsa    gsa
  •    gsa    gsa    gsa
more >>
  Latest Released GSA
AccessionDescription
  Related Databases