URL: | https://portal.gdc.cancer.gov |
Full name: | The Cancer Genome Atlas |
Description: | A landmark cancer genomics program, molecularly characterized over 20,000 primary cancer and matched normal samples spanning 33 cancer types. |
Year founded: | 2013 |
Last update: | |
Version: | |
Accessibility: | |
Country/Region: | United States |
Data type: | |
Data object: |
NA
|
Database category: | |
Major species: | |
Keywords: |
University/Institution: | National Cancer Institute |
Address: | Center for Cancer Genomics 31 Center Drive, Room 10A11 Bethesda, MD 20892 |
City: | Bethesda |
Province/State: | |
Country/Region: | United States |
Contact name (PI/Team): | Harold Varmus |
Contact email (PI/Helpdesk): | tcga@mail.nih.gov |
Before and After: Comparison of Legacy and Harmonized TCGA Genomic Data Commons' Data. [PMID: 31344359]
We present a systematic analysis of the effects of synchronizing a large-scale, deeply characterized, multi-omic dataset to the current human reference genome, using updated software, pipelines, and annotations. For each of 5 molecular data platforms in The Cancer Genome Atlas (TCGA)-mRNA and miRNA expression, single nucleotide variants, DNA methylation and copy number alterations-comprehensive sample, gene, and probe-level studies were performed, towards quantifying the degree of similarity between the 'legacy' GRCh37 (hg19) TCGA data and its GRCh38 (hg38) version as 'harmonized' by the Genomic Data Commons. We offer gene lists to elucidate differences that remained after controlling for confounders, and strategies to mitigate their impact on biological interpretation. Our results demonstrate that the hg19 and hg38 TCGA datasets are very highly concordant, promote informed use of either legacy or harmonized omics data, and provide a rubric that encourages similar comparisons as new data emerge and reference data evolve. |
The Cancer Genome Atlas: Creating Lasting Value beyond Its Data. [PMID: 29625045]
The Cancer Genome Atlas (TCGA) team now presents the Pan-Cancer Atlas, investigating different aspects of cancer biology by analyzing the data generated during the 10+ years of the TCGA project. |
An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics. [PMID: 29625055]
For a decade, The Cancer Genome Atlas (TCGA) program collected clinicopathologic annotation data along with multi-platform molecular profiles of more than 11,000 human tumors across 33 different cancer types. TCGA clinical data contain key features representing the democratized nature of the data collection process. To ensure proper use of this large clinical dataset associated with genomic features, we developed a standardized dataset named the TCGA Pan-Cancer Clinical Data Resource (TCGA-CDR), which includes four major clinical outcome endpoints. In addition to detailing major challenges and statistical limitations encountered during the effort of integrating the acquired clinical data, we present a summary that includes endpoint usage recommendations for each cancer type. These TCGA-CDR findings appear to be consistent with cancer genomics studies independent of the TCGA effort and provide opportunities for investigating cancer biology using clinical correlates at an unprecedented scale. |
The NCI Genomic Data Commons as an engine for precision medicine. [PMID: 28600341]
The National Cancer Institute Genomic Data Commons (GDC) is an information system for storing, analyzing, and sharing genomic and clinical data from patients with cancer. The recent high-throughput sequencing of cancer genomes and transcriptomes has produced a big data problem that precludes many cancer biologists and oncologists from gleaning knowledge from these data regarding the nature of malignant processes and the relationship between tumor genomic profiles and treatment response. The GDC aims to democratize access to cancer genomic data and to foster the sharing of these data to promote precision medicine approaches to the diagnosis and treatment of cancer. |
Toward a Shared Vision for Cancer Genomic Data. [PMID: 27653561]
N Engl J Med. 2016:375(12)
| 756 Citations (from Europe
PMC, 2024-04-20)
|
The Cancer Genome Atlas Pan-Cancer analysis project. [PMID: 24071849]
The Cancer Genome Atlas (TCGA) Research Network has profiled and analyzed large numbers of human tumors to discover molecular aberrations at the DNA, RNA, protein and epigenetic levels. The resulting rich data provide a major opportunity to develop an integrated picture of commonalities, differences and emergent themes across tumor lineages. The Pan-Cancer initiative compares the first 12 tumor types profiled by TCGA. Analysis of the molecular aberrations and their functional roles across tumor types will teach us how to extend therapies effective in one cancer type to others with a similar genomic profile. |