[汇编资源]肿瘤相关数据库资源汇编合集
欢迎来到医科研,这里是白介素2的读书笔记,跟我一起聊临床与科研的故事, 生物医学数据挖掘,R语言,TCGA、GEO, SEER数据挖掘。
Cancer-related Data Resources
ArrayMap
URL http://www.arraymap.org
What you get copy number data from 8594 breast cancer samples (database includes other tissue types)
What you can do Search samples, search publications, gene CNV frequencies, process user provided data sets
Literature & tutorials PLoS One. (2012) 7(5), e36944. doi:10.1371/journal.pone.0036944
Nucl. Acids Res. (28 January 2015) 43 (D1): D825-D830. doi: 10.1093/nar/gku1123
Update Approximately biannually
BCCTBbp:the Breast Cancer Campaign Tissue Bank - - bioinformatics portal
Now called the BCNTBbp: Breast Cancer Now Tissue Bank bioinformatics portal
URL http://bioinformatics.breastcancertissuebank.org/
What you get Data from genomics, methylomics, transcriptomics, proteomics and microRNA experiments mined from literature, external and internal (Breast Cancer Now Tissue Bank) sources
What you can do Search for data and access via external links. Perform analysis on pathways and variation. (The bioinformatics portal was undergoing migration at the time of this writing, so a more complete description could not be written.)
Literature & tutorials Nucl. Acids Res. (28 January 2015) 43 (D1): D831-D836. doi: 10.1093/nar/gku984
BreCAN-Database of Breakpoint profiles of Cancer Genomes
URL http://14.139.32.56/
What you get Somatic DNA breakpoints; 15 (15.2%) breast cancers
What you can do Visualize breakpoints, upload breakpoint profiles to compare with breakpoint hotspots and profiles in database
Literature & tutorials Nucl. Acids Res. (04 January 2016) 44 (D1): D952-D958. doi: 10.1093/nar/gkv1264;
“Help” link available on home page
Updated October 2015
Cancer RNA-Seq Nexus
URL http://syslab4.nchu.edu.tw/
What you get Differential expression of coding and lncRNA of various combinations. Data derived from cell lines, tumors and adjacent normal deposited in TCGA and 6 GEO submissions. Total of 1537 breast specimens. mRNA-lncRNA coexpression network available for each pair.
What you can do Download differentially expressed transcripts of the subset pairs
Literature & tutorials Nucl. Acids Res. (04 January 2016) 44 (D1): D944-D951. doi: 10.1093/nar/gkv1282;
Tutorial available on website
Cancer3D
URL http://www.cancer3d.org
What you get Cancer mutations mapped onto ribbon diagrams of proteins, mutated AA colored according to their mutation frequency in TCGA; activity of selected drug as a function of location on the protein
What you can do Search on gene or drug; link to protein interaction partnersLiterature & tutorials Bioinformatics. (01 November 2014) 30(21):3109-14
Nucl. Acids Res. (28 January 2015) 43 (D1): D968-D973. doi: 10.1093/nar/gku1140
PLoS Comput Biol. (08 January 2015) 11(1):e1004024
PLoS Comput Biol. (20 October 2015) 11(10):e1004518Tutorial available on website
Updated at release in 2014
Cancer PPD
URL http://crdd.osdd.net/raghava/cancerppd/
What you get Lists of proteins and peptides that have demonstrated anti-cancer activities. Searches include protein, peptide, tissue and cell line.
What you can do Blast, align, map
Literature & tutorials Nucl. Acids Res. (28 January 2015) 43 (D1): D837-D843. doi: 10.1093/nar/gku892
Tutorial available on website
Updated October 2014
CanGEM
URL http://www.cangem.org/
What you get Gene copy number changes
What you can do Free text search or search on clinical attributes of the specimens of interest, includes stage. Searches can also be performed on the copy number status of a gene.
Literature & tutorials Nucl. Acids Res. (2008) 36 (suppl 1): D830-D835. doi: 10.1093/nar/gkm802
Tutorial available on website
Updated not current
CaSNP
URL http://cistrome.dfci.harvard.edu/CaSNP/index/
What you get Copy number alteration based on data from Affymetrix SNP arrays (10K to 6.0) available in GEO . Additional data from TCGA, a few individual publications and GlaxoSmithKlein cancer cell lines.
What you can do Search on genomic position, refSeq ID, coordinate range or miRNA ID, cancer type and specific GEO study(ies)
Literature & tutorials Nucl. Acids Res. (2011) 39 (suppl 1): D968-D974. doi: 10.1093/nar/gkq997
Tutorial available on website
Update Last updated in 2011
cBioPortal
URL http://www.cbioportal.org/public-portal/
What you get Breast genomic data sets from TCGA, Sanger, Broad and British Columbia; cell line from Cancer Cell Line Encyclopedia and NCI-60
What you can do Search by mutations, CNA, mRNA and protein expression as available. Identify co-expressed genes and mutation; as well as copy-number, mRNA and protein enrichments. Plot mutation, survival and networks. Generate oncoprints and lollipop plots from investigator data.
Literature & tutorials Sci Signal. (02 April 2013) 6(269):pl1.
Cancer Discovery. (2012) May 1; 2: 401
No abstract in Nucleic Acids Research Database Issue, but this database is NAR Molecular Biology Database Collection entry number 1757
Tutorial available on website
Updated frequently
CellLineNavigator
URL http://medicalgenomics.org/celllinenavigator/data
What you get Array-based expression data (Affymetrix HG-U133 Plus2 GeneChip) from ArrayExpress experiment E-MTAB-37
What you can do Search by gene for relative expression across cell lines from 28 tissue types (19 breast cancer cell lines). Search by combining gene, KEGG Pathway, GO, cell line, organism part and/or disease state (although breast cancer not included among disease states). Gene lists generated in CellLineNavigator may automatically be transferred to the DAVID analysis tool (Bioinformatics 2012;28:1805-1806).
Literature & tutorials Nucl. Acids Res. (2013) 41: D942-D948. doi: 10.1093/nar/gks1012
Basic users’ information available on website
Updated Static dataset
COLT-Cancer
URL http://dpsc.ccbr.utoronto.ca/cancer/index.html
What you get Genes essential for cancer cell proliferation and survival identified by shRNA screen of 29 breast cancer cell lines. [Data from colon, pancreas and ovarian cell lines also available]
What you can do Search by gene, cell line or essential genes across all cell lines or within a tumor type
Literature & tutorials Nucl. Acids Res. (2012) 40 (D1): D957-D963. doi: 10.1093/nar/gkr959
Documentation available on website
Updated Copyright through 2014
COSMIC: Catalog of Somatic Mutations in Cancer
URL http://cancer.sanger.ac.uk/cancergenome/projects/cosmic/
What you get Annotations of mutations in cancer curated from biomedical literature, as well as noncoding mutations, gene fusions, genome rearrangements, abnormal copy number segments and abnormal expression variants annotated to the human genome and correlated across disease types
What you can do Search by gene or keyword. Retrieve results by gene, sample, mutations and literature (PubMed). Full datasets are available for download via SFTP site. Tools include a cancer browser, genome browser, CONAN (copy number analysis tool), Beacon (a tool to search for a mutation at any position in the human genome).
Literature & tutorials Nucl. Acids Res. (28 January 2015) 43 (D1): D805-D811. doi: 10.1093/nar/gku1075
Documentation and tutorials available on website
Updated Updated every 3 months
Database of Germline p53 Mutations
URL http://stary.lf2.cuni.cz/projects/germline_mut_p53.htm
What you get p53 mutation (type of the mutation, exon and codon affected by the mutation, nucleotide and amino acid change), each family (family history of cancer, diagnosis of LFS), each affected individual (sex, generation, p53 status, from which parent the mutation was inherited) and each tumor (type, age of onset, p53 status (loss of heterozygosity and immunostaining)). Each entry contains the original reference(s).
What you can do View and search Excel spreadsheet
Literature & tutorials Nucl. Acids Res. (1998) 26 (1): 214-215. doi: 10.1093/nar/26.1.214
Documentation available on website
Updated Every 4 months
intOGen (Integrative Onco Genomics)
URL http://www.intogen.org/search
What you get Mutational cancer driver genes currently numbering 459
What you can do Search gene to determine if gene is driver and, if so, in which cancer types. Output includes mode of action, mutation frequency and mutation distribution (lollipop diagram). Search by cancer: Breast cancer 184 driver genes identified to date. Analyze list of somatic mutations for a cohort of tumors to identify driver mutations, genes and pathways. Analyze list of somatic mutations from a single tumor to obtain rank ordered list based on implication in ca development
Literature & tutorials Cancer Cell (09 March 2015) 27: 382-396
Nature Methods (15 September 2013) 10: 1081–1083. doi:10.1038/nmeth.2642
No abstract in Nucleic Acids Research Database Issue, but this database is NAR Molecular Biology Database Collection entry number 1909
Updated Last release number: 2014.12
Lnc2Cancer
URL http://www.bio-bigdata.com/lnc2cancer/home.jsp
What you get lncRNAs associated to 93 cancers, method of experimental association, expression pattern, links to Pubmed, sequence, lncrnadb, LncRNA2Function, Co-LncRNA among others
What you can do Search by lncRNA, cancer; download Excel or text file of all experimentally supported LncRNA-cancer association data in database
Literature & tutorials Nucl. Acids Res. (04 January 2016) 44 (D1): D980-D985. doi: 10.1093/nar/gkv1094
Documentation available on website
Updated quarterly
MethHC
URL http://methhc.mbc.nctu.edu.tw/php/index.php
What you get DNA methylation, gene expression, microRNA methylation, microRNA expression, and the correlation of methylation and gene expression. Data derived from TCGA
What you can do Search by specific cancer (18), gene, genes in KEGG pathwayLiterature & tutorials Nucl. Acids Res. (28 January 2015) 43 (D1): D856-D861. doi: 10.1093/nar/gku1151
Documentation available on website
Updated Last update 2014
Mutations, Oncogenes, Knowledge & Cancer (MOKCa) formerly Mutations of Kinases in Cancer
URL http://strubiol.icr.ac.uk/extra/mokca/
What you get Structural and functional annotations of mutations of proteins implicated in cancer. When possible, the database makes predictions on the phenotypic consequences of the mutations.
What you can do Search by COSMIC or Uniprot gene name or accession number. Browse by several categories: whole genome, DNA damage response (DDR), protein kinases, Cancer Gene Census (CGC) molecularly dominant oncogenes, CGC tumor suppressors (molecularly recessive) and drug targets. Retrieve the location of the mutation in the protein sequence and amino acid variant, type of mutation, frequency, domain, modification of residues within 3 residues of the mutation, any PDB structure mapping (if available) and tissue specificity.
Literature & tutorials Nucl. Acids Res. (2009) 37 (suppl 1): D824-D831. doi: 10.1093/nar/gkn832
No help or tutorials available
Updated Last modified Dec. 1, 2015
Mouse Tumor Biology
URL http://tumor.informatics.jax.org
What you get Information on tumors in mice as a model system of hereditary cancer, including endogenous spontaneous tumors and induced tumors, genetic factors associated with susceptibility, and tumor pathology reports.
What you can do Use the Quick Organ/Tissue Search to search by tissue, or use the Search Forms on the left to search by tumor, strain, pathology images, genetics or other criteria. Retrieve tumor name, organ affected, treatment type (if any), mouse strain name, tumor frequency, metastasis sites, images (if any) and a tumor summary.
Literature & tutorials Exp Mol Pathol. (2015) Dec;99(3):533-6.
“Help” link on home page, left navigation menu
Updated Current and actively updated
MutationAligner
URL http://www.mutationaligner.org/
What you get Mutation “hotspots” identified in protein domains from over 5000 patients and across 22 cancer types. Hotspots are identified from results of multiple sequence alignment of domains that contain missense mutations between analogous residues.
What you can do Search by protein domain, gene or cancer type. View results in a graphical format displaying mutations by residue across the gene for all known mutations or by cancer type. A multiple sequence alignment of similar domains across other genes is also displayed. A full list of mutated positions shown in the alignment is displayed in table format. Download data that underlies all positions.
Literature & tutorials Nucl. Acids Res. (04 January 2016) 44 (D1): D986-D991. doi: 10.1093/nar/gkv1132
Cell Systems (2015) 1(3), 197–209
Link to “FAQ” on home page
Updated Current and actively updated
Network of Cancer Genes
URL http://ncg.kcl.ac.uk/index.php
What you get Duplicated loci, evolutionary appearance, expression and network properties of protein-coding cancer genes for multiple cancer types
What you can do Search single or multiple cancer genes by symbol, NCBI or Ensembl identifiers; search miRNA-cancer gene interactions with a target gene or miRNA identifier; search for cancer genes in a genomic region. Retrieve information on mutations, orthology, gene expression in normal tissues and cancer cell lines, protein function, duplicated loci, network interactions or miRNA interactions (both viewable in Cytoscape). Browse lists of cancer genes, screenings and possible false positives. Download full list of protein-coding cancer genes.
Literature & tutorials Nucl. Acids Res. (2016) 44 (D1): D992-D999. doi:10.1093/nar/gkv1123
Link to “Help” on home page
Updated Current and actively updated
SNP500Cancer
URL http://snp500cancer.nci.nih.gov/
What you get 102 resequenced samples to find known or newly discovered single nucleotide polymorphisms (SNPs) which are of immediate importance to molecular epidemiology studies in cancer
What you can do Search by assay ID or Hugo gene symbol, retrieve lists of SNPs on gene, chromosome and start-stop coordinates; click on CGF ID to view annotated sequence and allelic frequency in populations
Literature & tutorials Nucl. Acids Res. (2006 )34 (suppl 1): D617-D621. doi: 10.1093/nar/gkj151
“Help” link on home page
Updated Last dbSNP build used: 130; human genome 36.3 (not current)
Stem Cell Commons
URL http://stemcellcommons.org/
What you get Open source environment that brings together stem cell datasets, online tools and codes with experiments and their results.
What you can do Browse by organism, disease, cell type or assays; search by keyword; analyze and perform functional pathway analysis in the Refinery platform (still in private beta); visualize NGS data; download code for development and use. Researchers can register as members of the commons to upload their own data to add to the commons.
Literature & tutorials AMIA Jt Summits Transl Sci Proc. 2013 Mar 18;2013:70. eCollection 2013
Help links on home page under development
Updated Current and actively updated
Stem Cell Discovery Engine
URL http://discovery.hsci.harvard.edu/
What you get Much of the data is being migrated to Stem Cell Commons. See entry for that database.
What you can do At SCDE, users can still use the custom SCDE Galaxy instance with sample data and special tools created by SCDE for pathway fingerprints and gene list comparison.
Literature & tutorials Nucl. Acids Res. (2012) 40 (D1): D984-D991. doi: 10.1093/nar/gkr1051
Link to screencast tutorials on home page
Updated Last modified 05/13/2014
SynLethDB
URL http://histone.sce.ntu.edu.sg/SynLethDB/
What you get Information on synthetic lethality gene pairs for 5 species (human, mouse, fruit fly, worm and yeast) collected from literature, experimental assay data and computational prediction
What you can do Enter gene symbol or Entrez gene ID or upload a list of identifiers. Retrieve networks of SL pairs, view interactions and evidence for each pair, retrieve gene set enrichment analysis results based on enrichment results for SL pairs for a given gene, download SL/SDL pair interaction data, launch statistical analysis of drug interactions from SL pair data in tables
Literature & tutorials Nucl. Acids Res. (04 January 2016) 44 (D1): D1011-D1017. doi: 10.1093/nar/gkv1108
Manual available from link on home page
Updated Last modified 10/02/2015
TCGA Splice Seq
URL http://projects.insilico.us.com/TCGASpliceSeq/
What you get Cross-tumor and tumor-normal alterations in mRNA splicing patterns of The Cancer Genome Atlas project (TCGA) RNASeq data
What you can do Select from among 33 TCGA tumor types, view splice events for a single gene or for tumor/tumor differences or tumor/normal differences, view splice events in a gene graph or in UniProt sequence display, download PSI data for user-selected splice events by tumor types or for specific genes
Literature & tutorials Nucl. Acids Res. (04 January 2016) 44 (D1): D1018-D1022. doi: 10.1093/nar/gkv1288
Tutorial on site under “FAQ”
Updated Last modified 09/10/2015
Tumor Associated Gene database
URL http://www.binfo.ncku.edu.tw/TAG/GeneDoc.php
What you get Functional and structural information on tumor associated genes
What you can do Text search, search for oncogenic domains by protein sequence input, perform consensus analysis on two or more results of a search, browse by chromosome.
Literature & tutorials Bioinformatics 2013 29: 420-427.
Tutorial available from home page of site
Updated Last modified 10/03/2014 (not current)
Tumor Gene Family Databases (TGDBs)
URL http://www.tumor-gene.org/tgdf.html
What you get Information on tumor genes collected from OMIM, GeneCards and literature.
What you can do Search by gene names or symbols or by locus. Search across the entire tumor gene database, or search the breast cancer or oral cancer specific databases.
Literature & tutorials No links to help or tutorials on home page
No abstract summary paper in Nucleic Acids Research Database Issue, but it is NAR Molecular Biology Database Collection entry number 155
Updated Last modified 11/25/2007 (not current)
UCSC Cancer Genome Browser
URL https://genome-cancer.ucsc.edu/
What you get Integrated cancer genomics and clinical data in a heat map-style browser view. Site also includes the deployable Xena browser, which supports analysis of user data with Galaxy for visualization in Xena.
What you can do Select from available data to display, upload experimental data, download data, view and sort data by chromosome, gene or clinical features, display Kaplan-Meier survival plots, upload custom signatures
Literature & tutorials Nucl. Acids Res. (28 January 2015) 43 (D1): D812-D817. doi: 10.1093/nar/gku1073
User guide and tutorial available on site
Updated Continuously updated
UMD BRCA1/BRCA2 databases (Now BRCA Share)
URL http://www.umd.be/BRCA2/ and http://www.umd.be/BRCA1/
What you get Genomic variants on the BRCA1 and BRCA2 genes
What you can do Users must register for access. Government, non-profit and academic users can register for free. You must register for EACH (BRCA1 and BRCA2) database separately. Once registered, browse using the “Variants Classification” link, which shows all variants with validated classification in the last 6 months. Use the “Mutations” link on the left to view and access all mutations by phenotype, significance or location. Results display the location of the mutation, protein change, functional domain (if any), biological significance, validation, date and number of records supporting the mutation. Users can also perform insertion and deletion analyses.
Literature & tutorials Nucl. Acids Res. (2012) 40 (D1): D992-D1002. doi: 10.1093/nar/gkr1160
No links to help or tutorials on home page
Updated Since BRCA Share is a data sharing platform, data are shared and updated by users continuously.
本期内容就到这里,我是白介素2,下期再见,点击下方框框留言。