The Genomic Data Commons (GDC) maintains standardized genomic, clinical, and biospecimen data from National Cancer Institute (NCI) programs including The Cancer Genome Atlas (TCGA) and Therapeutically Applicable Research To Generate Effective Treatments (TARGET), It also accepts high quality datasets from non-NCI supported cancer research programs, such as genomic data from the Foundation Medicine.
GDCRNATools
is an R package which provides a standard,
easy-to-use and comprehensive pipeline for downloading, organizing, and
integrative analyzing RNA expression data in the GDC portal with an
emphasis on deciphering the lncRNA-mRNA related ceRNA regulatory network
in cancer.
Many analyses can be perfomed using GDCRNATools, including differential gene expression analysis (limma, edgeR, and DESeq2), univariate survival analysis (CoxPH and KM), competing endogenous RNA network analysis (hypergeometric test, Pearson correlation analysis, regulation similarity analysis, sensitivity Pearson partial correlation), and functional enrichment analysis(GO, KEGG, DO). Besides some routine visualization methods such as volcano plot, scatter plot, and bubble plot, etc., three simple shiny apps are developed in GDCRNATools allowing users visualize the results on a local webpage.
This user-friendly package allows researchers perform the analysis by simply running a few functions and integrate their own pipelines such as molecular subtype classification, weighted correlation network analysis (WGCNA), and TF-miRNA co-regulatory network analysis, etc. into the workflow easily.
GDCRNATools
is now under review in Bioconductor. Users
can install the package from github temporarily using the
install_github()
function in devtools
package.
Code to install and load devtools
:
Code to install and load GDCRNATools
:
A simply manual of GDCRNATools
is available online GDCRNATools
Manual. Users are also highly recommended to download the
comprhensive manual in .html format and view on local computer
GDCRNATools
Manual