Title: | Download datasets from EMBL-EBI Expression Atlas |
---|---|
Description: | This package is for searching for datasets in EMBL-EBI Expression Atlas, and downloading them into R for further analysis. Each Expression Atlas dataset is represented as a SimpleList object with one element per platform. Sequencing data is contained in a SummarizedExperiment object, while microarray data is contained in an ExpressionSet or MAList object. |
Authors: | Maria Keays [aut] , Pedro Madrigal [cre] |
Maintainer: | Pedro Madrigal <[email protected]> |
License: | GPL (>= 3) |
Version: | 1.35.0 |
Built: | 2024-10-30 07:16:51 UTC |
Source: | https://github.com/bioc/ExpressionAtlas |
This is a SimpleList object containing dummy data from some Expression Atlas
experiments, to demonstrate a typical example of the reesults of using the
getAtlasData()
function. The dummy data objects only contain the
first 10 rows of the real data, to save disk space. To get the real data,
download it using getAtlasData()
.
data( "allExps" )
data( "allExps" )
A SimpleList with one element per Expression Atlas dataset.
This is a DataFrame object listing some Expression Atlas experiments, to
demonstrate a typical example of the results of using the
searchAtlasExperiments()
function.
data( "atlasRes" )
data( "atlasRes" )
A DataFrame with one row per Expression Atlas dataset.
This function downloads Expression Atlas experiment summary SimpleList objects based on a vector of ArrayExpress/BioStudies experiment accessions, and returns a list containing these objects.
getAtlasData( experimentAccessions )
getAtlasData( experimentAccessions )
experimentAccessions |
Vector of ArrayExpress/BioStudies accessions for experiments to be downloaded. |
A list with one entry per experiment summary SimpleList object. Entries are named using the ArrayExpres/BioStudies accession of the respective experiment.
# Download some Expression Atlas data into a list. myExperimentSummaries <- getAtlasData( c( "E-GEOD-11175", "E-MTAB-3007", "E-GEOD-21070" ) )
# Download some Expression Atlas data into a list. myExperimentSummaries <- getAtlasData( c( "E-GEOD-11175", "E-MTAB-3007", "E-GEOD-21070" ) )
This function downloads and returns a SimpleList object representing a single Expression Atlas experiment, based on the ArrayExpress accession of the experiment.
getAtlasExperiment( experimentAccession )
getAtlasExperiment( experimentAccession )
experimentAccession |
ArrayExpress experiment accession e.g. "E-GEOD-11175" |
A SimpleList object representing a single Expression Atlas experiment. The SimpleList contains one entry per platform used in the experiment. For sequencing experiments, there is a single entry in the list. For microarray experiments, there is one entry per array design used. Currently Expression Atlas does not support multi-technology (e.g. microarray and RNA-seq) experiments.
For a single-channel microarray experiment, each entry of the list is an ExpressionSet object. For a sequencing experiment, the single entry is a SummarizedExperiment object. Please refer to the relevant documentation on these classes for more information about them.
RNA-seq data
Each SummarizedExperiment object contains the following:
Matrix of raw counts (not normalized), in the assays slot, in a counts element.
Sample annotations, in the colData slot.
Brief outline of methods, from QC of FASTQ files to production of raw counts, in the exptData slot.
Single-channel microarray data
Each ExpressionSet object contains the following:
Matrix of normalized intensity values, in the assayData, accessed via: exprs( expressionSet )
Sample annotations, in the phenoData, accessed via: pData( expressionSet )
Brief outline of normalization method applied, in the experimentData slot, accessed via: preproc( experimentData( expressionSet ) )
# Download the experiment summary for E-GEOD-11175 geod11175 <- getAtlasExperiment( "E-GEOD-11175" ) # See the entries available (in this case array design accessions) names( geod11175 ) # Prints out the following: # [1] "A-AFFY-126" # Get the only ExpressionSet object from this experiment. eset <- geod11175[[ "A-AFFY-126" ]]
# Download the experiment summary for E-GEOD-11175 geod11175 <- getAtlasExperiment( "E-GEOD-11175" ) # See the entries available (in this case array design accessions) names( geod11175 ) # Prints out the following: # [1] "A-AFFY-126" # Get the only ExpressionSet object from this experiment. eset <- geod11175[[ "A-AFFY-126" ]]
This is a SimpleList object containing dummy data from an RNA-seq Expression
Atlas experiment, to demonstrate a typical example of the results of using
the getAtlasData()
function for a subset of results obtained using
searchAtlasExperiments
. This object contains the first 10 rows of
the original data, to save disk space. To get the full dataset, download it
using getAtlasData()
.
data( "rnaseqExps" )
data( "rnaseqExps" )
A SimpleList with one element per Expression Atlas dataset.
This function accepts a vector of sample properties, and optionally a species name, and then searches for matching Expression Atlas experiments.
searchAtlasExperiments( properties, species = NULL )
searchAtlasExperiments( properties, species = NULL )
properties |
Character vector of sample properties to search Atlas for. These can be biological characteristics, experimental treatments, etc. |
species |
Optional. The name of a species to limit results to. If not provided, search is performed across all species in Expression Atlas. |
A DataFrame containing the ArrayExpress/BioStudies accessions, the species, experiment types, and titles of Expression Atlas experiments matching the query.
# Search for experiments on salt in oryza (rice) atlasRes <- searchAtlasExperiments( properties = "salt", species = "oryza" ) # Download data for first experiment found. if ( nrow( atlasRes ) == 1 ) { atlasData <- getAtlasData( atlasRes$Accession ) } else { atlasData <- getAtlasData( atlasRes$Accession[1] ) }
# Search for experiments on salt in oryza (rice) atlasRes <- searchAtlasExperiments( properties = "salt", species = "oryza" ) # Download data for first experiment found. if ( nrow( atlasRes ) == 1 ) { atlasData <- getAtlasData( atlasRes$Accession ) } else { atlasData <- getAtlasData( atlasRes$Accession[1] ) }