Title: | R functions for the normalization of Exiqon miRNA array data |
---|---|
Description: | This package contains functions for reading raw data in ImaGene TXT format obtained from Exiqon miRCURY LNA arrays, annotating them with appropriate GAL files, and normalizing them using a spike-in probe-based method. Other platforms and data formats are also supported. |
Authors: | Sylvain Gubian <[email protected]>, Alain Sewer <[email protected]>, PMP SA |
Maintainer: | Sylvain Gubian <[email protected]> |
License: | GPL-2 |
Version: | 2.49.0 |
Built: | 2024-11-19 03:40:05 UTC |
Source: | https://github.com/bioc/ExiMiR |
This package contains functions for reading raw data in ImaGene TXT format obtained from Exiqon miRCURY LNA arrays, annotating them with appropriate GAL files, and normalizing them using a spike-in probe-based method. Other platforms and data formats are also supported by ExiMiR.
Package: | ExiMiR |
Type: | Package |
Version: | 1.99.0 |
Date: | 2012-04-24 |
License: | GPL-2 |
LazyLoad: | yes |
Sylvain Gubian, Alain Sewer, PMP SA
Maintainer: [email protected]
This function performs background correction on an AffyBatch
object.
The methods supported by bg.correct.miR
are provided by the affy
or limma
packages, depending on whether the input AffyBatch
object
has been created with ReadAffy
or ReadExi
/createAB
, respectively.
bg.correct.miR(abatch, bgcorrect.method='auto', bgcorrect.param=list(), verbose=FALSE)
bg.correct.miR(abatch, bgcorrect.method='auto', bgcorrect.param=list(), verbose=FALSE)
abatch |
An |
bgcorrect.method |
Character vector. It contains the name of the background correction method. Running
|
bgcorrect.param |
A
|
verbose |
Logical. The default value is |
See accompanying vignette.
An AffyBatch
object containing the background-corrected raw expression data.
Sylvain.Gubian, Alain Sewer, PMP SA
NormiR.bgcorrect.methods
,
NormiR
.
data(galenv) data(GSE20122) NormiR.bgcorrect.methods(GSE20122) GSE20122.bgcorrected <- bg.correct.miR(GSE20122, bgcorrect.method='normexp', bgcorrect.param=list(offset=50))
data(galenv) data(GSE20122) NormiR.bgcorrect.methods(GSE20122) GSE20122.bgcorrected <- bg.correct.miR(GSE20122, bgcorrect.method='normexp', bgcorrect.param=list(offset=50))
This function creates an AffyBatch
object from a limma
object (RGList
, EListRaw
, MAList
) or from any appropriate list
object.
createAB(object, verbose=TRUE, ref.channel="R", genes.block=NULL, genes.row=NULL, genes.col=NULL, genes.id=NULL, genes.name=NULL, galname=NULL, env.overwrite=TRUE, ...)
createAB(object, verbose=TRUE, ref.channel="R", genes.block=NULL, genes.row=NULL, genes.col=NULL, genes.id=NULL, genes.name=NULL, galname=NULL, env.overwrite=TRUE, ...)
object |
An appropriate |
verbose |
Logical. The default value is |
genes.block |
Optional character vector in case the platform is neither ImaGene, Exiqon nor Agilent. The name of the column
in the |
genes.row |
Optional character vector in case the platform is neither ImaGene, Exiqon nor Agilent. The name of the column
in the |
genes.col |
Optional character vector in case the platform is neither ImaGene, Exiqon nor Agilent. The name of the column
in the |
genes.id |
Optional character vector in case the platform is neither ImaGene, Exiqon nor Agilent. The name of the column
in the |
genes.name |
Optional character vector in case the platform is neither ImaGene, Exiqon nor Agilent. The name of the column
in the |
ref.channel |
Character vector. The value of the reference channel for two-color arrays ('R' or 'G') |
galname |
Character vector. The default value is |
env.overwrite |
Logical. The default value is |
... |
Any additional argument that can be given to the |
See accompanying vignette.
An AffyBatch
object containing the raw expression data.
Sylvain Gubian, Alain Sewer, PMP SA
ReadExi
,
make.gal.env
.
The galenv
environment is a hash table for the annotation of the Exiqon miRCURY
LNA arrays used in the GEO series GSE20122 (Exiqon miRCURY LNA array v.11).
See accompanying vignette.
make.gal.env
.
The Affybatch object GSE19183
contains the raw expression data contained in the CEL files
of the GEO series GSE19183, obtained from the Affymetrix miRNA-1_0 platform. The annotation is
included in the Affybatch object.
See accompanying vignette.
The Affybatch object GSE20122
contains the raw expression data contained in the ImaGene TXT
files of the GEO series GSE20122, obtained from the Exiqon miRCURY LNA platform v.11.
See accompanying vignette.
Reads an Exiqon GAL file and creates an annotation environment used as a hash table for the probeset mapping location.
make.gal.env(galname=NULL, filename=NULL, gal.path=getwd(), verbose=FALSE)
make.gal.env(galname=NULL, filename=NULL, gal.path=getwd(), verbose=FALSE)
galname |
Character vector. Name to be used for the annotation environment. |
filename |
Character vector. Name of the GAL file. |
gal.path |
Character vector. Path to the GAL file. |
verbose |
Logical. The default value is |
This function is designed similarly to make.cdf.env
from the makecdfenv
package.
If no filename is provided as argument, the function tries to read the first GAL file in the input path.
The returned environment is a hash table. For every probeset name we have a matrix with 2 columns. The
first column contains the PM locations and the second column the MM locations. For PM only chips the MM column
will have NAs.
None.
Sylvain Gubian, Alain Sewer, PMP SA
# The folder 'Exiqon' contains a GAL file ## Not run: make.gal.env(galname='galenv', gal.path='Exiqon')
# The folder 'Exiqon' contains a GAL file ## Not run: make.gal.env(galname='galenv', gal.path='Exiqon')
This function performs low-level normalization on an AffyBatch
object
and returns the result in a new AffyBatch
object.
By default, it applies the spike-in probe-based normalization method. In case the spike-in probe-based method cannot be applied, a median normalization is executed instead. Several options allow however to force the execution of the spike-in probe-based normalization and to fine-tune the resulting correction functions.
norm.miR(abatch, normalize.method="spikein", normalize.param=list(), verbose=TRUE, ...)
norm.miR(abatch, normalize.method="spikein", normalize.param=list(), verbose=TRUE, ...)
abatch |
An |
normalize.method |
Character vector. It contains the name of normalization method. By default, the
|
normalize.param |
A
|
verbose |
Logical. The default value is |
... |
Any additional argument. Used for backward compatibility. |
See accompanying vignette.
An AffyBatch
object containing the normalized (but not summarized) expression data.
Sylvain.Gubian, Alain.Sewer, PMP SA
NormiR.normalize.methods
,
NormiR.spikein.args
,
NormiR
.
data(galenv) data(GSE20122) GSE20122.normalized <- norm.miR(GSE20122, normalize.param=list(figures.show=FALSE)) # Apply the affy method hist on the generated AffyBatch object GSE20122.normalized layout(matrix(c(1,2), 1, 2, byrow = TRUE)) hist(GSE20122) hist(GSE20122.normalized) layout(1)
data(galenv) data(GSE20122) GSE20122.normalized <- norm.miR(GSE20122, normalize.param=list(figures.show=FALSE)) # Apply the affy method hist on the generated AffyBatch object GSE20122.normalized layout(matrix(c(1,2), 1, 2, byrow = TRUE)) hist(GSE20122) hist(GSE20122.normalized) layout(1)
This function applies a standard raw data normalization pipeline (i.e. background
correction, normalization, PM correction if needed, and summarization) on the
input AffyBatch
object and returns the result in an ExpressionSet
object.
The methods supported by NormiR
for the background correction are provided by
the affy
or limma
packages, depending on whether the input AffyBatch
object has been created with ReadAffy
or ReadExi
/createAB
,
respectively.
By default, it applies the spike-in probe-based method for the second step of normalization. In case the spike-in probe-based method cannot be applied, a median normalization is executed instead. Several options allow however to force the execution of the spike-in probe-based normalization and to fine-tune the resulting correction functions.
The next step of PM correction is enabled only when numerical values are available
for the MM probes of the input AffyBatch
object. In this case the methods proposed
by NormiR
are provided by the affy
package.
The methods supported by NormiR
for the last step of summarization are also
provided by the affy
package. They do not depend on how the input AffyBatch
object has been created.
NormiR(abatch, # background correction bg.correct=TRUE, bgcorrect.method='auto', bgcorrect.param=list(), # normalize normalize=TRUE, normalize.method='spikein', normalize.param=list(), # pm correction (enabled only when MM-values are available) pmcorrect.method='pmonly', pmcorrect.param=list(), # expression values summary.method='medianpolish', summary.param=list(), summary.subset=NULL, # misc. verbose=FALSE, ...)
NormiR(abatch, # background correction bg.correct=TRUE, bgcorrect.method='auto', bgcorrect.param=list(), # normalize normalize=TRUE, normalize.method='spikein', normalize.param=list(), # pm correction (enabled only when MM-values are available) pmcorrect.method='pmonly', pmcorrect.param=list(), # expression values summary.method='medianpolish', summary.param=list(), summary.subset=NULL, # misc. verbose=FALSE, ...)
abatch |
An |
bg.correct |
Logical. Default is |
bgcorrect.method |
Character vector. It contains the name of the background correction method. Running
|
bgcorrect.param |
A
|
normalize |
Logical. Default is |
normalize.method |
Character vector. It contains the name of normalization method. By default, the
|
normalize.param |
A
|
pmcorrect.method |
Character vector. It contains the name of the PM correction method, which is enabled
only when numerical values are available for the MM probes of the input
|
pmcorrect.param |
A |
summary.method |
Character vector. It contains the name of the summarization method. Running
|
summary.param |
A |
summary.subset |
A |
verbose |
Logical. The default value is |
... |
Any additional argument. Used for backward compatibility. |
See accompanying vignette.
An ExpressionSet
object containing the normalized expression data.
Sylvain Gubian, Alain Sewer, PMP SA
bg.correct.miR
,
NormiR.bgcorrect.methods
,
norm.miR
,
NormiR.normalize.methods
,
NormiR.spikein.args
,
NormiR.pmcorrect.methods
,
summarize.miR
,
NormiR.summary.methods
.
data(galenv) data(GSE20122) eset.spike <- NormiR(GSE20122, bg.correct=FALSE, normalize.method='spikein', summary.method='medianpolish')
data(galenv) data(GSE20122) eset.spike <- NormiR(GSE20122, bg.correct=FALSE, normalize.method='spikein', summary.method='medianpolish')
These functions enumerate the names of methods or arguments of the low-level functions for
miRNA raw data normalization (i.e. background correction, (spike-in probe-based) normalization,
PM corrrection, summarization). They take into account how the input AffyBatch
object
was created as well as the underlying array type.
NormiR.bgcorrect.methods(object) NormiR.normalize.methods(object) NormiR.pmcorrect.methods(object) NormiR.summary.methods() NormiR.spikein.args()
NormiR.bgcorrect.methods(object) NormiR.normalize.methods(object) NormiR.pmcorrect.methods(object) NormiR.summary.methods() NormiR.spikein.args()
object |
An |
See accompanying vignette.
List of strings containing the names of the methods or arguments available for the input AffyBatch
object.
Sylvain.Gubian, Alain.Sewer, PMP SA
NormiR
,
bg.correct.miR
,
norm.miR
,
summarize.miR
.
This function reads Exiqon raw data in ImaGene file format and creates an AffyBatch
object.
ReadExi(txtfile.path=getwd(), galname=NULL, description=NULL, notes='', rm.background=FALSE, verbose=TRUE)
ReadExi(txtfile.path=getwd(), galname=NULL, description=NULL, notes='', rm.background=FALSE, verbose=TRUE)
txtfile.path |
Character vector. It contains the path to the folder containing the
|
galname |
Character vector. The default value is |
description |
Object of class MIAME, as specified in the documentation of the |
notes |
Character vector, as specified in the documentation of the |
rm.background |
Logical. This option is kept for compatibility reasons but it is not used anymore. See
the |
verbose |
Logical. The default value is |
The Exiqon miRNA raw expression data are normally in ImageGene txt
file format and
accompanied by a samplesinfo.txt
description file. It enumerates the names of the
sample files for each channel. Therefore the txtfile.path
argument of ReadExi
must be a folder that contains the ImageGene and the samplesinfo.txt
files. If this
is not the case, ReadExi
stops.
The galname
argument of ReadExi
must be the name of a GAL annotation
environment created with the make.gal.env
or the ReadExi
functions. If galname
is provided a NULL
value, which is the default situation, a minimal GAL annotation
environment is created based on the annotation contained in the ImageGene txt
files.
An AffyBatch
object containing the raw expression data.
The image
method of the AffyBatch
object might not work properly when the
galname
argument of ReadExi
has not been assigned.
Sylvain Gubian, Alain Sewer, PMP SA
make.gal.env
,
createAB
.
# The folder 'Exiqon' contains the file 'samplesinfo.txt' and the corresponding raw data files in ImaGene format ## Not run: ebatch <- ReadExi(txtfile.path='Exiqon') # If the GAL environment has already created by the function make.gal.env ## Not run: ebatch <- ReadExi(galenv='galenv', txtfile.path='Exiqon')
# The folder 'Exiqon' contains the file 'samplesinfo.txt' and the corresponding raw data files in ImaGene format ## Not run: ebatch <- ReadExi(txtfile.path='Exiqon') # If the GAL environment has already created by the function make.gal.env ## Not run: ebatch <- ReadExi(galenv='galenv', txtfile.path='Exiqon')
This function performs summarization on an AffyBatch
object using a
GAL or CDF annotation environment and generates an ExpressionSet
object
containing the results.
summarize.miR(abatch, pmcorrect.method='pmonly', pmcorrect.param=list(), summary.method='medianpolish', summary.param=list(), summary.subset=NULL)
summarize.miR(abatch, pmcorrect.method='pmonly', pmcorrect.param=list(), summary.method='medianpolish', summary.param=list(), summary.subset=NULL)
abatch |
An |
pmcorrect.method |
Character vector. It contains the name of the PM correction method, which is enabled
only when numerical values are available for the MM probes of the input
|
pmcorrect.param |
A |
summary.method |
Character vector. It contains the name of the summarization method. Running
|
summary.param |
A |
summary.subset |
A |
An ExpressionSet
containing the summarized expression data.
Sylvain.Gubian, Alain Sewer, PMP SA
NormiR.pmcorrect.methods
,
NormiR.summary.methods
,
NormiR
.
data(galenv) data(GSE20122) eset <- summarize.miR(GSE20122, summary.method="medianpolish")
data(galenv) data(GSE20122) eset <- summarize.miR(GSE20122, summary.method="medianpolish")