Title: | Normalization of array CGH data with imbalanced aberrations. |
---|---|
Description: | Normalization and centralization of array comparative genomic hybridization (aCGH) data. The algorithm uses an iterative procedure that effectively eliminates the influence of imbalanced copy numbers. This leads to a more reliable assessment of copy number alterations (CNAs). |
Authors: | Bart P.P. van Houte, Thomas W. Binsl, Hannes Hettling |
Maintainer: | Bart P.P. van Houte <[email protected]> |
License: | GPL (>= 3) |
Version: | 1.61.0 |
Built: | 2024-10-30 04:35:15 UTC |
Source: | https://github.com/bioc/CGHnormaliter |
CGHnormaliter is an iterative algorithm for normalization of aCGH data displaying imbalanced aberrations. First, provisory balanced copy numbers are identified and subsequently used for normalization based on LOWESS. These two steps are then iterated to refine the normalization. The assumption here is that the temporary exclusion of aberrations allows for a more appropriate calculation of the LOWESS regression curve. As a result, after normalization, the log2 intensity ratios of the normals will generally be closer to zero and better reflect the biological reality.
Package: | CGHnormaliter |
Type: | Package |
Version: | 1.5.6 |
Date: | 2011-04-06 |
License: | GPL (>= 3) |
The package contains two public functions. The function CGHnormaliter
performs the iterative normalization of aCGH data, while the function CGHnormaliter.write.table
prints normalized aCGH data to a file. See function documentation for details.
Bart P.P. van Houte, Thomas W. Binsl, Hannes Hettling
Bart P.P. van Houte, Thomas W. Binsl, Hannes Hettling, Walter Pirovano and Jaap Heringa. CGHnormaliter: an iterative strategy to enhance normalization of array CGH data with imbalanced aberrations. BMC Genomics, 10:401, 2009.
CGHcall
,
DNAcopy
Implementation of an iterative algorithm for normalization of aCGH data displaying imbalanced aberrations.
CGHnormaliter(data, nchrom = 24, cellularity = 1, max.losses = 0.3, plot.MA = TRUE, ...)
CGHnormaliter(data, nchrom = 24, cellularity = 1, max.losses = 0.3, plot.MA = TRUE, ...)
data |
either a dataframe or character string containing a filename. See section Details for the format. |
nchrom |
number of chromosomes. |
cellularity |
vector of cellularities ranging from 0 to 1 to define the contamination of your sample with healthy cells (1 = no contamination). See Details for more information. |
max.losses |
vector of maximum fraction of losses in your samples, ranging from 0 to 1. In case this fraction is exceeded after calling, losses are deemed normals and centralization will be around the (lower) peak at the left of the log2 ratio distribution. See Details for more information. |
plot.MA |
logical. If ‘plot.MA = TRUE’, MA-plots before and after normalization are generated of each sample. The plots are saved into a PDF file. |
... |
arguments for |
The input should be either a data.frame
or the file name of a tabseparated text file (text files must contain a header). The first four columns should contain the name, chromosome and the start and end position in bp for each array target respectively. The position columns must contain numbers only. Following these are two columns with the raw test and reference intensities for each of your samples. These intensities must be numeric as well. If the input type is a text file, missing values should be represented as 'NA' or an empty field.
The cellularity and max.losses parameters should both be a vector of length n
where n
is the number of
samples in your dataset. Each vector is recycled if there are not enough values in it, or truncated if there are too many.
There is a CGHnormaliter.write.table
method that prints the results in a tabular format.
This function returns a matrix of objects of class cghCall
with dimension (number of clones) * (number of samples). Each object contains the following components (See section Examples on how to access them):
normalized data |
A matrix with the normalized log2 intensity ratios for each profile. |
segments |
A matrix with the segments for each profile. |
calls |
A |
probabilities |
A |
Bart P.P. van Houte, Thomas W. Binsl, Hannes Hettling
Bart P.P. van Houte, Thomas W. Binsl, Hannes Hettling, Walter Pirovano and Jaap Heringa. CGHnormaliter: an iterative strategy to enhance normalization of array CGH data with imbalanced aberrations. BMC Genomics, 10:401, 2009.
data(Leukemia) ## Normalize the raw intensity values of the first 4 chromosomes. result <- CGHnormaliter(Leukemia, nchrom=4) ## Get the normalized log2 intensity ratios, segments and calls normalized.data <- copynumber(result) segmented.data <- segmented(result) called.data <- calls(result) ## Plot the normalization result of sample 2 plot(result[, 2]) ## Write the normalized log2 intensity ratios to file CGHnormaliter.write.table(result)
data(Leukemia) ## Normalize the raw intensity values of the first 4 chromosomes. result <- CGHnormaliter(Leukemia, nchrom=4) ## Get the normalized log2 intensity ratios, segments and calls normalized.data <- copynumber(result) segmented.data <- segmented(result) called.data <- calls(result) ## Plot the normalization result of sample 2 plot(result[, 2]) ## Write the normalized log2 intensity ratios to file CGHnormaliter.write.table(result)
This function stores the results of a CGHnormaliter normalization into a plain text file.
CGHnormaliter.write.table(input, data.type=c("normalized","segmented","called"), file=paste(data.type,".txt", sep=""))
CGHnormaliter.write.table(input, data.type=c("normalized","segmented","called"), file=paste(data.type,".txt", sep=""))
input |
result object of a CGHnormaliter normalization. |
data.type |
type of aCGH data to store. |
file |
file name to store the data in. |
Available aCGH data types are:
normalized: The normalized log2 intensity ratios
segmented: The segmented log2 intensity ratios (after normalization)
called: The calls (after normalization)
The selected data are stored into a tabseparated file with the specified file name.
Bart P.P. van Houte, Thomas W. Binsl, Hannes Hettling
Bart P.P. van Houte, Thomas W. Binsl, Hannes Hettling, Walter Pirovano and Jaap Heringa. CGHnormaliter: an iterative strategy to enhance normalization of array CGH data with imbalanced aberrations. BMC Genomics, 10:401, 2009.
data(Leukemia) ## Normalize the intensity ratios ## Not run: result <- CGHnormaliter(Leukemia) ## Write the result to a file ## Not run: CGHnormaliter.write.table(result) ## Not run: CGHnormaliter.write.table(result, type="segmented") ## Not run: CGHnormaliter.write.table(result, type="called", file="mycalls.txt")
data(Leukemia) ## Normalize the intensity ratios ## Not run: result <- CGHnormaliter(Leukemia) ## Write the result to a file ## Not run: CGHnormaliter.write.table(result) ## Not run: CGHnormaliter.write.table(result, type="segmented") ## Not run: CGHnormaliter.write.table(result, type="called", file="mycalls.txt")
A dataframe containing 30180 rows and 10 columns, representing the array CGH data of 3 ALL samples.
Leukemia
Leukemia
A dataframe containing the following 10 columns:
The unique identifiers of array elements.
Chromosome number of each array element.
Chromosomal start position in bp of each array element.
Chromosomal end position in bp of each array element.
Background corrected test intensity values for sample 1.
Background corrected reference intensity values for sample 1.
Background corrected test intensity values for sample 2.
Background corrected reference intensity values for sample 2.
Background corrected test intensity values for sample 3.
Background corrected reference intensity values for sample 3.
Provided by the authors (see references).
Paulsson K, Heidenblad M, Morse H, Borg A, Fioretos T, Johansson B: Identication of cryptic aberrations and characterization of translocation breakpoints using array CGH in high hyperdiploid childhood acute lymphoblastic leukemia. Leukemia 2006, 20:2002-2007.