Title: | Dimension Reduction for Array CGH Data with Minimal Information Loss. |
---|---|
Description: | Dimension Reduction for Array CGH Data with Minimal Information Loss |
Authors: | Sjoerd Vosse & Mark van de Wiel |
Maintainer: | Sjoerd Vosse <[email protected]> |
License: | GPL (http://www.gnu.org/copyleft/gpl.html) |
Version: | 1.65.0 |
Built: | 2024-10-30 04:34:55 UTC |
Source: | https://github.com/bioc/CGHregions |
CGHregions takes as input array CGH data (log2-ratios) that have been segmented (i.e., split into chromosomal segments of similar log2-ratios) and called (i.e., a copy number assigned to each segment) on a per-sample basis and adjusts the segmentation so that break-points that are in similar locations across multiple samples are set to be in identical locations. Segmented and called data can be obtained by using the CGHcall package. The resulting dimensionality reduction facilitates downstream analysis in a variety of ways (e.g., reduces severity of multiple hypothesis testing, facilitates clustering and visualization, reduces computer memory requirements).
Package: | CGHregions |
Type: | Package |
Version: | 0.99.0 |
Date: | 2008-04-29 |
License: | GPL |
Mark van de Wiel and Sjoerd Vosse Maintainer: Mark van de Wiel <[email protected]>
Mark A. van de Wiel and Wessel N. van Wieringen (2007). CGHregions: Dimension Reduction for Array CGH Data with Minimal Information Loss. Cancer Informatics, 2, 55-63.
Dimension Reduction for Array CGH Data with Minimal Information Loss
CGHregions(input, averror=0.01)
CGHregions(input, averror=0.01)
input |
An object of class |
averror |
Maximal information loss allowed. |
Please read the article and the supplementary information for detailed information on the algorithm.
If the input is not an object of class cghCall
it should be either a dataframe or a tabseparated textfile (textfiles must contain a header). The first three columns should contain the name, chromosome and position in bp for each array target respectively. The chromosome and position column must contain numbers only. Following these is a column with log2 ratios for each of your samples. If the input type is a textfile, missing values should be represented as 'NA' or an empty field.
The algorithm reduces the call matrix to a smaller matrix that contains regions rather than individual clones.
The regions consist of consequtive clones the signatures of which are very much alike. The dimension reduction
is potentially for testing and clustering puposes. The amount of information lost by this dimension
reduction is controlled by averror
. The larger averror
, the less regions will result.
This function returns an object of class cghRegions
Mark van de Wiel and Sjoerd Vosse Maintainer: Mark van de Wiel <[email protected]>
Mark A. van de Wiel and Wessel N. van Wieringen (2007). CGHregions: Dimension Reduction for Array CGH Data with Minimal Information Loss. Cancer Informatics, 2, 55-63.
data(WiltingCalled) result <- CGHregions(WiltingCalled)
data(WiltingCalled) result <- CGHregions(WiltingCalled)