Title: | Tools for parsing Affymetrix data files |
---|---|
Description: | Routines for parsing Affymetrix data files based upon file format information. Primary focus is on accessing the CEL and CDF file formats. |
Authors: | Ben Bolstad <[email protected]> |
Maintainer: | Ben Bolstad <[email protected]> |
License: | LGPL (>= 2) |
Version: | 1.77.0 |
Built: | 2024-10-30 03:25:04 UTC |
Source: | https://github.com/bioc/affyio |
This function returns a text string giving the file format for the supplied filename
check.cdf.type(filename)
check.cdf.type(filename)
filename |
fullpath to a cdf file |
Returns a string which is currently one of:
text |
the cdf file is of the text format |
xda |
the cdf file is of the binary format used in GCOS |
unknown |
the parser can not handle this format or does not recognize this file as a CDF file |
B. M. Bolstad <[email protected]>
This function reads the header information for a series of CEL files then extracts and returns the dates.
get.celfile.dates(filenames, ...)
get.celfile.dates(filenames, ...)
filenames |
a vector of characters with the CEL filenames. May be fully pathed. |
... |
further arguments passed on to |
The function uses read.celfile.header
to read in the header of each file. The ScanDate
component is then parsed to extract the date.
Note that an assumption is made about the format. Namely, that dates are in the Y-m-d or m/d/y format.
A vector of class Date
with one date for each celfile.
Rafael A. Irizarry <[email protected]>
See Also as read.celfile.header
.
This function reads the entire contents of a cdf file into an R list structure
read.cdffile.list(filename, cdf.path = getwd())
read.cdffile.list(filename, cdf.path = getwd())
filename |
name of CDF file |
cdf.path |
path to cdf file |
Note that this function can be very memory intensive with large CDF files.
returns a list
structure. The exact contents may vary
depending on the file format of the cdf file (see check.cdf.type
)
B. M. Bolstad <[email protected]>
This function reads the entire contents of a CEL file into an R list structure
read.celfile(filename,intensity.means.only=FALSE)
read.celfile(filename,intensity.means.only=FALSE)
filename |
name of CEL file |
intensity.means.only |
If |
The list has four main items. HEADER, INTENSITY, MASKS, OUTLIERS. Note that INTENSITY is a list of three vectors MEAN, STDEV, NPIXELS. HEADER is also a list. Both of MASKS and OUTLIERS are matrices.
returns a list
structure. The exact contents may vary
depending on the file format of the CEL file
B. M. Bolstad <[email protected]>
This function reads some of the header information (appears before probe intensity data) from the supplied cel file.
read.celfile.header(filename,info=c("basic","full"),verbose=FALSE)
read.celfile.header(filename,info=c("basic","full"),verbose=FALSE)
filename |
name of CEL file. May be fully pathed |
info |
A string. |
verbose |
a |
A list
data structure.
B. M. Bolstad <[email protected]>
This function reads PM, MM or both types of intensities into matrices. These matrices have all the probes for a probeset in adjacent rows
read.celfile.probeintensity.matrices(filenames, cdfInfo, rm.mask=FALSE, rm.outliers=FALSE, rm.extra=FALSE, verbose=FALSE, which= c("pm","mm","both"))
read.celfile.probeintensity.matrices(filenames, cdfInfo, rm.mask=FALSE, rm.outliers=FALSE, rm.extra=FALSE, verbose=FALSE, which= c("pm","mm","both"))
filenames |
a character vector of filenames |
cdfInfo |
a list with items giving PM and MM locations for
desired probesets. In same structure as returned by |
rm.mask |
a |
rm.outliers |
a |
.
rm.extra |
a |
.
verbose |
a |
which |
a string specifing which probe type to return |
returns a list
of matrix
items. One
matrix contains PM probe intensities, with probes in rows and arrays
in columns
B. M. Bolstad <[email protected]>