Package 'TransView' reference manual

Title:	Read density map construction and accession. Visualization of ChIPSeq and RNASeq data sets
Description:	This package provides efficient tools to generate, access and display read densities of sequencing based data sets such as from RNA-Seq and ChIP-Seq.
Authors:	Julius Muller
Maintainer:	Julius Muller <[email protected]>
License:	GPL-3
Version:	1.51.2
Built:	2025-03-05 03:32:14 UTC
Source:	https://github.com/bioc/TransView

Read density map construction and accession. Visualization of ChIPSeq and RNASeq data sets.

Description

This package provides efficient tools to generate, access and display read densities of sequencing based data sets such as from RNA-Seq and ChIP-Seq.

Details

Package:	TransView
Type:	Package
Version:	1.7.4
URL:	http://bioconductor.org/packages/release/bioc/html/TransView.html
License:	GPL-3
LazyLoad:	yes
Depends:	methods,GenomicRanges
Imports:	gplots,IRanges
Suggests:	RUnit,pasillaBamSubset
biocViews:	Bioinformatics,DNAMethylation,GeneExpression,Transcription, Microarray,Sequencing,HighThroughputSequencing,ChIPseq,RNAseq, Methylseq,DataImport,Visualization,Clustering,MultipleComparisons
LinkingTo:	Rhtslib

Index:

DensityContainer-class
                        Class '"DensityContainer"'
TVResults-class         Class '"TVResults"'
TransView-package       The TransView package: Construction and
                        visualisation of read density maps.
annotatePeaks           Associates peaks to TSS
gtf2gr                  GTF file parsing
histogram-methods       Histogram of the read distribution
macs2gr                 Convenience function for MACS output conversion
parseReads              User configurable efficient assembly of read
                        density maps
peak2tss                Changes the peak center to the TSS
plotTV                  Plot and cluster global read densities
plotTVData              Summarize plotTV results
rmTV                    Free space occupied by DensityContainer
slice1                  Slice read densities from a TransView dataset
slice1T                 Slice read densities of whole transcripts from
                        a TransView DensityContainer
tvStats-methods         DensityContainer accessor function

Further information is available in the following vignettes:

`TransView`	An introduction to TransView (source, pdf)

Author(s)

Julius Muller

Maintainer: Julius Muller <[email protected]>

Examples

#see vignette
#see vignette

Associates peaks to TSS

Description

A convenience function to associate the peak center to a TSS or gene body provided by a gtf file.

Usage

annotatePeaks(peaks, gtf, limit=c(-10e3,10e3), remove_unmatched=T, unifyBy=F, unify_fun="mean", min_genelength=0, reference="tss")
annotatePeaks(peaks, gtf, limit=c(-10e3,10e3), remove_unmatched=T, unifyBy=F, unify_fun="mean", min_genelength=0, reference="tss")

Arguments

`peaks`	A GRanges object.
`gtf`	A GRanges object with a meta data column ‘transcript_id’ and ‘exon_id’ like e.g. from `gtf2gr`.
`limit`	Maximal distance range for a peak - TSS association in base pairs.
`remove_unmatched`	If TRUE, only TSS associated peaks will be returned.
`unifyBy`	If a transcript has multiple isoforms, the peak will be associated arbitrarily to the first ID found. In order associate a peak to an isoform with specific characteristics, a `DensityContainer` can be provided. The choice of the returned isoform will be made based on unify_fun.
`unify_fun`	A function which will choose the isoform in case of non unique peak - TSS associations. Defaults to the isoform with the highest mean score `function(x){mean(x)}`.
`min_genelength`	Genes with a total sum of all exons smaller than this value will not be associated to a peak.
`reference`	If set to ‘tss’, the transcript with the smallest distance from the TSS to the peak center will be returned. If set to ‘gene_body’ the transcript with the smallest distance from the gene body (TSS or TES) to the peak center will be returned and the distance will be zero if the peak center is located within the gene body.

Details

Convenience function to annotate a GRanges object having one row per peak from e.g. macs2gr. The resulting peak - TSS associations can be customized by the restricting the distance and resolving multiple matches using unify_fun.

Value

GRanges object with row names according to the peak names provided and an added or updated meta data column ‘transcript_id’ with the associated transcript IDs and distances.

`gtf_file`	Character string with the filename of the gtf file. Fileformats from USCS and ENSEMBL are supported and gzip compression is supported.
`chromosomes`	A character vector with the chromosomes. Restricts the output to the case insensitive matching chromosomes.
`refseq_nm`	An option for GTF files based on RefSeq annotation. If TRUE only identifiers beginning with NM_ will be used.
`gtf_feature`	Defines the GTF feature types to be returned.
`transcript_id`	Defines name of the attribute within the attribute list which should be used as transcript IDs.
`gene_id`	Defines name of the attribute within the attribute list which should be used as gene IDs.

`macs_peaks_xls`	Full path to the file ending with ‘_peaks.xls’ located in the output folder of a MACS run.
`psize`	An integer setting the total length of the peaks. Setting psize to ‘preserve’ will keep the original peak lengths from the output file and override `peak_mid`. Note that this is not compatible with `plotTV`
`amount`	Amount of peaks returned. If an integer is provided, the returned peaks will be limited to this amount after sorting by pile up score.
`min_pileup`	Minimum pile up.
`log10qval`	Minimal log10 q-value
`log10pval`	Minimal log10 p-value
`fenrichment`	Minimal enrichment.
`peak_mid`	If set to ‘summit’, the peaks with length `psize` will centered on the peak summit. If set to ‘center’, the mid point of start and end will be used.

`...`	DensityContainer objects
`region`	Can be one entry of the annotated output of annotatePeaks or a GRanges object with one entry and with a transcript_id and distance metadata column.
`control`	An optional vector of DensityContainer objects, that have to match the order of experiments passed as a first argument. E.g. `plotTV(ex1.ChIP,ex2.ChIP,control=c(ex1.Input,ex2.Input)`. The content will be treated as background densities and subtracted from the matching experiment.
`peak_windows`	If set to an integer greater than 0, all binding profiles will be interpolated into this amount of windows by the method specified by `bin_method`.
`bin_method`	Specifies the function used to summarize the bins specified by nbins. Possible methods are ‘max’, ‘mean’, ‘median’ or ‘approx’ for linear interpolation.
`rpm`	If set to `TRUE`, all sample groups will be normalized to Reads Per Million mapped reads after quality filtering according to the filtered_reads slot of the DensityContainer. Should not be set in truncated density maps!
`smooth`	If greater than 0, smooth defines the smoother span as described in the function `lowess`. This function will be applied to reads or RPM values, depending on `rpm` and the results will be stored in the column ‘Smooth’.

`filename`	Character string with the filename of the bam file. The bam file must be sorted according to genomic position.
`spliced`	This option will mark the object to be treated like a data set with spliced reads. Can be switched off also for spliced experiments for special purposes. If `TRUE`, switches off `extendreads` and `readthrough_pairs`.
`read_stranded`	0 will read tags from both strands. 1 will skip all tags from the ‘-’ strand and -1 will only utilize tags from the ‘-’ strand
`paired_only`	If `TRUE`, any reads which are not members of a proper pair according to the 0x0002 FLAG will be discarded. If `FALSE` all reads will be used individually.
`set_filter`	Optional GRanges object or data.frame with similar structure: data.frame(chromosomes,start,end). Providing this filter will limit density maps to these regions.
`min_quality`	Phred-scaled mapping quality threshold. If 0, all reads will pass this filter.
`extendreads`	If greater 0, this amount of base pairs will be added into the strand direction of each read during density map generation.
`unique_only`	If TRUE, only unique reads with no multiple alignments will be used. This filter relies on the aligner to use the corresponding flag (0x100).
`max_dups`	If greater 0, maximally this amount of reads are allowed per start position and read direction.
`description`	An optional character string describing the experiment for labeling purposes.
`hwindow`	A numeric defining the window size used to compute the histogram. This value cannot be bigger than `compression`
`compression`	Should be left at the default value. Defines the minimal threshold in base pairs which triggers indexing and collapsing of read free regions. A smaller value leads to faster slicing at the cost of a higher memory footprint.
`readthrough_pairs`	Currently experimental. If `TRUE`, `parseReads` will attempt to use the region from the left to the right read of the pair for density map assembly. Requires ISIZE to be set within the BAM/SAM file.
`verbose`	Verbosity level

`peaks`	An annotated GRanges object with a meta data column ‘transcript_id’ and ‘exon_id’ like e.g. from `gtf2gr`.
`gtf`	A GRanges object with a meta data column ‘transcript_id’ like e.g. from `annotatePeaks`.
`peak_len`	The desired total size of the region with the TSS located in the middle.

`...`	Depending on the combination of arguments and limited by the layout up to 20 DensityContainer and maximally one `matrix` can be supplied. The elements will be plotted in the order they were passed with the expression profiles and the peak profiles on the right hand and the left hand side respectively. The spliced slot determines about the kind of plot. If a `matrix` is provided, it will be plotted as a heatmap.
`regions`	GRanges object with uniformly sized regions used for plotting or character vector with IDs matching column ‘transcript_id’ in the GTF.
`gtf`	A GRanges object with a meta data column ‘transcript_id’ and ‘exon_id’ like e.g. from `gtf2gr`.
`scale`	A character string that determines the row scaling of the colors. Defaults to ‘global’ which results in a global maximum and minimum read value to be plotted across experiments. Alternative is ‘individual’ for individual scaling.
`cluster`	Sets the clustering method of the read densities. Defaults to ‘none’. If an integer is passed, kmeans clustering will be performed with `cluster` defining the amount of clusters. A colour coded bar will be plotted to the left. For hierarchical clustering the options ‘hc_sp’ and ‘hc_pe’ for spearman or pearson correlation coefficient based distances respectively, or ‘hc_rm’ for distances based on row means are accepted and the results will be displayed as a dendrogram.
`control`	A vector of DensityContainer objects, matching the order of experiments passed as a first argument. E.g. `plotTV(ex1.ChIP,ex2.ChIP,ex3.RNA_KO,control=c(ex1.Input,ex2.Input,ex3.RNA_WT)`. The content will be treated as background densities and subtracted from the matching experiment.
`show_names`	If `TRUE`, peak labels and transcript IDs will be displayed on the left and the right of the plot respectively.
`label_size`	Font size of the row and axis labels.
`zero_alpha`	Determines the alpha level of the line indicating the zero point within the peaks.
`colr`	A vector containing the 3 colors used for the lowest, middle and highest values respectively.
`colr_df`	Determines the color in case a `matrix` is provided and uses `greenred(100)` from gplots by default. If changed, the arguments should be formatted analogous to `colr`.
`colour_spread`	sets the distance of the maximum and minimum value to the saturation levels of the plot. The first value for the left side (Peak profiles) and the right for the expression plots. Can be used to adjust the contrast.
`key_limit`	If left at the default, the upper and lower saturation levels the peak profile colour keys will be automatically determined based on colour_spread. Can be manually overridden by a numeric vector with upper and lower levels.
`key_limit_rna`	If left at the default, the upper and lower saturation levels the transcript profile colour keys will be automatically determined based on colour_spread. Can be manually overridden by a numeric vector with upper and lower levels.
`set_zero`	if set to an integer, it determines the zero point of the x axis below the plot. E.g. a value of 250 will scale the x-axis of a 500bp peak from -250 to +250.
`rowv`	If a numeric vector is provided, no clustering will be performed and all rows will be ordered based on the values of this vector. Alternatively a TVResults object can be provided to reproduce previous k-means clustering.
`peak_windows`	If set to an integer greater than 0, all binding profiles will be interpolated into this amount of windows by the method specified by `bin_method`.
`ex_windows`	An integer that determines the amount of points at which the read densities of an expression experiment will get interpolated by the method specified by `bin_method`.
`bin_method`	Specifies the function used to summarize the bins specified by nbins. Possible methods are ‘max’, ‘mean’, ‘median’ or ‘approx’ for linear interpolation.
`gclust`	If `cluster` is not set to ‘none’, this character string determines the cluster group. If set to ‘expression’ or ‘peaks’, only the expression profile or peak profile data sets will be used to perform the clustering respectively. All data sets passed will be reordered based on the results of the clustering. If set to ‘both’, all data sets will be treated as one matrix and clustered altogether.
`norm_readc`	If set to `TRUE`, all sample groups will be normalized based on the map mass which is defined here as all mapped reads after quality filtering multiplied by their individual read length.
`no_key`	If `TRUE`, no color keys will be displayed.
`stranded_peak`	If `TRUE` and strand informations are provided in `regions`, peak profiles will flipped if located on the negative strand.
`ck_size`	Determines the size of the colour key in the form `c(height,width)`
`remove_lowex`	Numeric that sets the threshold for the average read density per base pair for expression data sets. Transcripts not passing will be filtered out and a message will be displayed.
`verbose`	Verbosity level
`showPlot`	If `FALSE`, plotting will be suppressed and only the TVResults will be returned.
`name_width`	Determines the width of the space for the peak and gene names.
`pre_mRNA`	All expression data will be plotted from the start of the first exon to the end of the last exon including all introns.

`dc`	Source DensityContainer object
`chrom`	A case sensitive string of the chromosome
`start`, `end`	Genomic start and end of the slice
`ranges`	A GRanges object or a data.frame.
`toRle`	The return values will be converted to a `RleList`.
`control`	An optional DensityContainer which will used as control and by default subtracted from `dc`.
`input_method`	Defines the handling of the optional control DensityContainer. ‘-’ will subtract the control from the actual data and ‘/’ will return log2 fold change ratios with an added pseudo count of 1 read.
`treads_norm`	If `TRUE`, the input densities are normalized to the read counts of the data set. Should not be used if one of the `DensityContainer` objects does not contain the whole amount of reads by e.g. placing a filter in `parseReads`.
`nbins`	If all input regions have equal length and nbins greater than 0, all densities will be summarized using the method specified by bin_method into nbins windows of approximately equal size.
`bin_method`	Character string that specifies the function used to summarize or expand the bins specified by nbins. Valid methods are ‘max’, ‘mean’ or ‘median’.

`dc`	Source DensityContainer object
`tname`, `tnames`	A character string or a character vector with matching identifiers of the provided gtf
`gtf`	A GRanges object with a meta data column ‘transcript_id’ and ‘exon_id’ like e.g. from `gtf2gr`.
`toRle`	The return values will be converted to a `RleList`.
`control`	An optional DensityContainer which will used as control and by default subtracted from `dc`.
`input_method`	Defines the handling of the optional control DensityContainer. ‘-’ will subtract the control from the actual data and ‘/’ will return log2 fold change ratios with an added pseudo count of 1 read.
`concatenate`	Logical that determines whether exons will be concatenated to one numeric vector (default) or returned as a list of vectors per exon.
`stranded`	If TRUE, the resulting vector will be reversed for reads on the reverse strand.
`treads_norm`	If `TRUE`, the input densities are normalized to the read counts of the data set. Should not be used if one of the `DensityContainer` objects does not contain the whole amount of reads by e.g. placing a filter in `parseReads`.
`nbins`	If all input regions have equal length and nbins greater than 0, all densities will be summarized using the method specified by bin_method into nbins windows of approximately equal size.
`bin_method`	Character string that specifies the function used to summarize or expand the bins specified by nbins. Valid methods are ‘max’, ‘mean’ or ‘median’.

Package 'TransView'

Help Index

Read density map construction and accession. Visualization of ChIPSeq and RNASeq data sets.

Description

Details

Author(s)

Examples

Associates peaks to TSS

Description

Usage

Arguments

Details

Value

Author(s)

Examples

Class "DensityContainer"

Description

Objects from the Class

Accessors

Slice Methods

Convenience Methods

Extends

Note

Author(s)

See Also

Examples

GTF file parsing

Description

Usage

Arguments

Details

Value

Author(s)

Examples

Histogram of the read distribution

Description

Usage

Arguments

Details

Value

Author(s)

Convenience function for MACS output conversion

Description

Usage

Arguments

Details

Value

Author(s)

Examples

Convenience function which returns a data frame with normalized peak densities suitable for plotting with ggplot2

Description

Usage

Arguments

Details

Value

Author(s)

Examples

User configurable efficient assembly of read density maps

Description

Usage

Arguments

Details

Value

Author(s)

Examples

Changes the peak center to the next TSS according to previous annotation

Description

Usage

Arguments

Details

Value

Author(s)

Examples

Plot and cluster global read densities

Description

Usage

Arguments

Details

Value

Author(s)

Class `"DensityContainer"`

Class `"TVResults"`