Package: wavClusteR 2.39.0

Federico Comoglio

wavClusteR: Sensitive and highly resolved identification of RNA-protein interaction sites in PAR-CLIP data

The package provides an integrated pipeline for the analysis of PAR-CLIP data. PAR-CLIP-induced transitions are first discriminated from sequencing errors, SNPs and additional non-experimental sources by a non- parametric mixture model. The protein binding sites (clusters) are then resolved at high resolution and cluster statistics are estimated using a rigorous Bayesian framework. Post-processing of the results, data export for UCSC genome browser visualization and motif search analysis are provided. In addition, the package allows to integrate RNA-Seq data to estimate the False Discovery Rate of cluster detection. Key functions support parallel multicore computing. Note: while wavClusteR was designed for PAR-CLIP data analysis, it can be applied to the analysis of other NGS data obtained from experimental procedures that induce nucleotide substitutions (e.g. BisSeq).

Authors:Federico Comoglio and Cem Sievers

wavClusteR_2.39.0.tar.gz
wavClusteR_2.39.0.zip(r-4.5)wavClusteR_2.39.0.zip(r-4.4)wavClusteR_2.39.0.zip(r-4.3)
wavClusteR_2.39.0.tgz(r-4.4-any)wavClusteR_2.39.0.tgz(r-4.3-any)
wavClusteR_2.39.0.tar.gz(r-4.5-noble)wavClusteR_2.39.0.tar.gz(r-4.4-noble)
wavClusteR_2.39.0.tgz(r-4.4-emscripten)wavClusteR_2.39.0.tgz(r-4.3-emscripten)
wavClusteR.pdf |wavClusteR.html
wavClusteR/json (API)
NEWS

# Install 'wavClusteR' in R:
install.packages('wavClusteR', repos = c('https://bioc.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Datasets:
  • model - Components of the non-parametric mixture moodel fitted on Ago2 PAR-CLIP data

On BioConductor:wavClusteR-2.39.0(bioc 3.20)wavClusteR-2.38.0(bioc 3.19)

This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.

bioconductor-package

18 exports 0.91 score 128 dependencies 4 mentions

Last updated 2 months agofrom:949b87b1fe

Exports:annotateClustersestimateFDRexportClustersexportCoverageexportHighConfSubexportSequencesfilterClustersfitMixtureModelgetAllSubgetClustersgetExpIntervalgetHighConfSubgetMetaGenegetMetaTSSplotSizeDistributionplotStatisticsplotSubstitutionsreadSortedBam

Dependencies:abindade4AnnotationDbiaskpassbackportsbase64encBHBiobaseBiocGenericsBiocIOBiocParallelBiostringsbitbit64bitopsblobbslibcachemcheckmatecliclustercodetoolscolorspacecpp11crayoncurldata.tableDBIDelayedArraydigestevaluatefansifarverfastmapfontawesomeforeachforeignformatRFormulafsfutile.loggerfutile.optionsGenomeInfoDbGenomeInfoDbDataGenomicAlignmentsGenomicFeaturesGenomicRangesggplot2gluegridExtragtablehighrHmischtmlTablehtmltoolshtmlwidgetshttrIRangesisobanditeratorsjquerylibjsonliteKEGGRESTknitrlabelinglambda.rlatticelifecyclemagrittrMASSMatrixMatrixGenericsmatrixStatsmclustmemoisemgcvmimemunsellnlmennetopensslpillarpixmappkgconfigplogrpngR6rappdirsRColorBrewerRcppRcppArmadilloRCurlrestfulrRhtslibrjsonrlangrmarkdownrpartRsamtoolsRSQLiterstudioapirtracklayerS4ArraysS4VectorssassscalessegmentedseqinrsnowspSparseArraystringistringrSummarizedExperimentsystibbletinytexUCSC.utilsutf8vctrsviridisviridisLitewithrxfunXMLXVectoryamlzlibbioc

wavClusteR: a workflow for PAR-CLIP data analysis

Rendered fromwavCluster_vignette.Rmdusingknitr::rmarkdownon Jun 25 2024.

Last update: 2020-10-17
Started: 2015-12-14

Readme and manuals

Help Manual

Help pageTopics
A comprehensive pipeline for the analysis of PAR-CLIP data. PAR-CLIP-induced transitions are first discriminated from sequencing errors, SNPs and additional non- experimental sources by a non-parametric mixture model. The protein binding sites (clusters) are then resolved at high resolution and cluster statistics are estimated using a rigorous Bayesian framework. Post-processing of the results, data export for UCSC genome browser visualization and motif search analysis are provided. In addition, the package allows to integrate RNA-Seq data to estimate the False Discovery Rate of cluster detection. Key functions support parallel multicore computing. Note: while wavClusteR was designed for PAR-CLIP data analysis, it can be applied to the analysis of other NGS data obtained from experimental procedures that induce nucleotide substitutions (e.g. BisSeq).wavClusteR-package wavClusteR
Annotate clusters with respect to transcript featuresannotateClusters
Estimate False Discovery Rate within the relative substitution frequency support by integrating PAR-CLIP data and RNA-Seq dataestimateFDR
Export clusters as BED trackexportClusters
Export coverage as BigWig trackexportCoverage
Export high-confidence substitutions as BED trackexportHighConfSub
Export cluster sequences for motif search analysisexportSequences
Merge clusters and compute all relevant cluster statisticsfilterClusters
Fit a non-parametric mixture model from all identified substitutionsfitMixtureModel
Identify all substitutions observed across genomic positions exhibiting a specified minimum coveragegetAllSub
Identify clusters containing high-confidence substitutions and resolve boundaries at high resolutiongetClusters
Identify the interval of relative substitution frequencies dominated by experimental induction.getExpInterval
Classify substitutions based on identified RSF interval and return high confidence transitionsgetHighConfSub
Compute and plot distribution of average coverage or relative log-odds as metagene profile using identified clustersgetMetaCoverage
Compute and plot metagene profile using identified clustersgetMetaGene
Compute and plot read densities in genomic regions around transcription start sitesgetMetaTSS
Components of the non-parametric mixture moodel fitted on Ago2 PAR-CLIP datamodel
Plot the distribution of cluster sizesplotSizeDistribution
Pairs plot visualization of clusters statisticsplotStatistics
Barplot visualization of the number of genomic positions exhibiting a given substitution and, if model provided, additional diagnostic plots.plotSubstitutions
Load a sorted BAM filereadSortedBam