Biostrings - Efficient manipulation of biological strings

Memory efficient string containers, string matching algorithms, and other utilities, for fast manipulation of large biological sequences or sets of sequences.

Last updated 15 days ago

sequencematchingalignmentsequencinggeneticsdataimportdatarepresentationinfrastructurebioconductor-packagecore-package

17.79 score 57 stars 1.2k packages 8.9k scripts 96k downloads

DESeq2 - Differential gene expression analysis based on the negative binomial distribution

Estimate variance-mean dependence in count data from high-throughput sequencing assays and test for differential expression based on a model using the negative binomial distribution.

Last updated 7 days ago

sequencingrnaseqchipseqgeneexpressiontranscriptionnormalizationdifferentialexpressionbayesianregressionprincipalcomponentclusteringimmunooncology

16.02 score 360 stars 118 packages 16k scripts 36k downloads

BiocGenerics - S4 generic functions used in Bioconductor

The package defines many S4 generic functions used in Bioconductor.

Last updated 7 days ago

infrastructurebioconductor-packagecore-package

14.04 score 12 stars 2.2k packages 606 scripts 108k downloads

limma - Linear Models for Microarray and Omics Data

Data analysis, linear models and differential expression for omics data.

Last updated 11 days ago

exonarraygeneexpressiontranscriptionalternativesplicingdifferentialexpressiondifferentialsplicinggenesetenrichmentdataimportbayesianclusteringregressiontimecoursemicroarraymicrornaarraymrnamicroarrayonechannelproprietaryplatformstwochannelsequencingrnaseqbatcheffectmultiplecomparisonnormalizationpreprocessingqualitycontrolbiomedicalinformaticscellbiologycheminformaticsepigeneticsfunctionalgenomicsgeneticsimmunooncologymetabolomicsproteomicssystemsbiologytranscriptomics

13.78 score 584 packages 15k scripts 60k downloads

minfi - Analyze Illumina Infinium DNA methylation arrays

Tools to analyze & visualize Illumina Infinium methylation arrays.

Last updated 2 days ago

immunooncologydnamethylationdifferentialmethylationepigeneticsmicroarraymethylationarraymultichanneltwochanneldataimportnormalizationpreprocessingqualitycontrol

13.12 score 60 stars 34 packages 1.1k scripts 4.4k downloads

mixOmics - Omics Data Integration Project

Multivariate methods are well suited to large omics data sets where the number of variables (e.g. genes, proteins, metabolites) is much larger than the number of samples (patients, cells, mice). They have the appealing properties of reducing the dimension of the data by using instrumental variables (components), which are defined as combinations of all variables. Those components are then used to produce useful graphical outputs that enable better understanding of the relationships and correlation structures between the different data sets that are integrated. mixOmics offers a wide range of multivariate methods for the exploration and integration of biological datasets with a particular focus on variable selection. The package proposes several sparse multivariate models we have developed to identify the key variables that are highly correlated, and/or explain the biological outcome of interest. The data that can be analysed with mixOmics may come from high throughput sequencing technologies, such as omics data (transcriptomics, metabolomics, proteomics, metagenomics etc) but also beyond the realm of omics (e.g. spectral imaging). The methods implemented in mixOmics can also handle missing values without having to delete entire rows with missing data. A non exhaustive list of methods include variants of generalised Canonical Correlation Analysis, sparse Partial Least Squares and sparse Discriminant Analysis. Recently we implemented integrative methods to combine multiple data sets: N-integration with variants of Generalised Canonical Correlation Analysis and P-integration with variants of multi-group Partial Least Squares.

Last updated 23 days ago

immunooncologymicroarraysequencingmetabolomicsmetagenomicsproteomicsgenepredictionmultiplecomparisonclassificationregressionbioconductorgenomicsgenomics-datagenomics-visualizationmultivariate-analysismultivariate-statisticsomicsr-pkgr-project

13.06 score 159 stars 20 packages 1.2k scripts 4.0k downloads

MSnbase - Base Functions and Classes for Mass Spectrometry and Proteomics

MSnbase provides infrastructure for manipulation, processing and visualisation of mass spectrometry and proteomics data, ranging from raw to quantitative and annotated data.

Last updated 9 days ago

immunooncologyinfrastructureproteomicsmassspectrometryqualitycontroldataimportbioconductorbioinformaticsmass-spectrometryproteomics-datavisualisation

12.80 score 125 stars 35 packages 712 scripts 5.6k downloads

SingleR - Reference-Based Single-Cell RNA-Seq Annotation

Performs unbiased cell type recognition from single-cell RNA sequencing data, by leveraging reference transcriptomic datasets of pure cell types to infer the cell of origin of each single cell independently.

Last updated 3 days ago

softwaresinglecellgeneexpressiontranscriptomicsclassificationclusteringannotationbioconductorsingler

12.45 score 177 stars 1 packages 2.0k scripts 6.2k downloads

infercnv - Infer Copy Number Variation from Single-Cell RNA-Seq Data

Using single-cell RNA-Seq expression to visualize CNV in cells.

Last updated 23 days ago

softwarecopynumbervariationvariantdetectionstructuralvariationgenomicvariationgeneticstranscriptomicsstatisticalmethodbayesianhiddenmarkovmodelsinglecell

11.00 score 561 stars 666 scripts 2.4k downloads

destiny - Creates diffusion maps

Create and plot diffusion maps.

Last updated 3 days ago

cellbiologycellbasedassaysclusteringsoftwarevisualizationdiffusion-mapsdimensionality-reduction

10.95 score 77 stars 738 scripts 1.1k downloads

tximeta - Transcript Quantification Import with Automatic Metadata

Transcript quantification import from Salmon and other quantifiers with automatic attachment of transcript ranges and release information, and other associated metadata. De novo transcriptomes can be linked to the appropriate sources with linkedTxomes and shared for computational reproducibility.

Last updated 23 days ago

annotationgenomeannotationdataimportpreprocessingrnaseqsinglecelltranscriptomicstranscriptiongeneexpressionfunctionalgenomicsreproducibleresearchreportwritingimmunooncology

10.66 score 67 stars 1 packages 450 scripts 2.0k downloads

scuttle - Single-Cell RNA-Seq Analysis Utilities

Provides basic utility functions for performing single-cell analyses, focusing on simple normalization, quality control and data transformations. Also provides some helper functions to assist development of other packages.

Last updated 23 days ago

immunooncologysinglecellrnaseqqualitycontrolpreprocessingnormalizationtranscriptomicsgeneexpressionsequencingsoftwaredataimport

10.14 score 76 packages 1.6k scripts 16k downloads

graphite - GRAPH Interaction from pathway Topological Environment

Graph objects from pathway topology derived from KEGG, Panther, PathBank, PharmGKB, Reactome SMPDB and WikiPathways databases.

Last updated 21 days ago

pathwaysthirdpartyclientgraphandnetworknetworkreactomekeggmetabolomicsbioinformaticsmirrorpathway-analysis

10.09 score 6 stars 22 packages 119 scripts 4.1k downloads

tidybulk - Brings transcriptomics to the tidyverse

This is a collection of utility functions that allow to perform exploration of and calculations to RNA sequencing data, in a modular, pipe-friendly and tidy fashion.

Last updated 23 days ago

assaydomaininfrastructurernaseqdifferentialexpressiongeneexpressionnormalizationclusteringqualitycontrolsequencingtranscriptiontranscriptomicsbioconductorbulk-transcriptional-analysesdeseq2differential-expressionedgerensembl-idsentrezgene-symbolsgseamds-dimensionspcapiperedundancytibbletidytidy-datatidyversetranscriptstsne

9.53 score 165 stars 1 packages 163 scripts 546 downloads

Rsubread - Mapping, quantification and variant analysis of sequencing data

Alignment, quantification and analysis of RNA sequencing data (including both bulk RNA-seq and scRNA-seq) and DNA sequenicng data (including ATAC-seq, ChIP-seq, WGS, WES etc). Includes functionality for read mapping, read counting, SNP calling, structural variant detection and gene fusion discovery. Can be applied to all major sequencing techologies and to both short and long sequence reads.

Last updated 21 days ago

sequencingalignmentsequencematchingrnaseqchipseqsinglecellgeneexpressiongeneregulationgeneticsimmunooncologysnpgeneticvariabilitypreprocessingqualitycontrolgenomeannotationgenefusiondetectionindeldetectionvariantannotationvariantdetectionmultiplesequencealignment

9.04 score 10 packages 844 scripts 3.6k downloads

bambu - Context-Aware Transcript Quantification from Long Read RNA-Seq data

bambu is a R package for multi-sample transcript discovery and quantification using long read RNA-Seq data. You can use bambu after read alignment to obtain expression estimates for known and novel transcripts and genes. The output from bambu can directly be used for visualisation and downstream analysis such as differential gene expression or transcript usage.

Last updated 23 days ago

alignmentcoveragedifferentialexpressionfeatureextractiongeneexpressiongenomeannotationgenomeassemblyimmunooncologylongreadmultiplecomparisonnormalizationrnaseqregressionsequencingsoftwaretranscriptiontranscriptomicsbambubioconductorlong-readsnanoporenanopore-sequencingrna-seqrna-seq-analysistranscript-quantificationtranscript-reconstruction

8.72 score 189 stars 1 packages 77 scripts 657 downloads

EWCE - Expression Weighted Celltype Enrichment

Used to determine which cell types are enriched within gene lists. The package provides tools for testing enrichments within simple gene lists (such as human disease associated genes) and those resulting from differential expression studies. The package does not depend upon any particular Single Cell Transcriptome dataset and user defined datasets can be loaded in and used in the analyses.

Last updated 23 days ago

geneexpressiontranscriptiondifferentialexpressiongenesetenrichmentgeneticsmicroarraymrnamicroarrayonechannelrnaseqbiomedicalinformaticsproteomicsvisualizationfunctionalgenomicssinglecelldeconvolutionsingle-cellsingle-cell-rna-seqtranscriptomics

8.39 score 54 stars 93 scripts 501 downloads

igvR - igvR: integrative genomics viewer

Access to igv.js, the Integrative Genomics Viewer running in a web browser.

Last updated 23 days ago

visualizationthirdpartyclientgenomebrowsers

8.30 score 42 stars 118 scripts 281 downloads

fishpond - Fishpond: downstream methods and tools for expression data

Fishpond contains methods for differential transcript and gene expression analysis of RNA-seq data using inferential replicates for uncertainty of abundance quantification, as generated by Gibbs sampling or bootstrap sampling. Also the package contains a number of utilities for working with Salmon and Alevin quantification files.

Last updated 23 days ago

sequencingrnaseqgeneexpressiontranscriptionnormalizationregressionmultiplecomparisonbatcheffectvisualizationdifferentialexpressiondifferentialsplicingalternativesplicingsinglecellbioconductorgene-expressiongenomicssalmonscrnaseqstatisticstranscriptomics

8.23 score 27 stars 1 packages 151 scripts 595 downloads

hypeR - An R Package For Geneset Enrichment Workflows

An R Package for Geneset Enrichment Workflows.

Last updated 23 days ago

genesetenrichmentannotationpathwaysbioinformaticscomputational-biologygeneset-enrichment-analysis

8.19 score 76 stars 135 scripts 218 downloads

rrvgo - Reduce + Visualize GO

Reduce and visualize lists of Gene Ontology terms by identifying redudance based on semantic similarity.

Last updated 23 days ago

annotationclusteringgonetworkpathwayssoftware

7.75 score 21 stars 168 scripts 880 downloads

pathwayPCA - Integrative Pathway Analysis with Modern PCA Methodology and Gene Selection

pathwayPCA is an integrative analysis tool that implements the principal component analysis (PCA) based pathway analysis approaches described in Chen et al. (2008), Chen et al. (2010), and Chen (2011). pathwayPCA allows users to: (1) Test pathway association with binary, continuous, or survival phenotypes. (2) Extract relevant genes in the pathways using the SuperPCA and AES-PCA approaches. (3) Compute principal components (PCs) based on the selected genes. These estimated latent variables represent pathway activities for individual subjects, which can then be used to perform integrative pathway analysis, such as multi-omics analysis. (4) Extract relevant genes that drive pathway significance as well as data corresponding to these relevant genes for additional in-depth analysis. (5) Perform analyses with enhanced computational efficiency with parallel computing and enhanced data safety with S4-class data objects. (6) Analyze studies with complex experimental designs, with multiple covariates, and with interaction effects, e.g., testing whether pathway association with clinical phenotype is different between male and female subjects. Citations: Chen et al. (2008) <https://doi.org/10.1093/bioinformatics/btn458>; Chen et al. (2010) <https://doi.org/10.1002/gepi.20532>; and Chen (2011) <https://doi.org/10.2202/1544-6115.1697>.

Last updated 23 days ago

copynumbervariationdnamethylationgeneexpressionsnptranscriptiongenepredictiongenesetenrichmentgenesignalinggenetargetgenomewideassociationgenomicvariationcellbiologyepigeneticsfunctionalgenomicsgeneticslipidomicsmetabolomicsproteomicssystemsbiologytranscriptomicsclassificationdimensionreductionfeatureextractionprincipalcomponentregressionsurvivalmultiplecomparisonpathways

7.74 score 11 stars 42 scripts 170 downloads

RBioFormats - R interface to Bio-Formats

An R package which interfaces the OME Bio-Formats Java library to allow reading of proprietary microscopy image data and metadata.

Last updated 23 days ago

dataimportbio-formatsbioconductorimage-processing

7.49 score 23 stars 1 packages 50 scripts 302 downloads

netSmooth - Network smoothing for scRNAseq

netSmooth is an R package for network smoothing of single cell RNA sequencing data. Using bio networks such as protein-protein interactions as priors for gene co-expression, netsmooth improves cell type identification from noisy, sparse scRNAseq data.

Last updated 23 days ago

networkgraphandnetworksinglecellrnaseqgeneexpressionsequencingtranscriptomicsnormalizationpreprocessingclusteringdimensionreductionbioinformaticsgenomicssingle-cell

7.41 score 27 stars 4 scripts 212 downloads

Wrench - Wrench normalization for sparse count data

Wrench is a package for normalization sparse genomic count data, like that arising from 16s metagenomic surveys.

Last updated 23 days ago

normalizationsequencingsoftware

7.39 score 6 stars 10 packages 9 scripts 2.3k downloads

mbkmeans - Mini-batch K-means Clustering for Single-Cell RNA-seq

Implements the mini-batch k-means algorithm for large datasets, including support for on-disk data representation.

Last updated 23 days ago

clusteringgeneexpressionrnaseqsoftwaretranscriptomicssequencingsinglecellhuman-cell-atlas

7.37 score 9 stars 2 packages 54 scripts 804 downloads

GeomxTools - NanoString GeoMx Tools

Tools for NanoString Technologies GeoMx Technology. Package provides functions for reading in DCC and PKC files based on an ExpressionSet derived object. Normalization and QC functions are also included.

Last updated 23 days ago

geneexpressiontranscriptioncellbasedassaysdataimporttranscriptomicsproteomicsmrnamicroarrayproprietaryplatformsrnaseqsequencingexperimentaldesignnormalizationspatial

7.25 score 3 packages 189 scripts 748 downloads

TnT - Interactive Visualization for Genomic Features

A R interface to the TnT javascript library (https://github.com/ tntvis) to provide interactive and flexible visualization of track-based genomic data.

Last updated 23 days ago

infrastructurevisualizationbioconductorgenome-browserhtmlwidgetsshiny

7.15 score 14 stars 17 scripts 102 downloads

iSEEu - iSEE Universe

iSEEu (the iSEE universe) contains diverse functionality to extend the usage of the iSEE package, including additional classes for the panels, or modes allowing easy configuration of iSEE applications.

Last updated 23 days ago

immunooncologyvisualizationguidimensionreductionfeatureextractionclusteringtranscriptiongeneexpressiontranscriptomicssinglecellcellbasedassayshacktoberfest

7.15 score 9 stars 1 packages 35 scripts 265 downloads

systemPipeShiny - systemPipeShiny: An Interactive Framework for Workflow Management and Visualization

systemPipeShiny (SPS) extends the widely used systemPipeR (SPR) workflow environment with a versatile graphical user interface provided by a Shiny App. This allows non-R users, such as experimentalists, to run many systemPipeR’s workflow designs, control, and visualization functionalities interactively without requiring knowledge of R. Most importantly, SPS has been designed as a general purpose framework for interacting with other R packages in an intuitive manner. Like most Shiny Apps, SPS can be used on both local computers as well as centralized server-based deployments that can be accessed remotely as a public web service for using SPR’s functionalities with community and/or private data. The framework can integrate many core packages from the R/Bioconductor ecosystem. Examples of SPS’ current functionalities include: (a) interactive creation of experimental designs and metadata using an easy to use tabular editor or file uploader; (b) visualization of workflow topologies combined with auto-generation of R Markdown preview for interactively designed workflows; (d) access to a wide range of data processing routines; (e) and an extendable set of visualization functionalities. Complex visual results can be managed on a 'Canvas Workbench’ allowing users to organize and to compare plots in an efficient manner combined with a session snapshot feature to continue work at a later time. The present suite of pre-configured visualization examples. The modular design of SPR makes it easy to design custom functions without any knowledge of Shiny, as well as extending the environment in the future with contributions from the community.

Last updated 23 days ago

shinyappsinfrastructuredataimportsequencingqualitycontrolreportwritingexperimentaldesignclusteringbioconductorbioconductor-packagedata-visualizationshinysystempiper

7.04 score 34 stars 36 scripts 192 downloads

BridgeDbR - Code for using BridgeDb identifier mapping framework from within R

Use BridgeDb functions and load identifier mapping databases in R. It uses GitHub, Zenodo, and Figshare if you use this package to download identifier mappings files.

Last updated 15 days ago

softwareannotationmetabolomicscheminformaticsbioconductor-packagebridgedbgenesidentifierslife-sciencesmetabolitesproteins

6.97 score 4 stars 43 scripts 200 downloads

distinct - distinct: a method for differential analyses via hierarchical permutation tests

distinct is a statistical method to perform differential testing between two or more groups of distributions; differential testing is performed via hierarchical non-parametric permutation tests on the cumulative distribution functions (cdfs) of each sample. While most methods for differential expression target differences in the mean abundance between conditions, distinct, by comparing full cdfs, identifies, both, differential patterns involving changes in the mean, as well as more subtle variations that do not involve the mean (e.g., unimodal vs. bi-modal distributions with the same mean). distinct is a general and flexible tool: due to its fully non-parametric nature, which makes no assumptions on how the data was generated, it can be applied to a variety of datasets. It is particularly suitable to perform differential state analyses on single cell data (i.e., differential analyses within sub-populations of cells), such as single cell RNA sequencing (scRNA-seq) and high-dimensional flow or mass cytometry (HDCyto) data. To use distinct one needs data from two or more groups of samples (i.e., experimental conditions), with at least 2 samples (i.e., biological replicates) per group.

Last updated 23 days ago

geneticsrnaseqsequencingdifferentialexpressiongeneexpressionmultiplecomparisonsoftwaretranscriptionstatisticalmethodvisualizationsinglecellflowcytometrygenetarget

6.90 score 11 stars 1 packages 34 scripts 508 downloads

BioTIP - BioTIP: An R package for characterization of Biological Tipping-Point

Adopting tipping-point theory to transcriptome profiles to unravel disease regulatory trajectory.

Last updated 23 days ago

sequencingrnaseqgeneexpressiontranscriptionsoftware

6.84 score 18 stars 37 scripts 204 downloads

spatialHeatmap - spatialHeatmap

The spatialHeatmap package offers the primary functionality for visualizing cell-, tissue- and organ-specific assay data in spatial anatomical images. Additionally, it provides extended functionalities for large-scale data mining routines and co-visualizing bulk and single-cell data.

Last updated 23 days ago

spatialvisualizationmicroarraysequencinggeneexpressiondatarepresentationnetworkclusteringgraphandnetworkcellbasedassaysatacseqdnaseqtissuemicroarraysinglecellcellbiologygenetarget

6.56 score 5 stars 12 scripts 446 downloads

NanoStringNCTools - NanoString nCounter Tools

Tools for NanoString Technologies nCounter Technology. Provides support for reading RCC files into an ExpressionSet derived object. Also includes methods for QC and normalizaztion of NanoString data.

Last updated 23 days ago

geneexpressiontranscriptioncellbasedassaysdataimporttranscriptomicsproteomicsmrnamicroarrayproprietaryplatformsrnaseq

6.44 score 4 packages 77 scripts 618 downloads

ViSEAGO - ViSEAGO: a Bioconductor package for clustering biological functions using Gene Ontology and semantic similarity

The main objective of ViSEAGO package is to carry out a data mining of biological functions and establish links between genes involved in the study. We developed ViSEAGO in R to facilitate functional Gene Ontology (GO) analysis of complex experimental design with multiple comparisons of interest. It allows to study large-scale datasets together and visualize GO profiles to capture biological knowledge. The acronym stands for three major concepts of the analysis: Visualization, Semantic similarity and Enrichment Analysis of Gene Ontology. It provides access to the last current GO annotations, which are retrieved from one of NCBI EntrezGene, Ensembl or Uniprot databases for several species. Using available R packages and novel developments, ViSEAGO extends classical functional GO analysis to focus on functional coherence by aggregating closely related biological themes while studying multiple datasets at once. It provides both a synthetic and detailed view using interactive functionalities respecting the GO graph structure and ensuring functional coherence supplied by semantic similarity. ViSEAGO has been successfully applied on several datasets from different species with a variety of biological questions. Results can be easily shared between bioinformaticians and biologists, enhancing reporting capabilities while maintaining reproducibility.

Last updated 1 days ago

softwareannotationgogenesetenrichmentmultiplecomparisonclusteringvisualization

6.40 score 21 scripts 241 downloads

gwasurvivr - gwasurvivr: an R package for genome wide survival analysis

gwasurvivr is a package to perform survival analysis using Cox proportional hazard models on imputed genetic data.

Last updated 23 days ago

genomewideassociationsurvivalregressiongeneticssnpgeneticvariabilitypharmacogenomicsbiomedicalinformatics

6.39 score 11 stars 75 scripts 278 downloads

artMS - Analytical R tools for Mass Spectrometry

artMS provides a set of tools for the analysis of proteomics label-free datasets. It takes as input the MaxQuant search result output (evidence.txt file) and performs quality control, relative quantification using MSstats, downstream analysis and integration. artMS also provides a set of functions to re-format and make it compatible with other analytical tools, including, SAINTq, SAINTexpress, Phosfate, and PHOTON. Check [http://artms.org](http://artms.org) for details.

Last updated 23 days ago

proteomicsdifferentialexpressionbiomedicalinformaticssystemsbiologymassspectrometryannotationqualitycontrolgenesetenrichmentclusteringnormalizationimmunooncologymultiplecomparisonanalysisanalyticalap-msbioconductorbioinformaticsmass-spectrometryphosphoproteomicspost-translational-modificationquantitative-analysis

6.37 score 14 stars 12 scripts 206 downloads

BSgenomeForge - Forge your own BSgenome data package

A set of tools to forge BSgenome data packages. Supersedes the old seed-based tools from the BSgenome software package. This package allows the user to create a BSgenome data package in one function call, simplifying the old seed-based process.

Last updated 23 days ago

infrastructuredatarepresentationgenomeassemblyannotationgenomeannotationsequencingalignmentdataimportsequencematchingbioconductor-packagecore-package

6.30 score 4 stars 4 scripts 244 downloads

CopyNumberPlots - Create Copy-Number Plots using karyoploteR functionality

CopyNumberPlots have a set of functions extending karyoploteRs functionality to create beautiful, customizable and flexible plots of copy-number related data.

Last updated 23 days ago

visualizationcopynumbervariationcoverageonechanneldataimportsequencingdnaseqbioconductorbioconductor-packagebioinformaticscopy-number-variationgenomicsgenomics-visualization

6.24 score 6 stars 2 packages 16 scripts 261 downloads

Moonlight2R - Identify oncogenes and tumor suppressor genes from omics data

The understanding of cancer mechanism requires the identification of genes playing a role in the development of the pathology and the characterization of their role (notably oncogenes and tumor suppressors). We present an updated version of the R/bioconductor package called MoonlightR, namely Moonlight2R, which returns a list of candidate driver genes for specific cancer types on the basis of omics data integration. The Moonlight framework contains a primary layer where gene expression data and information about biological processes are integrated to predict genes called oncogenic mediators, divided into putative tumor suppressors and putative oncogenes. This is done through functional enrichment analyses, gene regulatory networks and upstream regulator analyses to score the importance of well-known biological processes with respect to the studied cancer type. By evaluating the effect of the oncogenic mediators on biological processes or through random forests, the primary layer predicts two putative roles for the oncogenic mediators: i) tumor suppressor genes (TSGs) and ii) oncogenes (OCGs). As gene expression data alone is not enough to explain the deregulation of the genes, a second layer of evidence is needed. We have automated the integration of a secondary mutational layer through new functionalities in Moonlight2R. These functionalities analyze mutations in the cancer cohort and classifies these into driver and passenger mutations using the driver mutation prediction tool, CScape-somatic. Those oncogenic mediators with at least one driver mutation are retained as the driver genes. As a consequence, this methodology does not only identify genes playing a dual role (e.g. TSG in one cancer type and OCG in another) but also helps in elucidating the biological processes underlying their specific roles. In particular, Moonlight2R can be used to discover OCGs and TSGs in the same cancer type. This may for instance help in answering the question whether some genes change role between early stages (I, II) and late stages (III, IV). In the future, this analysis could be useful to determine the causes of different resistances to chemotherapeutic treatments.

Last updated 23 days ago

dnamethylationdifferentialmethylationgeneregulationgeneexpressionmethylationarraydifferentialexpressionpathwaysnetworksurvivalgenesetenrichmentnetworkenrichment

6.23 score 5 stars 42 scripts 136 downloads

MOMA - Multi Omic Master Regulator Analysis

This package implements the inference of candidate master regulator proteins from multi-omics' data (MOMA) algorithm, as well as ancillary analysis and visualization functions.

Last updated 23 days ago

softwarenetworkenrichmentnetworkinferencenetworkfeatureextractionclusteringfunctionalgenomicstranscriptomicssystemsbiology

6.19 score 6 stars 13 scripts 141 downloads

IgGeneUsage - Differential gene usage in immune repertoires

Detection of biases in the usage of immunoglobulin (Ig) genes is an important task in immune repertoire profiling. IgGeneUsage detects aberrant Ig gene usage between biological conditions using a probabilistic model which is analyzed computationally by Bayes inference. With this IgGeneUsage also avoids some common problems related to the current practice of null-hypothesis significance testing.

Last updated 23 days ago

differentialexpressionregressiongeneticsbayesianbiomedicalinformaticsimmunooncologymathematicalbiologyb-cell-receptorbcr-repertoiredifferential-analysisdifferential-gene-expressionhigh-throughput-sequencingimmune-repertoireimmune-repertoire-analysisimmune-repertoiresimmunogenomicsimmunoglobulinimmunoinformaticsimmunological-bioinformaticsimmunologytcr-repertoirevdj-recombination

6.19 score 6 stars 1 scripts 143 downloads

bayNorm - Single-cell RNA sequencing data normalization

bayNorm is used for normalizing single-cell RNA-seq data.

Last updated 23 days ago

immunooncologynormalizationrnaseqsinglecellsequencingscrnaseq

6.11 score 9 stars 36 scripts 326 downloads

crisprViz - Visualization Functions for CRISPR gRNAs

Provides functionalities to visualize and contextualize CRISPR guide RNAs (gRNAs) on genomic tracks across nucleases and applications. Works in conjunction with the crisprBase and crisprDesign Bioconductor packages. Plots are produced using the Gviz framework.

Last updated 23 days ago

crisprfunctionalgenomicsgenetargetbioconductorbioconductor-packagecrispr-analysiscrispr-designgrnagrna-sequencegrna-sequencessgrnasgrna-designvisualization

6.08 score 5 stars 2 packages 6 scripts 190 downloads

metaseqR2 - An R package for the analysis and result reporting of RNA-Seq data by combining multiple statistical algorithms

Provides an interface to several normalization and statistical testing packages for RNA-Seq gene expression data. Additionally, it creates several diagnostic plots, performs meta-analysis by combinining the results of several statistical tests and reports the results in an interactive way.

Last updated 23 days ago

softwaregeneexpressiondifferentialexpressionworkflowsteppreprocessingqualitycontrolnormalizationreportwritingrnaseqtranscriptionsequencingtranscriptomicsbayesianclusteringcellbiologybiomedicalinformaticsfunctionalgenomicssystemsbiologyimmunooncologyalternativesplicingdifferentialsplicingmultiplecomparisontimecoursedataimportatacseqepigeneticsregressionproprietaryplatformsgenesetenrichmentbatcheffectchipseq

6.05 score 7 stars 4 scripts 202 downloads

omicsViewer - Interactive and explorative visualization of SummarizedExperssionSet or ExpressionSet using omicsViewer

omicsViewer visualizes ExpressionSet (or SummarizedExperiment) in an interactive way. The omicsViewer has a separate back- and front-end. In the back-end, users need to prepare an ExpressionSet that contains all the necessary information for the downstream data interpretation. Some extra requirements on the headers of phenotype data or feature data are imposed so that the provided information can be clearly recognized by the front-end, at the same time, keep a minimum modification on the existing ExpressionSet object. The pure dependency on R/Bioconductor guarantees maximum flexibility in the statistical analysis in the back-end. Once the ExpressionSet is prepared, it can be visualized using the front-end, implemented by shiny and plotly. Both features and samples could be selected from (data) tables or graphs (scatter plot/heatmap). Different types of analyses, such as enrichment analysis (using Bioconductor package fgsea or fisher's exact test) and STRING network analysis, will be performed on the fly and the results are visualized simultaneously. When a subset of samples and a phenotype variable is selected, a significance test on means (t-test or ranked based test; when phenotype variable is quantitative) or test of independence (chi-square or fisher’s exact test; when phenotype data is categorical) will be performed to test the association between the phenotype of interest with the selected samples. Additionally, other analyses can be easily added as extra shiny modules. Therefore, omicsViewer will greatly facilitate data exploration, many different hypotheses can be explored in a short time without the need for knowledge of R. In addition, the resulting data could be easily shared using a shiny server. Otherwise, a standalone version of omicsViewer together with designated omics data could be easily created by integrating it with portable R, which can be shared with collaborators or submitted as supplementary data together with a manuscript.

Last updated 23 days ago

softwarevisualizationgenesetenrichmentdifferentialexpressionmotifdiscoverynetworknetworkenrichment

6.02 score 4 stars 22 scripts 168 downloads

SCOPE - A normalization and copy number estimation method for single-cell DNA sequencing

Whole genome single-cell DNA sequencing (scDNA-seq) enables characterization of copy number profiles at the cellular level. This circumvents the averaging effects associated with bulk-tissue sequencing and has increased resolution yet decreased ambiguity in deconvolving cancer subclones and elucidating cancer evolutionary history. ScDNA-seq data is, however, sparse, noisy, and highly variable even within a homogeneous cell population, due to the biases and artifacts that are introduced during the library preparation and sequencing procedure. Here, we propose SCOPE, a normalization and copy number estimation method for scDNA-seq data. The distinguishing features of SCOPE include: (i) utilization of cell-specific Gini coefficients for quality controls and for identification of normal/diploid cells, which are further used as negative control samples in a Poisson latent factor model for normalization; (ii) modeling of GC content bias using an expectation-maximization algorithm embedded in the Poisson generalized linear models, which accounts for the different copy number states along the genome; (iii) a cross-sample iterative segmentation procedure to identify breakpoints that are shared across cells from the same genetic background.

Last updated 23 days ago

singlecellnormalizationcopynumbervariationsequencingwholegenomecoveragealignmentqualitycontroldataimportdnaseq

5.88 score 75 scripts 208 downloads

ENmix - Quality control and analysis tools for Illumina DNA methylation BeadChip

Tools for quanlity control, analysis and visulization of Illumina DNA methylation array data.

Last updated 2 days ago

dnamethylationpreprocessingqualitycontroltwochannelmicroarrayonechannelmethylationarraybatcheffectnormalizationdataimportregressionprincipalcomponentepigeneticsmultichanneldifferentialmethylationimmunooncology

5.84 score 98 scripts 627 downloads

APAlyzer - A toolkit for APA analysis using RNA-seq data

Perform 3'UTR APA, Intronic APA and gene expression analysis using RNA-seq data.

Last updated 23 days ago

sequencingrnaseqdifferentialexpressiongeneexpressiongeneregulationannotationdataimportsoftwareative-polyadenylationbioinformatics-toolrna-seq

5.75 score 7 stars 5 scripts 188 downloads

ppcseq - Probabilistic Outlier Identification for RNA Sequencing Generalized Linear Models

Relative transcript abundance has proven to be a valuable tool for understanding the function of genes in biological systems. For the differential analysis of transcript abundance using RNA sequencing data, the negative binomial model is by far the most frequently adopted. However, common methods that are based on a negative binomial model are not robust to extreme outliers, which we found to be abundant in public datasets. So far, no rigorous and probabilistic methods for detection of outliers have been developed for RNA sequencing data, leaving the identification mostly to visual inspection. Recent advances in Bayesian computation allow large-scale comparison of observed data against its theoretical distribution given in a statistical model. Here we propose ppcseq, a key quality-control tool for identifying transcripts that include outlier data points in differential expression analysis, which do not follow a negative binomial distribution. Applying ppcseq to analyse several publicly available datasets using popular tools, we show that from 3 to 10 percent of differentially abundant transcripts across algorithms and datasets had statistics inflated by the presence of outliers.

Last updated 23 days ago

rnaseqdifferentialexpressiongeneexpressionnormalizationclusteringqualitycontrolsequencingtranscriptiontranscriptomicsbayesian-inferencedeseq2edgernegative-binomialoutlierstan

5.65 score 7 stars 16 scripts 175 downloads

planet - Placental DNA methylation analysis tools

This package contains R functions to predict biological variables to from placnetal DNA methylation data generated from infinium arrays. This includes inferring ethnicity/ancestry, gestational age, and cell composition from placental DNA methylation array (450k/850k) data.

Last updated 23 days ago

softwaredifferentialmethylationepigeneticsmicroarraymethylationarraydnamethylationcpgislandancestrydna-methylation-datageneticsinferencemachine-learningplacenta

5.64 score 4 stars 1 packages 12 scripts 316 downloads

breakpointR - Find breakpoints in Strand-seq data

This package implements functions for finding breakpoints, plotting and export of Strand-seq data.

Last updated 23 days ago

softwaresequencingdnaseqsinglecellcoverage

5.60 score 8 stars 10 scripts 301 downloads

MetID - Network-based prioritization of putative metabolite IDs

This package uses an innovative network-based approach that will enhance our ability to determine the identities of significant ions detected by LC-MS.

Last updated 23 days ago

assaydomainbiologicalquestioninfrastructureresearchfieldstatisticalmethodtechnologyworkflowstepnetworkkegg

5.54 score 1 stars 70 scripts 150 downloads

IsoBayes - IsoBayes: Single Isoform protein inference Method via Bayesian Analyses

IsoBayes is a Bayesian method to perform inference on single protein isoforms. Our approach infers the presence/absence of protein isoforms, and also estimates their abundance; additionally, it provides a measure of the uncertainty of these estimates, via: i) the posterior probability that a protein isoform is present in the sample; ii) a posterior credible interval of its abundance. IsoBayes inputs liquid cromatography mass spectrometry (MS) data, and can work with both PSM counts, and intensities. When available, trascript isoform abundances (i.e., TPMs) are also incorporated: TPMs are used to formulate an informative prior for the respective protein isoform relative abundance. We further identify isoforms where the relative abundance of proteins and transcripts significantly differ. We use a two-layer latent variable approach to model two sources of uncertainty typical of MS data: i) peptides may be erroneously detected (even when absent); ii) many peptides are compatible with multiple protein isoforms. In the first layer, we sample the presence/absence of each peptide based on its estimated probability of being mistakenly detected, also known as PEP (i.e., posterior error probability). In the second layer, for peptides that were estimated as being present, we allocate their abundance across the protein isoforms they map to. These two steps allow us to recover the presence and abundance of each protein isoform.

Last updated 23 days ago

statisticalmethodbayesianproteomicsmassspectrometryalternativesplicingsequencingrnaseqgeneexpressiongeneticsvisualizationsoftware

5.50 score 7 stars 10 scripts 170 downloads

cicero - Predict cis-co-accessibility from single-cell chromatin accessibility data

Cicero computes putative cis-regulatory maps from single-cell chromatin accessibility data. It also extends monocle 2 for use in chromatin accessibility data.

Last updated 23 days ago

sequencingclusteringcellbasedassaysimmunooncologygeneregulationgenetargetepigeneticsatacseqsinglecell

5.50 score 315 scripts 541 downloads

GRaNIE - GRaNIE: Reconstruction cell type specific gene regulatory networks including enhancers using single-cell or bulk chromatin accessibility and RNA-seq data

Genetic variants associated with diseases often affect non-coding regions, thus likely having a regulatory role. To understand the effects of genetic variants in these regulatory regions, identifying genes that are modulated by specific regulatory elements (REs) is crucial. The effect of gene regulatory elements, such as enhancers, is often cell-type specific, likely because the combinations of transcription factors (TFs) that are regulating a given enhancer have cell-type specific activity. This TF activity can be quantified with existing tools such as diffTF and captures differences in binding of a TF in open chromatin regions. Collectively, this forms a gene regulatory network (GRN) with cell-type and data-specific TF-RE and RE-gene links. Here, we reconstruct such a GRN using single-cell or bulk RNAseq and open chromatin (e.g., using ATACseq or ChIPseq for open chromatin marks) and optionally (Capture) Hi-C data. Our network contains different types of links, connecting TFs to regulatory elements, the latter of which is connected to genes in the vicinity or within the same chromatin domain (TAD). We use a statistical framework to assign empirical FDRs and weights to all links using a permutation-based approach.

Last updated 23 days ago

softwaregeneexpressiongeneregulationnetworkinferencegenesetenrichmentbiomedicalinformaticsgeneticstranscriptomicsatacseqrnaseqgraphandnetworkregressiontranscriptionchipseq

5.49 score 23 scripts 226 downloads

DEWSeq - Differential Expressed Windows Based on Negative Binomial Distribution

DEWSeq is a sliding window approach for the analysis of differentially enriched binding regions eCLIP or iCLIP next generation sequencing data.

Last updated 23 days ago

sequencinggeneregulationfunctionalgenomicsdifferentialexpressionbioinformaticseclipngs-analysis

5.48 score 5 stars 4 scripts 154 downloads

tRNAdbImport - Importing from tRNAdb and mitotRNAdb as GRanges objects

tRNAdbImport imports the entries of the tRNAdb and mtRNAdb (http://trna.bioinf.uni-leipzig.de) as GRanges object.

Last updated 23 days ago

softwarevisualizationdataimportbioconductorsequencesstructurestrnatrna-genestrna-sequencestrnadb

5.35 score 1 stars 1 packages 3 scripts 342 downloads

MEAT - Muscle Epigenetic Age Test

This package estimates epigenetic age in skeletal muscle, using DNA methylation data generated with the Illumina Infinium technology (HM27, HM450 and HMEPIC).

Last updated 23 days ago

epigeneticsdnamethylationmicroarraynormalizationbiomedicalinformaticsmethylationarraypreprocessing

5.30 score 4 scripts 152 downloads

SGCP - SGCP: A semi-supervised pipeline for gene clustering using self-training approach in gene co-expression networks

SGC is a semi-supervised pipeline for gene clustering in gene co-expression networks. SGC consists of multiple novel steps that enable the computation of highly enriched modules in an unsupervised manner. But unlike all existing frameworks, it further incorporates a novel step that leverages Gene Ontology information in a semi-supervised clustering method that further improves the quality of the computed modules.

Last updated 23 days ago

geneexpressiongenesetenrichmentnetworkenrichmentsystemsbiologyclassificationclusteringdimensionreductiongraphandnetworkneuralnetworknetworkmrnamicroarrayrnaseqvisualizationbioinformaticsgenecoexpressionnetworkgraphsnetworkclusteringnetworksself-trainingsemi-supervised-learningunsupervised-learning

5.12 score 2 stars 44 scripts 245 downloads

densvis - Density-Preserving Data Visualization via Non-Linear Dimensionality Reduction

Implements the density-preserving modification to t-SNE and UMAP described by Narayan et al. (2020) <doi:10.1101/2020.05.12.077776>. The non-linear dimensionality reduction techniques t-SNE and UMAP enable users to summarise complex high-dimensional sequencing data such as single cell RNAseq using lower dimensional representations. These lower dimensional representations enable the visualisation of discrete transcriptional states, as well as continuous trajectory (for example, in early development). However, these methods focus on the local neighbourhood structure of the data. In some cases, this results in misleading visualisations, where the density of cells in the low-dimensional embedding does not represent the transcriptional heterogeneity of data in the original high-dimensional space. den-SNE and densMAP aim to enable more accurate visual interpretation of high-dimensional datasets by producing lower-dimensional embeddings that accurately represent the heterogeneity of the original high-dimensional space, enabling the identification of homogeneous and heterogeneous cell states. This accuracy is accomplished by including in the optimisation process a term which considers the local density of points in the original high-dimensional space. This can help to create visualisations that are more representative of heterogeneity in the original high-dimensional space.

Last updated 23 days ago

dimensionreductionvisualizationsoftwaresinglecellsequencing

5.12 score 2 stars 9 scripts 2.6k downloads

RcwlPipelines - Bioinformatics pipelines based on Rcwl

A collection of Bioinformatics tools and pipelines based on R and the Common Workflow Language.

Last updated 23 days ago

softwareworkflowstepalignmentpreprocessingqualitycontroldnaseqrnaseqdataimportimmunooncology

5.07 score 1 packages 26 scripts 233 downloads

atSNP - Affinity test for identifying regulatory SNPs

atSNP performs affinity tests of motif matches with the SNP or the reference genomes and SNP-led changes in motif matches.

Last updated 23 days ago

softwarechipseqgenomeannotationmotifannotationvisualization

5.03 score 1 stars 36 scripts 317 downloads

RSeqAn - R SeqAn

Headers and some wrapper functions from the SeqAn C++ library for ease of usage in R.

Last updated 23 days ago

infrastructuresoftware

4.95 score 3 stars 1 packages 2 scripts 231 downloads

MBQN - Mean/Median-balanced quantile normalization

Modified quantile normalization for omics or other matrix-like data distorted in location and scale.

Last updated 23 days ago

normalizationpreprocessingproteomicssoftware

4.92 score 2 stars 14 scripts 154 downloads

SurfR - Surface Protein Prediction and Identification

Identify Surface Protein coding genes from a list of candidates. Systematically download data from GEO and TCGA or use your own data. Perform DGE on bulk RNAseq data. Perform Meta-analysis. Descriptive enrichment analysis and plots.

Last updated 10 days ago

softwaresequencingrnaseqgeneexpressiontranscriptiondifferentialexpressionprincipalcomponentgenesetenrichmentpathwaysbatcheffectfunctionalgenomicsvisualizationdataimportfunctionalpredictiongenepredictiongodgeenrichment-analysismetaanalysisplotsproteinspublic-datasurfacesurfaceome

4.85 score 1 stars 3 scripts 122 downloads

evaluomeR - Evaluation of Bioinformatics Metrics

Evaluating the reliability of your own metrics and the measurements done on your own datasets by analysing the stability and goodness of the classifications of such metrics.

Last updated 23 days ago

clusteringclassificationfeatureextractionassessmentclustering-evaluationevaluomeevaluomermetrics

4.82 score 33 scripts 199 downloads

DeepPINCS - Protein Interactions and Networks with Compounds based on Sequences using Deep Learning

The identification of novel compound-protein interaction (CPI) is important in drug discovery. Revealing unknown compound-protein interactions is useful to design a new drug for a target protein by screening candidate compounds. The accurate CPI prediction assists in effective drug discovery process. To identify potential CPI effectively, prediction methods based on machine learning and deep learning have been developed. Data for sequences are provided as discrete symbolic data. In the data, compounds are represented as SMILES (simplified molecular-input line-entry system) strings and proteins are sequences in which the characters are amino acids. The outcome is defined as a variable that indicates how strong two molecules interact with each other or whether there is an interaction between them. In this package, a deep-learning based model that takes only sequence information of both compounds and proteins as input and the outcome as output is used to predict CPI. The model is implemented by using compound and protein encoders with useful features. The CPI model also supports other modeling tasks, including protein-protein interaction (PPI), chemical-chemical interaction (CCI), or single compounds and proteins. Although the model is designed for proteins, DNA and RNA can be used if they are represented as sequences.

Last updated 23 days ago

softwarenetworkgraphandnetworkneuralnetwork

4.78 score 2 packages 4 scripts 153 downloads

rmelting - R Interface to MELTING 5

R interface to the MELTING 5 program (https://www.ebi.ac.uk/biomodels/tools/melting/) to compute melting temperatures of nucleic acid duplexes along with other thermodynamic parameters.

Last updated 23 days ago

biomedicalinformaticscheminformaticsbioconductorbioinformaticsmelting-temperature

4.78 score 2 stars 10 scripts 139 downloads

MatrixQCvis - Shiny-based interactive data-quality exploration for omics data

Data quality assessment is an integral part of preparatory data analysis to ensure sound biological information retrieval. We present here the MatrixQCvis package, which provides shiny-based interactive visualization of data quality metrics at the per-sample and per-feature level. It is broadly applicable to quantitative omics data types that come in matrix-like format (features x samples). It enables the detection of low-quality samples, drifts, outliers and batch effects in data sets. Visualizations include amongst others bar- and violin plots of the (count/intensity) values, mean vs standard deviation plots, MA plots, empirical cumulative distribution function (ECDF) plots, visualizations of the distances between samples, and multiple types of dimension reduction plots. Furthermore, MatrixQCvis allows for differential expression analysis based on the limma (moderated t-tests) and proDA (Wald tests) packages. MatrixQCvis builds upon the popular Bioconductor SummarizedExperiment S4 class and enables thus the facile integration into existing workflows. The package is especially tailored towards metabolomics and proteomics mass spectrometry data, but also allows to assess the data quality of other data types that can be represented in a SummarizedExperiment object.

Last updated 23 days ago

visualizationshinyappsguiqualitycontroldimensionreductionmetabolomicsproteomicstranscriptomics

4.74 score 4 scripts 374 downloads

RESOLVE - RESOLVE: An R package for the efficient analysis of mutational signatures from cancer genomes

Cancer is a genetic disease caused by somatic mutations in genes controlling key biological functions such as cellular growth and division. Such mutations may arise both through cell-intrinsic and exogenous processes, generating characteristic mutational patterns over the genome named mutational signatures. The study of mutational signatures have become a standard component of modern genomics studies, since it can reveal which (environmental and endogenous) mutagenic processes are active in a tumor, and may highlight markers for therapeutic response. Mutational signatures computational analysis presents many pitfalls. First, the task of determining the number of signatures is very complex and depends on heuristics. Second, several signatures have no clear etiology, casting doubt on them being computational artifacts rather than due to mutagenic processes. Last, approaches for signatures assignment are greatly influenced by the set of signatures used for the analysis. To overcome these limitations, we developed RESOLVE (Robust EStimation Of mutationaL signatures Via rEgularization), a framework that allows the efficient extraction and assignment of mutational signatures. RESOLVE implements a novel algorithm that enables (i) the efficient extraction, (ii) exposure estimation, and (iii) confidence assessment during the computational inference of mutational signatures.

Last updated 23 days ago

biomedicalinformaticssomaticmutation

4.60 score 1 stars 3 scripts 147 downloads

MAGAR - MAGAR: R-package to compute methylation Quantitative Trait Loci (methQTL) from DNA methylation and genotyping data

"Methylation-Aware Genotype Association in R" (MAGAR) computes methQTL from DNA methylation and genotyping data from matched samples. MAGAR uses a linear modeling stragety to call CpGs/SNPs that are methQTLs. MAGAR accounts for the local correlation structure of CpGs.

Last updated 23 days ago

regressionepigeneticsdnamethylationsnpgeneticvariabilitymethylationarraymicroarraycpgislandmethylseqsequencingmrnamicroarraypreprocessingcopynumbervariationtwochannelimmunooncologydifferentialmethylationbatcheffectqualitycontroldataimportnetworkclusteringgraphandnetwork

4.60 score 3 scripts 274 downloads

DegNorm - DegNorm: degradation normalization for RNA-seq data

This package performs degradation normalization in bulk RNA-seq data to improve differential expression analysis accuracy.

Last updated 23 days ago

rnaseqnormalizationgeneexpressionalignmentcoveragedifferentialexpressionbatcheffectsoftwaresequencingimmunooncologyqualitycontroldataimport

4.60 score 1 stars 3 scripts 165 downloads

NoRCE - NoRCE: Noncoding RNA Sets Cis Annotation and Enrichment

While some non-coding RNAs (ncRNAs) are assigned critical regulatory roles, most remain functionally uncharacterized. This presents a challenge whenever an interesting set of ncRNAs needs to be analyzed in a functional context. Transcripts located close-by on the genome are often regulated together. This genomic proximity on the sequence can hint to a functional association. We present a tool, NoRCE, that performs cis enrichment analysis for a given set of ncRNAs. Enrichment is carried out using the functional annotations of the coding genes located proximal to the input ncRNAs. Other biologically relevant information such as topologically associating domain (TAD) boundaries, co-expression patterns, and miRNA target prediction information can be incorporated to conduct a richer enrichment analysis. To this end, NoRCE includes several relevant datasets as part of its data repository, including cell-line specific TAD boundaries, functional gene sets, and expression data for coding & ncRNAs specific to cancer. Additionally, the users can utilize custom data files in their investigation. Enrichment results can be retrieved in a tabular format or visualized in several different ways. NoRCE is currently available for the following species: human, mouse, rat, zebrafish, fruit fly, worm, and yeast.

Last updated 23 days ago

biologicalquestiondifferentialexpressiongenomeannotationgenesetenrichmentgenetargetgenomeassemblygo

4.60 score 1 stars 6 scripts 158 downloads

brendaDb - The BRENDA Enzyme Database

R interface for importing and analyzing enzyme information from the BRENDA database.

Last updated 23 days ago

thirdpartyclientannotationdataimportbrendadatabaseenzymehacktoberfest

4.60 score 2 stars 4 scripts 153 downloads

Dune - Improving replicability in single-cell RNA-Seq cell type discovery

Given a set of clustering labels, Dune merges pairs of clusters to increase mean ARI between labels, improving replicability.

Last updated 23 days ago

clusteringgeneexpressionrnaseqsoftwaresinglecelltranscriptomicsvisualization

4.59 score 39 scripts 140 downloads

RNAseqCovarImpute - Impute Covariate Data in RNA Sequencing Studies

The RNAseqCovarImpute package makes linear model analysis for RNA sequencing read counts compatible with multiple imputation (MI) of missing covariates. A major problem with implementing MI in RNA sequencing studies is that the outcome data must be included in the imputation prediction models to avoid bias. This is difficult in omics studies with high-dimensional data. The first method we developed in the RNAseqCovarImpute package surmounts the problem of high-dimensional outcome data by binning genes into smaller groups to analyze pseudo-independently. This method implements covariate MI in gene expression studies by 1) randomly binning genes into smaller groups, 2) creating M imputed datasets separately within each bin, where the imputation predictor matrix includes all covariates and the log counts per million (CPM) for the genes within each bin, 3) estimating gene expression changes using `limma::voom` followed by `limma::lmFit` functions, separately on each M imputed dataset within each gene bin, 4) un-binning the gene sets and stacking the M sets of model results before applying the `limma::squeezeVar` function to apply a variance shrinking Bayesian procedure to each M set of model results, 5) pooling the results with Rubins’ rules to produce combined coefficients, standard errors, and P-values, and 6) adjusting P-values for multiplicity to account for false discovery rate (FDR). A faster method uses principal component analysis (PCA) to avoid binning genes while still retaining outcome information in the MI models. Binning genes into smaller groups requires that the MI and limma-voom analysis is run many times (typically hundreds). The more computationally efficient MI PCA method implements covariate MI in gene expression studies by 1) performing PCA on the log CPM values for all genes using the Bioconductor `PCAtools` package, 2) creating M imputed datasets where the imputation predictor matrix includes all covariates and the optimum number of PCs to retain (e.g., based on Horn’s parallel analysis or the number of PCs that account for >80% explained variation), 3) conducting the standard limma-voom pipeline with the `voom` followed by `lmFit` followed by `eBayes` functions on each M imputed dataset, 4) pooling the results with Rubins’ rules to produce combined coefficients, standard errors, and P-values, and 5) adjusting P-values for multiplicity to account for false discovery rate (FDR).

Last updated 23 days ago

rnaseqgeneexpressiondifferentialexpressionsequencing

4.48 score 1 stars 4 scripts 106 downloads

LRcell - Differential cell type change analysis using Logistic/linear Regression

The goal of LRcell is to identify specific sub-cell types that drives the changes observed in a bulk RNA-seq differential gene expression experiment. To achieve this, LRcell utilizes sets of cell marker genes acquired from single-cell RNA-sequencing (scRNA-seq) as indicators for various cell types in the tissue of interest. Next, for each cell type, using its marker genes as indicators, we apply Logistic Regression on the complete set of genes with differential expression p-values to calculate a cell-type significance p-value. Finally, these p-values are compared to predict which one(s) are likely to be responsible for the differential gene expression pattern observed in the bulk RNA-seq experiments. LRcell is inspired by the LRpath[@sartor2009lrpath] algorithm developed by Sartor et al., originally designed for pathway/gene set enrichment analysis. LRcell contains three major components: LRcell analysis, plot generation and marker gene selection. All modules in this package are written in R. This package also provides marker genes in the Prefrontal Cortex (pFC) human brain region, human PBMC and nine mouse brain regions (Frontal Cortex, Cerebellum, Globus Pallidus, Hippocampus, Entopeduncular, Posterior Cortex, Striatum, Substantia Nigra and Thalamus).

Last updated 23 days ago

singlecellgenesetenrichmentsequencingregressiongeneexpressiondifferentialexpressionenrichmentmarker-genes

4.48 score 3 stars 5 scripts 139 downloads

PDATK - Pancreatic Ductal Adenocarcinoma Tool-Kit

Pancreatic ductal adenocarcinoma (PDA) has a relatively poor prognosis and is one of the most lethal cancers. Molecular classification of gene expression profiles holds the potential to identify meaningful subtypes which can inform therapeutic strategy in the clinical setting. The Pancreatic Cancer Adenocarcinoma Tool-Kit (PDATK) provides an S4 class-based interface for performing unsupervised subtype discovery, cross-cohort meta-clustering, gene-expression-based classification, and subsequent survival analysis to identify prognostically useful subtypes in pancreatic cancer and beyond. Two novel methods, Consensus Subtypes in Pancreatic Cancer (CSPC) and Pancreatic Cancer Overall Survival Predictor (PCOSP) are included for consensus-based meta-clustering and overall-survival prediction, respectively. Additionally, four published subtype classifiers and three published prognostic gene signatures are included to allow users to easily recreate published results, apply existing classifiers to new data, and benchmark the relative performance of new methods. The use of existing Bioconductor classes as input to all PDATK classes and methods enables integration with existing Bioconductor datasets, including the 21 pancreatic cancer patient cohorts available in the MetaGxPancreas data package. PDATK has been used to replicate results from Sandhu et al (2019) [https://doi.org/10.1200/cci.18.00102] and an additional paper is in the works using CSPC to validate subtypes from the included published classifiers, both of which use the data available in MetaGxPancreas. The inclusion of subtype centroids and prognostic gene signatures from these and other publications will enable researchers and clinicians to classify novel patient gene expression data, allowing the direct clinical application of the classifiers included in PDATK. Overall, PDATK provides a rich set of tools to identify and validate useful prognostic and molecular subtypes based on gene-expression data, benchmark new classifiers against existing ones, and apply discovered classifiers on novel patient data to inform clinical decision making.

Last updated 23 days ago

geneexpressionpharmacogeneticspharmacogenomicssoftwareclassificationsurvivalclusteringgeneprediction

4.31 score 1 stars 17 scripts 372 downloads

RNAdecay - Maximum Likelihood Decay Modeling of RNA Degradation Data

RNA degradation is monitored through measurement of RNA abundance after inhibiting RNA synthesis. This package has functions and example scripts to facilitate (1) data normalization, (2) data modeling using constant decay rate or time-dependent decay rate models, (3) the evaluation of treatment or genotype effects, and (4) plotting of the data and models. Data Normalization: functions and scripts make easy the normalization to the initial (T0) RNA abundance, as well as a method to correct for artificial inflation of Reads per Million (RPM) abundance in global assessments as the total size of the RNA pool decreases. Modeling: Normalized data is then modeled using maximum likelihood to fit parameters. For making treatment or genotype comparisons (up to four), the modeling step models all possible treatment effects on each gene by repeating the modeling with constraints on the model parameters (i.e., the decay rate of treatments A and B are modeled once with them being equal and again allowing them to both vary independently). Model Selection: The AICc value is calculated for each model, and the model with the lowest AICc is chosen. Modeling results of selected models are then compiled into a single data frame. Graphical Plotting: functions are provided to easily visualize decay data model, or half-life distributions using ggplot2 package functions.

Last updated 23 days ago

immunooncologysoftwaregeneexpressiongeneregulationdifferentialexpressiontranscriptiontranscriptomicstimecourseregressionrnaseqnormalizationworkflowstep

4.30 score 2 scripts 150 downloads

betaHMM - A Hidden Markov Model Approach for Identifying Differentially Methylated Sites and Regions for Beta-Valued DNA Methylation Data

A novel approach utilizing a homogeneous hidden Markov model. And effectively model untransformed beta values. To identify DMCs while considering the spatial. Correlation of the adjacent CpG sites.

Last updated 23 days ago

dnamethylationdifferentialmethylationimmunooncologybiomedicalinformaticsmethylationarraysoftwaremultiplecomparisonsequencingspatialcoveragegenetargethiddenmarkovmodelmicroarray

4.30 score 106 downloads

netboost - Network Analysis Supported by Boosting

Boosting supported network analysis for high-dimensional omics applications. This package comes bundled with the MC-UPGMA clustering package by Yaniv Loewenstein.

Last updated 23 days ago

softwarestatisticalmethodgraphandnetworknetworkclusteringdimensionreductionbiomedicalinformaticsepigeneticsmetabolomicstranscriptomics

4.18 score 1 scripts 136 downloads

LoomExperiment - LoomExperiment container

The LoomExperiment package provide a means to easily convert the Bioconductor "Experiment" classes to loom files and vice versa.

Last updated 23 days ago

immunooncologydatarepresentationdataimportinfrastructuresinglecell

4.17 score 74 scripts 908 downloads

qPLEXanalyzer - Tools for quantitative proteomics data analysis

Tools for TMT based quantitative proteomics data analysis.

Last updated 23 days ago

immunooncologyproteomicsmassspectrometrynormalizationpreprocessingqualitycontroldataimport

4.08 score 1 stars 9 scripts 187 downloads

GSEAmining - Make Biological Sense of Gene Set Enrichment Analysis Outputs

Gene Set Enrichment Analysis is a very powerful and interesting computational method that allows an easy correlation between differential expressed genes and biological processes. Unfortunately, although it was designed to help researchers to interpret gene expression data it can generate huge amounts of results whose biological meaning can be difficult to interpret. Many available tools rely on the hierarchically structured Gene Ontology (GO) classification to reduce reundandcy in the results. However, due to the popularity of GSEA many more gene set collections, such as those in the Molecular Signatures Database are emerging. Since these collections are not organized as those in GO, their usage for GSEA do not always give a straightforward answer or, in other words, getting all the meaninful information can be challenging with the currently available tools. For these reasons, GSEAmining was born to be an easy tool to create reproducible reports to help researchers make biological sense of GSEA outputs. Given the results of GSEA, GSEAmining clusters the different gene sets collections based on the presence of the same genes in the leadind edge (core) subset. Leading edge subsets are those genes that contribute most to the enrichment score of each collection of genes or gene sets. For this reason, gene sets that participate in similar biological processes should share genes in common and in turn cluster together. After that, GSEAmining is able to identify and represent for each cluster: - The most enriched terms in the names of gene sets (as wordclouds) - The most enriched genes in the leading edge subsets (as bar plots). In each case, positive and negative enrichments are shown in different colors so it is easy to distinguish biological processes or genes that may be of interest in that particular study.

Last updated 23 days ago

genesetenrichmentclusteringvisualization

4.00 score 7 scripts 152 downloads

scTGIF - Cell type annotation for unannotated single-cell RNA-Seq data

scTGIF connects the cells and the related gene functions without cell type label.

Last updated 23 days ago

dimensionreductionqualitycontrolsinglecellsoftwaregeneexpression

4.00 score 2 scripts 414 downloads

SCANVIS - SCANVIS - a tool for SCoring, ANnotating and VISualizing splice junctions

SCANVIS is a set of annotation-dependent tools for analyzing splice junctions and their read support as predetermined by an alignment tool of choice (for example, STAR aligner). SCANVIS assesses each junction's relative read support (RRS) by relating to the context of local split reads aligning to annotated transcripts. SCANVIS also annotates each splice junction by indicating whether the junction is supported by annotation or not, and if not, what type of junction it is (e.g. exon skipping, alternative 5' or 3' events, Novel Exons). Unannotated junctions are also futher annotated by indicating whether it induces a frame shift or not. SCANVIS includes a visualization function to generate static sashimi-style plots depicting relative read support and number of split reads using arc thickness and arc heights, making it easy for users to spot well-supported junctions. These plots also clearly delineate unannotated junctions from annotated ones using designated color schemes, and users can also highlight splice junctions of choice. Variants and/or a read profile are also incoroporated into the plot if the user supplies variants in bed format and/or the BAM file. One further feature of the visualization function is that users can submit multiple samples of a certain disease or cohort to generate a single plot - this occurs via a "merge" function wherein junction details over multiple samples are merged to generate a single sashimi plot, which is useful when contrasting cohorots (eg. disease vs control).

Last updated 23 days ago

softwareresearchfieldtranscriptomicsworkflowstepannotationvisualization

4.00 score 1 scripts 157 downloads

fobitools - Tools for Manipulating the FOBI Ontology

A set of tools for interacting with the Food-Biomarker Ontology (FOBI). A collection of basic manipulation tools for biological significance analysis, graphs, and text mining strategies for annotating nutritional data.

Last updated 23 days ago

massspectrometrymetabolomicssoftwarevisualizationbiomedicalinformaticsgraphandnetworkannotationcheminformaticspathwaysgenesetenrichmentbiological-intrerpretationbiological-knowledgebiological-significance-analysisenrichment-analysisfood-biomarker-ontologyknowledge-graphnutritionobofoundryontologytext-mining

3.70 score 1 stars 5 scripts 187 downloads

ssrch - a simple search engine

Demonstrate tokenization and a search gadget for collections of CSV files.

Last updated 23 days ago

infrastructure

3.60 score 20 scripts 114 downloads

SpatialOmicsOverlay - Spatial Overlay for Omic Data from Nanostring GeoMx Data

Tools for NanoString Technologies GeoMx Technology. Package to easily graph on top of an OME-TIFF image. Plotting annotations can range from tissue segment to gene expression.

Last updated 23 days ago

geneexpressiontranscriptioncellbasedassaysdataimporttranscriptomicsproteomicsproprietaryplatformsrnaseqspatialdatarepresentationvisualization

3.48 score 7 scripts 174 downloads

LRBaseDbi - DBI to construct LRBase-related package

Interface to construct LRBase package (LRBase.XXX.eg.db).

Last updated 23 days ago

infrastructure

3.48 score 1 scripts 217 downloads

sampleClassifier - Sample Classifier

The package is designed to classify microarray RNA-seq gene expression profiles.

Last updated 23 days ago

immunooncologyclassificationmicroarrayrnaseqgeneexpression

3.30 score 222 downloads

optimalFlow - optimalFlow

Optimal-transport techniques applied to supervised flow cytometry gating.

Last updated 23 days ago

softwareflowcytometrytechnology

3.30 score 1 scripts 162 downloads

ggtree - an R package for visualization of tree and annotation data

'ggtree' extends the 'ggplot2' plotting system which implemented the grammar of graphics. 'ggtree' is designed for visualization and annotation of phylogenetic trees and other tree-like structures with their annotation data.

Last updated 23 days ago

alignmentannotationclusteringdataimportmultiplesequencealignmentphylogeneticsreproducibleresearchsoftwarevisualizationannotationsggplot2phylogenetic-trees

16.90 score 839 stars 108 packages 5.0k scripts 34k downloads

maftools - Summarize, Analyze and Visualize MAF Files

Analyze and visualize Mutation Annotation Format (MAF) files from large scale sequencing studies. This package provides various functions to perform most commonly used analyses in cancer genomics and to create feature rich customizable visualzations with minimal effort.

Last updated 23 days ago

datarepresentationdnaseqvisualizationdrivermutationvariantannotationfeatureextractionclassificationsomaticmutationsequencingfunctionalgenomicssurvivalbioinformaticscancer-genome-atlascancer-genomicsgenomicsmaf-filestcga

14.66 score 447 stars 18 packages 904 scripts 3.9k downloads

scran - Methods for Single-Cell RNA-Seq Data Analysis

Implements miscellaneous functions for interpretation of single-cell RNA-seq data. Methods are provided for assignment of cell cycle phase, detection of highly variable and significantly correlated genes, identification of marker genes, and other common tasks in routine single-cell analysis workflows.

Last updated 23 days ago

immunooncologynormalizationsequencingrnaseqsoftwaregeneexpressiontranscriptomicssinglecellclusteringbioconductor-packagehuman-cell-atlassingle-cell-rna-seq

13.21 score 40 stars 36 packages 7.6k scripts 8.2k downloads

MAST - Model-based Analysis of Single Cell Transcriptomics

Methods and models for handling zero-inflated single cell assay data.

Last updated 23 days ago

geneexpressiondifferentialexpressiongenesetenrichmentrnaseqtranscriptomicssinglecell

12.78 score 228 stars 5 packages 2.0k scripts 3.3k downloads

Cardinal - A mass spectrometry imaging toolbox for statistical analysis

Implements statistical & computational tools for analyzing mass spectrometry imaging datasets, including methods for efficient pre-processing, spatial segmentation, and classification.

Last updated 23 days ago

softwareinfrastructureproteomicslipidomicsmassspectrometryimagingmassspectrometryimmunooncologynormalizationclusteringclassificationregression

10.26 score 42 stars 187 scripts 504 downloads

pcaExplorer - Interactive Visualization of RNA-seq Data Using a Principal Components Approach

This package provides functionality for interactive visualization of RNA-seq datasets based on Principal Components Analysis. The methods provided allow for quick information extraction and effective data exploration. A Shiny application encapsulates the whole analysis.

Last updated 23 days ago

immunooncologyvisualizationrnaseqdimensionreductionprincipalcomponentqualitycontrolguireportwritingshinyappsbioconductorprincipal-componentsreproducible-researchrna-seq-analysisrna-seq-datashinytranscriptomeuser-friendly

9.60 score 55 stars 171 scripts 794 downloads

ProtGenerics - Generic infrastructure for Bioconductor mass spectrometry packages

S4 generic functions and classes needed by Bioconductor proteomics packages.

Last updated 23 days ago

infrastructureproteomicsmassspectrometrybioconductormass-spectrometrymetabolomics

9.38 score 8 stars 187 packages 4 scripts 19k downloads

AneuFinder - Analysis of Copy Number Variation in Single-Cell-Sequencing Data

AneuFinder implements functions for copy-number detection, breakpoint detection, and karyotype and heterogeneity analysis in single-cell whole genome sequencing and strand-seq data.

Last updated 23 days ago

immunooncologysoftwaresequencingsinglecellcopynumbervariationgenomicvariationhiddenmarkovmodelwholegenome

7.70 score 17 stars 37 scripts 424 downloads

ropls - PCA, PLS(-DA) and OPLS(-DA) for multivariate analysis and feature selection of omics data

Latent variable modeling with Principal Component Analysis (PCA) and Partial Least Squares (PLS) are powerful methods for visualization, regression, classification, and feature selection of omics data where the number of variables exceeds the number of samples and with multicollinearity among variables. Orthogonal Partial Least Squares (OPLS) enables to separately model the variation correlated (predictive) to the factor of interest and the uncorrelated (orthogonal) variation. While performing similarly to PLS, OPLS facilitates interpretation. Successful applications of these chemometrics techniques include spectroscopic data such as Raman spectroscopy, nuclear magnetic resonance (NMR), mass spectrometry (MS) in metabolomics and proteomics, but also transcriptomics data. In addition to scores, loadings and weights plots, the package provides metrics and graphics to determine the optimal number of components (e.g. with the R2 and Q2 coefficients), check the validity of the model by permutation testing, detect outliers, and perform feature selection (e.g. with Variable Importance in Projection or regression coefficients). The package can be accessed via a user interface on the Workflow4Metabolomics.org online resource for computational metabolomics (built upon the Galaxy environment).

Last updated 23 days ago

regressionclassificationprincipalcomponenttranscriptomicsproteomicsmetabolomicslipidomicsmassspectrometryimmunooncology

7.69 score 8 packages 191 scripts 1.8k downloads

sevenbridges - Seven Bridges Platform API Client and Common Workflow Language Tool Builder in R

R client and utilities for Seven Bridges platform API, from Cancer Genomics Cloud to other Seven Bridges supported platforms.

Last updated 23 days ago

softwaredataimportthirdpartyclientapi-clientbioconductorbioinformaticscloudcommon-workflow-languagesevenbridges

7.40 score 35 stars 24 scripts 198 downloads

isomiRs - Analyze isomiRs and miRNAs from small RNA-seq

Characterization of miRNAs and isomiRs, clustering and differential expression.

Last updated 23 days ago

mirnarnaseqdifferentialexpressionclusteringimmunooncologyanalyze-isomirsbioconductorisomirs

7.05 score 8 stars 39 scripts 309 downloads

RnBeads - RnBeads

RnBeads facilitates comprehensive analysis of various types of DNA methylation data at the genome scale.

Last updated 23 days ago

dnamethylationmethylationarraymethylseqepigeneticsqualitycontrolpreprocessingbatcheffectdifferentialmethylationsequencingcpgislandimmunooncologytwochanneldataimport

6.93 score 1 packages 158 scripts 639 downloads

DEFormats - Differential gene expression data formats converter

Convert between different data formats used by differential gene expression analysis tools.

Last updated 23 days ago

immunooncologydifferentialexpressiongeneexpressionrnaseqsequencingtranscription

6.79 score 4 stars 1 packages 74 scripts 414 downloads

kebabs - Kernel-Based Analysis of Biological Sequences

The package provides functionality for kernel-based analysis of DNA, RNA, and amino acid sequences via SVM-based methods. As core functionality, kebabs implements following sequence kernels: spectrum kernel, mismatch kernel, gappy pair kernel, and motif kernel. Apart from an efficient implementation of standard position-independent functionality, the kernels are extended in a novel way to take the position of patterns into account for the similarity measure. Because of the flexibility of the kernel formulation, other kernels like the weighted degree kernel or the shifted weighted degree kernel with constant weighting of positions are included as special cases. An annotation-specific variant of the kernels uses annotation information placed along the sequence together with the patterns in the sequence. The package allows for the generation of a kernel matrix or an explicit feature representation in dense or sparse format for all available kernels which can be used with methods implemented in other R packages. With focus on SVM-based methods, kebabs provides a framework which simplifies the usage of existing SVM implementations in kernlab, e1071, and LiblineaR. Binary and multi-class classification as well as regression tasks can be used in a unified way without having to deal with the different functions, parameters, and formats of the selected SVM. As support for choosing hyperparameters, the package provides cross validation - including grouped cross validation, grid search and model selection functions. For easier biological interpretation of the results, the package computes feature weights for all SVMs and prediction profiles which show the contribution of individual sequence positions to the prediction result and indicate the relevance of sequence sections for the learning result and the underlying biological functions.

Last updated 23 days ago

supportvectormachineclassificationclusteringregression

6.68 score 3 packages 44 scripts 480 downloads

dupRadar - Assessment of duplication rates in RNA-Seq datasets

Duplication rate quality control for RNA-Seq datasets.

Last updated 23 days ago

technologysequencingrnaseqqualitycontrolimmunooncology

6.48 score 1 stars 60 scripts 296 downloads

csaw - ChIP-Seq Analysis with Windows

Detection of differentially bound regions in ChIP-seq data with sliding windows, with methods for normalization and proper FDR control.

Last updated 23 days ago

multiplecomparisonchipseqnormalizationsequencingcoveragegeneticsannotationdifferentialpeakcalling

6.48 score 7 packages 474 scripts 893 downloads

subSeq - Subsampling of high-throughput sequencing count data

Subsampling of high throughput sequencing count data for use in experiment design and analysis.

Last updated 23 days ago

immunooncologysequencingtranscriptionrnaseqgeneexpressiondifferentialexpression

6.36 score 19 stars 20 scripts 202 downloads

DiffLogo - DiffLogo: A comparative visualisation of biooligomer motifs

DiffLogo is an easy-to-use tool to visualize motif differences.

Last updated 23 days ago

softwaresequencematchingmultiplecomparisonmotifannotationvisualizationalignment

5.81 score 8 stars 27 scripts 196 downloads

R3CPET - 3CPET: Finding Co-factor Complexes in Chia-PET experiment using a Hierarchical Dirichlet Process

The package provides a method to infer the set of proteins that are more probably to work together to maintain chormatin interaction given a ChIA-PET experiment results.

Last updated 23 days ago

networkinferencegenepredictionbayesiangraphandnetworknetworkgeneexpressionhicchia-petchromatin-interactiondirichlet-process-mixturestranscription-facto

5.62 score 4 stars 5 scripts 196 downloads

BEclear - Correction of batch effects in DNA methylation data

Provides functions to detect and correct for batch effects in DNA methylation data. The core function is based on latent factor models and can also be used to predict missing values in any other matrix containing real numbers.

Last updated 23 days ago

batcheffectdnamethylationsoftwarepreprocessingstatisticalmethodbatch-effectsbioconductor-packagedna-methylationlatent-factor-modelmethylationmissing-datamissing-valuesstochastic-gradient-descent

5.38 score 4 stars 10 scripts 298 downloads

globalSeq - Global Test for Counts

The method may be conceptualised as a test of overall significance in regression analysis, where the response variable is overdispersed and the number of explanatory variables exceeds the sample size. Useful for testing for association between RNA-Seq and high-dimensional data.

Last updated 23 days ago

geneexpressionexonarraydifferentialexpressiongenomewideassociationtranscriptomicsdimensionreductionregressionsequencingwholegenomernaseqexomeseqmirnamultiplecomparison

5.32 score 1 stars 4 scripts 176 downloads

RCyjs - Display and manipulate graphs in cytoscape.js

Interactive viewing and exploration of graphs, connecting R to Cytoscape.js, using websockets.

Last updated 23 days ago

visualizationgraphandnetworkthirdpartyclient

4.68 score 48 scripts 208 downloads

GeneBreak - Gene Break Detection

Recurrent breakpoint gene detection on copy number aberration profiles.

Last updated 23 days ago

acghcopynumbervariationdnaseqgeneticssequencingwholegenomevisualization

4.60 score 2 stars 6 scripts 204 downloads

conumee - Enhanced copy-number variation analysis using Illumina DNA methylation arrays

This package contains a set of processing and plotting methods for performing copy-number variation (CNV) analysis using Illumina 450k or EPIC methylation arrays.

Last updated 23 days ago

copynumbervariationdnamethylationmethylationarraymicroarraynormalizationpreprocessingqualitycontrolsoftware

4.46 score 29 scripts 394 downloads

ChIPComp - Quantitative comparison of multiple ChIP-seq datasets

ChIPComp detects differentially bound sharp binding sites across multiple conditions considering matching control.

Last updated 23 days ago

chipseqsequencingtranscriptiongeneticscoveragemultiplecomparisondataimport

4.01 score 51 scripts 316 downloads

SNPhood - SNPhood: Investigate, quantify and visualise the epigenomic neighbourhood of SNPs using NGS data

To date, thousands of single nucleotide polymorphisms (SNPs) have been found to be associated with complex traits and diseases. However, the vast majority of these disease-associated SNPs lie in the non-coding part of the genome, and are likely to affect regulatory elements, such as enhancers and promoters, rather than function of a protein. Thus, to understand the molecular mechanisms underlying genetic traits and diseases, it becomes increasingly important to study the effect of a SNP on nearby molecular traits such as chromatin environment or transcription factor (TF) binding. Towards this aim, we developed SNPhood, a user-friendly *Bioconductor* R package to investigate and visualize the local neighborhood of a set of SNPs of interest for NGS data such as chromatin marks or transcription factor binding sites from ChIP-Seq or RNA- Seq experiments. SNPhood comprises a set of easy-to-use functions to extract, normalize and summarize reads for a genomic region, perform various data quality checks, normalize read counts using additional input files, and to cluster and visualize the regions according to the binding pattern. The regions around each SNP can be binned in a user-defined fashion to allow for analysis of very broad patterns as well as a detailed investigation of specific binding shapes. Furthermore, SNPhood supports the integration with genotype information to investigate and visualize genotype-specific binding patterns. Finally, SNPhood can be employed for determining, investigating, and visualizing allele-specific binding patterns around the SNPs of interest.

Last updated 23 days ago

software

3.90 score 1 scripts 196 downloads

ExperimentHubData - Add resources to ExperimentHub

Functions to add metadata to ExperimentHub db and resource files to AWS S3 buckets.

Last updated 23 days ago

infrastructuredataimportguithirdpartyclient

3.78 score 1 packages 6 scripts 622 downloads

iGC - An integrated analysis package of Gene expression and Copy number alteration

This package is intended to identify differentially expressed genes driven by Copy Number Alterations from samples with both gene expression and CNA data.

Last updated 23 days ago

softwarebiological questiondifferentialexpressiongenomicvariationassaydomaincopynumbervariationgeneexpressionresearchfieldgeneticstechnologymicroarraysequencingworkflowstepmultiplecomparison

3.78 score 1 stars 1 scripts 208 downloads

pandaR - PANDA Algorithm

Runs PANDA, an algorithm for discovering novel network structure by combining information from multiple complementary data sources.

Last updated 23 days ago

statisticalmethodgraphandnetworkmicroarraygeneregulationnetworkinferencegeneexpressiontranscriptionnetwork

3.78 score 1 packages 8 scripts 246 downloads

CONFESS - Cell OrderiNg by FluorEScence Signal

Single Cell Fluidigm Spot Detector.

Last updated 23 days ago

immunooncologygeneexpressiondataimportcellbiologyclusteringrnaseqqualitycontrolvisualizationtimecourseregressionclassification

3.60 score 2 scripts 210 downloads

MethTargetedNGS - Perform Methylation Analysis on Next Generation Sequencing Data

Perform step by step methylation analysis of Next Generation Sequencing data.

Last updated 23 days ago

researchfieldgeneticssequencingalignmentsequencematchingdataimport

3.48 score 1 scripts 194 downloads

sscu - Strength of Selected Codon Usage

The package calculates the indexes for selective stength in codon usage in bacteria species. (1) The package can calculate the strength of selected codon usage bias (sscu, also named as s_index) based on Paul Sharp's method. The method take into account of background mutation rate, and focus only on four pairs of codons with universal translational advantages in all bacterial species. Thus the sscu index is comparable among different species. (2) The package can detect the strength of translational accuracy selection by Akashi's test. The test tabulating all codons into four categories with the feature as conserved/variable amino acids and optimal/non-optimal codons. (3) Optimal codon lists (selected codons) can be calculated by either op_highly function (by using the highly expressed genes compared with all genes to identify optimal codons), or op_corre_CodonW/op_corre_NCprime function (by correlative method developed by Hershberg & Petrov). Users will have a list of optimal codons for further analysis, such as input to the Akashi's test. (4) The detailed codon usage information, such as RSCU value, number of optimal codons in the highly/all gene set, as well as the genomic gc3 value, can be calculate by the optimal_codon_statistics and genomic_gc3 function. (5) Furthermore, we added one test function low_frequency_op in the package. The function try to find the low frequency optimal codons, among all the optimal codons identified by the op_highly function.

Last updated 23 days ago

geneticsgeneexpressionwholegenome

2.30 score 1 scripts 162 downloads

ISoLDE - Integrative Statistics of alleLe Dependent Expression

This package provides ISoLDE a new method for identifying imprinted genes. This method is dedicated to data arising from RNA sequencing technologies. The ISoLDE package implements original statistical methodology described in the publication below.

Last updated 23 days ago

immunooncologygeneexpressiontranscriptiongenesetenrichmentgeneticssequencingrnaseqmultiplecomparisonsnpgeneticvariabilityepigeneticsmathematicalbiologygeneregulation

2.30 score 2 scripts 140 downloads

scater - Single-Cell Analysis Toolkit for Gene Expression Data in R

A collection of tools for doing various analyses of single-cell RNA-seq gene expression data, with a focus on quality control and visualization.

Last updated 23 days ago

immunooncologysinglecellrnaseqqualitycontrolpreprocessingnormalizationvisualizationdimensionreductiontranscriptomicsgeneexpressionsequencingsoftwaredataimportdatarepresentationinfrastructurecoverage

11.05 score 39 packages 11k scripts 12k downloads

SC3 - Single-Cell Consensus Clustering

A tool for unsupervised clustering and analysis of single cell RNA-Seq data.

Last updated 23 days ago

immunooncologysinglecellsoftwareclassificationclusteringdimensionreductionsupportvectormachinernaseqvisualizationtranscriptomicsdatarepresentationguidifferentialexpressiontranscriptionbioconductor-packagehuman-cell-atlassingle-cell-rna-seq

10.06 score 119 stars 1 packages 354 scripts 618 downloads

GenVisR - Genomic Visualizations in R

Produce highly customizable publication quality graphics for genomic data primarily at the cohort level.

Last updated 23 days ago

infrastructuredatarepresentationclassificationdnaseq

9.79 score 210 stars 70 scripts 519 downloads

ROTS - Reproducibility-Optimized Test Statistic

Calculates the Reproducibility-Optimized Test Statistic (ROTS) for differential testing in omics data.

Last updated 23 days ago

softwaregeneexpressiondifferentialexpressionmicroarrayrnaseqproteomicsimmunooncology

6.19 score 3 packages 86 scripts 466 downloads

iCARE - Individualized Coherent Absolute Risk Estimation (iCARE)

An R package to build, validate and apply absolute risk models

Last updated 23 days ago

softwarestatisticalmethodgenomewideassociation

4.78 score 8 scripts 319 downloads

chromPlot - Global visualization tool of genomic data

Package designed to visualize genomic data along the chromosomes, where the vertical chromosomes are sorted by number, with sex chromosomes at the end.

Last updated 23 days ago

datarepresentationfunctionalgenomicsgeneticssequencingannotationvisualization

4.53 score 24 scripts 446 downloads

cellity - Quality Control for Single-Cell RNA-seq Data

A support vector machine approach to identifying and filtering low quality cells from single-cell RNA-seq datasets.

Last updated 23 days ago

immunooncologyrnaseqqualitycontrolpreprocessingnormalizationvisualizationdimensionreductiontranscriptomicsgeneexpressionsequencingsoftwaresupportvectormachine

4.00 score 9 scripts 202 downloads

Chicago - CHiCAGO: Capture Hi-C Analysis of Genomic Organization

A pipeline for analysing Capture Hi-C data.

Last updated 23 days ago

epigeneticshicsequencingsoftware

3.78 score 30 scripts 308 downloads

RGraph2js - Convert a Graph into a D3js Script

Generator of web pages which display interactive network/graph visualizations with D3js, jQuery and Raphael.

Last updated 23 days ago

visualizationnetworkgraphandnetworkthirdpartyclient

3.30 score 1 scripts 184 downloads

transcriptR - An Integrative Tool for ChIP- And RNA-Seq Based Primary Transcripts Detection and Quantification

The differences in the RNA types being sequenced have an impact on the resulting sequencing profiles. mRNA-seq data is enriched with reads derived from exons, while GRO-, nucRNA- and chrRNA-seq demonstrate a substantial broader coverage of both exonic and intronic regions. The presence of intronic reads in GRO-seq type of data makes it possible to use it to computationally identify and quantify all de novo continuous regions of transcription distributed across the genome. This type of data, however, is more challenging to interpret and less common practice compared to mRNA-seq. One of the challenges for primary transcript detection concerns the simultaneous transcription of closely spaced genes, which needs to be properly divided into individually transcribed units. The R package transcriptR combines RNA-seq data with ChIP-seq data of histone modifications that mark active Transcription Start Sites (TSSs), such as, H3K4me3 or H3K9/14Ac to overcome this challenge. The advantage of this approach over the use of, for example, gene annotations is that this approach is data driven and therefore able to deal also with novel and case specific events. Furthermore, the integration of ChIP- and RNA-seq data allows the identification all known and novel active transcription start sites within a given sample.

Last updated 23 days ago

immunooncologytranscriptionsoftwaresequencingrnaseqcoverage

3.30 score 2 scripts 300 downloads

profileScoreDist - Profile score distributions

Regularization and score distributions for position count matrices.

Last updated 23 days ago

softwaregeneregulationstatisticalmethod

3.30 score 1 scripts 182 downloads

CausalR - Causal network analysis methods

Causal network analysis methods for regulator prediction and network reconstruction from genome scale data.

Last updated 23 days ago

immunooncologysystemsbiologynetworkgraphandnetworknetwork inferencetranscriptomicsproteomicsdifferentialexpressionrnaseqmicroarray

3.30 score 7 scripts 170 downloads

Biobase - Biobase: Base functions for Bioconductor

Functions that are needed by many other packages or which replace R functions.

Last updated 23 days ago

infrastructurebioconductor-packagecore-package

16.48 score 9 stars 1.8k packages 7.0k scripts 94k downloads

CNEr - CNE Detection and Visualization

Large-scale identification and advanced visualization of sets of conserved noncoding elements.

Last updated 23 days ago

generegulationvisualizationdataimport

9.27 score 3 stars 19 packages 34 scripts 5.9k downloads

Rcpi - Molecular Informatics Toolkit for Compound-Protein Interaction in Drug Discovery

A molecular informatics toolkit with an integration of bioinformatics and chemoinformatics tools for drug discovery.

Last updated 23 days ago

softwaredataimportdatarepresentationfeatureextractioncheminformaticsbiomedicalinformaticsproteomicsgosystemsbiologybioconductorbioinformaticsdrug-discoveryfeature-extractionfingerprintmolecular-descriptorsprotein-sequences

7.78 score 36 stars 28 scripts 290 downloads

rols - An R interface to the Ontology Lookup Service

The rols package is an interface to the Ontology Lookup Service (OLS) to access and query hundred of ontolgies directly from R.

Last updated 23 days ago

immunooncologysoftwareannotationmassspectrometrygo

8.57 score 11 stars 5 packages 84 scripts 722 downloads

AnnotationForge - Tools for building SQLite-based annotation data packages

Provides code for generating Annotation packages and their databases. Packages produced are intended to be used with AnnotationDbi.

Last updated 23 days ago

annotationinfrastructurebioconductor-packagecore-package

8.41 score 4 stars 20 packages 125 scripts 3.3k downloads

hpar - Human Protein Atlas in R

The hpar package provides a simple R interface to and data from the Human Protein Atlas project.

Last updated 23 days ago

proteomicscellbiologydataimportfunctionalgenomicssystemsbiologyexperimenthubsoftware

5.41 score 1 packages 17 scripts 632 downloads

MiRaGE - MiRNA Ranking by Gene Expression

The package contains functions for inferece of target gene regulation by miRNA, based on only target gene expression profile.

Last updated 23 days ago

immunooncologymicroarraygeneexpressionrnaseqsequencingsage

3.85 score 35 scripts 237 downloads

goseq - Gene Ontology analyser for RNA-seq and other length biased data

Detects Gene Ontology and/or other user defined categories which are over/under represented in RNA-seq data.

Last updated 23 days ago

immunooncologysequencinggogeneexpressiontranscriptionrnaseqdifferentialexpressionannotationgenesetenrichmentkeggpathwayssoftware

9.71 score 9 packages 556 scripts 2.1k downloads

PICS - Probabilistic inference of ChIP-seq

Probabilistic inference of ChIP-Seq using an empirical Bayes mixture model approach.

Last updated 23 days ago

clusteringvisualizationsequencingchipseq

6.26 score 1 packages 7 scripts 310 downloads

frma - Frozen RMA and Barcode

Preprocessing and analysis for single microarrays and microarray batches.

Last updated 23 days ago

softwaremicroarraypreprocessing

4.72 score 1 packages 87 scripts 354 downloads

DEGseq - Identify Differentially Expressed Genes from RNA-seq data

DEGseq is an R package to identify differentially expressed genes from RNA-Seq data.

Last updated 23 days ago

rnaseqpreprocessinggeneexpressiondifferentialexpressionimmunooncology

4.16 score 48 scripts 332 downloads

frmaTools - Frozen RMA Tools

Tools for advanced use of the frma package.

Last updated 23 days ago

softwaremicroarraypreprocessing

3.90 score 6 scripts 286 downloads

hyperdraw - Visualizing Hypergaphs

Functions for visualizing hypergraphs.

Last updated 23 days ago

visualizationgraphandnetwork

3.78 score 1 packages 3 scripts 354 downloads

genomes - Genome sequencing project metadata

Download genome and assembly reports from NCBI

Last updated 23 days ago

annotationgenetics

3.48 score 15 scripts 253 downloads

GSRI - Gene Set Regulation Index

The GSRI package estimates the number of differentially expressed genes in gene sets, utilizing the concept of the Gene Set Regulation Index (GSRI).

Last updated 23 days ago

microarraytranscriptiondifferentialexpressiongenesetenrichmentgeneregulation

3.30 score 2 scripts 268 downloads

qpgraph - Estimation of genetic and molecular regulatory networks from high-throughput genomics data

Estimate gene and eQTL networks from high-throughput expression and genotyping assays.

Last updated 23 days ago

microarraygeneexpressiontranscriptionpathwaysnetworkinferencegraphandnetworkgeneregulationgeneticsgeneticvariabilitysnpsoftware

7.08 score 3 packages 20 scripts 421 downloads

RPA - RPA: Robust Probabilistic Averaging for probe-level analysis

Probabilistic analysis of probe reliability and differential gene expression on short oligonucleotide arrays.

Last updated 23 days ago

geneexpressionmicroarraypreprocessingqualitycontrol

5.78 score 1 packages 20 scripts 286 downloads

chipseq - chipseq: A package for analyzing chipseq data

Tools for helping process short read data for chipseq experiments.

Last updated 23 days ago

chipseqsequencingcoveragequalitycontroldataimport

5.43 score 5 packages 90 scripts 926 downloads

HilbertVis - Hilbert curve visualization

Functions to visualize long vectors of integer data by means of Hilbert curves

Last updated 23 days ago

visualization

4.40 score 1 packages 14 scripts 359 downloads

flagme - Analysis of Metabolomics GC/MS Data

Fragment-level analysis of gas chromatography-massspectrometry metabolomics data.

Last updated 23 days ago

differentialexpressionmassspectrometry

4.30 score 2 scripts 230 downloads

BicARE - Biclustering Analysis and Results Exploration

Biclustering Analysis and Results Exploration.

Last updated 23 days ago

microarraytranscriptionclustering

4.08 score 2 packages 2 scripts 388 downloads

AgiMicroRna - Processing and Differential Expression Analysis of Agilent microRNA chips

Processing and Analysis of Agilent microRNA data

Last updated 23 days ago

microarrayagilentchiponechannelpreprocessingdifferentialexpression

3.30 score 9 scripts 452 downloads

edgeR - Empirical Analysis of Digital Gene Expression Data in R

Differential expression analysis of RNA-seq expression profiles with biological replication. Implements a range of statistical methodology based on the negative binomial distributions, including empirical Bayes estimation, exact tests, generalized linear models and quasi-likelihood tests. As well as RNA-seq, it be applied to differential signal analysis of other types of genomic data that produce read counts, including ChIP-seq, ATAC-seq, Bisulfite-seq, SAGE and CAGE.

Last updated 23 days ago

geneexpressiontranscriptionalternativesplicingcoveragedifferentialexpressiondifferentialsplicingdifferentialmethylationgenesetenrichmentpathwaysgeneticsdnamethylationbayesianclusteringchipseqregressiontimecoursesequencingrnaseqbatcheffectsagenormalizationqualitycontrolmultiplecomparisonbiomedicalinformaticscellbiologyfunctionalgenomicsepigeneticsimmunooncologysystemsbiologytranscriptomicssinglecell

13.47 score 252 packages 16k scripts 43k downloads

minet - Mutual Information NETworks

This package implements various algorithms for inferring mutual information networks from data.

Last updated 23 days ago

microarraygraphandnetworknetworknetworkinference

6.14 score 16 packages 116 scripts 1.2k downloads

goProfiles - goProfiles: an R package for the statistical analysis of functional profiles

The package implements methods to compare lists of genes based on comparing the corresponding 'functional profiles'.

Last updated 23 days ago

annotationgogeneexpressiongenesetenrichmentgraphandnetworkmicroarraymultiplecomparisonpathwayssoftware

5.48 score 1 packages 6 scripts 340 downloads

HELP - Tools for HELP data analysis

The package contains a modular pipeline for analysis of HELP microarray data, and includes graphical and mathematical tools with more general applications.

Last updated 23 days ago

cpgislanddnamethylationmicroarraytwochanneldataimportqualitycontrolpreprocessingvisualization

5.15 score 70 scripts 248 downloads

SIM - Integrated Analysis on two human genomic datasets

Finds associations between two human genomic datasets.

Last updated 23 days ago

microarrayvisualization

4.30 score 3 scripts 274 downloads

ITALICS - ITALICS

A Method to normalize of Affymetrix GeneChip Human Mapping 100K and 500K set

Last updated 23 days ago

microarraycopynumbervariation

4.08 score 337 downloads

CGHregions - Dimension Reduction for Array CGH Data with Minimal Information Loss.

Dimension Reduction for Array CGH Data with Minimal Information Loss

Last updated 23 days ago

microarraycopynumbervariationvisualization

3.72 score 26 scripts 284 downloads

CGHbase - CGHbase: Base functions and classes for arrayCGH data analysis.

Contains functions and classes that are needed by arrayCGH packages.

Last updated 23 days ago

infrastructuremicroarraycopynumbervariation

3.68 score 8 packages 3 scripts 738 downloads

KCsmart - Multi sample aCGH analysis package using kernel convolution

Multi sample aCGH analysis package using kernel convolution

Last updated 23 days ago

copynumbervariationvisualizationacghmicroarray

3.60 score 1 scripts 266 downloads

microRNA - Data and functions for dealing with microRNAs

Different data resources for microRNAs and some functions for manipulating them.

Last updated 23 days ago

infrastructuregenomeannotationsequencematching

3.30 score 7 scripts 374 downloads

PLPE - Local Pooled Error Test for Differential Expression with Paired High-throughput Data

This package performs tests for paired high-throughput data.

Last updated 23 days ago

proteomicsmicroarraydifferentialexpression

3.30 score 7 scripts 263 downloads

RBioinf - RBioinf

Functions and datasets and examples to accompany the monograph R For Bioinformatics.

Last updated 23 days ago

geneexpressionmicroarraypreprocessingqualitycontrolclassificationclusteringmultiplecomparisonannotation

3.30 score 2 scripts 250 downloads

xcms - LC-MS and GC-MS Data Analysis

Framework for processing and visualization of chromatographically separated and single-spectra mass spectral data. Imports from AIA/ANDI NetCDF, mzXML, mzData and mzML files. Preprocesses data for high-throughput, untargeted analyte profiling.

Last updated 23 days ago

immunooncologymassspectrometrymetabolomicsbioconductorfeature-detectionmass-spectrometrypeak-detection

14.35 score 185 stars 11 packages 832 scripts 3.3k downloads

AnnotationDbi - Manipulation of SQLite-based annotations in Bioconductor

Implements a user-friendly interface for querying SQLite-based annotation data packages.

Last updated 23 days ago

annotationmicroarraysequencinggenomeannotationbioconductor-packagecore-package

14.16 score 9 stars 770 packages 3.2k scripts 67k downloads

phyloseq - Handling and analysis of high-throughput microbiome census data

phyloseq provides a set of classes and tools to facilitate the import, storage, analysis, and graphical display of microbiome census data.

Last updated 23 days ago

immunooncologysequencingmicrobiomemetagenomicsclusteringclassificationmultiplecomparisongeneticvariability

13.86 score 584 stars 37 packages 8.2k scripts 9.1k downloads

BSgenome - Software infrastructure for efficient representation of full genomes and their SNPs

Infrastructure shared by all the Biostrings-based genome data packages.

Last updated 23 days ago

geneticsinfrastructuredatarepresentationsequencematchingannotationsnpbioconductor-packagecore-package

13.17 score 9 stars 268 packages 1.1k scripts 25k downloads

microbiome - Microbiome Analytics

Utilities for microbiome analysis.

Last updated 23 days ago

metagenomicsmicrobiomesequencingsystemsbiologyhitchiphitchip-atlashuman-microbiomemicrobiologymicrobiome-analysisphyloseqpopulation-study

12.36 score 289 stars 4 packages 1.8k scripts 2.5k downloads

ReactomePA - Reactome Pathway Analysis

This package provides functions for pathway analysis based on REACTOME pathway database. It implements enrichment analysis, gene set enrichment analysis and several functions for visualization. This package is not affiliated with the Reactome team.

Last updated 23 days ago

pathwaysvisualizationannotationmultiplecomparisongenesetenrichmentreactomeenrichment-analysisreactome-pathway-analysisreactomepa

12.19 score 37 stars 7 packages 1.0k scripts 4.0k downloads

preprocessCore - A collection of pre-processing functions

A library of core preprocessing routines.

Last updated 23 days ago

infrastructure

12.00 score 17 stars 214 packages 1.8k scripts 21k downloads

bumphunter - Bump Hunter

Tools for finding bumps in genomic data

Last updated 23 days ago

dnamethylationepigeneticsinfrastructuremultiplecomparisonimmunooncology

11.82 score 16 stars 50 packages 210 scripts 4.7k downloads

GenomicDataCommons - NIH / NCI Genomic Data Commons Access

Programmatically access the NIH / NCI Genomic Data Commons RESTful service.

Last updated 23 days ago

dataimportsequencingapi-clientbioconductorbioinformaticscancercore-servicesdata-sciencegenomicsncitcgavignette

11.67 score 84 stars 10 packages 181 scripts 1.6k downloads

VariantAnnotation - Annotation of Genetic Variants

Annotate variants, compute amino acid coding changes, predict coding outcomes.

Last updated 23 days ago

dataimportsequencingsnpannotationgeneticsvariantannotation

11.58 score 157 packages 1.8k scripts 16k downloads

graph - graph: A package to handle graph data structures

A package that implements some simple graph handling capabilities.

Last updated 23 days ago

graphandnetwork

11.55 score 342 packages 756 scripts 31k downloads

systemPipeR - systemPipeR: Workflow Environment for Data Analysis and Report Generation

systemPipeR is a multipurpose data analysis workflow environment that unifies R with command-line tools. It enables scientists to analyze many types of large- or small-scale data on local or distributed computer systems with a high level of reproducibility, scalability and portability. At its core is a command-line interface (CLI) that adopts the Common Workflow Language (CWL). This design allows users to choose for each analysis step the optimal R or command-line software. It supports both end-to-end and partial execution of workflows with built-in restart functionalities. Efficient management of complex analysis tasks is accomplished by a flexible workflow control container class. Handling of large numbers of input samples and experimental designs is facilitated by consistent sample annotation mechanisms. As a multi-purpose workflow toolkit, systemPipeR enables users to run existing workflows, customize them or design entirely new ones while taking advantage of widely adopted data structures within the Bioconductor ecosystem. Another important core functionality is the generation of reproducible scientific analysis and technical reports. For result interpretation, systemPipeR offers a wide range of plotting functionality, while an associated Shiny App offers many useful functionalities for interactive result exploration. The vignettes linked from this page include (1) a general introduction, (2) a description of technical details, and (3) a collection of workflow templates.

Last updated 23 days ago

geneticsinfrastructuredataimportsequencingrnaseqriboseqchipseqmethylseqsnpgeneexpressioncoveragegenesetenrichmentalignmentqualitycontrolimmunooncologyreportwritingworkflowstepworkflowmanagement

11.52 score 52 stars 3 packages 332 scripts 1.9k downloads

Rhdf5lib - hdf5 library as an R package

Provides C and C++ hdf5 libraries.

Last updated 23 days ago

infrastructurebioconductorhdf5hdf5-library

11.43 score 6 stars 338 packages 24 scripts 39k downloads

genefilter - genefilter: methods for filtering genes from high-throughput experiments

Some basic functions for filtering genes.

Last updated 23 days ago

microarray

11.31 score 152 packages 2.4k scripts 21k downloads

XVector - Foundation of external vector representation and manipulation in Bioconductor

Provides memory efficient S4 classes for storing sequences "externally" (e.g. behind an R external pointer, or on disk).

Last updated 23 days ago

infrastructuredatarepresentationbioconductor-packagecore-package

11.06 score 2 stars 1.7k packages 67 scripts 94k downloads

annotate - Annotation for microarrays

Using R enviroments for annotation.

Last updated 23 days ago

annotationpathwaysgo

10.74 score 256 packages 792 scripts 33k downloads

flowCore - flowCore: Basic structures for flow cytometry data

Provides S4 data structures and basic functions to deal with flow cytometry data.

Last updated 23 days ago

immunooncologyinfrastructureflowcytometrycellbasedassays

10.66 score 59 packages 1.8k scripts 4.7k downloads

GWASTools - Tools for Genome Wide Association Studies

Classes for storing very large GWAS data sets and annotation, and functions for GWAS data cleaning and analysis.

Last updated 23 days ago

snpgeneticvariabilityqualitycontrolmicroarray

10.45 score 16 stars 5 packages 380 scripts 1.1k downloads

illuminaio - Parsing Illumina Microarray Output Files

Tools for parsing Illumina's microarray output files, including IDAT.

Last updated 23 days ago

infrastructuredataimportmicroarrayproprietaryplatformsbioconductor

10.39 score 6 stars 44 packages 50 scripts 4.5k downloads

pRoloc - A unifying bioinformatics framework for spatial proteomics

The pRoloc package implements machine learning and visualisation methods for the analysis and interogation of quantitiative mass spectrometry data to reliably infer protein sub-cellular localisation.

Last updated 23 days ago

immunooncologyproteomicsmassspectrometryclassificationclusteringqualitycontrolbioconductorproteomics-dataspatial-proteomicsvisualisation

10.36 score 15 stars 2 packages 100 scripts 471 downloads

oligo - Preprocessing tools for oligonucleotide arrays

A package to analyze oligonucleotide arrays (expression/SNP/tiling/exon) at probe-level. It currently supports Affymetrix (CEL files) and NimbleGen arrays (XYS files).

Last updated 23 days ago

microarrayonechanneltwochannelpreprocessingsnpdifferentialexpressionexonarraygeneexpressiondataimport

10.29 score 3 stars 11 packages 532 scripts 2.5k downloads

GSEABase - Gene set enrichment data structures and methods

This package provides classes and methods to support Gene Set Enrichment Analysis (GSEA).

Last updated 23 days ago

geneexpressiongenesetenrichmentgraphandnetworkgokegg

10.27 score 80 packages 1.5k scripts 13k downloads

RUVSeq - Remove Unwanted Variation from RNA-Seq Data

This package implements the remove unwanted variation (RUV) methods of Risso et al. (2014) for the normalization of RNA-Seq read counts between samples.

Last updated 23 days ago

immunooncologydifferentialexpressionpreprocessingrnaseqsoftware

9.88 score 12 stars 5 packages 498 scripts 1.4k downloads

DEGreport - Report of DEG analysis

Creation of ready-to-share figures of differential expression analyses of count data. It integrates some of the code mentioned in DESeq2 and edgeR vignettes, and report a ranked list of genes according to the fold changes mean and variability for each selected gene.

Last updated 23 days ago

differentialexpressionvisualizationrnaseqreportwritinggeneexpressionimmunooncologybioconductordifferential-expressionqcreportrna-seqsmallrna

9.53 score 24 stars 1 packages 340 scripts 1.1k downloads

DECIPHER - Tools for curating, analyzing, and manipulating biological sequences

A toolset for deciphering and managing biological sequences.

Last updated 23 days ago

clusteringgeneticssequencingdataimportvisualizationmicroarrayqualitycontrolqpcralignmentwholegenomemicrobiomeimmunooncologygeneprediction

9.38 score 13 packages 828 scripts 4.0k downloads

BASiCS - Bayesian Analysis of Single-Cell Sequencing data

Single-cell mRNA sequencing can uncover novel cell-to-cell heterogeneity in gene expression levels in seemingly homogeneous populations of cells. However, these experiments are prone to high levels of technical noise, creating new challenges for identifying genes that show genuine heterogeneous expression within the population of cells under study. BASiCS (Bayesian Analysis of Single-Cell Sequencing data) is an integrated Bayesian hierarchical model to perform statistical analyses of single-cell RNA sequencing datasets in the context of supervised experiments (where the groups of cells of interest are known a priori, e.g. experimental conditions or cell types). BASiCS performs built-in data normalisation (global scaling) and technical noise quantification (based on spike-in genes). BASiCS provides an intuitive detection criterion for highly (or lowly) variable genes within a single group of cells. Additionally, BASiCS can compare gene expression patterns between two or more pre-specified groups of cells. Unlike traditional differential expression tools, BASiCS quantifies changes in expression that lie beyond comparisons of means, also allowing the study of changes in cell-to-cell heterogeneity. The latter can be quantified via a biological over-dispersion parameter that measures the excess of variability that is observed with respect to Poisson sampling noise, after normalisation and technical noise removal. Due to the strong mean/over-dispersion confounding that is typically observed for scRNA-seq datasets, BASiCS also tests for changes in residual over-dispersion, defined by residual values with respect to a global mean/over-dispersion trend.

Last updated 23 days ago

immunooncologynormalizationsequencingrnaseqsoftwaregeneexpressiontranscriptomicssinglecelldifferentialexpressionbayesiancellbiologybioconductor-packagegene-expressionrcpprcpparmadilloscrna-seqsingle-cell

9.36 score 84 stars 1 packages 366 scripts 405 downloads

IsoformSwitchAnalyzeR - Identify, Annotate and Visualize Isoform Switches with Functional Consequences from both short- and long-read RNA-seq data

Analysis of alternative splicing and isoform switches with predicted functional consequences (e.g. gain/loss of protein domains etc.) from quantification of all types of RNASeq by tools such as Kallisto, Salmon, StringTie, Cufflinks/Cuffdiff etc.

Last updated 23 days ago

geneexpressiontranscriptionalternativesplicingdifferentialexpressiondifferentialsplicingvisualizationstatisticalmethodtranscriptomevariantbiomedicalinformaticsfunctionalgenomicssystemsbiologytranscriptomicsrnaseqannotationfunctionalpredictiongenepredictiondataimportmultiplecomparisonbatcheffectimmunooncology

9.20 score 101 stars 117 scripts 592 downloads

scone - Single Cell Overview of Normalized Expression data

SCONE is an R package for comparing and ranking the performance of different normalization schemes for single-cell RNA-seq and other high-throughput analyses.

Last updated 23 days ago

immunooncologynormalizationpreprocessingqualitycontrolgeneexpressionrnaseqsoftwaretranscriptomicssequencingsinglecellcoverage

9.00 score 53 stars 104 scripts 373 downloads

impute - impute: Imputation for microarray data

Imputation for microarray data (currently KNN only)

Last updated 23 days ago

microarray

8.97 score 131 packages 808 scripts 15k downloads

GenomicScores - Infrastructure to work with genomewide position-specific scores

Provide infrastructure to store and access genomewide position-specific scores within R and Bioconductor.

Last updated 23 days ago

infrastructuregeneticsannotationsequencingcoverageannotationhubsoftware

8.78 score 8 stars 6 packages 83 scripts 1.2k downloads

SeqVarTools - Tools for variant data

An interface to the fast-access storage format for VCF data provided in SeqArray, with tools for common operations and analysis.

Last updated 23 days ago

snpgeneticvariabilitysequencinggenetics

8.75 score 3 stars 2 packages 368 scripts 759 downloads

SCnorm - Normalization of single cell RNA-seq data

This package implements SCnorm — a method to normalize single-cell RNA-seq data.

Last updated 23 days ago

normalizationrnaseqsinglecellimmunooncology

8.48 score 47 stars 80 scripts 280 downloads

geneplotter - Graphics related functions for Bioconductor

Functions for plotting genomic data

Last updated 23 days ago

visualization

8.46 score 10 packages 252 scripts 9.5k downloads

TitanCNA - Subclonal copy number and LOH prediction from whole genome sequencing of tumours

Hidden Markov model to segment and predict regions of subclonal copy number alterations (CNA) and loss of heterozygosity (LOH), and estimate cellular prevalence of clonal clusters in tumour whole genome sequencing data.

Last updated 23 days ago

sequencingwholegenomednaseqexomeseqstatisticalmethodcopynumbervariationhiddenmarkovmodelgeneticsgenomicvariationimmunooncology10x-genomicscopy-number-variationgenome-sequencinghmmtumor-heterogeneity

8.45 score 94 stars 67 scripts 254 downloads

survcomp - Performance Assessment and Comparison for Survival Analysis

Assessment and Comparison for Performance of Risk Prediction (Survival) Models.

Last updated 23 days ago

geneexpressiondifferentialexpressionvisualization

8.45 score 12 packages 444 scripts 1.8k downloads

piano - Platform for integrative analysis of omics data

Piano performs gene set analysis using various statistical methods, from different gene level statistics and a wide range of gene-set collections. Furthermore, the Piano package contains functions for combining the results of multiple runs of gene set analyses.

Last updated 23 days ago

microarraypreprocessingqualitycontroldifferentialexpressionvisualizationgeneexpressiongenesetenrichmentpathwaysbioconductorbioconductor-packagebioinformaticsgene-set-enrichmenttranscriptomics

8.29 score 13 stars 7 packages 179 scripts 796 downloads

MSstats - Protein Significance Analysis in DDA, SRM and DIA for Label-free or Label-based Proteomics Experiments

A set of tools for statistical relative protein significance analysis in DDA, SRM and DIA experiments.

Last updated 23 days ago

immunooncologymassspectrometryproteomicssoftwarenormalizationqualitycontroltimecourse

8.19 score 6 packages 142 scripts 806 downloads

wateRmelon - Illumina DNA methylation array normalization and metrics

15 flavours of betas and three performance metrics, with methods for objects produced by methylumi and minfi packages.

Last updated 23 days ago

dnamethylationmicroarraytwochannelpreprocessingqualitycontrol

8.11 score 4 packages 239 scripts 1.5k downloads

apeglm - Approximate posterior estimation for GLM coefficients

apeglm provides Bayesian shrinkage estimators for effect sizes for a variety of GLM models, using approximation of the posterior for individual coefficients.

Last updated 23 days ago

immunooncologysequencingrnaseqdifferentialexpressiongeneexpressionbayesian

7.58 score 9 packages 648 scripts 5.5k downloads

openCyto - Hierarchical Gating Pipeline for flow cytometry data

This package is designed to facilitate the automated gating methods in sequential way to mimic the manual gating strategy.

Last updated 23 days ago

immunooncologyflowcytometrydataimportpreprocessingdatarepresentation

7.54 score 1 packages 350 scripts 1.1k downloads

metaMS - MS-based metabolomics annotation pipeline

MS-based metabolomics data processing and compound annotation pipeline.

Last updated 23 days ago

immunooncologymassspectrometrymetabolomics

7.47 score 15 stars 14 scripts 417 downloads

EBSeq - An R package for gene and isoform differential expression analysis of RNA-seq data

Differential Expression analysis at both gene and isoform level using RNA-seq data

Last updated 23 days ago

immunooncologystatisticalmethoddifferentialexpressionmultiplecomparisonrnaseqsequencing

7.46 score 6 packages 159 scripts 680 downloads

shinyMethyl - Interactive visualization for Illumina methylation arrays

Interactive tool for visualizing Illumina methylation array data. Both the 450k and EPIC array are supported.

Last updated 23 days ago

dnamethylationmicroarraytwochannelpreprocessingqualitycontrolmethylationarray

7.42 score 5 stars 38 scripts 422 downloads

flowViz - Visualization for flow cytometry

Provides visualization tools for flow cytometry data.

Last updated 23 days ago

immunooncologyinfrastructureflowcytometrycellbasedassaysvisualization

7.42 score 12 packages 226 scripts 1.6k downloads

orthogene - Interspecies gene mapping

`orthogene` is an R package for easy mapping of orthologous genes across hundreds of species. It pulls up-to-date gene ortholog mappings across **700+ organisms**. It also provides various utility functions to aggregate/expand common objects (e.g. data.frames, gene expression matrices, lists) using **1:1**, **many:1**, **1:many** or **many:many** gene mappings, both within- and between-species.

Last updated 23 days ago

geneticscomparativegenomicspreprocessingphylogeneticstranscriptomicsgeneexpressionanimal-modelsbioconductorbioconductor-packagebioinformaticsbiomedicinecomparative-genomicsevolutionary-biologygenesgenomicsontologiestranslational-research

7.41 score 40 stars 1 packages 24 scripts 524 downloads

gcrma - Background Adjustment Using Sequence Information

Background adjustment using sequence information

Last updated 23 days ago

microarrayonechannelpreprocessing

7.33 score 12 packages 162 scripts 1.8k downloads

cytolib - C++ infrastructure for representing and interacting with the gated cytometry data

This package provides the core data structure and API to represent and interact with the gated cytometry data.

Last updated 23 days ago

immunooncologyflowcytometrydataimportpreprocessingdatarepresentation

7.29 score 60 packages 6 scripts 4.3k downloads

ROC - utilities for ROC, with microarray focus

Provide utilities for ROC, with microarray focus.

Last updated 23 days ago

differentialexpression

7.07 score 10 packages 70 scripts 1.9k downloads

Category - Category Analysis

A collection of tools for performing category (gene set enrichment) analysis.

Last updated 23 days ago

annotationgopathwaysgenesetenrichment

7.07 score 19 packages 204 scripts 2.5k downloads

GOstats - Tools for manipulating GO and microarrays

A set of tools for interacting with GO and microarray data. A variety of basic manipulation tools for graphs, hypothesis testing and other simple calculations.

Last updated 23 days ago

annotationgomultiplecomparisongeneexpressionmicroarraypathwaysgenesetenrichmentgraphandnetwork

6.99 score 14 packages 516 scripts 2.3k downloads

viper - Virtual Inference of Protein-activity by Enriched Regulon analysis

Inference of protein activity from gene expression data, including the VIPER and msVIPER algorithms

Last updated 23 days ago

systemsbiologynetworkenrichmentgeneexpressionfunctionalpredictiongeneregulation

6.95 score 5 packages 324 scripts 1.0k downloads

regionReport - Generate HTML or PDF reports for a set of genomic regions or DESeq2/edgeR results

Generate HTML or PDF reports to explore a set of regions such as the results from annotation-agnostic expression analysis of RNA-seq data at base-pair resolution performed by derfinder. You can also create reports for DESeq2 or edgeR results.

Last updated 23 days ago

differentialexpressionsequencingrnaseqsoftwarevisualizationtranscriptioncoveragereportwritingdifferentialmethylationdifferentialpeakcallingimmunooncologyqualitycontrolbioconductorderfinderdeseq2edgerregionreportrmarkdown

6.90 score 9 stars 44 scripts 306 downloads

FlowSOM - Using self-organizing maps for visualization and interpretation of cytometry data

FlowSOM offers visualization options for cytometry data, by using Self-Organizing Map clustering and Minimal Spanning Trees.

Last updated 23 days ago

cellbiologyflowcytometryclusteringvisualizationsoftwarecellbasedassays

6.69 score 10 packages 466 scripts 1.8k downloads

BioCor - Functional similarities

Calculates functional similarities based on the pathways described on KEGG and REACTOME or in gene sets. These similarities can be calculated for pathways or gene sets, genes, or clusters and combined with other similarities. They can be used to improve networks, gene selection, testing relationships...

Last updated 23 days ago

statisticalmethodclusteringgeneexpressionnetworkpathwaysnetworkenrichmentsystemsbiologybioconductor-packagesbioinformaticsfunctional-similaritygenegene-setspathway-analysissimilaritysimilarity-measurement

6.59 score 14 stars 330 downloads

vidger - Create rapid visualizations of RNAseq data in R

The aim of vidger is to rapidly generate information-rich visualizations for the interpretation of differential gene expression results from three widely-used tools: Cuffdiff, DESeq2, and edgeR.

Last updated 23 days ago

immunooncologyvisualizationrnaseqdifferentialexpressiongeneexpressiondata-mungingdifferential-expressiongene-expressionrna-seq-analysis

6.59 score 18 stars 27 scripts 270 downloads

chimeraviz - Visualization tools for gene fusions

chimeraviz manages data from fusion gene finders and provides useful visualization tools.

Last updated 23 days ago

infrastructurealignment

6.59 score 37 stars 14 scripts 288 downloads

MoonlightR - Identify oncogenes and tumor suppressor genes from omics data

Motivation: The understanding of cancer mechanism requires the identification of genes playing a role in the development of the pathology and the characterization of their role (notably oncogenes and tumor suppressors). Results: We present an R/bioconductor package called MoonlightR which returns a list of candidate driver genes for specific cancer types on the basis of TCGA expression data. The method first infers gene regulatory networks and then carries out a functional enrichment analysis (FEA) (implementing an upstream regulator analysis, URA) to score the importance of well-known biological processes with respect to the studied cancer type. Eventually, by means of random forests, MoonlightR predicts two specific roles for the candidate driver genes: i) tumor suppressor genes (TSGs) and ii) oncogenes (OCGs). As a consequence, this methodology does not only identify genes playing a dual role (e.g. TSG in one cancer type and OCG in another) but also helps in elucidating the biological processes underlying their specific roles. In particular, MoonlightR can be used to discover OCGs and TSGs in the same cancer type. This may help in answering the question whether some genes change role between early stages (I, II) and late stages (III, IV) in breast cancer. In the future, this analysis could be useful to determine the causes of different resistances to chemotherapeutic treatments.

Last updated 23 days ago

dnamethylationdifferentialmethylationgeneregulationgeneexpressionmethylationarraydifferentialexpressionpathwaysnetworksurvivalgenesetenrichmentnetworkenrichment

6.57 score 17 stars 224 downloads

GENIE3 - GEne Network Inference with Ensemble of trees

This package implements the GENIE3 algorithm for inferring gene regulatory networks from expression data.

Last updated 23 days ago

networkinferencesystemsbiologydecisiontreeregressionnetworkgraphandnetworkgeneexpression

6.57 score 4 packages 166 scripts 1.9k downloads

M3C - Monte Carlo Reference-based Consensus Clustering

M3C is a consensus clustering algorithm that uses a Monte Carlo simulation to eliminate overestimation of K and can reject the null hypothesis K=1.

Last updated 23 days ago

clusteringgeneexpressiontranscriptionrnaseqsequencingimmunooncology

6.53 score 1 packages 152 scripts 1.1k downloads

DMRcate - Methylation array and sequencing spatial analysis methods

De novo identification and extraction of differentially methylated regions (DMRs) from the human genome using Whole Genome Bisulfite Sequencing (WGBS) and Illumina Infinium Array (450K and EPIC) data. Provides functionality for filtering probes possibly confounded by SNPs and cross-hybridisation. Includes GRanges generation and plotting functions.

Last updated 23 days ago

differentialmethylationgeneexpressionmicroarraymethylationarraygeneticsdifferentialexpressiongenomeannotationdnamethylationonechanneltwochannelmultiplecomparisonqualitycontroltimecoursesequencingwholegenomeepigeneticscoveragepreprocessingdataimport

6.47 score 1 packages 302 scripts 1.9k downloads

OmaDB - R wrapper for the OMA REST API

A package for the orthology prediction data download from OMA database.

Last updated 23 days ago

softwarecomparativegenomicsfunctionalgenomicsgeneticsannotationgofunctionalprediction

6.23 score 2 stars 5 scripts 484 downloads

BrowserViz - BrowserViz: interactive R/browser graphics using websockets and JSON

Interactvive graphics in a web browser from R, using websockets and JSON.

Last updated 23 days ago

visualizationthirdpartyclient

6.23 score 2 stars 2 packages 20 scripts 314 downloads

CNORode - ODE add-on to CellNOptR

Logic based ordinary differential equation (ODE) add-on to CellNOptR.

Last updated 23 days ago

immunooncologycellbasedassayscellbiologyproteomicsbioinformaticstimecourse

6.22 score 1 packages 37 scripts 242 downloads

mpra - Analyze massively parallel reporter assays

Tools for data management, count preprocessing, and differential analysis in massively parallel report assays (MPRA).

Last updated 23 days ago

softwaregeneregulationsequencingfunctionalgenomics

6.14 score 5 stars 13 scripts 212 downloads

seqcombo - Visualization Tool for Genetic Reassortment

Provides useful functions for visualizing virus reassortment events.

Last updated 23 days ago

alignmentsoftwarevisualization

6.10 score 21 stars 4 scripts 175 downloads

DiffBind - Differential Binding Analysis of ChIP-Seq Peak Data

Compute differentially bound sites from multiple ChIP-seq experiments using affinity (quantitative) data. Also enables occupancy (overlap) analysis and plotting functions.

Last updated 23 days ago

sequencingchipseqatacseqdnaseseqmethylseqripseqdifferentialpeakcallingdifferentialmethylationgeneregulationhistonemodificationpeakdetectionbiomedicalinformaticscellbiologymultiplecomparisonnormalizationreportwritingepigeneticsfunctionalgenomics

6.06 score 2 packages 460 scripts 2.1k downloads

tkWidgets - R based tk widgets

Widgets to provide user interfaces. tcltk should have been installed for the widgets to run.

Last updated 23 days ago

infrastructure

6.03 score 6 packages 72 scripts 2.1k downloads

PROcess - Ciphergen SELDI-TOF Processing

A package for processing protein mass spectrometry data.

Last updated 23 days ago

immunooncologymassspectrometryproteomics

6.02 score 528 scripts 294 downloads

rpx - R Interface to the ProteomeXchange Repository

The rpx package implements an interface to proteomics data submitted to the ProteomeXchange consortium.

Last updated 23 days ago

immunooncologyproteomicsmassspectrometrydataimportthirdpartyclientbioconductordatamass-spectrometryproteomexchange

6.00 score 5 stars 20 scripts 496 downloads

switchde - Switch-like differential expression across single-cell trajectories

Inference and detection of switch-like differential expression across single-cell RNA-seq trajectories.

Last updated 23 days ago

immunooncologysoftwaretranscriptomicsgeneexpressionrnaseqregressiondifferentialexpressionsinglecellgene-expressiongenomicssingle-cell

5.98 score 19 stars 6 scripts 185 downloads

bioCancer - Interactive Multi-Omics Cancers Data Visualization and Analysis

This package is a Shiny App to visualize and analyse interactively Multi-Assays of Cancer Genomic Data.

Last updated 23 days ago

guidatarepresentationnetworkmultiplecomparisonpathwaysreactomevisualizationgeneexpressiongenetargetanalysisbiocancer-interfacecancercancer-studiesrmarkdown

5.95 score 20 stars 7 scripts 201 downloads

cqn - Conditional quantile normalization

A normalization tool for RNA-Seq data, implementing the conditional quantile normalization method.

Last updated 23 days ago

immunooncologyrnaseqpreprocessingdifferentialexpression

5.94 score 4 packages 240 scripts 508 downloads

MetaNeighbor - Single cell replicability analysis

MetaNeighbor allows users to quantify cell type replicability across datasets using neighbor voting.

Last updated 23 days ago

immunooncologygeneexpressiongomultiplecomparisonsinglecelltranscriptomics

5.89 score 77 scripts 322 downloads

regsplice - L1-regularization based methods for detection of differential splicing

Statistical methods for detection of differential splicing (differential exon usage) in RNA-seq and exon microarray data, using L1-regularization (lasso) to improve power.

Last updated 23 days ago

immunooncologyalternativesplicingdifferentialexpressiondifferentialsplicingsequencingrnaseqmicroarrayexonarrayexperimentaldesignsoftware

5.86 score 3 stars 27 scripts 171 downloads

rexposome - Exposome exploration and outcome data analysis

Package that allows to explore the exposome and to perform association analyses between exposures and health outcomes.

Last updated 23 days ago

softwarebiologicalquestioninfrastructuredataimportdatarepresentationbiomedicalinformaticsexperimentaldesignmultiplecomparisonclassificationclustering

5.79 score 1 packages 23 scripts 222 downloads

netresponse - Functional Network Analysis

Algorithms for functional network analysis. Includes an implementation of a variational Dirichlet process Gaussian mixture model for nonparametric mixture modeling.

Last updated 23 days ago

cellbiologyclusteringgeneexpressiongeneticsnetworkgraphandnetworkdifferentialexpressionmicroarraynetworkinferencetranscription

5.64 score 3 stars 21 scripts 245 downloads

statTarget - Statistical Analysis of Molecular Profiles

A streamlined tool provides a graphical user interface for quality control based signal drift correction (QC-RFSC), integration of data from multi-batch MS-based experiments, and the comprehensive statistical analysis in metabolomics and proteomics.

Last updated 23 days ago

immunooncologymetabolomicsproteomicsmachine learninglipidomicsmassspectrometryqualitycontrolnormalizationqc-rfsccombatdifferentialexpressionbatcheffectvisualizationmultiplecomparisonpreprocessingsoftware

5.64 score 24 scripts 241 downloads

pepStat - Statistical analysis of peptide microarrays

Statistical analysis of peptide microarrays

Last updated 23 days ago

microarraypreprocessing

5.62 score 7 stars 4 scripts 217 downloads

qusage - qusage: Quantitative Set Analysis for Gene Expression

This package is an implementation the Quantitative Set Analysis for Gene Expression (QuSAGE) method described in (Yaari G. et al, Nucl Acids Res, 2013). This is a novel Gene Set Enrichment-type test, which is designed to provide a faster, more accurate, and easier to understand test for gene expression studies. qusage accounts for inter-gene correlations using the Variance Inflation Factor technique proposed by Wu et al. (Nucleic Acids Res, 2012). In addition, rather than simply evaluating the deviation from a null hypothesis with a single number (a P value), qusage quantifies gene set activity with a complete probability density function (PDF). From this PDF, P values and confidence intervals can be easily extracted. Preserving the PDF also allows for post-hoc analysis (e.g., pair-wise comparisons of gene set activity) while maintaining statistical traceability. Finally, while qusage is compatible with individual gene statistics from existing methods (e.g., LIMMA), a Welch-based method is implemented that is shown to improve specificity. The QuSAGE package also includes a mixed effects model implementation, as described in (Turner JA et al, BMC Bioinformatics, 2015), and a meta-analysis framework as described in (Meng H, et al. PLoS Comput Biol. 2019). For questions, contact Chris Bolen ([email protected]) or Steven Kleinstein ([email protected])

Last updated 23 days ago

genesetenrichmentmicroarrayrnaseqsoftwareimmunooncology

5.60 score 1 packages 167 scripts 632 downloads

omicade4 - Multiple co-inertia analysis of omics datasets

This package performes multiple co-inertia analysis of omics datasets.

Last updated 23 days ago

softwareclusteringclassificationmultiplecomparison

5.47 score 1 packages 49 scripts 426 downloads

GeneOverlap - Test and visualize gene overlaps

Test two sets of gene lists and visualize the results.

Last updated 23 days ago

multiplecomparisonvisualization

5.41 score 258 scripts 884 downloads

eiR - Accelerated similarity searching of small molecules

The eiR package provides utilities for accelerated structure similarity searching of very large small molecule data sets using an embedding and indexing approach.

Last updated 23 days ago

cheminformaticsbiomedicalinformaticspharmacogeneticspharmacogenomicsmicrotitreplateassaycellbasedassaysvisualizationinfrastructuredataimportclusteringproteomicsmetabolomics

5.33 score 3 stars 12 scripts 221 downloads

nucleR - Nucleosome positioning package for R

Nucleosome positioning for Tiling Arrays and NGS experiments.

Last updated 23 days ago

nucleosomepositioningcoveragechipseqmicroarraysequencinggeneticsqualitycontroldataimport

5.32 score 21 scripts 285 downloads

GlobalAncova - Global test for groups of variables via model comparisons

The association between a variable of interest (e.g. two groups) and the global pattern of a group of variables (e.g. a gene set) is tested via a global F-test. We give the following arguments in support of the GlobalAncova approach: After appropriate normalisation, gene-expression-data appear rather symmetrical and outliers are no real problem, so least squares should be rather robust. ANCOVA with interaction yields saturated data modelling e.g. different means per group and gene. Covariate adjustment can help to correct for possible selection bias. Variance homogeneity and uncorrelated residuals cannot be expected. Application of ordinary least squares gives unbiased, but no longer optimal estimates (Gauss-Markov-Aitken). Therefore, using the classical F-test is inappropriate, due to correlation. The test statistic however mirrors deviations from the null hypothesis. In combination with a permutation approach, empirical significance levels can be approximated. Alternatively, an approximation yields asymptotic p-values. The framework is generalized to groups of categorical variables or even mixed data by a likelihood ratio approach. Closed and hierarchical testing procedures are supported. This work was supported by the NGFN grant 01 GR 0459, BMBF, Germany and BMBF grant 01ZX1309B, Germany.

Last updated 23 days ago

microarrayonechanneldifferentialexpressionpathwaysregression

5.31 score 1 packages 9 scripts 1.7k downloads

muscle - Multiple Sequence Alignment with MUSCLE

MUSCLE performs multiple sequence alignments of nucleotide or amino acid sequences.

Last updated 23 days ago

multiplesequencealignmentalignmentsequencinggeneticssequencematchingdataimport

5.21 score 82 scripts 544 downloads

flowDensity - Sequential Flow Cytometry Data Gating

This package provides tools for automated sequential gating analogous to the manual gating strategy based on the density of the data.

Last updated 23 days ago

bioinformaticsflowcytometrycellbiologyclusteringcancerflowcytdatadatarepresentationstemcelldensitygating

5.17 score 3 packages 83 scripts 426 downloads

MouseFM - In-silico methods for genetic finemapping in inbred mice

This package provides methods for genetic finemapping in inbred mice by taking advantage of their very high homozygosity rate (>95%).

Last updated 23 days ago

geneticssnpgenetargetvariantannotationgenomicvariationmultiplecomparisonsystemsbiologymathematicalbiologypatternlogicgenepredictionbiomedicalinformaticsfunctionalgenomicsfinemapgene-candidatesinbred-miceinbred-strainsmouseqtlqtl-mapping

5.13 score 5 scripts 359 downloads

ROntoTools - R Onto-Tools suite

Suite of tools for functional analysis.

Last updated 23 days ago

networkanalysismicroarraygraphsandnetworks

5.10 score 2 packages 15 scripts 350 downloads

MANOR - CGH Micro-Array NORmalization

Importation, normalization, visualization, and quality control functions to correct identified sources of variability in array-CGH experiments.

Last updated 23 days ago

microarraytwochanneldataimportqualitycontrolpreprocessingcopynumbervariationnormalization

5.10 score 1 scripts 352 downloads

tRNAscanImport - Importing a tRNAscan-SE result file as GRanges object

The package imports the result of tRNAscan-SE as a GRanges object.

Last updated 23 days ago

softwaredataimportworkflowsteppreprocessingvisualizationbioconductorsequencesstructurestrnatrnascantrnascan-se

5.08 score 2 stars 3 scripts 228 downloads

EGSEA - Ensemble of Gene Set Enrichment Analyses

This package implements the Ensemble of Gene Set Enrichment Analyses (EGSEA) method for gene set testing. EGSEA algorithm utilizes the analysis results of twelve prominent GSE algorithms in the literature to calculate collective significance scores for each gene set.

Last updated 23 days ago

immunooncologydifferentialexpressiongogeneexpressiongenesetenrichmentgeneticsmicroarraymultiplecomparisononechannelpathwaysrnaseqsequencingsoftwaresystemsbiologytwochannelmetabolomicsproteomicskegggraphandnetworkgenesignalinggenetargetnetworkenrichmentnetworkclassification

5.08 score 60 scripts 509 downloads

epivizrData - Data Management API for epiviz interactive visualization app

Serve data from Bioconductor Objects through a WebSocket connection.

Last updated 23 days ago

infrastructurevisualization

5.08 score 1 stars 4 packages 4 scripts 230 downloads

widgetTools - Creates an interactive tcltk widget

This packages contains tools to support the construction of tcltk widgets

Last updated 23 days ago

infrastructure

5.03 score 8 packages 11 scripts 2.0k downloads

ReadqPCR - Read qPCR data

The package provides functions to read raw RT-qPCR data of different platforms.

Last updated 23 days ago

dataimportmicrotitreplateassaygeneexpressionqpcr

4.98 score 1 packages 16 scripts 367 downloads

cummeRbund - Analysis, exploration, manipulation, and visualization of Cufflinks high-throughput sequencing data.

Allows for persistent storage, access, exploration, and manipulation of Cufflinks high-throughput sequencing data. In addition, provides numerous plotting functions for commonly used visualizations.

Last updated 23 days ago

highthroughputsequencinghighthroughputsequencingdatarnaseqrnaseqdatageneexpressiondifferentialexpressioninfrastructuredataimportdatarepresentationvisualizationbioinformaticsclusteringmultiplecomparisonsqualitycontrol

4.92 score 209 scripts 582 downloads

MAGeCKFlute - Integrative Analysis Pipeline for Pooled CRISPR Functional Genetic Screens

CRISPR (clustered regularly interspaced short palindrome repeats) coupled with nuclease Cas9 (CRISPR/Cas9) screens represent a promising technology to systematically evaluate gene functions. Data analysis for CRISPR/Cas9 screens is a critical process that includes identifying screen hits and exploring biological functions for these hits in downstream analysis. We have previously developed two algorithms, MAGeCK and MAGeCK-VISPR, to analyze CRISPR/Cas9 screen data in various scenarios. These two algorithms allow users to perform quality control, read count generation and normalization, and calculate beta score to evaluate gene selection performance. In downstream analysis, the biological functional analysis is required for understanding biological functions of these identified genes with different screening purposes. Here, We developed MAGeCKFlute for supporting downstream analysis. MAGeCKFlute provides several strategies to remove potential biases within sgRNA-level read counts and gene-level beta scores. The downstream analysis with the package includes identifying essential, non-essential, and target-associated genes, and performing biological functional category analysis, pathway enrichment analysis and protein complex enrichment analysis of these genes. The package also visualizes genes in multiple ways to benefit users exploring screening data. Collectively, MAGeCKFlute enables accurate identification of essential, non-essential, and targeted genes, as well as their related biological functions. This vignette explains the use of the package and demonstrates typical workflows.

Last updated 23 days ago

functionalgenomicscrisprpooledscreensqualitycontrolnormalizationgenesetenrichmentpathwaysvisualizationgenetargetkegg

4.89 score 1 packages 52 scripts 740 downloads

ctc - Cluster and Tree Conversion.

Tools for export and import classification trees and clusters to other programs

Last updated 23 days ago

microarrayclusteringclassificationdataimportvisualization

4.87 score 2 packages 62 scripts 554 downloads

bioDist - Different distance measures

A collection of software tools for calculating distance measures.

Last updated 23 days ago

clusteringclassification

4.86 score 2 packages 61 scripts 529 downloads

bigmelon - Illumina methylation array analysis for large experiments

Methods for working with Illumina arrays using gdsfmt.

Last updated 23 days ago

dnamethylationmicroarraytwochannelpreprocessingqualitycontrolmethylationarraydataimportcpgisland

4.86 score 24 scripts 302 downloads

IMAS - Integrative analysis of Multi-omics data for Alternative Splicing

Integrative analysis of Multi-omics data for Alternative splicing.

Last updated 23 days ago

immunooncologyalternativesplicingdifferentialexpressiondifferentialsplicinggeneexpressiongeneregulationregressionrnaseqsequencingsnpsoftwaretranscription

4.85 score 1 scripts 210 downloads

MethylSeekR - Segmentation of Bis-seq data

This is a package for the discovery of regulatory regions from Bis-seq data

Last updated 23 days ago

sequencingmethylseqdnamethylation

4.82 score 33 scripts 350 downloads

interactiveDisplayBase - Base package for enabling powerful shiny web displays of Bioconductor objects

The interactiveDisplayBase package contains the the basic methods needed to generate interactive Shiny based display methods for Bioconductor objects.

Last updated 23 days ago

gogeneexpressionmicroarraysequencingclassificationnetworkqualitycontrolvisualizationgeneticsdatarepresentationguiannotationdatashinyapps

4.78 score 1 packages 5 scripts 10k downloads

IVAS - Identification of genetic Variants affecting Alternative Splicing

Identification of genetic variants affecting alternative splicing.

Last updated 23 days ago

immunooncologyalternativesplicingdifferentialexpressiondifferentialsplicinggeneexpressiongeneregulationregressionrnaseqsequencingsnpsoftwaretranscription

4.78 score 1 packages 1 scripts 226 downloads

SPEM - S-system parameter estimation method

This package can optimize the parameter in S-system models given time series data

Last updated 23 days ago

networknetworkinferencesoftware

4.78 score 1 packages 4 scripts 262 downloads

GeneMeta - MetaAnalysis for High Throughput Experiments

A collection of meta-analysis tools for analysing high throughput experimental data

Last updated 23 days ago

sequencinggeneexpressionmicroarray

4.78 score 1 packages 2 scripts 343 downloads

MLSeq - Machine Learning Interface for RNA-Seq Data

This package applies several machine learning methods, including SVM, bagSVM, Random Forest and CART to RNA-Seq data.

Last updated 23 days ago

immunooncologysequencingrnaseqclassificationclustering

4.76 score 1 packages 24 scripts 277 downloads

RNASeqPower - Sample size for RNAseq studies

RNA-seq, sample size

Last updated 23 days ago

immunooncologyrnaseq

4.75 score 28 scripts 322 downloads

convert - Convert Microarray Data Objects

Define coerce methods for microarray data objects.

Last updated 23 days ago

infrastructuremicroarraytwochannel

4.74 score 1 packages 91 scripts 348 downloads

ASpli - Analysis of Alternative Splicing Using RNA-Seq

Integrative pipeline for the analysis of alternative splicing using RNAseq.

Last updated 23 days ago

immunooncologygeneexpressiontranscriptionalternativesplicingcoveragedifferentialexpressiondifferentialsplicingtimecoursernaseqgenomeannotationsequencingalignment

4.73 score 1 packages 45 scripts 344 downloads

SBMLR - SBML-R Interface and Analysis Tools

This package contains a systems biology markup language (SBML) interface to R.

Last updated 23 days ago

graphandnetworkpathwaysnetwork

4.71 score 43 scripts 256 downloads

rTRM - Identification of Transcriptional Regulatory Modules from Protein-Protein Interaction Networks

rTRM identifies transcriptional regulatory modules (TRMs) from protein-protein interaction networks.

Last updated 23 days ago

transcriptionnetworkgeneregulationgraphandnetworkbioconductorbioinformatics

4.68 score 2 stars 1 packages 3 scripts 305 downloads

geneClassifiers - Application of gene classifiers

This packages aims for easy accessible application of classifiers which have been published in literature using an ExpressionSet as input.

Last updated 23 days ago

geneexpressionbiomedicalinformaticsclassificationsurvivalmicroarray

4.62 score 1 stars 35 scripts 183 downloads

TargetDecoy - Diagnostic Plots to Evaluate the Target Decoy Approach

A first step in the data analysis of Mass Spectrometry (MS) based proteomics data is to identify peptides and proteins. With this respect the huge number of experimental mass spectra typically have to be assigned to theoretical peptides derived from a sequence database. Search engines are used for this purpose. These tools compare each of the observed spectra to all candidate theoretical spectra derived from the sequence data base and calculate a score for each comparison. The observed spectrum is then assigned to the theoretical peptide with the best score, which is also referred to as the peptide to spectrum match (PSM). It is of course crucial for the downstream analysis to evaluate the quality of these matches. Therefore False Discovery Rate (FDR) control is used to return a reliable list PSMs. The FDR, however, requires a good characterisation of the score distribution of PSMs that are matched to the wrong peptide (bad target hits). In proteomics, the target decoy approach (TDA) is typically used for this purpose. The TDA method matches the spectra to a database of real (targets) and nonsense peptides (decoys). A popular approach to generate these decoys is to reverse the target database. Hence, all the PSMs that match to a decoy are known to be bad hits and the distribution of their scores are used to estimate the distribution of the bad scoring target PSMs. A crucial assumption of the TDA is that the decoy PSM hits have similar properties as bad target hits so that the decoy PSM scores are a good simulation of the target PSM scores. Users, however, typically do not evaluate these assumptions. To this end we developed TargetDecoy to generate diagnostic plots to evaluate the quality of the target decoy method.

Last updated 23 days ago

massspectrometryproteomicsqualitycontrolsoftwarevisualizationbioconductormass-spectrometry

4.60 score 1 stars 9 scripts 185 downloads

methInheritSim - Simulating Whole-Genome Inherited Bisulphite Sequencing Data

Simulate a multigeneration methylation case versus control experiment with inheritance relation using a real control dataset.

Last updated 23 days ago

biologicalquestionepigeneticsdnamethylationdifferentialmethylationmethylseqsoftwareimmunooncologystatisticalmethodwholegenomesequencingbisulphite-sequencinginheritancemethylationsimulation

4.60 score 1 stars 1 scripts 236 downloads

methylInheritance - Permutation-Based Analysis associating Conserved Differentially Methylated Elements Across Multiple Generations to a Treatment Effect

Permutation analysis, based on Monte Carlo sampling, for testing the hypothesis that the number of conserved differentially methylated elements, between several generations, is associated to an effect inherited from a treatment and that stochastic effect can be dismissed.

Last updated 23 days ago

biologicalquestionepigeneticsdnamethylationdifferentialmethylationmethylseqsoftwareimmunooncologystatisticalmethodwholegenomesequencinganalysisbioconductorbioinformaticscpgdifferentially-methylated-elementsinheritancemonte-carlo-samplingpermutation

4.60 score 1 scripts 194 downloads

limmaGUI - GUI for limma Package With Two Color Microarrays

A Graphical User Interface for differential expression analysis of two-color microarray data using the limma package.

Last updated 23 days ago

guigeneexpressiondifferentialexpressiondataimportbayesianregressiontimecoursemicroarraymrnamicroarraytwochannelbatcheffectmultiplecomparisonnormalizationpreprocessingqualitycontrol

4.60 score 1 scripts 297 downloads

massiR - massiR: MicroArray Sample Sex Identifier

Predicts the sex of samples in gene expression microarray datasets

Last updated 23 days ago

softwaremicroarraygeneexpressionclusteringclassificationqualitycontrol

4.59 score 13 scripts 205 downloads

meshr - Tools for conducting enrichment analysis of MeSH

A set of annotation maps describing the entire MeSH assembled using data from MeSH.

Last updated 23 days ago

annotationdatafunctionalannotationbioinformaticsstatisticsannotationmultiplecomparisonsmeshdb

4.56 score 1 packages 9 scripts 279 downloads

VariantTools - Tools for Exploratory Analysis of Variant Calls

Explore, diagnose, and compare variant calls using filters.

Last updated 23 days ago

geneticsgeneticvariabilitysequencing

4.56 score 1 packages 40 scripts 399 downloads

rRDP - Interface to the RDP Classifier

This package installs and interfaces the naive Bayesian classifier for 16S rRNA sequences developed by the Ribosomal Database Project (RDP). With this package the classifier trained with the standard training set can be used or a custom classifier can be trained.

Last updated 23 days ago

geneticssequencinginfrastructureclassificationmicrobiomeimmunooncologyalignmentsequencematchingdataimportbayesianbioconductorbioinformatics

4.54 score 1 stars 3 scripts 210 downloads

flowClean - flowClean

A quality control tool for flow cytometry data based on compositional data analysis.

Last updated 23 days ago

flowcytometryqualitycontrolimmunooncology

4.53 score 17 scripts 352 downloads

msgbsR - msgbsR: methylation sensitive genotyping by sequencing (MS-GBS) R functions

Pipeline for the anaysis of a MS-GBS experiment.

Last updated 23 days ago

immunooncologydifferentialmethylationdataimportepigeneticsmethylseq

4.48 score 1 scripts 209 downloads

MGFM - Marker Gene Finder in Microarray gene expression data

The package is designed to detect marker genes from Microarray gene expression data sets

Last updated 23 days ago

geneticsgeneexpressionmicroarray

4.48 score 1 packages 1 scripts 217 downloads

Pviz - Peptide Annotation and Data Visualization using Gviz

Pviz adapts the Gviz package for protein sequences and data.

Last updated 23 days ago

visualizationproteomicsmicroarray

4.48 score 4 scripts 250 downloads

ChIPQC - Quality metrics for ChIPseq data

Quality metrics for ChIPseq data.

Last updated 23 days ago

sequencingchipseqqualitycontrolreportwriting

4.44 score 137 scripts 634 downloads

GEM - GEM: fast association study for the interplay of Gene, Environment and Methylation

Tools for analyzing EWAS, methQTL and GxE genome widely.

Last updated 23 days ago

methylseqmethylationarraygenomewideassociationregressiondnamethylationsnpgeneexpressiongui

4.43 score 27 scripts 164 downloads

groHMM - GRO-seq Analysis Pipeline

A pipeline for the analysis of GRO-seq data.

Last updated 23 days ago

sequencingsoftware

4.43 score 1 stars 25 scripts 317 downloads

GSAR - Gene Set Analysis in R

Gene set analysis using specific alternative hypotheses. Tests for differential expression, scale and net correlation structure.

Last updated 23 days ago

softwarestatisticalmethoddifferentialexpression

4.38 score 7 scripts 252 downloads

a4Core - Automated Affymetrix Array Analysis Core Package

Utility functions for the Automated Affymetrix Array Analysis set of packages.

Last updated 23 days ago

microarrayclassification

4.38 score 4 packages 2 scripts 453 downloads

protGear - Protein Micro Array Data Management and Interactive Visualization

A generic three-step pre-processing package for protein microarray data. This package contains different data pre-processing procedures to allow comparison of their performance.These steps are background correction, the coefficient of variation (CV) based filtering, batch correction and normalization.

Last updated 23 days ago

microarrayonechannelpreprocessingbiomedicalinformaticsproteomicsbatcheffectnormalizationbayesianclusteringregressionsystemsbiologyimmunooncologybackground-correctionmicroarray-datanormalisationproteomics-datashinyshinydashboard

4.30 score 1 stars 6 scripts 258 downloads

iCNV - Integrated Copy Number Variation detection

Integrative copy number variation (CNV) detection from multiple platform and experimental design.

Last updated 23 days ago

immunooncologyexomeseqwholegenomesnpcopynumbervariationhiddenmarkovmodel

4.30 score 5 scripts 206 downloads

goSTAG - A tool to use GO Subtrees to Tag and Annotate Genes within a set

Gene lists derived from the results of genomic analyses are rich in biological information. For instance, differentially expressed genes (DEGs) from a microarray or RNA-Seq analysis are related functionally in terms of their response to a treatment or condition. Gene lists can vary in size, up to several thousand genes, depending on the robustness of the perturbations or how widely different the conditions are biologically. Having a way to associate biological relatedness between hundreds and thousands of genes systematically is impractical by manually curating the annotation and function of each gene. Over-representation analysis (ORA) of genes was developed to identify biological themes. Given a Gene Ontology (GO) and an annotation of genes that indicate the categories each one fits into, significance of the over-representation of the genes within the ontological categories is determined by a Fisher's exact test or modeling according to a hypergeometric distribution. Comparing a small number of enriched biological categories for a few samples is manageable using Venn diagrams or other means for assessing overlaps. However, with hundreds of enriched categories and many samples, the comparisons are laborious. Furthermore, if there are enriched categories that are shared between samples, trying to represent a common theme across them is highly subjective. goSTAG uses GO subtrees to tag and annotate genes within a set. goSTAG visualizes the similarities between the over-representation of DEGs by clustering the p-values from the enrichment statistical tests and labels clusters with the GO term that has the most paths to the root within the subtree generated from all the GO terms in the cluster.

Last updated 23 days ago

geneexpressiondifferentialexpressiongenesetenrichmentclusteringmicroarraymrnamicroarrayrnaseqvisualizationgoimmunooncology

4.30 score 1 scripts 210 downloads

ChIPexoQual - ChIPexoQual

Package with a quality control pipeline for ChIP-exo/nexus data.

Last updated 23 days ago

chipseqsequencingtranscriptionvisualizationqualitycontrolcoveragealignment

4.30 score 1 stars 5 scripts 329 downloads

RJMCMCNucleosomes - Bayesian hierarchical model for genome-wide nucleosome positioning with high-throughput short-read data (MNase-Seq)

This package does nucleosome positioning using informative Multinomial-Dirichlet prior in a t-mixture with reversible jump estimation of nucleosome positions for genome-wide profiling.

Last updated 23 days ago

biologicalquestionchipseqnucleosomepositioningsoftwarestatisticalmethodbayesiansequencingcoveragebayesian-t-mixturebioconductorc-plus-plusgenome-wide-profilingmultinomial-dirichlet-priornucleosome-positioningnucleosomesreversible-jump-mcmc

4.30 score 1 scripts 210 downloads

SELEX - Functions for analyzing SELEX-seq data

Tools for quantifying DNA binding specificities based on SELEX-seq data.

Last updated 23 days ago

softwaremotifdiscoverymotifannotationgeneregulationtranscription

4.30 score 8 scripts 217 downloads

MultiMed - Testing multiple biological mediators simultaneously

Implements methods for testing multiple mediators

Last updated 23 days ago

multiplecomparisonstatisticalmethodsoftware

4.30 score 8 scripts 201 downloads

CAFE - Chromosmal Aberrations Finder in Expression data

Detection and visualizations of gross chromosomal aberrations using Affymetrix expression microarrays as input

Last updated 23 days ago

geneexpressionmicroarrayonechannelgenesetenrichment

4.30 score 2 scripts 243 downloads

BEAT - BEAT - BS-Seq Epimutation Analysis Toolkit

Model-based analysis of single-cell methylation data

Last updated 23 days ago

immunooncologygeneticsmethylseqsoftwarednamethylationepigenetics

4.30 score 3 scripts 213 downloads

PREDA - Position Related Data Analysis

Package for the position related analysis of quantitative functional genomics data.

Last updated 23 days ago

softwarecopynumbervariationgeneexpressiongenetics

4.30 score 9 scripts 258 downloads

mogsa - Multiple omics data integrative clustering and gene set analysis

This package provide a method for doing gene set analysis based on multiple omics data.

Last updated 23 days ago

geneexpressionprincipalcomponentstatisticalmethodclusteringsoftware

4.28 score 48 scripts 439 downloads

altcdfenvs - alternative CDF environments (aka probeset mappings)

Convenience data structures and functions to handle cdfenvs

Last updated 23 days ago

microarrayonechannelqualitycontrolpreprocessingannotationproprietaryplatformstranscription

4.26 score 1 packages 5 scripts 414 downloads

GRmetrics - Calculate growth-rate inhibition (GR) metrics

Functions for calculating and visualizing growth-rate inhibition (GR) metrics.

Last updated 23 days ago

immunooncologycellbasedassayscellbiologysoftwaretimecoursevisualization

4.23 score 1 stars 17 scripts 218 downloads

ivygapSE - A SummarizedExperiment for Ivy-GAP data

Define a SummarizedExperiment and exploratory app for Ivy-GAP glioblastoma image, expression, and clinical data.

Last updated 23 days ago

transcriptionsoftwarevisualizationsurvivalgeneexpressionsequencing

4.20 score 16 scripts 190 downloads

tenXplore - ontological exploration of scRNA-seq of 1.3 million mouse neurons from 10x genomics

Perform ontological exploration of scRNA-seq of 1.3 million mouse neurons from 10x genomics.

Last updated 23 days ago

immunooncologydimensionreductionprincipalcomponenttranscriptomicssinglecell

4.18 score 7 scripts 140 downloads

bnbc - Bandwise normalization and batch correction of Hi-C data

Tools to normalize (several) Hi-C data from replicates.

Last updated 23 days ago

hicpreprocessingnormalizationsoftware

4.18 score 1 stars 15 scripts 438 downloads

erccdashboard - Assess Differential Gene Expression Experiments with ERCC Controls

Technical performance metrics for differential gene expression experiments using External RNA Controls Consortium (ERCC) spike-in ratio mixtures.

Last updated 23 days ago

immunooncologygeneexpressiontranscriptionalternativesplicingdifferentialexpressiondifferentialsplicinggeneticsmicroarraymrnamicroarrayrnaseqbatcheffectmultiplecomparisonqualitycontrol

4.18 score 4 scripts 220 downloads

ssize - Estimate Microarray Sample Size

Functions for computing and displaying sample size information for gene expression arrays.

Last updated 23 days ago

microarraydifferentialexpression

4.18 score 15 scripts 318 downloads

STATegRa - Classes and methods for multi-omics data integration

Classes and tools for multi-omics data integration.

Last updated 23 days ago

softwarestatisticalmethodclusteringdimensionreductionprincipalcomponent

4.15 score 3 scripts 220 downloads

flowCyBar - Analyze flow cytometric data using gate information

A package to analyze flow cytometric data using gate information to follow population/community dynamics

Last updated 23 days ago

immunooncologycellbasedassaysclusteringflowcytometrysoftwarevisualization

4.15 score 1 scripts 183 downloads

methimpute - Imputation-guided re-construction of complete methylomes from WGBS data

This package implements functions for calling methylation for all cytosines in the genome.

Last updated 23 days ago

immunooncologysoftwarednamethylationepigeneticshiddenmarkovmodelsequencingcoverage

4.11 score 13 scripts 204 downloads

panelcn.mops - CNV detection tool for targeted NGS panel data

CNV detection tool for targeted NGS panel data. Extension of the cn.mops package.

Last updated 23 days ago

sequencingcopynumbervariationcellbiologygenomicvariationvariantdetectiongenetics

4.08 score 12 scripts 245 downloads

KEGGlincs - Visualize all edges within a KEGG pathway and overlay LINCS data

See what is going on 'under the hood' of KEGG pathways by explicitly re-creating the pathway maps from information obtained from KGML files.

Last updated 23 days ago

networkinferencegeneexpressiondatarepresentationthirdpartyclientcellbiologygraphandnetworkpathwayskeggnetwork

4.00 score 3 scripts 220 downloads

GAprediction - Prediction of gestational age with Illumina HumanMethylation450 data

[GAprediction] predicts gestational age using Illumina HumanMethylation450 CpG data.

Last updated 23 days ago

immunooncologydnamethylationepigeneticsregressionbiomedicalinformatics

4.00 score 167 downloads

ctsGE - Clustering of Time Series Gene Expression data

Methodology for supervised clustering of potentially many predictor variables, such as genes etc., in time series datasets Provides functions that help the user assigning genes to predefined set of model profiles.

Last updated 23 days ago

immunooncologygeneexpressiontranscriptiondifferentialexpressiongenesetenrichmentgeneticsbayesianclusteringtimecoursesequencingrnaseq

4.00 score 1 stars 6 scripts 188 downloads

Rnits - R Normalization and Inference of Time Series data

R/Bioconductor package for normalization, curve registration and inference in time course gene expression data.

Last updated 23 days ago

geneexpressionmicroarraytimecoursedifferentialexpressionnormalization

4.00 score 1 scripts 200 downloads

IntEREst - Intron-Exon Retention Estimator

This package performs Intron-Exon Retention analysis on RNA-seq data (.bam files).

Last updated 23 days ago

softwarealternativesplicingcoveragedifferentialsplicingsequencingrnaseqalignmentnormalizationdifferentialexpressionimmunooncology

3.98 score 12 scripts 236 downloads

EGAD - Extending guilt by association by degree

The package implements a series of highly efficient tools to calculate functional properties of networks based on guilt by association methods.

Last updated 23 days ago

softwarefunctionalgenomicssystemsbiologygenepredictionfunctionalpredictionnetworkenrichmentgraphandnetworknetwork

3.92 score 83 scripts 206 downloads

DynDoc - Dynamic document tools

A set of functions to create and interact with dynamic documents and vignettes.

Last updated 23 days ago

reportwritinginfrastructure

3.92 score 7 packages 8 scripts 2.0k downloads

Icens - NPMLE for Censored and Truncated Data

Many functions for computing the NPMLE for censored and truncated data.

Last updated 23 days ago

infrastructure

3.83 score 7 packages 16 scripts 857 downloads

DMCHMM - Differentially Methylated CpG using Hidden Markov Model

A pipeline for identifying differentially methylated CpG sites using Hidden Markov Model in bisulfite sequencing data. DNA methylation studies have enabled researchers to understand methylation patterns and their regulatory roles in biological processes and disease. However, only a limited number of statistical approaches have been developed to provide formal quantitative analysis. Specifically, a few available methods do identify differentially methylated CpG (DMC) sites or regions (DMR), but they suffer from limitations that arise mostly due to challenges inherent in bisulfite sequencing data. These challenges include: (1) that read-depths vary considerably among genomic positions and are often low; (2) both methylation and autocorrelation patterns change as regions change; and (3) CpG sites are distributed unevenly. Furthermore, there are several methodological limitations: almost none of these tools is capable of comparing multiple groups and/or working with missing values, and only a few allow continuous or multiple covariates. The last of these is of great interest among researchers, as the goal is often to find which regions of the genome are associated with several exposures and traits. To tackle these issues, we have developed an efficient DMC identification method based on Hidden Markov Models (HMMs) called “DMCHMM” which is a three-step approach (model selection, prediction, testing) aiming to address the aforementioned drawbacks.

Last updated 23 days ago

differentialmethylationsequencinghiddenmarkovmodelcoverage

3.78 score 3 scripts 192 downloads

MGFR - Marker Gene Finder in RNA-seq data

The package is designed to detect marker genes from RNA-seq data.

Last updated 23 days ago

immunooncologygeneticsgeneexpressionrnaseq

3.78 score 1 packages 2 scripts 202 downloads

flowCHIC - Analyze flow cytometric data using histogram information

A package to analyze flow cytometric data of complex microbial communities based on histogram images

Last updated 23 days ago

immunooncologycellbasedassaysclusteringflowcytometrysoftwarevisualization

3.78 score 1 scripts 200 downloads

MVCClass - Model-View-Controller (MVC) Classes

Creates classes used in model-view-controller (MVC) design

Last updated 23 days ago

visualizationinfrastructuregraphandnetwork

3.78 score 1 packages 365 downloads

mimager - mimager: The Microarray Imager

Easily visualize and inspect microarrays for spatial artifacts.

Last updated 23 days ago

infrastructurevisualizationmicroarraybioconductorbioinformatics

3.70 score 3 scripts 226 downloads

clstutils - Tools for performing taxonomic assignment

Tools for performing taxonomic assignment based on phylogeny using pplacer and clst.

Last updated 23 days ago

sequencingclassificationvisualizationqualitycontrol

3.64 score 11 scripts 205 downloads

ssviz - A small RNA-seq visualizer and analysis toolkit

Small RNA sequencing viewer

Last updated 23 days ago

immunooncologysequencingrnaseqvisualizationmultiplecomparisongenetics

3.60 score 2 scripts 229 downloads

pathRender - Render molecular pathways

build graphs from pathway databases, render them by Rgraphviz.

Last updated 23 days ago

graphandnetworkpathwaysvisualization

3.60 score 2 scripts 317 downloads

apComplex - Estimate protein complex membership using AP-MS protein data

Functions to estimate a bipartite graph of protein complex membership using AP-MS data.

Last updated 23 days ago

immunooncologynetworkinferencemassspectrometrygraphandnetwork

3.60 score 8 scripts 411 downloads

OLINgui - Graphical user interface for OLIN

Graphical user interface for the OLIN package

Last updated 23 days ago

microarraytwochannelqualitycontrolpreprocessingvisualization

3.60 score 1 scripts 278 downloads

affylmGUI - GUI for limma Package with Affymetrix Microarrays

A Graphical User Interface (GUI) for analysis of Affymetrix microarray gene expression data using the affy and limma packages.

Last updated 23 days ago

guigeneexpressiontranscriptiondifferentialexpressiondataimportbayesianregressiontimecoursemicroarraymrnamicroarrayonechannelproprietaryplatformsbatcheffectmultiplecomparisonnormalizationpreprocessingqualitycontrol

3.60 score 3 scripts 442 downloads

HTSeqGenie - A NGS analysis pipeline.

Libraries to perform NGS analysis.

Last updated 23 days ago

3.48 score 4 scripts 176 downloads

TFEA.ChIP - Analyze Transcription Factor Enrichment

Package to analize transcription factor enrichment in a gene set using data from ChIP-Seq experiments.

Last updated 23 days ago

transcriptiongeneregulationgenesetenrichmenttranscriptomicssequencingchipseqrnaseqimmunooncology

3.45 score 14 scripts 226 downloads

GeneStructureTools - Tools for spliced gene structure manipulation and analysis

GeneStructureTools can be used to create in silico alternative splicing events, and analyse potential effects this has on functional gene products.

Last updated 23 days ago

immunooncologysoftwaredifferentialsplicingfunctionalpredictiontranscriptomicsalternativesplicingrnaseq

3.32 score 21 scripts 260 downloads

DEScan2 - Differential Enrichment Scan 2

Integrated peak and differential caller, specifically designed for broad epigenomic signals.

Last updated 23 days ago

immunooncologypeakdetectionepigeneticssoftwaresequencingcoverage

3.30 score 2 scripts 207 downloads

diggit - Inference of Genetic Variants Driving Cellular Phenotypes

Inference of Genetic Variants Driving Cellullar Phenotypes by the DIGGIT algorithm

Last updated 23 days ago

systemsbiologynetworkenrichmentgeneexpressionfunctionalpredictiongeneregulation

3.30 score 3 scripts 196 downloads

IMPCdata - Retrieves data from IMPC database

Package contains methods for data retrieval from IMPC Database.

Last updated 23 days ago

experimentdata

3.30 score 4 scripts 189 downloads

FRGEpistasis - Epistasis Analysis for Quantitative Traits by Functional Regression Model

A Tool for Epistasis Analysis Based on Functional Regression Model

Last updated 23 days ago

geneticsnetworkinferencegeneticvariabilitysoftware

3.30 score 8 scripts 181 downloads

Clomial - Infers clonal composition of a tumor

Clomial fits binomial distributions to counts obtained from Next Gen Sequencing data of multiple samples of the same tumor. The trained parameters can be interpreted to infer the clonal structure of the tumor.

Last updated 23 days ago

geneticsgeneticvariabilitysequencingclusteringmultiplecomparisonbayesiandnaseqexomeseqtargetedresequencingimmunooncology

3.30 score 3 scripts 178 downloads

metaSeq - Meta-analysis of RNA-Seq count data in multiple studies

The probabilities by one-sided NOISeq are combined by Fisher's method or Stouffer's method

Last updated 23 days ago

rnaseqdifferentialexpressionsequencingimmunooncology

3.30 score 2 scripts 221 downloads

interactiveDisplay - Package for enabling powerful shiny web displays of Bioconductor objects

The interactiveDisplay package contains the methods needed to generate interactive Shiny based display methods for Bioconductor objects.

Last updated 23 days ago

gogeneexpressionmicroarraysequencingclassificationnetworkqualitycontrolvisualizationgeneticsdatarepresentationguiannotationdatashinyapps

3.30 score 4 scripts 434 downloads

rTRMui - A shiny user interface for rTRM

This package provides a web interface to compute transcriptional regulatory modules with rTRM.

Last updated 23 days ago

transcriptionnetworkgeneregulationgraphandnetworkgui

3.30 score 1 stars 1 scripts 221 downloads

SigFuge - SigFuge

Algorithm for testing significance of clustering in RNA-seq data.

Last updated 23 days ago

clusteringvisualizationrnaseqimmunooncology

3.30 score 3 scripts 239 downloads

BaseSpaceR - R SDK for BaseSpace RESTful API

A rich R interface to Illumina's BaseSpace cloud computing environment, enabling the fast development of data analysis and visualisation tools.

Last updated 23 days ago

infrastructuredatarepresentationconnecttoolssoftwaredataimporthighthroughputsequencingsequencinggenetics

3.30 score 9 scripts 312 downloads

deltaGseg - deltaGseg

Identifying distinct subpopulations through multiscale time series analysis

Last updated 23 days ago

proteomicstimecoursevisualizationclustering

3.30 score 2 scripts 187 downloads

iBMQ - integrated Bayesian Modeling of eQTL data

integrated Bayesian Modeling of eQTL data

Last updated 23 days ago

microarraypreprocessinggeneexpressionsnp

3.30 score 1 scripts 164 downloads

AGDEX - Agreement of Differential Expression Analysis

A tool to evaluate agreement of differential expression for cross-species genomics

Last updated 23 days ago

microarraygeneticsgeneexpression

3.30 score 6 scripts 359 downloads

chopsticks - The 'snp.matrix' and 'X.snp.matrix' Classes

Implements classes and methods for large-scale SNP association studies

Last updated 23 days ago

microarraysnpsandgeneticvariabilitysnpgeneticvariability

3.30 score 5 scripts 250 downloads

RbcBook1 - Support for Springer monograph on Bioconductor

tools for building book

Last updated 23 days ago

software

3.30 score 1 scripts 298 downloads

BioMVCClass - Model-View-Controller (MVC) Classes That Use Biobase

Creates classes used in model-view-controller (MVC) design

Last updated 23 days ago

visualizationinfrastructuregraphandnetwork

3.30 score 357 downloads

diffGeneAnalysis - Performs differential gene expression Analysis

Analyze microarray data

Last updated 23 days ago

microarraydifferentialexpression

3.30 score 1 scripts 236 downloads

idiogram - idiogram

A package for plotting genomic data by chromosomal location

Last updated 23 days ago

visualization

3.30 score 6 scripts 316 downloads

arrayQuality - Assessing array quality on spotted arrays

Functions for performing print-run and array level quality assessment.

Last updated 23 days ago

microarraytwochannelqualitycontrolvisualization

3.30 score 9 scripts 500 downloads

iCheck - QC Pipeline and Data Analysis Tools for High-Dimensional Illumina mRNA Expression Data

QC pipeline and data analysis tools for high-dimensional Illumina mRNA expression data.

Last updated 23 days ago

geneexpressiondifferentialexpressionmicroarraypreprocessingdnamethylationonechanneltwochannelqualitycontrol

3.00 score 1 scripts 196 downloads

Mulcom - Calculates Mulcom test

Identification of differentially expressed genes and false discovery rate (FDR) calculation by Multiple Comparison test.

Last updated 23 days ago

statisticalmethodmultiplecomparisonmicroarraydifferentialexpressiongeneexpression

3.00 score 248 downloads

bgx - Bayesian Gene eXpression

Bayesian integrated analysis of Affymetrix GeneChips

Last updated 23 days ago

microarraydifferentialexpression

2.60 score 1 scripts 438 downloads

roastgsa - Rotation based gene set analysis

This package implements a variety of functions useful for gene set analysis using rotations to approximate the null distribution. It contributes with the implementation of seven test statistic scores that can be used with different goals and interpretations. Several functions are available to complement the statistical results with graphical representations.

Last updated 23 days ago

microarraypreprocessingnormalizationgeneexpressionsurvivaltranscriptionsequencingtranscriptomicsbayesianclusteringregressionrnaseqmicrornaarraymrnamicroarrayfunctionalgenomicssystemsbiologyimmunooncologydifferentialexpressiongenesetenrichmentbatcheffectmultiplecomparisonqualitycontroltimecoursemetabolomicsproteomicsepigeneticscheminformaticsexonarrayonechanneltwochannelproprietaryplatformscellbiologybiomedicalinformaticsalternativesplicingdifferentialsplicingdataimportpathways

2.30 score 120 downloads

CTDquerier - Package for CTDbase data query, visualization and downstream analysis

Package to retrieve and visualize data from the Comparative Toxicogenomics Database (http://ctdbase.org/). The downloaded data is formated as DataFrames for further downstream analyses.

Last updated 23 days ago

softwarebiomedicalinformaticsinfrastructuredataimportdatarepresentationgenesetenrichmentnetworkenrichmentpathwaysnetworkgokegg

2.30 score 2 scripts 167 downloads

TransView - Read density map construction and accession. Visualization of ChIPSeq and RNASeq data sets

This package provides efficient tools to generate, access and display read densities of sequencing based data sets such as from RNA-Seq and ChIP-Seq.

Last updated 23 days ago

immunooncologydnamethylationgeneexpressiontranscriptionmicroarraysequencingchipseqrnaseqmethylseqdataimportvisualizationclusteringmultiplecomparison

2.30 score 247 downloads

stepNorm - Stepwise normalization functions for cDNA microarrays

Stepwise normalization functions for cDNA microarray data.

Last updated 23 days ago

microarraytwochannelpreprocessing

2.30 score 2 scripts 259 downloads

GraphAT - Graph Theoretic Association Tests

Functions and data used in Balasubramanian, et al. (2004)

Last updated 23 days ago

networkgraphandnetwork

2.30 score 4 scripts 286 downloads