systemPipeR - systemPipeR: workflow management and report generation environment

systemPipeR is a multipurpose data analysis workflow environment that unifies R with command-line tools. It enables scientists to analyze many types of large- or small-scale data on local or distributed computer systems with a high level of reproducibility, scalability and portability. At its core is a command-line interface (CLI) that adopts the Common Workflow Language (CWL). This design allows users to choose for each analysis step the optimal R or command-line software. It supports both end-to-end and partial execution of workflows with built-in restart functionalities. Efficient management of complex analysis tasks is accomplished by a flexible workflow control container class. Handling of large numbers of input samples and experimental designs is facilitated by consistent sample annotation mechanisms. As a multi-purpose workflow toolkit, systemPipeR enables users to run existing workflows, customize them or design entirely new ones while taking advantage of widely adopted data structures within the Bioconductor ecosystem. Another important core functionality is the generation of reproducible scientific analysis and technical reports. For result interpretation, systemPipeR offers a wide range of plotting functionality, while an associated Shiny App offers many useful functionalities for interactive result exploration. The vignettes linked from this page include (1) a general introduction, (2) a description of technical details, and (3) a collection of workflow templates.

Last updated 5 days ago

bioconductor-package

4.17 score 99 dependencies 3 dependents

IsoBayes - IsoBayes: Single Isoform protein inference Method via Bayesian Analyses

IsoBayes is a Bayesian method to perform inference on single protein isoforms. Our approach infers the presence/absence of protein isoforms, and also estimates their abundance; additionally, it provides a measure of the uncertainty of these estimates, via: i) the posterior probability that a protein isoform is present in the sample; ii) a posterior credible interval of its abundance. IsoBayes inputs liquid cromatography mass spectrometry (MS) data, and can work with both PSM counts, and intensities. When available, trascript isoform abundances (i.e., TPMs) are also incorporated: TPMs are used to formulate an informative prior for the respective protein isoform relative abundance. We further identify isoforms where the relative abundance of proteins and transcripts significantly differ. We use a two-layer latent variable approach to model two sources of uncertainty typical of MS data: i) peptides may be erroneously detected (even when absent); ii) many peptides are compatible with multiple protein isoforms. In the first layer, we sample the presence/absence of each peptide based on its estimated probability of being mistakenly detected, also known as PEP (i.e., posterior error probability). In the second layer, for peptides that were estimated as being present, we allocate their abundance across the protein isoforms they map to. These two steps allow us to recover the presence and abundance of each protein isoform.

Last updated 9 days ago

bioconductor-package

1.80 score 64 dependencies

omicsViewer - Interactive and explorative visualization of SummarizedExperssionSet or ExpressionSet using omicsViewer

omicsViewer visualizes ExpressionSet (or SummarizedExperiment) in an interactive way. The omicsViewer has a separate back- and front-end. In the back-end, users need to prepare an ExpressionSet that contains all the necessary information for the downstream data interpretation. Some extra requirements on the headers of phenotype data or feature data are imposed so that the provided information can be clearly recognized by the front-end, at the same time, keep a minimum modification on the existing ExpressionSet object. The pure dependency on R/Bioconductor guarantees maximum flexibility in the statistical analysis in the back-end. Once the ExpressionSet is prepared, it can be visualized using the front-end, implemented by shiny and plotly. Both features and samples could be selected from (data) tables or graphs (scatter plot/heatmap). Different types of analyses, such as enrichment analysis (using Bioconductor package fgsea or fisher's exact test) and STRING network analysis, will be performed on the fly and the results are visualized simultaneously. When a subset of samples and a phenotype variable is selected, a significance test on means (t-test or ranked based test; when phenotype variable is quantitative) or test of independence (chi-square or fisher’s exact test; when phenotype data is categorical) will be performed to test the association between the phenotype of interest with the selected samples. Additionally, other analyses can be easily added as extra shiny modules. Therefore, omicsViewer will greatly facilitate data exploration, many different hypotheses can be explored in a short time without the need for knowledge of R. In addition, the resulting data could be easily shared using a shiny server. Otherwise, a standalone version of omicsViewer together with designated omics data could be easily created by integrating it with portable R, which can be shared with collaborators or submitted as supplementary data together with a manuscript.

Last updated 23 days ago

bioconductor-package

1.24 score 191 dependencies

RnBeads - RnBeads

RnBeads facilitates comprehensive analysis of various types of DNA methylation data at the genome scale.

Last updated 1 months ago

bioconductor-package

6.32 score 156 dependencies 1 dependents

ViSEAGO - ViSEAGO: a Bioconductor package for clustering biological functions using Gene Ontology and semantic similarity

The main objective of ViSEAGO package is to carry out a data mining of biological functions and establish links between genes involved in the study. We developed ViSEAGO in R to facilitate functional Gene Ontology (GO) analysis of complex experimental design with multiple comparisons of interest. It allows to study large-scale datasets together and visualize GO profiles to capture biological knowledge. The acronym stands for three major concepts of the analysis: Visualization, Semantic similarity and Enrichment Analysis of Gene Ontology. It provides access to the last current GO annotations, which are retrieved from one of NCBI EntrezGene, Ensembl or Uniprot databases for several species. Using available R packages and novel developments, ViSEAGO extends classical functional GO analysis to focus on functional coherence by aggregating closely related biological themes while studying multiple datasets at once. It provides both a synthetic and detailed view using interactive functionalities respecting the GO graph structure and ensuring functional coherence supplied by semantic similarity. ViSEAGO has been successfully applied on several datasets from different species with a variety of biological questions. Results can be easily shared between bioinformaticians and biologists, enhancing reporting capabilities while maintaining reproducibility.

Last updated 1 months ago

bioconductor-package

0.82 score 159 dependencies

RNAdecay - Maximum Likelihood Decay Modeling of RNA Degradation Data

RNA degradation is monitored through measurement of RNA abundance after inhibiting RNA synthesis. This package has functions and example scripts to facilitate (1) data normalization, (2) data modeling using constant decay rate or time-dependent decay rate models, (3) the evaluation of treatment or genotype effects, and (4) plotting of the data and models. Data Normalization: functions and scripts make easy the normalization to the initial (T0) RNA abundance, as well as a method to correct for artificial inflation of Reads per Million (RPM) abundance in global assessments as the total size of the RNA pool decreases. Modeling: Normalized data is then modeled using maximum likelihood to fit parameters. For making treatment or genotype comparisons (up to four), the modeling step models all possible treatment effects on each gene by repeating the modeling with constraints on the model parameters (i.e., the decay rate of treatments A and B are modeled once with them being equal and again allowing them to both vary independently). Model Selection: The AICc value is calculated for each model, and the model with the lowest AICc is chosen. Modeling results of selected models are then compiled into a single data frame. Graphical Plotting: functions are provided to easily visualize decay data model, or half-life distributions using ggplot2 package functions.

Last updated 2 months ago

bioconductor-package

1.16 score 37 dependencies

RNAseqCovarImpute - Impute Covariate Data in RNA Sequencing Studies

The RNAseqCovarImpute package makes linear model analysis for RNA sequencing read counts compatible with multiple imputation (MI) of missing covariates. A major problem with implementing MI in RNA sequencing studies is that the outcome data must be included in the imputation prediction models to avoid bias. This is difficult in omics studies with high-dimensional data. The first method we developed in the RNAseqCovarImpute package surmounts the problem of high-dimensional outcome data by binning genes into smaller groups to analyze pseudo-independently. This method implements covariate MI in gene expression studies by 1) randomly binning genes into smaller groups, 2) creating M imputed datasets separately within each bin, where the imputation predictor matrix includes all covariates and the log counts per million (CPM) for the genes within each bin, 3) estimating gene expression changes using `limma::voom` followed by `limma::lmFit` functions, separately on each M imputed dataset within each gene bin, 4) un-binning the gene sets and stacking the M sets of model results before applying the `limma::squeezeVar` function to apply a variance shrinking Bayesian procedure to each M set of model results, 5) pooling the results with Rubins’ rules to produce combined coefficients, standard errors, and P-values, and 6) adjusting P-values for multiplicity to account for false discovery rate (FDR). A faster method uses principal component analysis (PCA) to avoid binning genes while still retaining outcome information in the MI models. Binning genes into smaller groups requires that the MI and limma-voom analysis is run many times (typically hundreds). The more computationally efficient MI PCA method implements covariate MI in gene expression studies by 1) performing PCA on the log CPM values for all genes using the Bioconductor `PCAtools` package, 2) creating M imputed datasets where the imputation predictor matrix includes all covariates and the optimum number of PCs to retain (e.g., based on Horn’s parallel analysis or the number of PCs that account for >80% explained variation), 3) conducting the standard limma-voom pipeline with the `voom` followed by `lmFit` followed by `eBayes` functions on each M imputed dataset, 4) pooling the results with Rubins’ rules to produce combined coefficients, standard errors, and P-values, and 5) adjusting P-values for multiplicity to account for false discovery rate (FDR).

Last updated 2 months ago

bioconductor-package

1.00 score 73 dependencies

Moonlight2R - Identify oncogenes and tumor suppressor genes from omics data

The understanding of cancer mechanism requires the identification of genes playing a role in the development of the pathology and the characterization of their role (notably oncogenes and tumor suppressors). We present an updated version of the R/bioconductor package called MoonlightR, namely Moonlight2R, which returns a list of candidate driver genes for specific cancer types on the basis of omics data integration. The Moonlight framework contains a primary layer where gene expression data and information about biological processes are integrated to predict genes called oncogenic mediators, divided into putative tumor suppressors and putative oncogenes. This is done through functional enrichment analyses, gene regulatory networks and upstream regulator analyses to score the importance of well-known biological processes with respect to the studied cancer type. By evaluating the effect of the oncogenic mediators on biological processes or through random forests, the primary layer predicts two putative roles for the oncogenic mediators: i) tumor suppressor genes (TSGs) and ii) oncogenes (OCGs). As gene expression data alone is not enough to explain the deregulation of the genes, a second layer of evidence is needed. We have automated the integration of a secondary mutational layer through new functionalities in Moonlight2R. These functionalities analyze mutations in the cancer cohort and classifies these into driver and passenger mutations using the driver mutation prediction tool, CScape-somatic. Those oncogenic mediators with at least one driver mutation are retained as the driver genes. As a consequence, this methodology does not only identify genes playing a dual role (e.g. TSG in one cancer type and OCG in another) but also helps in elucidating the biological processes underlying their specific roles. In particular, Moonlight2R can be used to discover OCGs and TSGs in the same cancer type. This may for instance help in answering the question whether some genes change role between early stages (I, II) and late stages (III, IV). In the future, this analysis could be useful to determine the causes of different resistances to chemotherapeutic treatments.

Last updated 2 months ago

bioconductor-package

1.45 score 213 dependencies

RESOLVE - RESOLVE: An R package for the efficient analysis of mutational signatures from cancer genomes

Cancer is a genetic disease caused by somatic mutations in genes controlling key biological functions such as cellular growth and division. Such mutations may arise both through cell-intrinsic and exogenous processes, generating characteristic mutational patterns over the genome named mutational signatures. The study of mutational signatures have become a standard component of modern genomics studies, since it can reveal which (environmental and endogenous) mutagenic processes are active in a tumor, and may highlight markers for therapeutic response. Mutational signatures computational analysis presents many pitfalls. First, the task of determining the number of signatures is very complex and depends on heuristics. Second, several signatures have no clear etiology, casting doubt on them being computational artifacts rather than due to mutagenic processes. Last, approaches for signatures assignment are greatly influenced by the set of signatures used for the analysis. To overcome these limitations, we developed RESOLVE (Robust EStimation Of mutationaL signatures Via rEgularization), a framework that allows the efficient extraction and assignment of mutational signatures. RESOLVE implements a novel algorithm that enables (i) the efficient extraction, (ii) exposure estimation, and (iii) confidence assessment during the computational inference of mutational signatures.

Last updated 2 months ago

bioconductor-package

0.71 score 125 dependencies

destiny - Creates diffusion maps

Create and plot diffusion maps.

Last updated 2 months ago

bioconductor-package

3.31 score 109 dependencies

TargetDecoy - Diagnostic Plots to Evaluate the Target Decoy Approach

A first step in the data analysis of Mass Spectrometry (MS) based proteomics data is to identify peptides and proteins. With this respect the huge number of experimental mass spectra typically have to be assigned to theoretical peptides derived from a sequence database. Search engines are used for this purpose. These tools compare each of the observed spectra to all candidate theoretical spectra derived from the sequence data base and calculate a score for each comparison. The observed spectrum is then assigned to the theoretical peptide with the best score, which is also referred to as the peptide to spectrum match (PSM). It is of course crucial for the downstream analysis to evaluate the quality of these matches. Therefore False Discovery Rate (FDR) control is used to return a reliable list PSMs. The FDR, however, requires a good characterisation of the score distribution of PSMs that are matched to the wrong peptide (bad target hits). In proteomics, the target decoy approach (TDA) is typically used for this purpose. The TDA method matches the spectra to a database of real (targets) and nonsense peptides (decoys). A popular approach to generate these decoys is to reverse the target database. Hence, all the PSMs that match to a decoy are known to be bad hits and the distribution of their scores are used to estimate the distribution of the bad scoring target PSMs. A crucial assumption of the TDA is that the decoy PSM hits have similar properties as bad target hits so that the decoy PSM scores are a good simulation of the target PSM scores. Users, however, typically do not evaluate these assumptions. To this end we developed TargetDecoy to generate diagnostic plots to evaluate the quality of the target decoy method.

Last updated 2 months ago

bioconductor-package

0.71 score 103 dependencies

DeepPINCS - Protein Interactions and Networks with Compounds based on Sequences using Deep Learning

The identification of novel compound-protein interaction (CPI) is important in drug discovery. Revealing unknown compound-protein interactions is useful to design a new drug for a target protein by screening candidate compounds. The accurate CPI prediction assists in effective drug discovery process. To identify potential CPI effectively, prediction methods based on machine learning and deep learning have been developed. Data for sequences are provided as discrete symbolic data. In the data, compounds are represented as SMILES (simplified molecular-input line-entry system) strings and proteins are sequences in which the characters are amino acids. The outcome is defined as a variable that indicates how strong two molecules interact with each other or whether there is an interaction between them. In this package, a deep-learning based model that takes only sequence information of both compounds and proteins as input and the outcome as output is used to predict CPI. The model is implemented by using compound and protein encoders with useful features. The CPI model also supports other modeling tasks, including protein-protein interaction (PPI), chemical-chemical interaction (CCI), or single compounds and proteins. Although the model is designed for proteins, DNA and RNA can be used if they are represented as sequences.

Last updated 2 months ago

bioconductor-package

1.31 score 137 dependencies 2 dependents

PDATK - Pancreatic Ductal Adenocarcinoma Tool-Kit

Pancreatic ductal adenocarcinoma (PDA) has a relatively poor prognosis and is one of the most lethal cancers. Molecular classification of gene expression profiles holds the potential to identify meaningful subtypes which can inform therapeutic strategy in the clinical setting. The Pancreatic Cancer Adenocarcinoma Tool-Kit (PDATK) provides an S4 class-based interface for performing unsupervised subtype discovery, cross-cohort meta-clustering, gene-expression-based classification, and subsequent survival analysis to identify prognostically useful subtypes in pancreatic cancer and beyond. Two novel methods, Consensus Subtypes in Pancreatic Cancer (CSPC) and Pancreatic Cancer Overall Survival Predictor (PCOSP) are included for consensus-based meta-clustering and overall-survival prediction, respectively. Additionally, four published subtype classifiers and three published prognostic gene signatures are included to allow users to easily recreate published results, apply existing classifiers to new data, and benchmark the relative performance of new methods. The use of existing Bioconductor classes as input to all PDATK classes and methods enables integration with existing Bioconductor datasets, including the 21 pancreatic cancer patient cohorts available in the MetaGxPancreas data package. PDATK has been used to replicate results from Sandhu et al (2019) [https://doi.org/10.1200/cci.18.00102] and an additional paper is in the works using CSPC to validate subtypes from the included published classifiers, both of which use the data available in MetaGxPancreas. The inclusion of subtype centroids and prognostic gene signatures from these and other publications will enable researchers and clinicians to classify novel patient gene expression data, allowing the direct clinical application of the classifiers included in PDATK. Overall, PDATK provides a rich set of tools to identify and validate useful prognostic and molecular subtypes based on gene-expression data, benchmark new classifiers against existing ones, and apply discovered classifiers on novel patient data to inform clinical decision making.

Last updated 2 months ago

bioconductor-package

0.71 score 251 dependencies

densvis - Density-Preserving Data Visualization via Non-Linear Dimensionality Reduction

Implements the density-preserving modification to t-SNE and UMAP described by Narayan et al. (2020) <doi:10.1101/2020.05.12.077776>. The non-linear dimensionality reduction techniques t-SNE and UMAP enable users to summarise complex high-dimensional sequencing data such as single cell RNAseq using lower dimensional representations. These lower dimensional representations enable the visualisation of discrete transcriptional states, as well as continuous trajectory (for example, in early development). However, these methods focus on the local neighbourhood structure of the data. In some cases, this results in misleading visualisations, where the density of cells in the low-dimensional embedding does not represent the transcriptional heterogeneity of data in the original high-dimensional space. den-SNE and densMAP aim to enable more accurate visual interpretation of high-dimensional datasets by producing lower-dimensional embeddings that accurately represent the heterogeneity of the original high-dimensional space, enabling the identification of homogeneous and heterogeneous cell states. This accuracy is accomplished by including in the optimisation process a term which considers the local density of points in the original high-dimensional space. This can help to create visualisations that are more representative of heterogeneity in the original high-dimensional space.

Last updated 2 months ago

bioconductor-package

1.31 score 19 dependencies

systemPipeShiny - systemPipeShiny: An Interactive Framework for Workflow Management and Visualization

systemPipeShiny (SPS) extends the widely used systemPipeR (SPR) workflow environment with a versatile graphical user interface provided by a Shiny App. This allows non-R users, such as experimentalists, to run many systemPipeR’s workflow designs, control, and visualization functionalities interactively without requiring knowledge of R. Most importantly, SPS has been designed as a general purpose framework for interacting with other R packages in an intuitive manner. Like most Shiny Apps, SPS can be used on both local computers as well as centralized server-based deployments that can be accessed remotely as a public web service for using SPR’s functionalities with community and/or private data. The framework can integrate many core packages from the R/Bioconductor ecosystem. Examples of SPS’ current functionalities include: (a) interactive creation of experimental designs and metadata using an easy to use tabular editor or file uploader; (b) visualization of workflow topologies combined with auto-generation of R Markdown preview for interactively designed workflows; (d) access to a wide range of data processing routines; (e) and an extendable set of visualization functionalities. Complex visual results can be managed on a 'Canvas Workbench’ allowing users to organize and to compare plots in an efficient manner combined with a session snapshot feature to continue work at a later time. The present suite of pre-configured visualization examples. The modular design of SPR makes it easy to design custom functions without any knowledge of Shiny, as well as extending the environment in the future with contributions from the community.

Last updated 2 months ago

bioconductor-package

1.24 score 114 dependencies

GSEAmining - Make Biological Sense of Gene Set Enrichment Analysis Outputs

Gene Set Enrichment Analysis is a very powerful and interesting computational method that allows an easy correlation between differential expressed genes and biological processes. Unfortunately, although it was designed to help researchers to interpret gene expression data it can generate huge amounts of results whose biological meaning can be difficult to interpret. Many available tools rely on the hierarchically structured Gene Ontology (GO) classification to reduce reundandcy in the results. However, due to the popularity of GSEA many more gene set collections, such as those in the Molecular Signatures Database are emerging. Since these collections are not organized as those in GO, their usage for GSEA do not always give a straightforward answer or, in other words, getting all the meaninful information can be challenging with the currently available tools. For these reasons, GSEAmining was born to be an easy tool to create reproducible reports to help researchers make biological sense of GSEA outputs. Given the results of GSEA, GSEAmining clusters the different gene sets collections based on the presence of the same genes in the leadind edge (core) subset. Leading edge subsets are those genes that contribute most to the enrichment score of each collection of genes or gene sets. For this reason, gene sets that participate in similar biological processes should share genes in common and in turn cluster together. After that, GSEAmining is able to identify and represent for each cluster: - The most enriched terms in the names of gene sets (as wordclouds) - The most enriched genes in the leading edge subsets (as bar plots). In each case, positive and negative enrichments are shown in different colors so it is easy to distinguish biological processes or genes that may be of interest in that particular study.

Last updated 2 months ago

bioconductor-package

0.71 score 51 dependencies

optimalFlow - optimalFlow

Optimal-transport techniques applied to supervised flow cytometry gating.

Last updated 2 months ago

bioconductor-package

0.49 score 86 dependencies

SCANVIS - SCANVIS - a tool for SCoring, ANnotating and VISualizing splice junctions

SCANVIS is a set of annotation-dependent tools for analyzing splice junctions and their read support as predetermined by an alignment tool of choice (for example, STAR aligner). SCANVIS assesses each junction's relative read support (RRS) by relating to the context of local split reads aligning to annotated transcripts. SCANVIS also annotates each splice junction by indicating whether the junction is supported by annotation or not, and if not, what type of junction it is (e.g. exon skipping, alternative 5' or 3' events, Novel Exons). Unannotated junctions are also futher annotated by indicating whether it induces a frame shift or not. SCANVIS includes a visualization function to generate static sashimi-style plots depicting relative read support and number of split reads using arc thickness and arc heights, making it easy for users to spot well-supported junctions. These plots also clearly delineate unannotated junctions from annotated ones using designated color schemes, and users can also highlight splice junctions of choice. Variants and/or a read profile are also incoroporated into the plot if the user supplies variants in bed format and/or the BAM file. One further feature of the visualization function is that users can submit multiple samples of a certain disease or cohort to generate a single plot - this occurs via a "merge" function wherein junction details over multiple samples are merged to generate a single sashimi plot, which is useful when contrasting cohorots (eg. disease vs control).

Last updated 2 months ago

bioconductor-package

0.49 score 50 dependencies

nanotatoR -

Last updated 2 months ago

ssrch - a simple search engine

Demonstrate tokenization and a search gadget for collections of CSV files.

Last updated 2 months ago

bioconductor-package

0.49 score 41 dependencies

pathwayPCA - Integrative Pathway Analysis with Modern PCA Methodology and Gene Selection

pathwayPCA is an integrative analysis tool that implements the principal component analysis (PCA) based pathway analysis approaches described in Chen et al. (2008), Chen et al. (2010), and Chen (2011). pathwayPCA allows users to: (1) Test pathway association with binary, continuous, or survival phenotypes. (2) Extract relevant genes in the pathways using the SuperPCA and AES-PCA approaches. (3) Compute principal components (PCs) based on the selected genes. These estimated latent variables represent pathway activities for individual subjects, which can then be used to perform integrative pathway analysis, such as multi-omics analysis. (4) Extract relevant genes that drive pathway significance as well as data corresponding to these relevant genes for additional in-depth analysis. (5) Perform analyses with enhanced computational efficiency with parallel computing and enhanced data safety with S4-class data objects. (6) Analyze studies with complex experimental designs, with multiple covariates, and with interaction effects, e.g., testing whether pathway association with clinical phenotype is different between male and female subjects. Citations: Chen et al. (2008) <https://doi.org/10.1093/bioinformatics/btn458>; Chen et al. (2010) <https://doi.org/10.1002/gepi.20532>; and Chen (2011) <https://doi.org/10.2202/1544-6115.1697>.

Last updated 2 months ago

bioconductor-package

0.91 score 4 dependencies

hypeR - An R Package For Geneset Enrichment Workflows

An R Package for Geneset Enrichment Workflows.

Last updated 2 months ago

bioconductor-package

1.58 score 93 dependencies

mixOmics - Omics Data Integration Project

Multivariate methods are well suited to large omics data sets where the number of variables (e.g. genes, proteins, metabolites) is much larger than the number of samples (patients, cells, mice). They have the appealing properties of reducing the dimension of the data by using instrumental variables (components), which are defined as combinations of all variables. Those components are then used to produce useful graphical outputs that enable better understanding of the relationships and correlation structures between the different data sets that are integrated. mixOmics offers a wide range of multivariate methods for the exploration and integration of biological datasets with a particular focus on variable selection. The package proposes several sparse multivariate models we have developed to identify the key variables that are highly correlated, and/or explain the biological outcome of interest. The data that can be analysed with mixOmics may come from high throughput sequencing technologies, such as omics data (transcriptomics, metabolomics, proteomics, metagenomics etc) but also beyond the realm of omics (e.g. spectral imaging). The methods implemented in mixOmics can also handle missing values without having to delete entire rows with missing data. A non exhaustive list of methods include variants of generalised Canonical Correlation Analysis, sparse Partial Least Squares and sparse Discriminant Analysis. Recently we implemented integrative methods to combine multiple data sets: N-integration with variants of Generalised Canonical Correlation Analysis and P-integration with variants of multi-group Partial Least Squares.

Last updated 2 months ago

bioconductor-package

3.43 score 56 dependencies 19 dependents

LRBaseDbi - DBI to construct LRBase-related package

Interface to construct LRBase package (LRBase.XXX.eg.db).

Last updated 2 months ago

bioconductor-package

0.82 score 38 dependencies

RandomWalkRestartMH -

Last updated 2 months ago

RSeqAn - R SeqAn

Headers and some wrapper functions from the SeqAn C++ library for ease of usage in R.

Last updated 2 months ago

bioconductor-package

0.91 score 1 dependencies 1 dependents

MAGeCKFlute - Integrative Analysis Pipeline for Pooled CRISPR Functional Genetic Screens

CRISPR (clustered regularly interspaced short palindrome repeats) coupled with nuclease Cas9 (CRISPR/Cas9) screens represent a promising technology to systematically evaluate gene functions. Data analysis for CRISPR/Cas9 screens is a critical process that includes identifying screen hits and exploring biological functions for these hits in downstream analysis. We have previously developed two algorithms, MAGeCK and MAGeCK-VISPR, to analyze CRISPR/Cas9 screen data in various scenarios. These two algorithms allow users to perform quality control, read count generation and normalization, and calculate beta score to evaluate gene selection performance. In downstream analysis, the biological functional analysis is required for understanding biological functions of these identified genes with different screening purposes. Here, We developed MAGeCKFlute for supporting downstream analysis. MAGeCKFlute provides several strategies to remove potential biases within sgRNA-level read counts and gene-level beta scores. The downstream analysis with the package includes identifying essential, non-essential, and target-associated genes, and performing biological functional category analysis, pathway enrichment analysis and protein complex enrichment analysis of these genes. The package also visualizes genes in multiple ways to benefit users exploring screening data. Collectively, MAGeCKFlute enables accurate identification of essential, non-essential, and targeted genes, as well as their related biological functions. This vignette explains the use of the package and demonstrates typical workflows.

Last updated 2 months ago

bioconductor-package

1.00 score 140 dependencies 1 dependents

tenXplore -

Last updated 2 months ago

BASiCS - Bayesian Analysis of Single-Cell Sequencing data

Single-cell mRNA sequencing can uncover novel cell-to-cell heterogeneity in gene expression levels in seemingly homogeneous populations of cells. However, these experiments are prone to high levels of technical noise, creating new challenges for identifying genes that show genuine heterogeneous expression within the population of cells under study. BASiCS (Bayesian Analysis of Single-Cell Sequencing data) is an integrated Bayesian hierarchical model to perform statistical analyses of single-cell RNA sequencing datasets in the context of supervised experiments (where the groups of cells of interest are known a priori, e.g. experimental conditions or cell types). BASiCS performs built-in data normalisation (global scaling) and technical noise quantification (based on spike-in genes). BASiCS provides an intuitive detection criterion for highly (or lowly) variable genes within a single group of cells. Additionally, BASiCS can compare gene expression patterns between two or more pre-specified groups of cells. Unlike traditional differential expression tools, BASiCS quantifies changes in expression that lie beyond comparisons of means, also allowing the study of changes in cell-to-cell heterogeneity. The latter can be quantified via a biological over-dispersion parameter that measures the excess of variability that is observed with respect to Poisson sampling noise, after normalisation and technical noise removal. Due to the strong mean/over-dispersion confounding that is typically observed for scRNA-seq datasets, BASiCS also tests for changes in residual over-dispersion, defined by residual values with respect to a global mean/over-dispersion trend.

Last updated 2 months ago

bioconductor-package

1.64 score 134 dependencies 1 dependents

DMCHMM - Differentially Methylated CpG using Hidden Markov Model

A pipeline for identifying differentially methylated CpG sites using Hidden Markov Model in bisulfite sequencing data. DNA methylation studies have enabled researchers to understand methylation patterns and their regulatory roles in biological processes and disease. However, only a limited number of statistical approaches have been developed to provide formal quantitative analysis. Specifically, a few available methods do identify differentially methylated CpG (DMC) sites or regions (DMR), but they suffer from limitations that arise mostly due to challenges inherent in bisulfite sequencing data. These challenges include: (1) that read-depths vary considerably among genomic positions and are often low; (2) both methylation and autocorrelation patterns change as regions change; and (3) CpG sites are distributed unevenly. Furthermore, there are several methodological limitations: almost none of these tools is capable of comparing multiple groups and/or working with missing values, and only a few allow continuous or multiple covariates. The last of these is of great interest among researchers, as the goal is often to find which regions of the genome are associated with several exposures and traits. To tackle these issues, we have developed an efficient DMC identification method based on Hidden Markov Models (HMMs) called “DMCHMM” which is a three-step approach (model selection, prediction, testing) aiming to address the aforementioned drawbacks.

Last updated 2 months ago

bioconductor-package

0.71 score 58 dependencies

microbiome - Microbiome Analytics

Utilities for microbiome analysis.

Last updated 2 months ago

bioconductor-package

1.95 score 84 dependencies 3 dependents

Rhdf5lib - hdf5 library as an R package

Provides C and C++ hdf5 libraries.

Last updated 2 months ago

bioconductor-package

8.94 score 0 dependencies 321 dependents

msgbsR -

Last updated 2 months ago

goSTAG - A tool to use GO Subtrees to Tag and Annotate Genes within a set

Gene lists derived from the results of genomic analyses are rich in biological information. For instance, differentially expressed genes (DEGs) from a microarray or RNA-Seq analysis are related functionally in terms of their response to a treatment or condition. Gene lists can vary in size, up to several thousand genes, depending on the robustness of the perturbations or how widely different the conditions are biologically. Having a way to associate biological relatedness between hundreds and thousands of genes systematically is impractical by manually curating the annotation and function of each gene. Over-representation analysis (ORA) of genes was developed to identify biological themes. Given a Gene Ontology (GO) and an annotation of genes that indicate the categories each one fits into, significance of the over-representation of the genes within the ontological categories is determined by a Fisher's exact test or modeling according to a hypergeometric distribution. Comparing a small number of enriched biological categories for a few samples is manageable using Venn diagrams or other means for assessing overlaps. However, with hundreds of enriched categories and many samples, the comparisons are laborious. Furthermore, if there are enriched categories that are shared between samples, trying to represent a common theme across them is highly subjective. goSTAG uses GO subtrees to tag and annotate genes within a set. goSTAG visualizes the similarities between the over-representation of DEGs by clustering the p-values from the enrichment statistical tests and labels clusters with the GO term that has the most paths to the root within the subtree generated from all the GO terms in the cluster.

Last updated 2 months ago

bioconductor-package

0.82 score 63 dependencies

ChIPexoQual - ChIPexoQual

Package with a quality control pipeline for ChIP-exo/nexus data.

Last updated 2 months ago

bioconductor-package

0.91 score 133 dependencies

mimager - mimager: The Microarray Imager

Easily visualize and inspect microarrays for spatial artifacts.

Last updated 2 months ago

bioconductor-package

0.91 score 66 dependencies

MoonlightR - Identify oncogenes and tumor suppressor genes from omics data

Motivation: The understanding of cancer mechanism requires the identification of genes playing a role in the development of the pathology and the characterization of their role (notably oncogenes and tumor suppressors). Results: We present an R/bioconductor package called MoonlightR which returns a list of candidate driver genes for specific cancer types on the basis of TCGA expression data. The method first infers gene regulatory networks and then carries out a functional enrichment analysis (FEA) (implementing an upstream regulator analysis, URA) to score the importance of well-known biological processes with respect to the studied cancer type. Eventually, by means of random forests, MoonlightR predicts two specific roles for the candidate driver genes: i) tumor suppressor genes (TSGs) and ii) oncogenes (OCGs). As a consequence, this methodology does not only identify genes playing a dual role (e.g. TSG in one cancer type and OCG in another) but also helps in elucidating the biological processes underlying their specific roles. In particular, MoonlightR can be used to discover OCGs and TSGs in the same cancer type. This may help in answering the question whether some genes change role between early stages (I, II) and late stages (III, IV) in breast cancer. In the future, this analysis could be useful to determine the causes of different resistances to chemotherapeutic treatments.

Last updated 2 months ago

bioconductor-package

1.38 score 183 dependencies

MAST - Model-based Analysis of Single Cell Transcriptomics

Methods and models for handling zero-inflated single cell assay data.

Last updated 2 months ago

bioconductor-package

2.09 score 63 dependencies 5 dependents

Pi -

Last updated 2 months ago

ASpli - Analysis of Alternative Splicing Using RNA-Seq

Integrative pipeline for the analysis of alternative splicing using RNAseq.

Last updated 2 months ago

bioconductor-package

2.18 score 164 dependencies 1 dependents

MGFR - Marker Gene Finder in RNA-seq data

The package is designed to detect marker genes from RNA-seq data.

Last updated 2 months ago

bioconductor-package

1.08 score 65 dependencies 1 dependents

CONFESS - Cell OrderiNg by FluorEScence Signal

Single Cell Fluidigm Spot Detector.

Last updated 2 months ago

bioconductor-package

0.91 score 141 dependencies

sscu - Strength of Selected Codon Usage

The package calculates the indexes for selective stength in codon usage in bacteria species. (1) The package can calculate the strength of selected codon usage bias (sscu, also named as s_index) based on Paul Sharp's method. The method take into account of background mutation rate, and focus only on four pairs of codons with universal translational advantages in all bacterial species. Thus the sscu index is comparable among different species. (2) The package can detect the strength of translational accuracy selection by Akashi's test. The test tabulating all codons into four categories with the feature as conserved/variable amino acids and optimal/non-optimal codons. (3) Optimal codon lists (selected codons) can be calculated by either op_highly function (by using the highly expressed genes compared with all genes to identify optimal codons), or op_corre_CodonW/op_corre_NCprime function (by correlative method developed by Hershberg & Petrov). Users will have a list of optimal codons for further analysis, such as input to the Akashi's test. (4) The detailed codon usage information, such as RSCU value, number of optimal codons in the highly/all gene set, as well as the genomic gc3 value, can be calculate by the optimal_codon_statistics and genomic_gc3 function. (5) Furthermore, we added one test function low_frequency_op in the package. The function try to find the low frequency optimal codons, among all the optimal codons identified by the op_highly function.

Last updated 2 months ago

bioconductor-package

0.82 score 28 dependencies

SwathXtend -

Last updated 2 months ago

SC3 - Single-Cell Consensus Clustering

A tool for unsupervised clustering and analysis of single cell RNA-Seq data.

Last updated 2 months ago

bioconductor-package

1.45 score 98 dependencies 1 dependents

transcriptR - An Integrative Tool for ChIP- And RNA-Seq Based Primary Transcripts Detection and Quantification

The differences in the RNA types being sequenced have an impact on the resulting sequencing profiles. mRNA-seq data is enriched with reads derived from exons, while GRO-, nucRNA- and chrRNA-seq demonstrate a substantial broader coverage of both exonic and intronic regions. The presence of intronic reads in GRO-seq type of data makes it possible to use it to computationally identify and quantify all de novo continuous regions of transcription distributed across the genome. This type of data, however, is more challenging to interpret and less common practice compared to mRNA-seq. One of the challenges for primary transcript detection concerns the simultaneous transcription of closely spaced genes, which needs to be properly divided into individually transcribed units. The R package transcriptR combines RNA-seq data with ChIP-seq data of histone modifications that mark active Transcription Start Sites (TSSs), such as, H3K4me3 or H3K9/14Ac to overcome this challenge. The advantage of this approach over the use of, for example, gene annotations is that this approach is data driven and therefore able to deal also with novel and case specific events. Furthermore, the integration of ChIP- and RNA-seq data allows the identification all known and novel active transcription start sites within a given sample.

Last updated 2 months ago

bioconductor-package

1.00 score 142 dependencies

Chicago - CHiCAGO: Capture Hi-C Analysis of Genomic Organization

A pipeline for analysing Capture Hi-C data.

Last updated 2 months ago

bioconductor-package

1.00 score 67 dependencies

SNPhood - SNPhood: Investigate, quantify and visualise the epigenomic neighbourhood of SNPs using NGS data

To date, thousands of single nucleotide polymorphisms (SNPs) have been found to be associated with complex traits and diseases. However, the vast majority of these disease-associated SNPs lie in the non-coding part of the genome, and are likely to affect regulatory elements, such as enhancers and promoters, rather than function of a protein. Thus, to understand the molecular mechanisms underlying genetic traits and diseases, it becomes increasingly important to study the effect of a SNP on nearby molecular traits such as chromatin environment or transcription factor (TF) binding. Towards this aim, we developed SNPhood, a user-friendly *Bioconductor* R package to investigate and visualize the local neighborhood of a set of SNPs of interest for NGS data such as chromatin marks or transcription factor binding sites from ChIP-Seq or RNA- Seq experiments. SNPhood comprises a set of easy-to-use functions to extract, normalize and summarize reads for a genomic region, perform various data quality checks, normalize read counts using additional input files, and to cluster and visualize the regions according to the binding pattern. The regions around each SNP can be binned in a user-defined fashion to allow for analysis of very broad patterns as well as a detailed investigation of specific binding shapes. Furthermore, SNPhood supports the integration with genotype information to investigate and visualize genotype-specific binding patterns. Finally, SNPhood can be employed for determining, investigating, and visualizing allele-specific binding patterns around the SNPs of interest.

Last updated 2 months ago

bioconductor-package

0.71 score 102 dependencies

GeneBreak - Gene Break Detection

Recurrent breakpoint gene detection on copy number aberration profiles.

Last updated 2 months ago

bioconductor-package

0.82 score 51 dependencies

erma -

Last updated 2 months ago

kebabs - Kernel-Based Analysis of Biological Sequences

The package provides functionality for kernel-based analysis of DNA, RNA, and amino acid sequences via SVM-based methods. As core functionality, kebabs implements following sequence kernels: spectrum kernel, mismatch kernel, gappy pair kernel, and motif kernel. Apart from an efficient implementation of standard position-independent functionality, the kernels are extended in a novel way to take the position of patterns into account for the similarity measure. Because of the flexibility of the kernel formulation, other kernels like the weighted degree kernel or the shifted weighted degree kernel with constant weighting of positions are included as special cases. An annotation-specific variant of the kernels uses annotation information placed along the sequence together with the patterns in the sequence. The package allows for the generation of a kernel matrix or an explicit feature representation in dense or sparse format for all available kernels which can be used with methods implemented in other R packages. With focus on SVM-based methods, kebabs provides a framework which simplifies the usage of existing SVM implementations in kernlab, e1071, and LiblineaR. Binary and multi-class classification as well as regression tasks can be used in a unified way without having to deal with the different functions, parameters, and formats of the selected SVM. As support for choosing hyperparameters, the package provides cross validation - including grouped cross validation, grid search and model selection functions. For easier biological interpretation of the results, the package computes feature weights for all SVMs and prediction profiles which show the contribution of individual sequence positions to the prediction result and indicate the relevance of sequence sections for the learning result and the underlying biological functions.

Last updated 2 months ago

bioconductor-package

1.69 score 28 dependencies 3 dependents

groHMM - GRO-seq Analysis Pipeline

A pipeline for the analysis of GRO-seq data.

Last updated 2 months ago

bioconductor-package

1.16 score 50 dependencies

pepStat - Statistical analysis of peptide microarrays

Statistical analysis of peptide microarrays

Last updated 2 months ago

bioconductor-package

0.71 score 54 dependencies

ssviz - A small RNA-seq visualizer and analysis toolkit

Small RNA sequencing viewer

Last updated 2 months ago

bioconductor-package

0.91 score 61 dependencies

flowClean - flowClean

A quality control tool for flow cytometry data based on compositional data analysis.

Last updated 2 months ago

bioconductor-package

1.00 score 16 dependencies

ChIPQC - Quality metrics for ChIPseq data

Quality metrics for ChIPseq data.

Last updated 2 months ago

bioconductor-package

1.00 score 158 dependencies

BEAT - BEAT - BS-Seq Epimutation Analysis Toolkit

Model-based analysis of single-cell methylation data

Last updated 2 months ago

bioconductor-package

0.91 score 62 dependencies

GeneOverlap - Test and visualize gene overlaps

Test two sets of gene lists and visualize the results.

Last updated 2 months ago

bioconductor-package

1.31 score 6 dependencies 1 dependents

omicade4 - Multiple co-inertia analysis of omics datasets

This package performes multiple co-inertia analysis of omics datasets.

Last updated 2 months ago

bioconductor-package

2.86 score 42 dependencies 1 dependents

qusage - qusage: Quantitative Set Analysis for Gene Expression

This package is an implementation the Quantitative Set Analysis for Gene Expression (QuSAGE) method described in (Yaari G. et al, Nucl Acids Res, 2013). This is a novel Gene Set Enrichment-type test, which is designed to provide a faster, more accurate, and easier to understand test for gene expression studies. qusage accounts for inter-gene correlations using the Variance Inflation Factor technique proposed by Wu et al. (Nucleic Acids Res, 2012). In addition, rather than simply evaluating the deviation from a null hypothesis with a single number (a P value), qusage quantifies gene set activity with a complete probability density function (PDF). From this PDF, P values and confidence intervals can be easily extracted. Preserving the PDF also allows for post-hoc analysis (e.g., pair-wise comparisons of gene set activity) while maintaining statistical traceability. Finally, while qusage is compatible with individual gene statistics from existing methods (e.g., LIMMA), a Welch-based method is implemented that is shown to improve specificity. The QuSAGE package also includes a mixed effects model implementation, as described in (Turner JA et al, BMC Bioinformatics, 2015), and a meta-analysis framework as described in (Meng H, et al. PLoS Comput Biol. 2019). For questions, contact Chris Bolen ([email protected]) or Steven Kleinstein ([email protected])

Last updated 2 months ago

bioconductor-package

1.95 score 11 dependencies 1 dependents

SigFuge - SigFuge

Algorithm for testing significance of clustering in RNA-seq data.

Last updated 2 months ago

bioconductor-package

0.91 score 49 dependencies

ROntoTools - R Onto-Tools suite

Suite of tools for functional analysis.

Last updated 2 months ago

bioconductor-package

2.50 score 27 dependencies 2 dependents

SPEM - S-system parameter estimation method

This package can optimize the parameter in S-system models given time series data

Last updated 2 months ago

bioconductor-package

2.89 score 4 dependencies 1 dependents

RNASeqPower - Sample size for RNAseq studies

RNA-seq, sample size

Last updated 2 months ago

bioconductor-package

3.52 score 0 dependencies

deltaGseg - deltaGseg

Identifying distinct subpopulations through multiscale time series analysis

Last updated 2 months ago

bioconductor-package

0.82 score 48 dependencies

iBMQ - integrated Bayesian Modeling of eQTL data

integrated Bayesian Modeling of eQTL data

Last updated 2 months ago

bioconductor-package

0.82 score 30 dependencies

bumphunter - Bump Hunter

Tools for finding bumps in genomic data

Last updated 2 months ago

bioconductor-package

4.99 score 76 dependencies 51 dependents

illuminaio - Parsing Illumina Microarray Output Files

Tools for parsing Illumina's microarray output files, including IDAT.

Last updated 2 months ago

bioconductor-package

4.63 score 4 dependencies 44 dependents

VariantTools - Tools for Exploratory Analysis of Variant Calls

Explore, diagnose, and compare variant calls using filters.

Last updated 2 months ago

bioconductor-package

1.31 score 70 dependencies 1 dependents

HTSeqGenie - A NGS analysis pipeline.

Libraries to perform NGS analysis.

Last updated 2 months ago

bioconductor-package

1.08 score 85 dependencies

CNORode - ODE add-on to CellNOptR

Logic based ordinary differential equation (ODE) add-on to CellNOptR.

Last updated 2 months ago

bioconductor-package

1.75 score 63 dependencies 1 dependents

agilp - Agilent expression array processing package

More about what it does (maybe more than one line)

Last updated 2 months ago

bioconductor-package

1.58 score 0 dependencies

hpar - Human Protein Atlas in R

The hpar package provides a simple R interface to and data from the Human Protein Atlas project.

Last updated 2 months ago

bioconductor-package

2.22 score 60 dependencies 1 dependents

gwascat - representing and modeling data in the EMBL-EBI GWAS catalog

Represent and model data in the EMBL-EBI GWAS catalog.

Last updated 2 months ago

bioconductor-package

2.09 score 99 dependencies 2 dependents

easyRNASeq -

Last updated 2 months ago

BiocGenerics - S4 generic functions used in Bioconductor

The package defines many S4 generic functions used in Bioconductor.

Last updated 2 months ago

bioconductor-package

14.57 score 0 dependencies 2182 dependents

minfi - Analyze Illumina Infinium DNA methylation arrays

Tools to analyze & visualize Illumina Infinium methylation arrays.

Last updated 2 months ago

bioconductor-package

4.44 score 129 dependencies 34 dependents

VariantAnnotation - Annotation of Genetic Variants

Annotate variants, compute amino acid coding changes, predict coding outcomes.

Last updated 2 months ago

bioconductor-package

7.20 score 69 dependencies 152 dependents

nucleR - Nucleosome positioning package for R

Nucleosome positioning for Tiling Arrays and NGS experiments.

Last updated 2 months ago

bioconductor-package

1.16 score 80 dependencies

ReadqPCR - Read qPCR data

The package provides functions to read raw RT-qPCR data of different platforms.

Last updated 2 months ago

bioconductor-package

2.83 score 2 dependencies 1 dependents

genomes - Genome sequencing project metadata

Download genome and assembly reports from NCBI

Last updated 2 months ago

bioconductor-package

0.71 score 26 dependencies

PICS - Probabilistic inference of ChIP-seq

Probabilistic inference of ChIP-Seq using an empirical Bayes mixture model approach.

Last updated 2 months ago

bioconductor-package

5.60 score 42 dependencies 1 dependents

SamSPECTRAL -

Last updated 2 months ago

hyperdraw - Visualizing Hypergaphs

Functions for visualizing hypergraphs.

Last updated 2 months ago

bioconductor-package

1.38 score 4 dependencies 1 dependents

frmaTools - Frozen RMA Tools

Tools for advanced use of the frma package.

Last updated 2 months ago

bioconductor-package

0.71 score 8 dependencies

frma - Frozen RMA and Barcode

Preprocessing and analysis for single microarrays and microarray batches.

Last updated 2 months ago

bioconductor-package

3.80 score 57 dependencies 1 dependents

chipseq - chipseq: A package for analyzing chipseq data

Tools for helping process short read data for chipseq experiments.

Last updated 2 months ago

bioconductor-package

2.97 score 54 dependencies 5 dependents

HilbertVisGUI -

Last updated 2 months ago

HilbertVis - Hilbert curve visualization

Functions to visualize long vectors of integer data by means of Hilbert curves

Last updated 2 months ago

bioconductor-package

1.16 score 1 dependencies 1 dependents

BicARE - Biclustering Analysis and Results Exploration

Biclustering Analysis and Results Exploration.

Last updated 2 months ago

bioconductor-package

1.51 score 49 dependencies 2 dependents

ITALICS - ITALICS

A Method to normalize of Affymetrix GeneChip Human Mapping 100K and 500K set

Last updated 2 months ago

bioconductor-package

2.00 score 61 dependencies

RBioinf - RBioinf

Functions and datasets and examples to accompany the monograph R For Bioinformatics.

Last updated 2 months ago

bioconductor-package

0.71 score 2 dependencies

GEOmetadb -

Last updated 2 months ago

bgx - Bayesian Gene eXpression

Bayesian integrated analysis of Affymetrix GeneChips

Last updated 2 months ago

bioconductor-package

1.24 score 25 dependencies

flowViz - Visualization for flow cytometry

Provides visualization tools for flow cytometry data.

Last updated 2 months ago

bioconductor-package

4.38 score 23 dependencies 12 dependents

RbcBook1 - Support for Springer monograph on Bioconductor

tools for building book

Last updated 2 months ago

bioconductor-package

0.82 score 4 dependencies

pathRender - Render molecular pathways

build graphs from pathway databases, render them by Rgraphviz.

Last updated 2 months ago

bioconductor-package

1.08 score 42 dependencies

diffGeneAnalysis - Performs differential gene expression Analysis

Analyze microarray data

Last updated 2 months ago

bioconductor-package

1.58 score 1 dependencies

Category - Category Analysis

A collection of tools for performing category (gene set enrichment) analysis.

Last updated 2 months ago

bioconductor-package

4.59 score 51 dependencies 22 dependents

idiogram - idiogram

A package for plotting genomic data by chromosomal location

Last updated 2 months ago

bioconductor-package

0.82 score 42 dependencies

bioDist - Different distance measures

A collection of software tools for calculating distance measures.

Last updated 2 months ago

bioconductor-package

3.16 score 3 dependencies 2 dependents

MVCClass - Model-View-Controller (MVC) Classes

Creates classes used in model-view-controller (MVC) design

Last updated 2 months ago

bioconductor-package

1.16 score 0 dependencies 1 dependents

OLINgui - Graphical user interface for OLIN

Graphical user interface for the OLIN package

Last updated 2 months ago

bioconductor-package

1.24 score 9 dependencies

GraphAT - Graph Theoretic Association Tests

Functions and data used in Balasubramanian, et al. (2004)

Last updated 2 months ago

bioconductor-package

1.08 score 12 dependencies

convert - Convert Microarray Data Objects

Define coerce methods for microarray data objects.

Last updated 2 months ago

bioconductor-package

1.24 score 5 dependencies 1 dependents

altcdfenvs - alternative CDF environments (aka probeset mappings)

Convenience data structures and functions to handle cdfenvs

Last updated 2 months ago

bioconductor-package

2.05 score 26 dependencies 1 dependents

PROcess - Ciphergen SELDI-TOF Processing

A package for processing protein mass spectrometry data.

Last updated 2 months ago

bioconductor-package

2.97 score 4 dependencies

Icens - NPMLE for Censored and Truncated Data

Many functions for computing the NPMLE for censored and truncated data.

Last updated 2 months ago

bioconductor-package

2.35 score 3 dependencies 7 dependents

impute - impute: Imputation for microarray data

Imputation for microarray data (currently KNN only)

Last updated 2 months ago

bioconductor-package

6.77 score 0 dependencies 130 dependents

gcrma - Background Adjustment Using Sequence Information

Background adjustment using sequence information

Last updated 2 months ago

bioconductor-package

6.44 score 23 dependencies 12 dependents

widgetTools - Creates an interactive tcltk widget

This packages contains tools to support the construction of tcltk widgets

Last updated 2 months ago

bioconductor-package

2.35 score 0 dependencies 8 dependents

graph - graph: A package to handle graph data structures

A package that implements some simple graph handling capabilities.

Last updated 2 months ago

bioconductor-package

9.42 score 1 dependencies 339 dependents

preprocessCore - A collection of pre-processing functions

A library of core preprocessing routines.

Last updated 2 months ago

bioconductor-package

1 stars 8.94 score 0 dependencies 222 dependents

tkWidgets - R based tk widgets

Widgets to provide user interfaces. tcltk should have been installed for the widgets to run.

Last updated 2 months ago

bioconductor-package

2.05 score 2 dependencies 6 dependents

ROC - utilities for ROC, with microarray focus

Provide utilities for ROC, with microarray focus.

Last updated 2 months ago

bioconductor-package

2.53 score 5 dependencies 10 dependents

ctc - Cluster and Tree Conversion.

Tools for export and import classification trees and clusters to other programs

Last updated 2 months ago

bioconductor-package

2.09 score 1 dependencies 2 dependents

geneplotter - Graphics related functions for Bioconductor

Functions for plotting genomic data

Last updated 2 months ago

bioconductor-package

3.76 score 43 dependencies 12 dependents

genefilter - genefilter: methods for filtering genes from high-throughput experiments

Some basic functions for filtering genes.

Last updated 2 months ago

bioconductor-package

8.38 score 46 dependencies 155 dependents

annotate - Annotation for microarrays

Using R enviroments for annotation.

Last updated 2 months ago

bioconductor-package

8.34 score 40 dependencies 253 dependents

DynDoc - Dynamic document tools

A set of functions to create and interact with dynamic documents and vignettes.

Last updated 2 months ago

bioconductor-package

2.14 score 0 dependencies 7 dependents

Biobase - Biobase: Base functions for Bioconductor

Functions that are needed by many other packages or which replace R functions.

Last updated 2 months ago

bioconductor-package

13.84 score 1 dependencies 1744 dependents

enhancerHomologSearch -

Last updated 2 years ago

RadioGx -

Last updated 4 years ago