An introduction to the TRONCO R package

The TRONCO (TRanslational ONCOlogy) package collects algorithms to infer progression models via the approach of Suppes-Bayes Causal Network, both from an ensemble of tumors (cross-sectional samples) and within an individual patient (multi-region or single-cell samples). The package provides parallel implementation of algorithms that process binary matrices where each row represents a tumor sample and each column a single-nucleotide or a structural variant driving the progression; a 0/1 value models the absence/presence of that alteration in the sample. The tool can import data from plain, MAF or GISTIC format files, and can fetch it from the cBioPortal for cancer genomics. Functions for data manipulation and visualization are provided, as well as functions to import/export such data to other bioinformatics tools for, e.g, clustering or detection of mutually exclusive alterations. Inferred models can be visualized and tested for their confidence via bootstrap and cross-validation. TRONCO is used for the implementation of the Pipeline for Cancer Inference.

In this vignette, we will give an overview of the package by presenting some of the functions that could be most commonly used to arrange a data-analysis pipeline, along with their parameters to customize TRONCO’s functioning. Advanced example case studies are available at the tool webpage

Changelog

  • [2.31.5] Update vignette.
  • [2.31.4] Fix error on the usate of order function.
  • [2.8.1] Minor fix on documentation.
  • [2.7.7] RNA Seq validation. Random restart on Hill Climbing added to CAPRI algorithm. Minor fixes to algorithms and error model.
  • [2.7.3] Development version. Assignment to .GlobalEnv removed.
  • [2.6.1] Current stable version.
  • [2.5.3] New algorithms: Edmonds, Gabow, Chow-Liu and Prim. New scores: PMI, CPMI, MI.
  • [2.4.3] Bugfix.
  • [2.4.2] Implements a noise model and finalizes a series of algorithms reconstructing Suppes-Bayes Causal Network as maximum spanning trees.
  • [2.4] New statistics available for model confidence via cross-validation routines. New algorithms based on Minimum Spanning Tree extraction.
  • [2.0] Released in summer 2015 on our GitHUB, replaced the version in autumn 2015. This version is parallel, includes also the CAPRI algorithm, supports common GISTIC and MAF input formats, supports TCGA samples editing and queries to the cBio portal. This version has new plotting capabilities, and a general from-scratch design. It is not compatible with previous releases.
  • [1.0] released in mid 2014, includes CAPRESE algorithm. It is now outdated and no more maintained;