NEWS
GSVA 2.00
USER VISIBLE CHANGES
- Added a missing data policy for the GSVA and the ssGSEA methods by which missing values in the input expression data either propagate throughout calculations (use="everything"), prompt an error (use="all.obs"), or are discarded from calculations (use="na.rm").
- The readGMT() function now returns also a GeneSetCollection object, and can automatically identify the type of feature/gene identifier used in the GMT file, adding it to the metadata of the return object.
- Mapping between different types of feature/gene identifiers between gene sets and expression data has been improved.
- Added a new default for the kcdf parameter in the GSVA method, kcdf='auto', which tries to select the best value for this parameter, according to the input data.
- Added a new sparse regime of the GSVA algorithm, specifically tailored for expression data stored in sparse matrices, such as single-cell data, to address some of the shortcomings of the original (classical) algorithm with this type of data.
- Added two new methods, gsvaRanks() and gsvaScores(), to run the GSVA algorithm in two steps that provide additional flexibility to score gene sets and extract enrichment information.
- Added a new method gsvaEnrichment() to be used after gsvaRanks(), which allows one to extract enrichment information for a particular gene set and sample/column, including the possibility to produce an enrichment plot using base R or ggplot2 graphics.
- Package messages are now displayed using the cli package, changing and removing some of them, as well as adding other ones, to improve the information reported to the user.
BUG FIXES
- Removed API legacy convenience features, which keep causing problems, as per request of CRAN maintainers.
- Fixed some bugs in the shiny app.
GSVA 1.52
USER VISIBLE CHANGES
- Moved old API from deprecated to defunct; see vignette and help pages for examples on how to use the new API.
- Documentation fixes.
- Gene/feature filtering is not based anymore on floating point arithmetic, but on comparing minimum and maximum values. In the case of expression data stored in dgCMatrix objects, this criterion applies to non-zero values only.
- Parameter objects have nicer show methods.
- Resulting enrichment score matrices are now always dense.
- Added a first version of the support to SpatialExperiment objects. At the moment, only GSVA scores are calculated without using spatial coordinates information.
- When expression data is stored in an input sparse matrix of class dgCMatrix, now it is handled as such a sparse matrix as much as possible to reduce memory consumption.
- Added geneSets() and geneSetSizes() methods that allow one to retrieve, respectively, the filtered collection of gene sets and their sizes from either the parameter object or the resulting object output by the gsva() function. This allows the user to more easily give this information to analysis pipelines that exploit it, such as limma-trend; see the vignette for an example.
- Added functions readGMT() and deduplicateGeneSets() to read GMT files, handling the case when the file contains gene sets with duplicated names; see the corresponding help pages for more information.
- The underlying code of the ssGSEA method has been optimized after discussion on https://github.com/rcastelo/GSVA/issues/71 and now it runs one order of magnitude faster and consumes one order of magnitude less memory.
BUG FIXES
- Bugfix on accessing the assay names of a SingleCellExperiment object from the 'gsva()' function.
- Bugfix on a rare combination of input parameter for gsvaParam().
GSVA 1.50
USER VISIBLE CHANGES
- The API has changed. The main function remains under the same name 'gsva()', but the way in which is called is different. From this release, it has three parameters only: the first is a parameter object whose class depends on the method to be used, the second is a flag to set verbosity and the third controls the parallelization of the calculations. The old way of using 'gsva()' has been deprecated, which means that during this release, the user may still use the old API, but will get a deprecation warning message. In the next release, the old way of using 'gsva()' will become defunct and prompt an error. Please check the help page of 'gsva()' for details.
BUG FIXES
- Bugfix for https://github.com/rcastelo/GSVA/issues/88 to correctly deal with a GeneSetCollection object as input gene sets, when the input expression data is a SingleCellExperiment object.
- Bugfix for https://github.com/rcastelo/GSVA/issues/90 to enable working with long vectors in the calls to C code by the GSVA algorithm.
GSVA 1.46
BUG FIXES
- Bugfix for https://github.com/rcastelo/GSVA/issues/61 to enable using the ssgsea method with one single column (sample) in the input data container.
- Bugfix when input is a SummarizedExperiment and assays contain a data.frame instead of a matrix.
GSVA 1.44
BUG FIXES
- Bugfix for https://github.com/rcastelo/GSVA/issues/54 to force filtering genes with constant expression behaving the same regardless of the delayed or non-delayed nature of the data container.
GSVA 1.40
USER VISIBLE CHANGES
- The vignette has been rewritten in R Markdown to produce an HTML vignette page and make it shorter and faster to produce.
- Development of a shiny app available through the function 'igsva()'.
BUG FIXES
- Replaced fastmatch::fmatch() by IRanges::match,CharacterList-method after disscussion at https://github.com/rcastelo/GSVA/issues/39 to avoid the row names of an input expression matrix being altered by fastmatch::fmatch() adding an attribute.
- Fixed wrong call to .mapGeneSetsToFeatures() when gene sets are given in a GeneSetCollection object.
GSVA 1.36
USER VISIBLE CHANGES
- Improved management of parallel calculations by using the BiocParallel package. This facilitates calculating GSVA scores from gene expression data matrices with thousands of samples. See arguments 'parallel.sz' and 'BPPARAM' in the manual page of the 'gsva()' function.
- Improved implementation of the SSGSEA method ('method="ssgsea" in the call to 'gsva()') that makes calculations with this option about one order of magnitude faster. Improvement thanks to Alexey Sergushichev (https://github.com/rcastelo/GSVA/pull/15).
- The function 'gsva()' now issues a warning when one or more gene sets are singletons, i.e., they are formed by just one feature/gene.
GSVA 1.34
BUG FIXES
- Bugfix to handle when parallel::detectCores() returns NA instead of an integer number of cores, which may happen when running GSVA in a docker container. Bug reporting and pull request fix thanks to Aaron (https://github.com/rcastelo/GSVA/pull/10).
- Bugfix to handle when arguments 'method="ssgsea"' and 'tau=0'. Bug reporting thanks to Lena Morill (https://github.com/rcastelo/GSVA/issues/4).
GSVA 1.28
USER VISIBLE CHANGES
- Arguments 'rnaseq', 'kernel', 'no.bootstraps' and 'bootstrap.percent' have become defunct.
- A Bioconductor sticker has been created and it is available at https://github.com/Bioconductor/BiocStickers/tree/master/GSVA
GSVA 1.26
USER VISIBLE CHANGES
- Updated implementation of the option 'abs.ranking=TRUE' to use the original Kuiper statistic.
- Arguments 'rnaseq' and 'kernel' have been deprecated and replaced by a new argument 'kcdf'.
- Arguments 'no.bootstraps' and 'bootstrap.percent' have been deprecated.
- The return value with the default argument 'method="gsva"' has been simplified and it is not a list object anymore. Now the 'gsva()' function return always a matrix or an 'ExpressionSet' object, when the input expression data is also an 'ExpressionSet' object.
- The 'gsva()' function can now be used through a shiny app that runs through the function 'igsva()'.
GSVA 1.24
BUG FIXES
- Bugfixes on the parallel execution of gsva() with bootstrap calculations.
GSVA 1.14
USER VISIBLE CHANGES
- added an argument 'ssgsea.norm' to the 'gsva()' function to enable disabling the default score normalization of the original SSGSEA method by Barbie et al. (2009).
BUG FIXES
- Better error handling of the situation when no gene identifiers match between gene sets and expression data.
GSVA 1.4
USER VISIBLE CHANGES
- removed the system-requirement dependency from the GNU Scientific Library
- added two additional gene-set expression summarization methods: single-sample GSEA from Barbie et al. (Nature, 2009) and a combined Z-score method similar to the one used by Lee et al. (PLos Comp Biol, 2008) via a new 'method' argument in the 'gsva()' function
- added handling of RNA-seq expression data matrices by the GSVA method with a new 'rnaseq' argument in the 'gsva()' function
- added a method with signature(expr="matrix", gset.idx.list="GeneSetCollection", annotation="character") which did not exist before. Now gsva() accepts the following pairs of data structures storing expression data and gene sets: ExpressionSet-GeneSetCollection, ExpressionSet-list, matrix-GeneSetCollection and matrix-list
BUG FIXES
- matching of gene IDs from ExpressionSet objects to GeneSetCollection objects now also works with Entrez-based gene IDs in ExpressionSet objects (e.g., when annotation(eset) == "org.Hs.eg.db") by using GSEABase >= 1.17.4
GSVA 0.9
USER VISIBLE CHANGES
- first version of the package
(start date: 18 Feburary, 2011)