Package 'glmSparseNet' reference manual

Title:	Network Centrality Metrics for Elastic-Net Regularized Models
Description:	glmSparseNet is an R-package that generalizes sparse regression models when the features (e.g. genes) have a graph structure (e.g. protein-protein interactions), by including network-based regularizers. glmSparseNet uses the glmnet R-package, by including centrality measures of the network as penalty weights in the regularization. The current version implements regularization based on node degree, i.e. the strength and/or number of its associated edges, either by promoting hubs in the solution or orphan genes in the solution. All the glmnet distribution families are supported, namely "gaussian", "poisson", "binomial", "multinomial", "cox", and "mgaussian".
Authors:	André Veríssimo [aut, cre] , Susana Vinga [aut], Eunice Carrasquinha [ctb], Marta Lopes [ctb]
Maintainer:	André Veríssimo <[email protected]>
License:	GPL-3
Version:	1.25.0
Built:	2025-03-29 05:36:32 UTC
Source:	https://github.com/bioc/glmSparseNet

Change base dir for '.runCache

Description

Change base dir for '.runCache

Usage

.baseDir(path = NULL)
.baseDir(path = NULL)

Arguments

path

to base directory where cache is saved

Value

the new path

Examples

glmSparseNet:::.baseDir("/tmp/cache")
glmSparseNet:::.baseDir("/tmp/cache")

Common call to biomaRt to avoid repetitive code

Description

Common call to biomaRt to avoid repetitive code

Usage

.biomartLoad(attributes, filters, values, useCache, verbose)
.biomartLoad(attributes, filters, values, useCache, verbose)

Arguments

`attributes`	Attributes you want to retrieve. A possible list of attributes can be retrieved using the function biomaRt::listAttributes.
`filters`	Filters (one or more) that should be used in the query. A possible list of filters can be retrieved using the function biomaRt::listFilters.
`values`	Values of the filter, e.g. vector of affy IDs. If multiple filters are specified then the argument should be a list of vectors of which the position of each vector corresponds to the position of the filters in the filters argument
`useCache`	Boolean indicating if biomaRt cache should be used
`verbose`	When using biomaRt in webservice mode and setting verbose to TRUE, the XML query to the webservice will be printed.

Value

data.frame with attributes as columns and values translated to them

Examples

glmSparseNet:::.biomartLoad(
    attributes = c("external_gene_name", "ensembl_gene_id"),
    filters = "external_gene_name",
    values = c("MOB1A", "RFLNB", "SPIC", "TP53"),
    useCache = TRUE,
    verbose = FALSE
)
glmSparseNet:::.biomartLoad(
    attributes = c("external_gene_name", "ensembl_gene_id"),
    filters = "external_gene_name",
    values = c("MOB1A", "RFLNB", "SPIC", "TP53"),
    useCache = TRUE,
    verbose = FALSE
)

Build digest of function from the actual code

Description

Build digest of function from the actual code

Usage

.buildFunctionDigest(fun)
.buildFunctionDigest(fun)

Arguments

fun

function call name

Value

a digest

Examples

glmSparseNet:::.buildFunctionDigest(sum)
glmSparseNet:::.buildFunctionDigest(c)
glmSparseNet:::.buildFunctionDigest(sum)
glmSparseNet:::.buildFunctionDigest(c)

Change cache.compression for run_cache

Description

Change cache.compression for run_cache

Usage

.cacheCompression(compression = NULL)
.cacheCompression(compression = NULL)

Arguments

compression

see compression parameter in save function

Value

the new compression

Examples

glmSparseNet:::.cacheCompression("bzip2")
glmSparseNet:::.cacheCompression("bzip2")

Calculate penalty based on data

Description

Internal method to calculate the network using data-dependant methods

Usage

.calcPenalty(xdata, penaltyType, options = networkOptions())
.calcPenalty(xdata, penaltyType, options = networkOptions())

Arguments

`xdata`	input data
`penaltyType`	which method to use
`options`	options to be used

Value

vector with penalty weights

Examples

xdata <- matrix(rnorm(1000), ncol = 200)
glmSparseNet:::.calcPenalty(xdata, "none")
glmSparseNet:::.calcPenalty(
    xdata, "correlation",
    networkOptions(cutoff = .6)
)
glmSparseNet:::.calcPenalty(xdata, "correlation")
glmSparseNet:::.calcPenalty(
    xdata, "covariance",
    networkOptions(cutoff = .6)
)
glmSparseNet:::.calcPenalty(xdata, "covariance")
xdata <- matrix(rnorm(1000), ncol = 200)
glmSparseNet:::.calcPenalty(xdata, "none")
glmSparseNet:::.calcPenalty(
    xdata, "correlation",
    networkOptions(cutoff = .6)
)
glmSparseNet:::.calcPenalty(xdata, "correlation")
glmSparseNet:::.calcPenalty(
    xdata, "covariance",
    networkOptions(cutoff = .6)
)
glmSparseNet:::.calcPenalty(xdata, "covariance")

Calculate/load result and save if necessary

Description

This is where the actual work is done

Usage

.calculateResult(path, compression, forceRecalc, showMessage, fun, ...)
.calculateResult(path, compression, forceRecalc, showMessage, fun, ...)

Arguments

`path`	path to save cache
`compression`	compression used in save
`forceRecalc`	force to recalculate cache
`showMessage`	boolean to show messages
`fun`	function to be called
`...`	arguments to said function ,

Value

result of fun(...)

Examples

glmSparseNet:::.calculateResult(
    file.path(tempdir(), "calculate_result.Rdata"),
    "gzip",
    FALSE,
    TRUE,
    sum,
    1, 2, 3
)
glmSparseNet:::.calculateResult(
    file.path(tempdir(), "calculate_result.Rdata"),
    "gzip",
    FALSE,
    TRUE,
    sum,
    1, 2, 3
)

Calculate combined score for STRINGdb interactions

Description

Please note that all the interactions have duplicates as it's a two way interaction (score(ProteinA-Protein) == score(ProteinB, PorteinA))

Usage

.combinedScore(allInteractions, scoreThreshold, removeText)
.combinedScore(allInteractions, scoreThreshold, removeText)

Arguments

`allInteractions`	table with score of all interactions
`scoreThreshold`	threshold to keep interactions
`removeText`	remove text-based interactions

Details

To better understand how the score is calculated, please see: https://string-db.org/help/faq/#how-are-the-scores-computed

Value

table with combined score

Create directories for cache

Description

Create directories for cache

Usage

.createDirectoryForCache(baseDir, parentPath)
.createDirectoryForCache(baseDir, parentPath)

Arguments

`baseDir`	tentative base dir to create.
`parentPath`	first 4 characters of digest that will become parent directory for the actual cache file (this reduces number of files per folder)

Value

a list of updated baseDir and parentDir

Examples

glmSparseNet:::.createDirectoryForCache(tempdir(), "abcd")

glmSparseNet:::.createDirectoryForCache(
    file.path(getwd(), "run-cache"), "abcd"
)

glmSparseNet:::.createDirectoryForCache(tempdir(), "abcd")

glmSparseNet:::.createDirectoryForCache(
    file.path(getwd(), "run-cache"), "abcd"
)

Workaround for bug with curl when fetching specific ensembl mirror

Description

Should be solved in issue #39, will test to remove it.

Usage

.curlWorkaround(expr)
.curlWorkaround(expr)

Arguments

expr

expression

Value

result of expression

Examples


glmSparseNet:::.curlWorkaround({
    biomaRt::useEnsembl(
        biomart = "genes",
        dataset = "hsapiens_gene_ensembl"
    )
})

glmSparseNet:::.curlWorkaround({
    biomaRt::useEnsembl(
        biomart = "genes",
        dataset = "hsapiens_gene_ensembl"
    )
})

Generic function to calculate degree based on data

Description

The assumption to use this function is that the network represented by a matrix is symetric and without any connection the node and itself.

Usage

.degreeGeneric(
  fun = stats::cor,
  funPrefix = "operator",
  xdata,
  cutoff = 0,
  considerUnweighted = FALSE,
  chunks = 1000,
  forceRecalcDegree = FALSE,
  forceRecalcNetwork = FALSE,
  nCores = 1,
  ...
)
.degreeGeneric(
  fun = stats::cor,
  funPrefix = "operator",
  xdata,
  cutoff = 0,
  considerUnweighted = FALSE,
  chunks = 1000,
  forceRecalcDegree = FALSE,
  forceRecalcNetwork = FALSE,
  nCores = 1,
  ...
)

Arguments

`fun`	function that will calculate the edge weight between 2 nodes
`funPrefix`	used to store low-level information on network as it can become to large to be stored in memory
`xdata`	calculate correlation matrix on each column
`cutoff`	positive value that determines a cutoff value
`considerUnweighted`	consider all edges as 1 if they are greater than 0
`chunks`	calculate function at batches of this value (default is 1000)
`forceRecalcDegree`	force recalculation of penalty weights (but not the network), instead of going to cache
`forceRecalcNetwork`	force recalculation of network and penalty weights, instead of going to cache
`nCores`	number of cores to be used
`...`	extra parameters for fun

Value

a vector of the degrees

Default digest method

Description

Sets a default caching algorithm to use with .runCache

Usage

.digestCache(val)
.digestCache(val)

Arguments

val

object to calculate hash over

Value

a hash of the sha256

Examples

glmSparseNet:::.digestCache(c(1, 2, 3, 4, 5))
glmSparseNet:::.digestCache("some example")
glmSparseNet:::.digestCache(c(1, 2, 3, 4, 5))
glmSparseNet:::.digestCache("some example")

Calculate the upper triu of the matrix

Description

Calculate the upper triu of the matrix

Usage

.networkGenericParallel(
  fun,
  funPrefix,
  xdata,
  buildOutput = "matrix",
  nCores = 1,
  forceRecalcNetwork = FALSE,
  showMessage = FALSE,
  ...
)
.networkGenericParallel(
  fun,
  funPrefix,
  xdata,
  buildOutput = "matrix",
  nCores = 1,
  forceRecalcNetwork = FALSE,
  showMessage = FALSE,
  ...
)

Arguments

`fun`	function that will calculate the edge weight between 2 nodes
`funPrefix`	used to store low-level information on network as it can become to large to be stored in memory
`xdata`	base data to calculate network
`buildOutput`	if output returns a 'matrix', 'vector' of the upper triu without the diagonal or NULL with any other argument
`nCores`	number of cores to be used
`forceRecalcNetwork`	force recalculation, instead of going to cache
`showMessage`	shows cache operation messages
`...`	extra parameters for fun

Value

depends on buildOutput parameter

Worker to calculate edge weight for each pair of ixI node and following

Description

Note that it assumes it does not calculate for index below and equal to ixI

Usage

.networkWorker(fun, xdata, ixI, ...)
.networkWorker(fun, xdata, ixI, ...)

Arguments

`fun`	function to be used, can be cor, cov or any other defined function
`xdata`	original data to calculate the function over
`ixI`	starting index, this can be used to save ony upper triu
`...`	extra parameters for fun

Value

a vector with size ncol(xdata) - ixI

Run function and save cache

Description

This method saves the function that's being called

Usage

.runCache(
  fun,
  ...,
  seed = NULL,
  baseDir = NULL,
  cachePrefix = "generic_cache",
  cacheDigest = list(),
  showMessage = NULL,
  forceRecalc = FALSE,
  addToHash = NULL
)

## S4 method for signature 'function'
.runCache(
  fun,
  ...,
  seed = NULL,
  baseDir = NULL,
  cachePrefix = "generic_cache",
  cacheDigest = list(),
  showMessage = NULL,
  forceRecalc = FALSE,
  addToHash = NULL
)
.runCache(
  fun,
  ...,
  seed = NULL,
  baseDir = NULL,
  cachePrefix = "generic_cache",
  cacheDigest = list(),
  showMessage = NULL,
  forceRecalc = FALSE,
  addToHash = NULL
)

## S4 method for signature 'function'
.runCache(
  fun,
  ...,
  seed = NULL,
  baseDir = NULL,
  cachePrefix = "generic_cache",
  cacheDigest = list(),
  showMessage = NULL,
  forceRecalc = FALSE,
  addToHash = NULL
)

Arguments

`fun`	function call name
`...`	parameters for function call
`seed`	when function call is random, this allows to set seed beforehand
`baseDir`	directory where data is stored
`cachePrefix`	prefix for file name to be generated from parameters (...)
`cacheDigest`	cache of the digest for one or more of the parameters
`showMessage`	show message that data is being retrieved from cache
`forceRecalc`	force the recalculation of the values
`addToHash`	something to add to the filename generation

Value

the result of fun(...)

Functions

.runCache(`function`): accepts function as first argument and save cache

Examples

# [optional] save cache in a temporary directory
#
glmSparseNet:::.baseDir(tempdir())
glmSparseNet:::.runCache(c, 1, 2, 3, 4)
#
# next three should use the same cache
#  note, the middle call should be a little faster as digest is not
#  calculated
#   for the first argument
glmSparseNet:::.runCache(c, 1, 2, 3, 4)
glmSparseNet:::.runCache(c, a = 1, 2, c = 3, 4)

# Using a local folder
# glmSparseNet:::.runCache(c, 1, 2, 3, 4, baseDir = "runcache")

# [optional] save cache in a temporary directory
#
glmSparseNet:::.baseDir(tempdir())
glmSparseNet:::.runCache(c, 1, 2, 3, 4)
#
# next three should use the same cache
#  note, the middle call should be a little faster as digest is not
#  calculated
#   for the first argument
glmSparseNet:::.runCache(c, 1, 2, 3, 4)
glmSparseNet:::.runCache(c, a = 1, 2, c = 3, 4)

# Using a local folder
# glmSparseNet:::.runCache(c, 1, 2, 3, 4, baseDir = "runcache")

Saving the cache

Description

Saving the cache

Usage

.saveRunCache(result, path, compression, showMessage)
.saveRunCache(result, path, compression, showMessage)

Arguments

`result`	main result to save
`path`	path to the file to save
`compression`	compression method to be used
`showMessage`	TRUE to show messages, FALSE otherwise

Value

result of save operation

Examples

glmSparseNet:::.saveRunCache(
    35, file.path(tempdir(), "save_run_cache.Rdata"), FALSE, TRUE
)
glmSparseNet:::.saveRunCache(
    35, file.path(tempdir(), "save_run_cache.Rdata"), FALSE, TRUE
)

Show messages option in .runCache

Description

Show messages option in .runCache

Usage

.showMessage(showMessage = NULL)
.showMessage(showMessage = NULL)

Arguments

showMessage

boolean indicating to show messages or not

Value

the show.message option

Examples

glmSparseNet:::.showMessage(FALSE)
glmSparseNet:::.showMessage(FALSE)

Temporary directory for runCache

Description

Temporary directory for runCache

Usage

.tempdirCache()
.tempdirCache()

Value

a path to a temporary directory used by runCache

Write a file in run-cache directory to explain the origin

Description

Write a file in run-cache directory to explain the origin

Usage

.writeReadme(baseDir)
.writeReadme(baseDir)

Arguments

baseDir

directory where to build this file

Value

the path to the file it has written

Examples


glmSparseNet:::.writeReadme(tempdir())
glmSparseNet:::.writeReadme(tempdir())

Create balanced folds for cross validation using stratified sampling

Description

Create balanced folds for cross validation using stratified sampling

Usage

balancedCvFolds(..., nfolds = 10)

# deprecated, please use balancedCvFolds()
balanced.cv.folds(..., nfolds = 10)
balancedCvFolds(..., nfolds = 10)

# deprecated, please use balancedCvFolds()
balanced.cv.folds(..., nfolds = 10)

Arguments

`...`	vectors representing data
`nfolds`	number of folds to be created

Value

list with given input, nfolds and result. The result is a list matching the input with foldid attributed to each position.

Examples

balancedCvFolds(seq(10), seq(11, 15), nfolds = 2)

# will give a warning
balancedCvFolds(seq(10), seq(11, 13), nfolds = 10)

balancedCvFolds(seq(100), seq(101, 133), nfolds = 10)
balancedCvFolds(seq(10), seq(11, 15), nfolds = 2)

# will give a warning
balancedCvFolds(seq(10), seq(11, 13), nfolds = 10)

balancedCvFolds(seq(100), seq(101, 133), nfolds = 10)

Auxiliary function to generate suitable lambda parameters

Description

Auxiliary function to generate suitable lambda parameters

Usage

buildLambda(
  lambdaLargest = NULL,
  xdata = NULL,
  ydata = NULL,
  family = NULL,
  ordersOfMagnitudeSmaller = 3,
  lambdaPerOrderMagnitude = 150,
  lambda.largest = deprecated(),
  orders.of.magnitude.smaller = deprecated(),
  lambda.per.order.magnitude = deprecated()
)
buildLambda(
  lambdaLargest = NULL,
  xdata = NULL,
  ydata = NULL,
  family = NULL,
  ordersOfMagnitudeSmaller = 3,
  lambdaPerOrderMagnitude = 150,
  lambda.largest = deprecated(),
  orders.of.magnitude.smaller = deprecated(),
  lambda.per.order.magnitude = deprecated()
)

Arguments

`lambdaLargest`	numeric value for largest number of lambda to consider (usually with a target of 1 selected variable)
`xdata`	X parameter for glmnet function
`ydata`	Y parameter for glmnet function
`family`	family parameter to glmnet function
`ordersOfMagnitudeSmaller`	minimum value for lambda (lambda.largest / 10^orders.of.magnitude.smaller)
`lambdaPerOrderMagnitude`	how many lambdas to create for each order of magnitude
`lambda.largest`
`orders.of.magnitude.smaller`
`lambda.per.order.magnitude`

Value

a numeric vector with suitable lambdas

Examples

buildLambda(5.4)
buildLambda(5.4)

Build gene network from peptide ids

Description

This can reduce the dimension of the original network, as there may not be a mapping between peptide and gene id

Usage

buildStringNetwork(
  stringTbl,
  useNames = c("protein", "ensembl", "external"),
  string.tbl = deprecated(),
  use.names = deprecated()
)
buildStringNetwork(
  stringTbl,
  useNames = c("protein", "ensembl", "external"),
  string.tbl = deprecated(),
  use.names = deprecated()
)

Arguments

`stringTbl`	`data.frame` or `tibble` with colnames and rownames as ensembl peptide id (same order).
`useNames`	`character(1)` that defaults to use protein names _('protein'), other options are 'ensembl' for ensembl gene id or 'external' for external gene names.
`string.tbl`
`use.names`

Value

a new matrix with gene ids instead of peptide ids. The size of matrix can be different as there may not be a mapping or a peptide mapping can have multiple genes.

Examples


interactions <- stringDBhomoSapiens(scoreThreshold = 100)
string_network <- buildStringNetwork(interactions)

# number of edges
sum(string_network != 0)

interactions <- stringDBhomoSapiens(scoreThreshold = 100)
string_network <- buildStringNetwork(interactions)

# number of edges
sum(string_network != 0)

Calculate cross validating GLM model with network-based regularization

Description

network parameter accepts:

Usage

cv.glmDegree(
  xdata,
  ydata,
  network,
  options = networkOptions(),
  experiment = NULL,
  network.options = deprecated(),
  experiment.name = deprecated(),
  ...
)

cv.glmHub(
  xdata,
  ydata,
  network,
  options = networkOptions(),
  experiment = NULL,
  network.options = deprecated(),
  experiment.name = deprecated(),
  ...
)

cv.glmOrphan(
  xdata,
  ydata,
  network,
  options = networkOptions(),
  experiment = NULL,
  network.options = deprecated(),
  experiment.name = deprecated(),
  ...
)

cv.glmSparseNet(
  xdata,
  ydata,
  network,
  options = networkOptions(),
  experiment = NULL,
  network.options = deprecated(),
  experiment.name = deprecated(),
  ...
)
cv.glmDegree(
  xdata,
  ydata,
  network,
  options = networkOptions(),
  experiment = NULL,
  network.options = deprecated(),
  experiment.name = deprecated(),
  ...
)

cv.glmHub(
  xdata,
  ydata,
  network,
  options = networkOptions(),
  experiment = NULL,
  network.options = deprecated(),
  experiment.name = deprecated(),
  ...
)

cv.glmOrphan(
  xdata,
  ydata,
  network,
  options = networkOptions(),
  experiment = NULL,
  network.options = deprecated(),
  experiment.name = deprecated(),
  ...
)

cv.glmSparseNet(
  xdata,
  ydata,
  network,
  options = networkOptions(),
  experiment = NULL,
  network.options = deprecated(),
  experiment.name = deprecated(),
  ...
)

Arguments

`xdata`	input data, can be a matrix or MultiAssayExperiment.
`ydata`	response data compatible with glmnet.
`network`	type of network, see below.
`options`	options to calculate network.
`experiment`	name of experiment to use as input in MultiAssayExperiment object (only if xdata is an object of this class).
`network.options`
`experiment.name`
`...`	parameters that `glmnet::cv.glmnet()` accepts.

Details

string to calculate network based on data (correlation, covariance)
matrix representing the network
vector with already calculated penalty weights (can also be used directly glmnet)

Value

an object just as cv.glmnet

Functions

cv.glmDegree(): penalizes nodes with small degree (inversion penalization h(x) = 1 / x).
cv.glmHub(): penalizes nodes with small degree (normalized heuristic that promotes nodes with many edges).
cv.glmOrphan(): penalizes nodes with high degree (normalized heuristic that promotes nodes with few edges).

Examples

# Degree penalization

xdata <- matrix(rnorm(100), ncol = 5)
cv.glmDegree(
    xdata,
    rnorm(nrow(xdata)),
    "correlation",
    family = "gaussian",
    nfolds = 5,
    options = networkOptions(minDegree = .2)
)
# Hub penalization

xdata <- matrix(rnorm(100), ncol = 5)
cv.glmHub(
    xdata,
    rnorm(nrow(xdata)),
    "correlation",
    family = "gaussian",
    nfolds = 5,
    options = networkOptions(minDegree = .2)
)
# Orphan penalization

xdata <- matrix(rnorm(100), ncol = 5)
cv.glmOrphan(
    xdata,
    rnorm(nrow(xdata)),
    "correlation",
    family = "gaussian",
    nfolds = 5,
    options = networkOptions(minDegree = .2)
)

# Gaussian model
xdata <- matrix(rnorm(500), ncol = 5)
cv.glmSparseNet(
    xdata, rnorm(nrow(xdata)), "correlation",
    family = "gaussian"
)
cv.glmSparseNet(
    xdata, rnorm(nrow(xdata)), "covariance",
    family = "gaussian"
)


#
#
# Using MultiAssayExperiment with survival model
library(MultiAssayExperiment)
data("miniACC", package = "MultiAssayExperiment")

xdata <- miniACC

#
# build valid data with days of last follow up or to event
event.ix <- which(!is.na(xdata$days_to_death))
cens.ix <- which(!is.na(xdata$days_to_last_followup))
xdata$surv_event_time <- array(NA, nrow(colData(xdata)))
xdata$surv_event_time[event.ix] <- xdata$days_to_death[event.ix]
xdata$surv_event_time[cens.ix] <- xdata$days_to_last_followup[cens.ix]

#
# Keep only valid individuals
valid.ix <- as.vector(!is.na(xdata$surv_event_time) &
    !is.na(xdata$vital_status) &
    xdata$surv_event_time > 0)
xdata.valid <- xdata[, rownames(colData(xdata))[valid.ix]]
ydata.valid <- colData(xdata.valid)[, c("surv_event_time", "vital_status")]
colnames(ydata.valid) <- c("time", "status")

#
cv.glmSparseNet(
    xdata.valid,
    ydata.valid,
    nfolds     = 5,
    family     = "cox",
    network    = "correlation",
    experiment = "RNASeq2GeneNorm"
)

# Degree penalization

xdata <- matrix(rnorm(100), ncol = 5)
cv.glmDegree(
    xdata,
    rnorm(nrow(xdata)),
    "correlation",
    family = "gaussian",
    nfolds = 5,
    options = networkOptions(minDegree = .2)
)
# Hub penalization

xdata <- matrix(rnorm(100), ncol = 5)
cv.glmHub(
    xdata,
    rnorm(nrow(xdata)),
    "correlation",
    family = "gaussian",
    nfolds = 5,
    options = networkOptions(minDegree = .2)
)
# Orphan penalization

xdata <- matrix(rnorm(100), ncol = 5)
cv.glmOrphan(
    xdata,
    rnorm(nrow(xdata)),
    "correlation",
    family = "gaussian",
    nfolds = 5,
    options = networkOptions(minDegree = .2)
)

# Gaussian model
xdata <- matrix(rnorm(500), ncol = 5)
cv.glmSparseNet(
    xdata, rnorm(nrow(xdata)), "correlation",
    family = "gaussian"
)
cv.glmSparseNet(
    xdata, rnorm(nrow(xdata)), "covariance",
    family = "gaussian"
)


#
#
# Using MultiAssayExperiment with survival model
library(MultiAssayExperiment)
data("miniACC", package = "MultiAssayExperiment")

xdata <- miniACC

#
# build valid data with days of last follow up or to event
event.ix <- which(!is.na(xdata$days_to_death))
cens.ix <- which(!is.na(xdata$days_to_last_followup))
xdata$surv_event_time <- array(NA, nrow(colData(xdata)))
xdata$surv_event_time[event.ix] <- xdata$days_to_death[event.ix]
xdata$surv_event_time[cens.ix] <- xdata$days_to_last_followup[cens.ix]

#
# Keep only valid individuals
valid.ix <- as.vector(!is.na(xdata$surv_event_time) &
    !is.na(xdata$vital_status) &
    xdata$surv_event_time > 0)
xdata.valid <- xdata[, rownames(colData(xdata))[valid.ix]]
ydata.valid <- colData(xdata.valid)[, c("surv_event_time", "vital_status")]
colnames(ydata.valid) <- c("time", "status")

#
cv.glmSparseNet(
    xdata.valid,
    ydata.valid,
    nfolds     = 5,
    family     = "cox",
    network    = "correlation",
    experiment = "RNASeq2GeneNorm"
)

Calculate the degree of the correlation network based on xdata

Description

Calculate the degree of the correlation network based on xdata

Usage

degreeCor(
  xdata,
  cutoff = 0,
  considerUnweighted = FALSE,
  forceRecalcDegree = FALSE,
  forceRecalcNetwork = FALSE,
  nCores = 1,
  ...,
  consider.unweighted = deprecated(),
  force.recalc.degree = deprecated(),
  force.recalc.network = deprecated(),
  n.cores = deprecated()
)
degreeCor(
  xdata,
  cutoff = 0,
  considerUnweighted = FALSE,
  forceRecalcDegree = FALSE,
  forceRecalcNetwork = FALSE,
  nCores = 1,
  ...,
  consider.unweighted = deprecated(),
  force.recalc.degree = deprecated(),
  force.recalc.network = deprecated(),
  n.cores = deprecated()
)

Arguments

`xdata`	calculate correlation matrix on each column.
`cutoff`	positive value that determines a cutoff value.
`considerUnweighted`	consider all edges as 1 if they are greater than 0.
`forceRecalcDegree`	force recalculation of penalty weights (but not the network), instead of going to cache.
`forceRecalcNetwork`	force recalculation of network and penalty weights, instead of going to cache.
`nCores`	number of cores to be used.
`...`	extra parameters for cor function.
`consider.unweighted`
`force.recalc.degree`
`force.recalc.network`
`n.cores`

Value

a vector of the degrees.

Examples

n.col <- 6
xdata <- matrix(rnorm(n.col * 4), ncol = n.col)
degreeCor(xdata)
degreeCor(xdata, cutoff = .5)
degreeCor(xdata, cutoff = .5, considerUnweighted = TRUE)
n.col <- 6
xdata <- matrix(rnorm(n.col * 4), ncol = n.col)
degreeCor(xdata)
degreeCor(xdata, cutoff = .5)
degreeCor(xdata, cutoff = .5, considerUnweighted = TRUE)

Calculate the degree of the covariance network based on xdata

Description

Calculate the degree of the covariance network based on xdata

Usage

degreeCov(
  xdata,
  cutoff = 0,
  considerUnweighted = FALSE,
  forceRecalcDegree = FALSE,
  forceRecalcNetwork = FALSE,
  nCores = 1,
  ...,
  consider.unweighted = deprecated(),
  force.recalc.degree = deprecated(),
  force.recalc.network = deprecated(),
  n.cores = deprecated()
)
degreeCov(
  xdata,
  cutoff = 0,
  considerUnweighted = FALSE,
  forceRecalcDegree = FALSE,
  forceRecalcNetwork = FALSE,
  nCores = 1,
  ...,
  consider.unweighted = deprecated(),
  force.recalc.degree = deprecated(),
  force.recalc.network = deprecated(),
  n.cores = deprecated()
)

Arguments

`xdata`	calculate correlation matrix on each column.
`cutoff`	positive value that determines a cutoff value.
`considerUnweighted`	consider all edges as 1 if they are greater than 0.
`forceRecalcDegree`	force recalculation of penalty weights (but not the network), instead of going to cache.
`forceRecalcNetwork`	force recalculation of network and penalty weights, instead of going to cache.
`nCores`	number of cores to be used.
`...`	extra parameters for cov function.
`consider.unweighted`
`force.recalc.degree`
`force.recalc.network`
`n.cores`

Value

a vector of the degrees

Examples

n.col <- 6
xdata <- matrix(rnorm(n.col * 4), ncol = n.col)
degreeCov(xdata)
degreeCov(xdata, cutoff = .5)
degreeCov(xdata, cutoff = .5, considerUnweighted = TRUE)
n.col <- 6
xdata <- matrix(rnorm(n.col * 4), ncol = n.col)
degreeCov(xdata)
degreeCov(xdata, cutoff = .5)
degreeCov(xdata, cutoff = .5, considerUnweighted = TRUE)

Download files to local temporary path

Description

In case of new call it uses the temporary cache instead of downloading again.

Usage

downloadFileLocal(urlStr, oD = tempdir())
downloadFileLocal(urlStr, oD = tempdir())

Arguments

`urlStr`	url of file to download
`oD`	temporary directory to store file

Details

Inspired by STRINGdb Bioconductor package, but using curl as file may be too big to handle.

Value

path to file

Examples

glmSparseNet:::downloadFileLocal(
    "https://string-db.org/api/tsv-no-header/version"
)
glmSparseNet:::downloadFileLocal(
    "https://string-db.org/api/tsv-no-header/version"
)

Retrieve ensembl gene names from biomaRt

Description

Retrieve ensembl gene names from biomaRt

Usage

ensemblGeneNames(
  geneId,
  useCache = TRUE,
  verbose = FALSE,
  gene.id = deprecated(),
  use.cache = deprecated()
)
ensemblGeneNames(
  geneId,
  useCache = TRUE,
  verbose = FALSE,
  gene.id = deprecated(),
  use.cache = deprecated()
)

Arguments

`geneId`	character vector with gene names
`useCache`	Boolean indicating if biomaRt cache should be used
`verbose`	When using biomaRt in webservice mode and setting verbose to TRUE, the XML query to the webservice will be printed.
`gene.id`
`use.cache`

Value

a dataframe with external gene names, ensembl_id

Examples

ensemblGeneNames(c("MOB1A", "RFLNB", "SPIC", "TP53"))
ensemblGeneNames(c("MOB1A", "RFLNB", "SPIC", "TP53"))

Retrieve gene names from biomaRt

Description

Retrieve gene names from biomaRt

Usage

geneNames(
  ensemblGenes,
  useCache = TRUE,
  verbose = FALSE,
  ensembl.genes = deprecated(),
  use.cache = deprecated()
)
geneNames(
  ensemblGenes,
  useCache = TRUE,
  verbose = FALSE,
  ensembl.genes = deprecated(),
  use.cache = deprecated()
)

Arguments

`ensemblGenes`	character vector with gene names in ensembl_id format
`useCache`	Boolean indicating if biomaRt cache should be used
`verbose`	When using biomaRt in webservice mode and setting verbose to TRUE, the XML query to the webservice will be printed.
`ensembl.genes`
`use.cache`

Value

a dataframe with external gene names, ensembl_id

Examples

geneNames(c("ENSG00000114978", "ENSG00000166211", "ENSG00000183688"))
geneNames(c("ENSG00000114978", "ENSG00000166211", "ENSG00000183688"))

Calculate GLM model with network-based regularization

Description

network parameter accepts:

string to calculate network based on data (correlation, covariance)
matrix representing the network
vector with already calculated penalty weights (can also be used directly with glmnet)

Usage

glmSparseNet(
  xdata,
  ydata,
  network,
  options = networkOptions(),
  experiment = NULL,
  network.options = deprecated(),
  experiment.name = deprecated(),
  ...
)

glmDegree(
  xdata,
  ydata,
  network,
  options = networkOptions(),
  experiment = NULL,
  network.options = deprecated(),
  experiment.name = deprecated(),
  ...
)

glmHub(
  xdata,
  ydata,
  network,
  options = networkOptions(),
  experiment = NULL,
  network.options = deprecated(),
  experiment.name = deprecated(),
  ...
)

glmOrphan(
  xdata,
  ydata,
  network,
  options = networkOptions(),
  experiment = NULL,
  network.options = deprecated(),
  experiment.name = deprecated(),
  ...
)
glmSparseNet(
  xdata,
  ydata,
  network,
  options = networkOptions(),
  experiment = NULL,
  network.options = deprecated(),
  experiment.name = deprecated(),
  ...
)

glmDegree(
  xdata,
  ydata,
  network,
  options = networkOptions(),
  experiment = NULL,
  network.options = deprecated(),
  experiment.name = deprecated(),
  ...
)

glmHub(
  xdata,
  ydata,
  network,
  options = networkOptions(),
  experiment = NULL,
  network.options = deprecated(),
  experiment.name = deprecated(),
  ...
)

glmOrphan(
  xdata,
  ydata,
  network,
  options = networkOptions(),
  experiment = NULL,
  network.options = deprecated(),
  experiment.name = deprecated(),
  ...
)

Arguments

`xdata`	input data, can be a matrix or MultiAssayExperiment.
`ydata`	response data compatible with glmnet.
`network`	type of network, see below.
`options`	options to calculate network.
`experiment`	name of experiment to use as input in MultiAssayExperiment object (only if xdata is an object of this class).
`network.options`
`experiment.name`
`...`	parameters that `glmnet::glmnet()` accepts.

Value

an object just as glmnet

Functions

glmDegree(): penalizes nodes with small degree (inversion penalization h(x) = 1 / x).
glmHub(): Penalizes nodes with small degree (normalized heuristic that promotes nodes with many edges).
glmOrphan(): Penalizes nodes with high degree (normalized heuristic that promotes nodes with few edges).

Examples

xdata <- matrix(rnorm(100), ncol = 20)
glmSparseNet(xdata, rnorm(nrow(xdata)), "correlation", family = "gaussian")
glmSparseNet(xdata, rnorm(nrow(xdata)), "covariance", family = "gaussian")


#
#
# Using MultiAssayExperiment
# load data
library(MultiAssayExperiment)
data("miniACC", package = "MultiAssayExperiment")

xdata <- miniACC
# TODO aking out x individuals missing values
# build valid data with days of last follow up or to event
event.ix <- which(!is.na(xdata$days_to_death))
cens.ix <- which(!is.na(xdata$days_to_last_followup))

xdata$surv_event_time <- array(NA, nrow(colData(xdata)))
xdata$surv_event_time[event.ix] <- xdata$days_to_death[event.ix]
xdata$surv_event_time[cens.ix] <- xdata$days_to_last_followup[cens.ix]

# Keep only valid individuals
valid.ix <- as.vector(!is.na(xdata$surv_event_time) &
    !is.na(xdata$vital_status) &
    xdata$surv_event_time > 0)
xdata.valid <- xdata[, rownames(colData(xdata))[valid.ix]]
ydata.valid <- colData(xdata.valid)[, c("surv_event_time", "vital_status")]
colnames(ydata.valid) <- c("time", "status")

glmSparseNet(
    xdata.valid,
    ydata.valid,
    family = "cox",
    network = "correlation",
    experiment = "RNASeq2GeneNorm"
)

# Degree penalization

xdata <- matrix(rnorm(100), ncol = 5)
glmDegree(
    xdata,
    rnorm(nrow(xdata)),
    "correlation",
    family = "gaussian",
    options = networkOptions(minDegree = .2)
)
xdata <- matrix(rnorm(100), ncol = 5)
glmHub(
    xdata,
    rnorm(nrow(xdata)),
    "correlation",
    family = "gaussian",
    options = networkOptions(minDegree = .2)
)
# Orphan penalization

xdata <- matrix(rnorm(100), ncol = 5)
glmOrphan(
    xdata,
    rnorm(nrow(xdata)),
    "correlation",
    family = "gaussian",
    options = networkOptions(minDegree = .2)
)
xdata <- matrix(rnorm(100), ncol = 20)
glmSparseNet(xdata, rnorm(nrow(xdata)), "correlation", family = "gaussian")
glmSparseNet(xdata, rnorm(nrow(xdata)), "covariance", family = "gaussian")


#
#
# Using MultiAssayExperiment
# load data
library(MultiAssayExperiment)
data("miniACC", package = "MultiAssayExperiment")

xdata <- miniACC
# TODO aking out x individuals missing values
# build valid data with days of last follow up or to event
event.ix <- which(!is.na(xdata$days_to_death))
cens.ix <- which(!is.na(xdata$days_to_last_followup))

xdata$surv_event_time <- array(NA, nrow(colData(xdata)))
xdata$surv_event_time[event.ix] <- xdata$days_to_death[event.ix]
xdata$surv_event_time[cens.ix] <- xdata$days_to_last_followup[cens.ix]

# Keep only valid individuals
valid.ix <- as.vector(!is.na(xdata$surv_event_time) &
    !is.na(xdata$vital_status) &
    xdata$surv_event_time > 0)
xdata.valid <- xdata[, rownames(colData(xdata))[valid.ix]]
ydata.valid <- colData(xdata.valid)[, c("surv_event_time", "vital_status")]
colnames(ydata.valid) <- c("time", "status")

glmSparseNet(
    xdata.valid,
    ydata.valid,
    family = "cox",
    network = "correlation",
    experiment = "RNASeq2GeneNorm"
)

# Degree penalization

xdata <- matrix(rnorm(100), ncol = 5)
glmDegree(
    xdata,
    rnorm(nrow(xdata)),
    "correlation",
    family = "gaussian",
    options = networkOptions(minDegree = .2)
)
xdata <- matrix(rnorm(100), ncol = 5)
glmHub(
    xdata,
    rnorm(nrow(xdata)),
    "correlation",
    family = "gaussian",
    options = networkOptions(minDegree = .2)
)
# Orphan penalization

xdata <- matrix(rnorm(100), ncol = 5)
glmOrphan(
    xdata,
    rnorm(nrow(xdata)),
    "correlation",
    family = "gaussian",
    options = networkOptions(minDegree = .2)
)

Retrieve hallmarks of cancer count for genes

Description

The API has been removed and this function is no longer available.

Usage

hallmarks(
  genes,
  metric = "count",
  hierarchy = "full",
  generate.plot = TRUE,
  show.message = FALSE
)
hallmarks(
  genes,
  metric = "count",
  hierarchy = "full",
  generate.plot = TRUE,
  show.message = FALSE
)

Arguments

`genes`	gene names
`metric`	see below
`hierarchy`	see below
`generate.plot`	flag to indicate if return object has a ggplot2 object
`show.message`	flag to indicate if run_cache method shows messages

Value

data.frame with choosen metric and hierarchy It also returns a vector with genes that do not have any hallmarks.

See http://chat.lionproject.net/api for more details on the metric and hallmarks parameters

To standardize the colors in the gradient you can use scale_fill_gradientn(limits=c(0,1), colours=topo.colors(3)) to limit between 0 and 1 for cprob and -1 and 1 for npmi

Heuristic function to use in high dimensions

Description

Heuristic function to use in high dimensions

Usage

heuristicScale(
  x,
  subExp10 = -1,
  expMult = -1,
  subExp = -1,
  sub.exp10 = deprecated(),
  exp.mult = deprecated(),
  sub.exp = deprecated()
)
heuristicScale(
  x,
  subExp10 = -1,
  expMult = -1,
  subExp = -1,
  sub.exp10 = deprecated(),
  exp.mult = deprecated(),
  sub.exp = deprecated()
)

Arguments

`x`	vector of values to scale
`subExp10`	value to subtract to base 10 exponential, for example: `10^0 - subExp10 = 1 - subExp10`
`expMult`	parameter to multiply exponential, i.e. to have a negative exponential or positive
`subExp`	value to subtract for exponentional, for example if x = 0, `exp(0) - sub.exp = 1 - sub.exp`
`sub.exp10`
`exp.mult`
`sub.exp`

Value

a vector of scaled values

Examples

heuristicScale(rnorm(1:10))
heuristicScale(rnorm(1:10))

Heuristic function to penalize nodes with low degree

Description

Heuristic function to penalize nodes with low degree

Usage

hubHeuristic(x)
hubHeuristic(x)

Arguments

`x`	single value of vector

Value

transformed

Examples

hubHeuristic(rnorm(1:10))
hubHeuristic(rnorm(1:10))

Custom pallete of colors

Description

Custom pallete of colors

Usage

myColors(ix = NULL)

# deprecated, please use myColors()
my.colors(ix = NULL)
myColors(ix = NULL)

# deprecated, please use myColors()
my.colors(ix = NULL)

Arguments

`ix`	index for a color

Value

a color

Examples

myColors()
myColors(5)
myColors()
myColors(5)

Custom pallete of symbols in plots

Description

Custom pallete of symbols in plots

Usage

mySymbols(ix = NULL)

# deprecated, please use mySymbols()
my.symbols(ix = NULL)
mySymbols(ix = NULL)

# deprecated, please use mySymbols()
my.symbols(ix = NULL)

Arguments

`ix`	index for symbol

Value

a symbol

Examples

mySymbols()
mySymbols(2)
mySymbols()
mySymbols(2)

Calculates the correlation network

Description

Calculates the correlation network

Usage

networkCorParallel(
  xdata,
  buildOutput = "matrix",
  nCores = 1,
  forceRecalcNetwork = FALSE,
  showMessage = FALSE,
  ...,
  build.output = deprecated(),
  n.cores = deprecated(),
  force.recalc.network = deprecated(),
  show.message = deprecated()
)
networkCorParallel(
  xdata,
  buildOutput = "matrix",
  nCores = 1,
  forceRecalcNetwork = FALSE,
  showMessage = FALSE,
  ...,
  build.output = deprecated(),
  n.cores = deprecated(),
  force.recalc.network = deprecated(),
  show.message = deprecated()
)

Arguments

`xdata`	base data to calculate network
`buildOutput`	if output returns a 'matrix', 'vector' of the upper triu without the diagonal or NULL with any other argument
`nCores`	number of cores to be used
`forceRecalcNetwork`	force recalculation, instead of going to cache
`showMessage`	shows cache operation messages
`...`	extra parameters for fun
`build.output`	lifecycle::badge("deprecated") without the diagonal or NULL with any other argument
`n.cores`	lifecycle::badge("deprecated")
`force.recalc.network`	lifecycle::badge("deprecated")
`show.message`	lifecycle::badge("deprecated")

Value

depends on build.output parameter

Examples

n_col <- 6
xdata <- matrix(rnorm(n_col * 4), ncol = n_col)
networkCorParallel(xdata)
n_col <- 6
xdata <- matrix(rnorm(n_col * 4), ncol = n_col)
networkCorParallel(xdata)

Calculates the covariance network

Description

Calculates the covariance network

Usage

networkCovParallel(
  xdata,
  buildOutput = "matrix",
  nCores = 1,
  forceRecalcNetwork = FALSE,
  showMessage = FALSE,
  ...,
  build.output = deprecated(),
  n.cores = deprecated(),
  force.recalc.network = deprecated(),
  show.message = deprecated()
)
networkCovParallel(
  xdata,
  buildOutput = "matrix",
  nCores = 1,
  forceRecalcNetwork = FALSE,
  showMessage = FALSE,
  ...,
  build.output = deprecated(),
  n.cores = deprecated(),
  force.recalc.network = deprecated(),
  show.message = deprecated()
)

Arguments

`xdata`	base data to calculate network
`buildOutput`	if output returns a 'matrix', 'vector' of the upper triu without the diagonal or NULL with any other argument
`nCores`	number of cores to be used
`forceRecalcNetwork`	force recalculation, instead of going to cache
`showMessage`	shows cache operation messages
`...`	extra parameters for fun
`build.output`	lifecycle::badge("deprecated") without the diagonal or NULL with any other argument
`n.cores`	lifecycle::badge("deprecated")
`force.recalc.network`	lifecycle::badge("deprecated")
`show.message`	lifecycle::badge("deprecated")

Value

depends on build.output parameter

Examples

n.col <- 6
xdata <- matrix(rnorm(n.col * 4), ncol = n.col)
networkCovParallel(xdata)
n.col <- 6
xdata <- matrix(rnorm(n.col * 4), ncol = n.col)
networkCovParallel(xdata)

Setup network options

Description

Setup network options, such as using weighted or unweighted degree, which centrality measure to use

Usage

networkOptions(
  method = "pearson",
  unweighted = TRUE,
  cutoff = 0,
  centrality = "degree",
  minDegree = 0,
  nCores = 1,
  transFun = function(x) x,
  min.degree = deprecated(),
  n.cores = deprecated(),
  trans.fun = deprecated()
)
networkOptions(
  method = "pearson",
  unweighted = TRUE,
  cutoff = 0,
  centrality = "degree",
  minDegree = 0,
  nCores = 1,
  transFun = function(x) x,
  min.degree = deprecated(),
  n.cores = deprecated(),
  trans.fun = deprecated()
)

Arguments

`method`	in case of correlation and covariance, which method to use.
`unweighted`	calculate degree using unweighted network.
`cutoff`	cuttoff value in network edges to trim the network.
`centrality`	centrality measure to use, currently only supports degree.
`minDegree`	minimum value that individual penalty weight can take.
`nCores`	number of cores to use, default to 1.
`transFun`	See details below.
`min.degree`
`n.cores`
`trans.fun`	The `transFun` argument takes a function definition that will apply a transformation to the penalty vector calculated from the degree. This transformation allows to change how the penalty is applied.

Value

a list of options

Examples

networkOptions(unweighted = FALSE)
networkOptions(unweighted = FALSE)

Heuristic function to penalize nodes with high degree

Description

Heuristic function to penalize nodes with high degree

Usage

orphanHeuristic(x)
orphanHeuristic(x)

Arguments

`x`	single value of vector

Value

transformed

Examples

orphanHeuristic(rnorm(1:10))
orphanHeuristic(rnorm(1:10))

Retrieve ensembl gene ids from proteins

Description

Retrieve ensembl gene ids from proteins

Usage

protein2EnsemblGeneNames(
  ensemblProteins,
  useCache = TRUE,
  verbose = FALSE,
  ensembl.proteins = deprecated(),
  use.cache = deprecated()
)
protein2EnsemblGeneNames(
  ensemblProteins,
  useCache = TRUE,
  verbose = FALSE,
  ensembl.proteins = deprecated(),
  use.cache = deprecated()
)

Arguments

`ensemblProteins`	character vector with gene names in ensembl_peptide_id format
`useCache`	Boolean indicating if biomaRt cache should be used
`verbose`	When using biomaRt in webservice mode and setting verbose to TRUE, the XML query to the webservice will be printed.
`ensembl.proteins`
`use.cache`

Value

a dataframe with external gene names, ensembl_peptide_id

Examples

protein2EnsemblGeneNames(c(
    "ENSP00000235382",
    "ENSP00000233944",
    "ENSP00000216911"
))
protein2EnsemblGeneNames(c(
    "ENSP00000235382",
    "ENSP00000233944",
    "ENSP00000216911"
))

Separate data in High and Low risk groups (based on Cox model)

Description

Draws multiple kaplan meyer survival curves (or just 1) and calculates logrank test

Usage

separate2GroupsCox(
  chosenBetas,
  xdata,
  ydata,
  probs = c(0.5, 0.5),
  noPlot = FALSE,
  plotTitle = "SurvivalCurves",
  xlim = NULL,
  ylim = NULL,
  expandYZero = FALSE,
  legendOutside = FALSE,
  stopWhenOverlap = TRUE,
  ...,
  chosen.btas = deprecated(),
  no.plot = deprecated(),
  plot.title = deprecated(),
  expand.yzero = deprecated(),
  legend.outside = deprecated(),
  stop.when.overlap = deprecated()
)
separate2GroupsCox(
  chosenBetas,
  xdata,
  ydata,
  probs = c(0.5, 0.5),
  noPlot = FALSE,
  plotTitle = "SurvivalCurves",
  xlim = NULL,
  ylim = NULL,
  expandYZero = FALSE,
  legendOutside = FALSE,
  stopWhenOverlap = TRUE,
  ...,
  chosen.btas = deprecated(),
  no.plot = deprecated(),
  plot.title = deprecated(),
  expand.yzero = deprecated(),
  legend.outside = deprecated(),
  stop.when.overlap = deprecated()
)

Arguments

`chosenBetas`	list of testing coefficients to calculate prognostic indexes, for example `list(Age = some_vector)`.
`xdata`	n x m matrix with n observations and m variables.
`ydata`	Survival object.
`probs`	How to separate high and low risk patients `⁠50%-50%⁠` is the default, but for top and bottom `⁠40%⁠` -> `c(.4,.6)`.
`noPlot`	Only calculate p-value and do not generate survival curve plot.
`plotTitle`	Name of file if.
`xlim`	Optional argument to limit the x-axis view.
`ylim`	Optional argument to limit the y-axis view.
`expandYZero`	expand to y = 0.
`legendOutside`	If TRUE legend will be outside plot, otherwise inside.
`stopWhenOverlap`	when probs vector allows for overlapping of samples in both groups, then stop.
`...`	additional parameters to survminer::ggsurvplot
`chosen.btas`
`no.plot`
`plot.title`
`expand.yzero`
`legend.outside`
`stop.when.overlap`	Otherwise it will calculate with duplicate samples, i.e. simply adding them to xdata and ydata (in a different group).

Value

object with logrank test and kaplan-meier survival plot

A list with plot, p-value and kaplan-meier object. The plot was drawn from survminer::ggsurvplot with only the palette, data and fit arguments being defined and keeping all other defaults that can be customized as additional parameters to this function.

Examples

xdata <- survival::ovarian[, c("age", "resid.ds")]
ydata <- data.frame(
    time = survival::ovarian$futime,
    status = survival::ovarian$fustat
)
separate2GroupsCox(c(age = 1, 0), xdata, ydata)
separate2GroupsCox(c(age = 1, 0.5), xdata, ydata)
separate2GroupsCox(
    c(age = 1), c(1, 0, 1, 0, 1, 0),
    data.frame(time = runif(6), status = rbinom(6, 1, .5))
)
separate2GroupsCox(list(
    aa = c(age = 1, 0.5),
    bb = c(age = 0, 1.5)
), xdata, ydata)
xdata <- survival::ovarian[, c("age", "resid.ds")]
ydata <- data.frame(
    time = survival::ovarian$futime,
    status = survival::ovarian$fustat
)
separate2GroupsCox(c(age = 1, 0), xdata, ydata)
separate2GroupsCox(c(age = 1, 0.5), xdata, ydata)
separate2GroupsCox(
    c(age = 1), c(1, 0, 1, 0, 1, 0),
    data.frame(time = runif(6), status = rbinom(6, 1, .5))
)
separate2GroupsCox(list(
    aa = c(age = 1, 0.5),
    bb = c(age = 0, 1.5)
), xdata, ydata)

Cache of protein-protein network, as it takes some time to retrieve and process this will facilitate the vignette building

Description

It was filtered with combined_scores and individual scores below 700 without text-based scores

Usage

data('string.network.700.cache', package = 'glmSparseNet')
data('string.network.700.cache', package = 'glmSparseNet')

Format

An object of class dgCMatrix with 11033 rows and 11033 columns.

References

https://string-db.org/

Download protein-protein interactions from STRING DB

Description

Download protein-protein interactions from STRING DB

Usage

stringDBhomoSapiens(
  version = "11.0",
  scoreThreshold = 0,
  removeText = TRUE,
  score_threshold = deprecated(),
  remove.text = deprecated()
)
stringDBhomoSapiens(
  version = "11.0",
  scoreThreshold = 0,
  removeText = TRUE,
  score_threshold = deprecated(),
  remove.text = deprecated()
)

Arguments

`version`	version of the database to use
`scoreThreshold`	remove scores below threshold
`removeText`	remove text mining-based scores
`score_threshold`
`remove.text`

Value

a data.frame with rows representing an interaction between two proteins, and columns the count of scores above the given score_threshold

Examples


stringDBhomoSapiens(scoreThreshold = 800)

stringDBhomoSapiens(scoreThreshold = 800)

Package 'glmSparseNet'

Help Index

Change base dir for '.runCache

Description

Usage

Arguments

Value

Examples

Common call to biomaRt to avoid repetitive code

Description

Usage

Arguments

Value

See Also

Examples

Build digest of function from the actual code

Description

Usage

Arguments

Value

Examples

Change cache.compression for run_cache

Description

Usage

Arguments

Value

Examples

Calculate penalty based on data

Description

Usage

Arguments

Value

Examples

Calculate/load result and save if necessary

Description

Usage

Arguments

Value

Examples

Calculate combined score for STRINGdb interactions

Description

Usage

Arguments

Details

Value

Create directories for cache

Description

Usage

Arguments

Value

Examples

Workaround for bug with curl when fetching specific ensembl mirror

Description

Usage

Arguments

Value

Examples

Generic function to calculate degree based on data

Description

Usage

Arguments

Value

Default digest method

Description

Usage

Arguments

Value

Examples

Calculate the upper triu of the matrix

Description

Usage

Arguments

Value

Worker to calculate edge weight for each pair of ixI node and following

Description

Usage

Arguments

Value

Run function and save cache

Description