Package 'singIST' reference manual

Title:	comparative single-cell transcriptomics between disease models and a human condition
Description:	Provides with toolkits to implement a full singIST analysis with pseudobulked Seurat objects of disease models and human data.
Authors:	Aitor Moruno-Cuenca [aut, cre] (ORCID: <https://orcid.org/0009-0009-8133-2552>), Dr. Sergio Picart-Armada [rev] (ORCID: <https://orcid.org/0000-0002-6426-8204>), Dr. Alexandre Perera-Lluna [ths] (ORCID: <https://orcid.org/0000-0001-6427-851X>), Dr. Francesc Fernández-Albert [ths] (ORCID: <https://orcid.org/0000-0001-5561-0701>)
Maintainer:	Aitor Moruno-Cuenca <[email protected]>
License:	MIT + file LICENSE
Version:	1.1.0
Built:	2026-07-16 05:16:16 UTC
Source:	https://git.bioconductor.org/packages/singIST

Ensure all celltype–sample combinations are present in the pseudobulkmatrix

Description

Ensure all celltype–sample combinations are present in the pseudobulkmatrix

Usage

add_missing_psb_rows(mat, celltypes, sample_ids)
add_missing_psb_rows(mat, celltypes, sample_ids)

Arguments

mat

A numeric matrix with rownames in the form “celltype_sample”.

celltypes

Character vector of the celltypes you intend to include (in the exact order of object@superpathway_info@celltypes).

sample_ids

Character vector of all sample identifiers (in the order of rownames(object@pseudobulk_lognorm) split by “_”).

Value

A numeric matrix with length(celltypes) * length(sample_ids) rows, in the canonical paste(celltypes, sample_ids, sep = "_") order, where newly added rows are filled with NA_real_.

K‐fold × Repeated Cross‐Validation for asmbPLS-DA

Description

Implements stratified K‐fold cross‐validation with repetitions, mirroring the structure of asmbPLSDA.cv.loo but using K k and ncv instead of LOO.

Usage

asmbPLSDA.cv.kcv(
  X.matrix,
  Y.matrix,
  PLS_term = 2,
  X.dim,
  quantile.comb.table,
  k = 4,
  ncv = 10,
  outcome.type = c("binary", "multiclass"),
  Method = NULL,
  measure = "B_accuracy",
  parallel = FALSE,
  expected.measure.increase = 0.005,
  center = TRUE,
  scale = TRUE,
  maxiter = 100
)
asmbPLSDA.cv.kcv(
  X.matrix,
  Y.matrix,
  PLS_term = 2,
  X.dim,
  quantile.comb.table,
  k = 4,
  ncv = 10,
  outcome.type = c("binary", "multiclass"),
  Method = NULL,
  measure = "B_accuracy",
  parallel = FALSE,
  expected.measure.increase = 0.005,
  center = TRUE,
  scale = TRUE,
  maxiter = 100
)

Arguments

X.matrix

Predictor matrix (n×p)

Y.matrix

Response one‐hot matrix (n×q)

PLS_term

Integer: maximum number of PLS components

X.dim

Vector: feature counts per block

quantile.comb.table

Matrix (C×length(X.dim)): quantile combinations

k

Integer: number of CV k (K)

ncv

Integer: number of ncv

outcome.type

"binary" or "multiclass"

Method

Prediction method

measure

"B_accuracy","accuracy","precision","recall","F1"

parallel

Logical: TRUE to parallelize per-fold

expected.measure.increase

Numeric: min performance gain to add PLS

center

Logical: center predictors

scale

Logical: scale predictors

maxiter

Integer: max iterations for asmbPLSDA.fit

Value

A list with:

quantile_table_CV

Matrix (PLS_term × (blocks + metrics)) of optimal quantiles and CV metrics

optimal_nPLS

Integer: selected number of PLS components

splits

List of length (k*ncv) of train/validation splits

Examples

# example code
file <- system.file("extdata", "example_superpathway_input.rda",
package = "singIST")
load(file)
data <- example_superpathway_input
matrices <- matrixToBlock(data)
X.matrix <- matrices$block_predictor
Y.matrix <- matrices$matrix_response
X.dim <- matrices$block_dim
quantile.comb.table <- data$hyperparameters_info$quantile_comb_table
quantile.comb.table <- rbind(quantile.comb.table, c(0.1, 0.2)) # Add 2 cases
outcome.type <- data$hyperparameters_info$outcome_type
asmbPLSDA.cv.kcv(X.matrix, Y.matrix, PLS_term = 1,
X.dim,quantile.comb.table,Method = NULL, measure = "B_accuracy",
parallel = TRUE, outcome.type = outcome.type,
expected.measure.increase = 0.005, center = TRUE, scale = TRUE,
maxiter = 100)
# example code
file <- system.file("extdata", "example_superpathway_input.rda",
package = "singIST")
load(file)
data <- example_superpathway_input
matrices <- matrixToBlock(data)
X.matrix <- matrices$block_predictor
Y.matrix <- matrices$matrix_response
X.dim <- matrices$block_dim
quantile.comb.table <- data$hyperparameters_info$quantile_comb_table
quantile.comb.table <- rbind(quantile.comb.table, c(0.1, 0.2)) # Add 2 cases
outcome.type <- data$hyperparameters_info$outcome_type
asmbPLSDA.cv.kcv(X.matrix, Y.matrix, PLS_term = 1,
X.dim,quantile.comb.table,Method = NULL, measure = "B_accuracy",
parallel = TRUE, outcome.type = outcome.type,
expected.measure.increase = 0.005, center = TRUE, scale = TRUE,
maxiter = 100)

Leave-one-out Cross-validation

Description

Leave-one-out Cross-validation

Usage

asmbPLSDA.cv.loo(
  X.matrix,
  Y.matrix,
  PLS_term = 1,
  X.dim,
  quantile.comb.table,
  outcome.type = c("binary", "multiclass"),
  Method = NULL,
  measure = "B_accuracy",
  parallel = FALSE,
  expected.measure.increase = 0.005,
  center = TRUE,
  scale = TRUE,
  maxiter = 100
)
asmbPLSDA.cv.loo(
  X.matrix,
  Y.matrix,
  PLS_term = 1,
  X.dim,
  quantile.comb.table,
  outcome.type = c("binary", "multiclass"),
  Method = NULL,
  measure = "B_accuracy",
  parallel = FALSE,
  expected.measure.increase = 0.005,
  center = TRUE,
  scale = TRUE,
  maxiter = 100
)

Arguments

X.matrix

Predictor block matrix from matrixToBlock

Y.matrix

Response matrix from matrixToBlock

PLS_term

An integer with the number of PLS components to use passed from hyperparameter list

X.dim

A list with the observed gene set size for each cell type from matrixToBlock

quantile.comb.table

A matrix with the quantile comb table passed from hyperparameters list object

outcome.type

A character indicating binary or multiclass passed from hyperparameters list object

Method