Package: BERT 1.1.0

Yannis Schumann

BERT: Hierarchical Batch-Effect Adjustment with Trees

Provides efficient batch-effect adjustment of data with missing values. BERT orders all batch effect correction to a tree of pairwise computations. BERT allows parallelization over sub-trees.

Authors:Yannis Schumann [aut, cre], Simon Schlumbohm [aut]

BERT_1.1.0.tar.gz
BERT_1.1.0.zip(r-4.5)BERT_1.1.0.zip(r-4.4)BERT_1.1.0.zip(r-4.3)
BERT_1.1.0.tgz(r-4.4-any)BERT_1.1.0.tgz(r-4.3-any)
BERT_1.1.0.tar.gz(r-4.5-noble)BERT_1.1.0.tar.gz(r-4.4-noble)
BERT_1.1.0.tgz(r-4.4-emscripten)BERT_1.1.0.tgz(r-4.3-emscripten)
BERT.pdf |BERT.html
BERT/json (API)
NEWS

# Install 'BERT' in R:
install.packages('BERT', repos = c('https://bioc.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/hsu-hpc/bert/issues

On BioConductor:BERT-1.1.0(bioc 3.20)BERT-1.0.0(bioc 3.19)

bioconductor-package

5 exports 1.00 score 93 dependencies

Last updated 2 months agofrom:17da2e9759

Exports:BERTcompute_aswcount_existinggenerate_data_covariablesgenerate_dataset

Dependencies:abindannotateAnnotationDbiaskpassBHBiobaseBiocGenericsBiocParallelBiostringsbitbit64blobcachemcliclustercodetoolscomprehenrcpp11crayoncurlDBIDelayedArraydplyredgeRfansifastmapforeachformatRfutile.loggerfutile.optionsgenefiltergenericsGenomeInfoDbGenomeInfoDbDataGenomicRangesgluehmshttrinvgammaIRangesiteratorsjanitorjsonliteKEGGRESTlambda.rlatticelifecyclelimmalocfitlogginglubridatemagrittrMatrixMatrixGenericsmatrixStatsmemoisemgcvmimenlmeopensslpillarpkgconfigplogrpngpurrrR6RcpprlangRSQLiteS4ArraysS4VectorssnakecasesnowSparseArraystatmodstringistringrSummarizedExperimentsurvivalsvasystibbletidyrtidyselecttimechangeUCSC.utilsutf8vctrswithrXMLxtableXVectorzlibbioc

BERT-Vignette

Rendered fromBERT-Vignette.Rmdusingknitr::rmarkdownon Jun 22 2024.

Last update: 2023-11-28
Started: 2023-05-30

Readme and manuals

Help Manual

Help pageTopics
Adjust two batches to each other.adjust_node
Adjust a hierarchy level sequentially.adjustment_step
Adjust data using the BERT algorithm.BERT
Chunks data into n segments with (close-to) equivalent number of batches and stores them in temporary RDS fileschunk_data
Compute the average silhouette width (ASW) for the dataset with respect to both label and batch.compute_asw
Count the number of numeric features in this dataset. Columns labeled "Batch", "Sample" or "Label" will be ignored.count_existing
Format the data as expected by BERT.format_DF
Generate dataset with batch-effects and 2 classes with a specified imbalance.generate_data_covariables
Generate dataset with batch-effects and biological labels using a simple LS modelgenerate_dataset
Check, which features contain enough numeric data to be adjusted (at least 2 numeric values)get_adjustable_features
Check, which features contain enough numeric data to be adjusted (at least 2 numeric values per batch and covariate level)get_adjustable_features_with_mod
Identifies the adjustable features using only the references. Similar to the function in adjust_features.R but with different argumentsidentify_adjustableFeatures_refs
Identifies the references to use for this specific batch effect adjustmentidentify_references
Ordinal encoding of a vector.ordinal_encode
Adjusts all chunks of data (in parallel) as far as possible.parallel_bert
A method to remove batch effects estimated from a subset (references) per batch only. Source code is heavily based on limma::removeBatchEffects by Gordon Smyth and Carolyn de GraafremoveBatchEffectRefs
Replaces missing values (NaN) by NA, this appears to be fasterreplace_missing
Strip column labelled Cov_1 from dataframe.strip_Covariable
Verifies that the input to BERT is valid.validate_bert_input
Validate the user input to the function generate_dataset. Raises an error if and only if the input is malformatted.validate_input_generate_dataset
Verify that the Reference column of the data contains only zeros and ones (if it is present at all)verify_references