Package: structToolbox 1.19.0

Gavin Rhys Lloyd

structToolbox: Data processing & analysis tools for Metabolomics and other omics

An extensive set of data (pre-)processing and analysis methods and tools for metabolomics and other omics, with a strong emphasis on statistics and machine learning. This toolbox allows the user to build extensive and standardised workflows for data analysis. The methods and tools have been implemented using class-based templates provided by the struct (Statistics in R Using Class-based Templates) package. The toolbox includes pre-processing methods (e.g. signal drift and batch correction, normalisation, missing value imputation and scaling), univariate (e.g. ttest, various forms of ANOVA, Kruskal–Wallis test and more) and multivariate statistical methods (e.g. PCA and PLS, including cross-validation and permutation testing) as well as machine learning methods (e.g. Support Vector Machines). The STATistics Ontology (STATO) has been integrated and implemented to provide standardised definitions for the different methods, inputs and outputs.

Authors:Gavin Rhys Lloyd [aut, cre], Ralf Johannes Maria Weber [aut]

structToolbox_1.19.0.tar.gz
structToolbox_1.19.0.zip(r-4.5)structToolbox_1.19.0.zip(r-4.4)structToolbox_1.19.0.zip(r-4.3)
structToolbox_1.19.0.tgz(r-4.4-any)structToolbox_1.19.0.tgz(r-4.3-any)
structToolbox_1.19.0.tar.gz(r-4.5-noble)structToolbox_1.19.0.tar.gz(r-4.4-noble)
structToolbox_1.19.0.tgz(r-4.4-emscripten)structToolbox_1.19.0.tgz(r-4.3-emscripten)
structToolbox.pdf |structToolbox.html
structToolbox/json (API)
NEWS

# Install 'structToolbox' in R:
install.packages('structToolbox', repos = c('https://bioc.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/computational-metabolomics/structtoolbox/issues

Pkgdown site:https://computational-metabolomics.github.io

On BioConductor:structToolbox-1.19.0(bioc 3.21)structToolbox-1.18.0(bioc 3.20)

workflowstepmetabolomicsbioconductor-packagedimslc-msmachine-learningmultivariate-analysisstatisticsunivariate

6.11 score 9 stars 12 scripts 210 downloads 2 mentions 118 exports 70 dependencies

Last updated 2 months agofrom:4622d2fd8f. Checks:OK: 7. Indexed: yes.

TargetResultDate
Doc / VignettesOKNov 30 2024
R-4.5-winOKNov 30 2024
R-4.5-linuxOKNov 30 2024
R-4.4-winOKNov 30 2024
R-4.4-macOKNov 30 2024
R-4.3-winOKNov 30 2024
R-4.3-macOKNov 30 2024

Exports:ANOVAas_data_frameAUCautoscalebalanced_accuracyblank_filterblank_filter_histbootstrapcalculatechart_plotclassical_lsqcompare_distconfounders_clsqconfounders_lsq_barchartconfounders_lsq_boxplotconstant_sum_normcorr_coefDatasetExperiment_boxplotDatasetExperiment_distDatasetExperiment_factor_boxplotDatasetExperiment_heatmapDFAdfa_scores_plotdratio_filterequal_splitfeature_boxplotfeature_profilefeature_profile_arrayfilter_by_namefilter_na_countfilter_smetafisher_exactfold_changefold_change_intfold_change_plotforward_selection_by_rankfs_lineglog_opt_plotglog_transformgrid_search_1dgs_lineHCAhca_dendrogramHSDHSDEMkfold_xvalkfoldxcv_gridkfoldxcv_metricknn_imputekw_p_histkw_rank_sumlinear_modellog_transformmean_centremean_of_mediansmixed_effectmodel_applymodel_predictmodel_reversemodel_trainMTBLS79_DatasetExperimentmv_boxplotmv_feature_filtermv_feature_filter_histmv_histogrammv_sample_filtermv_sample_filter_histnroot_transformontology_cacheOPLSDAOPLSRpairs_filterpareto_scalePCApca_biplotpca_correlation_plotpca_dstat_plotpca_loadings_plotpca_scores_plotpca_scree_plotpermutation_testpermutation_test_plotpermute_sample_orderpls_regcoeff_plotpls_scores_plotpls_vip_plotPLSDAplsda_feature_importance_plotplsda_predicted_plotplsda_roc_plotplsda_scores_plotPLSRplsr_cook_distplsr_prediction_plotplsr_qq_plotplsr_residual_histpqn_normpqn_norm_histprop_nar_squaredresampleresample_chartrsd_filterrsd_filter_histrunsb_corrscatter_chartsplit_datastratified_splitSVMsvm_plot_2dtic_charttSNEtSNE_scatterttestvec_normwilcox_p_histwilcox_test

Dependencies:abindaskpassBiobaseBiocGenericsclicolorspacecrayoncurlDelayedArrayevaluatefansifarvergenericsGenomeInfoDbGenomeInfoDbDataGenomicRangesggplot2ggthemesgluegridExtragtablehighrhttrhttr2IRangesisobandjsonliteknitrlabelinglatticelifecyclemagrittrMASSMatrixMatrixGenericsmatrixStatsmgcvmimemunsellnlmeontologyIndexopensslpillarpkgconfigpurrrR6rappdirsRColorBrewerrlangrolsS4ArraysS4VectorsscalesspSparseArraystringistringrstructSummarizedExperimentsystibbleUCSC.utilsutf8vctrsviridisLitewithrxfunXVectoryamlzlibbioc

Data analysis of metabolomics and other omics datasets using the structToolbox

Rendered fromdata_analysis_omics_using_the_structtoolbox.Rmdusingknitr::rmarkdownon Nov 30 2024.

Last update: 2022-02-28
Started: 2020-04-22

Readme and manuals

Help Manual

Help pageTopics
Analysis of VarianceANOVA
Convert to data.frameas_data_frame as_data_frame,filter_na_count-method as_data_frame,ttest-method as_data_frame,wilcox_test-method
Area under ROC curveAUC
Autoscalingautoscale
Balanced Accuracybalanced_accuracy
Blank filterblank_filter
Histogram of blank filter fold changesblank_filter_hist
Bootstrap resamplingbootstrap
Calculate metriccalculate calculate,AUC-method calculate,balanced_accuracy-method calculate,r_squared-method
chart_plot methodchart_plot chart_plot,blank_filter_hist,blank_filter-method chart_plot,compare_dist,DatasetExperiment-method chart_plot,confounders_lsq_barchart,confounders_clsq-method chart_plot,confounders_lsq_boxplot,confounders_clsq-method chart_plot,DatasetExperiment_boxplot,DatasetExperiment-method chart_plot,DatasetExperiment_dist,DatasetExperiment-method chart_plot,DatasetExperiment_factor_boxplot,DatasetExperiment-method chart_plot,DatasetExperiment_heatmap,DatasetExperiment-method chart_plot,dfa_scores_plot,DFA-method chart_plot,feature_boxplot,DatasetExperiment-method chart_plot,feature_profile,DatasetExperiment-method chart_plot,feature_profile,sb_corr-method chart_plot,feature_profile_array,DatasetExperiment-method chart_plot,fold_change_plot,fold_change-method chart_plot,fs_line,forward_selection_by_rank-method chart_plot,glog_opt_plot,glog_transform-method chart_plot,gs_line,grid_search_1d-method chart_plot,hca_dendrogram,HCA-method chart_plot,kfoldxcv_grid,kfold_xval-method chart_plot,kfoldxcv_metric,kfold_xval-method chart_plot,kw_p_hist,kw_rank_sum-method chart_plot,mv_boxplot,DatasetExperiment-method chart_plot,mv_feature_filter_hist,mv_feature_filter-method chart_plot,mv_histogram,DatasetExperiment-method chart_plot,mv_sample_filter_hist,mv_sample_filter-method chart_plot,pca_biplot,PCA-method chart_plot,pca_correlation_plot,PCA-method chart_plot,pca_dstat_plot,PCA-method chart_plot,pca_loadings_plot,PCA-method chart_plot,pca_scores_plot,PCA-method chart_plot,pca_scree_plot,PCA-method chart_plot,permutation_test_plot,permutation_test-method chart_plot,plsda_feature_importance_plot,PLSDA-method chart_plot,plsda_predicted_plot,PLSDA-method chart_plot,plsda_roc_plot,PLSDA-method chart_plot,plsr_cook_dist,PLSR-method chart_plot,plsr_prediction_plot,PLSR-method chart_plot,plsr_qq_plot,PLSR-method chart_plot,plsr_residual_hist,PLSR-method chart_plot,pls_regcoeff_plot,PLSR-method chart_plot,pls_scores_plot,PLSR-method chart_plot,pls_vip_plot,PLSR-method chart_plot,pqn_norm_hist,pqn_norm-method chart_plot,resample_chart,resample-method chart_plot,rsd_filter_hist,rsd_filter-method chart_plot,scatter_chart,DatasetExperiment-method chart_plot,svm_plot_2d,SVM-method chart_plot,tic_chart,DatasetExperiment-method chart_plot,tSNE_scatter,tSNE-method chart_plot,wilcox_p_hist,wilcox_test-method
Univariate Classical Least Squares Regressionclassical_lsq
Compare distributionscompare_dist
Check for confounding factorsconfounders_clsq
Confounding factor relative change barchartconfounders_lsq_barchart
Confounding factor relative change boxplotconfounders_lsq_boxplot
Normalisation to constant sumconstant_sum_norm
Correlation coefficientcorr_coef
Feature distribution histogramDatasetExperiment_boxplot
Feature distribution histogramDatasetExperiment_dist
Factor boxplotDatasetExperiment_factor_boxplot
DatasetExperiment heatmapDatasetExperiment_heatmap
Discriminant Factor AnalysisDFA
DFA scores plotdfa_scores_plot
Dispersion ratio filterdratio_filter
Equal group sized samplingequal_split
Feature boxplotfeature_boxplot
Feature profilefeature_profile
Feature profilefeature_profile_array
Filter by namefilter_by_name
Minimum number of measured values filterfilter_na_count
Filter by sample meta datafilter_smeta
Fisher Exact Testfisher_exact
Fold changefold_change
Fold change for interactions between factorsfold_change_int
Fold change plotfold_change_plot
Forward selection by rankforward_selection_by_rank
Forward selection line plotfs_line
Glog optimisationglog_opt_plot
Generalised logarithmic transformglog_transform
One dimensional grid searchgrid_search_1d
Grid search line plotgs_line
Hierarchical Cluster AnalysisHCA
HCA dendrogramhca_dendrogram
Tukey's Honest Significant DifferenceHSD
Tukey's Honest Significant Difference using estimated marginal meansHSDEM
k-fold cross-validationkfold_xval
k-fold cross validation plotkfoldxcv_grid
kfoldxcv metric plotkfoldxcv_metric
kNN missing value imputationknn_impute
Histogram of p valueskw_p_hist
Kruskal-Wallis rank sum testkw_rank_sum
Linear modellinear_model
logarithm transformlog_transform
Mean centremean_centre
Mean of mediansmean_of_medians
Mixed effects modelmixed_effect
Apply methodmodel_apply model_apply,ANOVA,DatasetExperiment-method model_apply,classical_lsq,DatasetExperiment-method model_apply,confounders_clsq,DatasetExperiment-method model_apply,constant_sum_norm,DatasetExperiment-method model_apply,corr_coef,DatasetExperiment-method model_apply,equal_split,DatasetExperiment-method model_apply,filter_smeta,DatasetExperiment-method model_apply,fisher_exact,DatasetExperiment-method model_apply,fold_change,DatasetExperiment-method model_apply,fold_change_int,DatasetExperiment-method model_apply,HCA,DatasetExperiment-method model_apply,HSD,DatasetExperiment-method model_apply,HSDEM,DatasetExperiment-method model_apply,knn_impute,DatasetExperiment-method model_apply,kw_rank_sum,DatasetExperiment-method model_apply,log_transform,DatasetExperiment-method model_apply,mean_of_medians,DatasetExperiment-method model_apply,mixed_effect,DatasetExperiment-method model_apply,nroot_transform,DatasetExperiment-method model_apply,pairs_filter,DatasetExperiment-method model_apply,prop_na,DatasetExperiment-method model_apply,rsd_filter,DatasetExperiment-method model_apply,sb_corr,DatasetExperiment-method model_apply,split_data,DatasetExperiment-method model_apply,stratified_split,DatasetExperiment-method model_apply,tSNE,DatasetExperiment-method model_apply,ttest,DatasetExperiment-method model_apply,vec_norm,DatasetExperiment-method model_apply,wilcox_test,DatasetExperiment-method
Model predictionmodel_predict model_predict,autoscale,DatasetExperiment-method model_predict,blank_filter,DatasetExperiment-method model_predict,constant_sum_norm,DatasetExperiment-method model_predict,DFA,DatasetExperiment-method model_predict,dratio_filter,DatasetExperiment-method model_predict,filter_by_name,DatasetExperiment-method model_predict,filter_na_count,DatasetExperiment-method model_predict,filter_smeta,DatasetExperiment-method model_predict,glog_transform,DatasetExperiment-method model_predict,linear_model,DatasetExperiment-method model_predict,mean_centre,DatasetExperiment-method model_predict,mv_feature_filter,DatasetExperiment-method model_predict,mv_sample_filter,DatasetExperiment-method model_predict,OPLSDA,DatasetExperiment-method model_predict,OPLSR,DatasetExperiment-method model_predict,pareto_scale,DatasetExperiment-method model_predict,PCA,DatasetExperiment-method model_predict,PLSDA,DatasetExperiment-method model_predict,PLSR,DatasetExperiment-method model_predict,pqn_norm,DatasetExperiment-method model_predict,SVM,DatasetExperiment-method model_predict,vec_norm,DatasetExperiment-method
Reverse preprocessingmodel_reverse model_reverse,autoscale,DatasetExperiment-method model_reverse,mean_centre,DatasetExperiment-method
Train a modelmodel_train model_train,autoscale,DatasetExperiment-method model_train,blank_filter,DatasetExperiment-method model_train,constant_sum_norm,DatasetExperiment-method model_train,DFA,DatasetExperiment-method model_train,dratio_filter,DatasetExperiment-method model_train,filter_by_name,DatasetExperiment-method model_train,filter_na_count,DatasetExperiment-method model_train,filter_smeta,DatasetExperiment-method model_train,glog_transform,DatasetExperiment-method model_train,linear_model,DatasetExperiment-method model_train,mean_centre,DatasetExperiment-method model_train,mv_feature_filter,DatasetExperiment-method model_train,mv_sample_filter,DatasetExperiment-method model_train,OPLSDA,DatasetExperiment-method model_train,OPLSR,DatasetExperiment-method model_train,pareto_scale,DatasetExperiment-method model_train,PCA,DatasetExperiment-method model_train,PLSDA,DatasetExperiment-method model_train,PLSR,DatasetExperiment-method model_train,pqn_norm,DatasetExperiment-method model_train,SVM,DatasetExperiment-method model_train,vec_norm,DatasetExperiment-method
MTBLS79: Direct infusion mass spectrometry metabolomics dataset: a benchmark for data processing and quality controlMTBLS79_DatasetExperiment
Missing value boxplotsmv_boxplot
Filter features by missing valuesmv_feature_filter
Histogram of missing values per featuremv_feature_filter_hist
Missing value histogrammv_histogram
Missing value sample filtermv_sample_filter
Histogram of missing values per samplemv_sample_filter_hist
nth root transformnroot_transform
ontology cacheontology_cache
Orthogonal Partial Least Squares regressionOPLSDA
Orthogonal Partial Least Squares regressionOPLSR
Pairs filterpairs_filter
Pareto scalingpareto_scale
Principal Component Analysis (PCA)PCA
PCA biplotpca_biplot
PCA correlation plotpca_correlation_plot
d-statistic plotpca_dstat_plot
PCA loadings plotpca_loadings_plot
PCA scores plotpca_scores_plot
Scree plotpca_scree_plot
Permutation testpermutation_test
permutation_test_plot classpermutation_test_plot
Permute Sample Orderpermute_sample_order
pls_regcoeff_plot classpls_regcoeff_plot
PLSDA scores plotplsda_scores_plot pls_scores_plot pls_scores_plot,
PLSDA VIP plotpls_vip_plot
Partial least squares discriminant analysisPLSDA
PLSDA feature importance summary plotplsda_feature_importance_plot
PLSDA predicted plotplsda_predicted_plot
PLSDA ROC plotplsda_roc_plot
Partial least squares regressionPLSR
Cook's distance barchartplsr_cook_dist
PLSR prediction plotplsr_prediction_plot
PLSR QQ plotplsr_qq_plot
PLSR residuals histogramplsr_residual_hist
Probabilistic Quotient Normalisation (PQN)pqn_norm
PQN coefficient histogrampqn_norm_hist
Fisher's exact test for missing valuesprop_na
Coefficient of determination (R-squared)r_squared
Data resamplingresample
resample_chart classresample_chart
RSD filterrsd_filter
RSD histogramrsd_filter_hist
Runs an iterator, applying the chosen model multiple times.run run,bootstrap,DatasetExperiment,metric-method run,forward_selection_by_rank,DatasetExperiment,metric-method run,grid_search_1d,DatasetExperiment,metric-method run,kfold_xval,DatasetExperiment,metric-method run,permutation_test,DatasetExperiment,metric-method run,permute_sample_order,DatasetExperiment,metric-method run,resample,DatasetExperiment,metric-method
Signal/batch correction for mass spectrometry datasb_corr
Group scatter chartscatter_chart
Split datasplit_data
Stratified samplingstratified_split
Support Vector Machine ClassifierSVM
SVM scatter plotsvm_plot_2d
Total Ion Count chart.tic_chart
tSNEtSNE
Feature boxplottSNE_scatter
t-testttest
Vector normalisationvec_norm
Histogram of p valueswilcox_p_hist
wilcoxon signed rank testwilcox_test