Package: tidybulk 1.17.0

Stefano Mangiola

tidybulk: Brings transcriptomics to the tidyverse

This is a collection of utility functions that allow to perform exploration of and calculations to RNA sequencing data, in a modular, pipe-friendly and tidy fashion.

Authors:Stefano Mangiola [aut, cre], Maria Doyle [ctb]

tidybulk_1.17.0.tar.gz
tidybulk_1.17.0.zip(r-4.5)tidybulk_1.17.0.zip(r-4.4)tidybulk_1.17.0.zip(r-4.3)
tidybulk_1.17.0.tgz(r-4.4-any)tidybulk_1.17.0.tgz(r-4.3-any)
tidybulk_1.17.0.tar.gz(r-4.5-noble)tidybulk_1.17.0.tar.gz(r-4.4-noble)
tidybulk_1.17.0.tgz(r-4.4-emscripten)tidybulk_1.17.0.tgz(r-4.3-emscripten)
tidybulk.pdf |tidybulk.html
tidybulk/json (API)

# Install 'tidybulk' in R:
install.packages('tidybulk', repos = c('https://bioc.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/stemangiola/tidybulk/issues

Datasets:

On BioConductor:tidybulk-1.17.0(bioc 3.20)tidybulk-1.16.0(bioc 3.19)

bioconductor-package

52 exports 2.09 score 99 dependencies 1 dependents 2 mentions

Last updated 2 months agofrom:28767d5bf4

Exports:%>%adjust_abundanceaggregate_duplicatesarrangeas_matrixas_SummarizedExperimentas_tibblebind_colscluster_elementsdeconvolve_cellularitydescribe_transcriptdistinctdoensembl_to_symbolfill_missing_abundancefilterfull_joinget_bibliographygroup_byidentify_abundantimpute_missing_abundanceinner_joinkeep_abundantkeep_variableleft_joinlog10_reverse_translogit_transmutatenestpivot_samplepivot_transcriptquantile_normalise_abundancereduce_dimensionsremove_redundancyrenameright_joinrotate_dimensionsrowwisescale_abundanceselectsummarisesymbol_to_entreztest_differential_abundancetest_differential_cellularitytest_gene_enrichmenttest_gene_overrepresentationtest_gene_ranktest_stratification_cellularitytibbletidybulktidybulk_SAM_BAMunnest

Dependencies:abindaskpassbase64encBiobaseBiocGenericsbitbit64bslibcachemclicliprcolorspacecpp11crayoncrosstalkcurldata.tableDelayedArraydigestdplyrevaluatefansifarverfastmapfontawesomefsgenericsGenomeInfoDbGenomeInfoDbDataGenomicRangesggplot2gluegtablehighrhmshtmltoolshtmlwidgetshttrIRangesisobandjquerylibjsonliteknitrlabelinglaterlatticelazyevallifecyclemagrittrMASSMatrixMatrixGenericsmatrixStatsmemoisemgcvmimemunsellnlmeopensslpillarpkgconfigplotlypreprocessCoreprettyunitsprogresspromisespurrrR6rappdirsRColorBrewerRcppreadrrlangrmarkdownS4ArraysS4VectorssassscalesSparseArraystringistringrSummarizedExperimentsystibbletidyrtidyselecttinytexttservicetzdbUCSC.utilsutf8vctrsviridisLitevroomwithrxfunXVectoryamlzlibbioc

Overview of the tidybulk package

Rendered fromintroduction.Rmdusingknitr::knitron Jun 14 2024.

Last update: 2023-11-17
Started: 2019-09-11

Readme and manuals

Help Manual

Help pageTopics
Adjust transcript abundance for unwanted variationadjust_abundance adjust_abundance,RangedSummarizedExperiment-method adjust_abundance,spec_tbl_df-method adjust_abundance,SummarizedExperiment-method adjust_abundance,tbl_df-method adjust_abundance,tidybulk-method
Aggregates multiple counts from the same samples (e.g., from isoforms), concatenates other character columns, and averages other numeric columnsaggregate_duplicates aggregate_duplicates,RangedSummarizedExperiment-method aggregate_duplicates,spec_tbl_df-method aggregate_duplicates,SummarizedExperiment-method aggregate_duplicates,tbl_df-method aggregate_duplicates,tidybulk-method
Arrange rows by column valuesarrange
Get matrix from tibbleas_matrix
as_SummarizedExperimentas_SummarizedExperiment as_SummarizedExperiment,spec_tbl_df-method as_SummarizedExperiment,tbl_df-method as_SummarizedExperiment,tidybulk-method
Left join datasetsbind_cols left_join
Efficiently bind multiple data frames by row and columnbind_rows
Needed for vignette breast_tcga_mini_SEbreast_tcga_mini_SE
Get clusters of elements (e.g., samples or transcripts)cluster_elements cluster_elements,RangedSummarizedExperiment-method cluster_elements,spec_tbl_df-method cluster_elements,SummarizedExperiment-method cluster_elements,tbl_df-method cluster_elements,tidybulk-method
Counts with ensembl annotationcounts_ensembl
Get cell type proportions from samplesdeconvolve_cellularity deconvolve_cellularity,RangedSummarizedExperiment-method deconvolve_cellularity,spec_tbl_df-method deconvolve_cellularity,SummarizedExperiment-method deconvolve_cellularity,tbl_df-method deconvolve_cellularity,tidybulk-method
Get DESCRIPTION from gene SYMBOL for Human and Mouse.describe_transcript_SE describe_transcript describe_transcript,RangedSummarizedExperiment-method describe_transcript,spec_tbl_df-method describe_transcript,SummarizedExperiment-method describe_transcript,tbl_df-method describe_transcript,tidybulk-method
distinctdistinct
Data setensembl_symbol_mapping
Add transcript symbol column from ensembl id for human and mouse dataensembl_to_symbol ensembl_to_symbol,spec_tbl_df-method ensembl_to_symbol,tbl_df-method ensembl_to_symbol,tidybulk-method
Fill transcript abundance if missing from sample-transcript pairsfill_missing_abundance fill_missing_abundance,spec_tbl_df-method fill_missing_abundance,tbl_df-method fill_missing_abundance,tidybulk-method
Subset rows using column valuesfilter
flybaseIDsflybaseIDs
Produces the bibliography list of your workflowget_bibliography get_bibliography,RangedSummarizedExperiment-method get_bibliography,spec_tbl_df-method get_bibliography,SummarizedExperiment-method get_bibliography,tbl-method get_bibliography,tbl_df-method get_bibliography,tidybulk-method
Group by one or more variablesgroup_by
find abundant transcriptsidentify_abundant identify_abundant,RangedSummarizedExperiment-method identify_abundant,spec_tbl_df-method identify_abundant,SummarizedExperiment-method identify_abundant,tbl_df-method identify_abundant,tidybulk-method
impute transcript abundance if missing from sample-transcript pairsimpute_missing_abundance impute_missing_abundance,RangedSummarizedExperiment-method impute_missing_abundance,spec_tbl_df-method impute_missing_abundance,SummarizedExperiment-method impute_missing_abundance,tbl_df-method impute_missing_abundance,tidybulk-method
Inner join datasetsfull_join inner_join right_join
Keep abundant transcriptskeep_abundant keep_abundant,RangedSummarizedExperiment-method keep_abundant,spec_tbl_df-method keep_abundant,SummarizedExperiment-method keep_abundant,tbl_df-method keep_abundant,tidybulk-method
Keep variable transcriptskeep_variable keep_variable,RangedSummarizedExperiment-method keep_variable,spec_tbl_df-method keep_variable,SummarizedExperiment-method keep_variable,tbl_df-method keep_variable,tidybulk-method
log10_reverse_translog10_reverse_trans
logit scalelogit_trans
Create, modify, and delete columnsmutate
Extract sample-wise informationpivot_sample pivot_sample,RangedSummarizedExperiment-method pivot_sample,spec_tbl_df-method pivot_sample,SummarizedExperiment-method pivot_sample,tbl_df-method pivot_sample,tidybulk-method
Extract transcript-wise informationpivot_transcript pivot_transcript,RangedSummarizedExperiment-method pivot_transcript,spec_tbl_df-method pivot_transcript,SummarizedExperiment-method pivot_transcript,tbl_df-method pivot_transcript,tidybulk-method
Normalise by quantiles the counts of transcripts/genesquantile_normalise_abundance quantile_normalise_abundance,RangedSummarizedExperiment-method quantile_normalise_abundance,spec_tbl_df-method quantile_normalise_abundance,SummarizedExperiment-method quantile_normalise_abundance,tbl_df-method quantile_normalise_abundance,tidybulk-method
Dimension reduction of the transcript abundance datareduce_dimensions reduce_dimensions,RangedSummarizedExperiment-method reduce_dimensions,spec_tbl_df-method reduce_dimensions,SummarizedExperiment-method reduce_dimensions,tbl_df-method reduce_dimensions,tidybulk-method
Drop redundant elements (e.g., samples) for which feature (e.g., transcript/gene) abundances are correlatedremove_redundancy remove_redundancy,RangedSummarizedExperiment-method remove_redundancy,spec_tbl_df-method remove_redundancy,SummarizedExperiment-method remove_redundancy,tbl_df-method remove_redundancy,tidybulk-method
Rename columnsrename
Rotate two dimensions (e.g., principal components) of an arbitrary anglerotate_dimensions rotate_dimensions,RangedSummarizedExperiment-method rotate_dimensions,spec_tbl_df-method rotate_dimensions,SummarizedExperiment-method rotate_dimensions,tbl_df-method rotate_dimensions,tidybulk-method
Group input by rowsrowwise
Scale the counts of transcripts/genesscale_abundance scale_abundance,RangedSummarizedExperiment-method scale_abundance,spec_tbl_df-method scale_abundance,SummarizedExperiment-method scale_abundance,tbl_df-method scale_abundance,tidybulk-method
SummarizedExperimentse
SummarizedExperiment mini for vignettese_mini
Summarise each group to fewer rowssummarise
Get ENTREZ id from gene SYMBOLsymbol_to_entrez
Perform differential transcription testing using edgeR quasi-likelihood (QLT), edgeR likelihood-ratio (LR), limma-voom, limma-voom-with-quality-weights or DESeq2test_differential_abundance test_differential_abundance,RangedSummarizedExperiment-method test_differential_abundance,spec_tbl_df-method test_differential_abundance,SummarizedExperiment-method test_differential_abundance,tbl_df-method test_differential_abundance,tidybulk-method
Add differential tissue composition information to a tbltest_differential_cellularity test_differential_cellularity,RangedSummarizedExperiment-method test_differential_cellularity,spec_tbl_df-method test_differential_cellularity,SummarizedExperiment-method test_differential_cellularity,tbl_df-method test_differential_cellularity,tidybulk-method
analyse gene enrichment with EGSEAtest_gene_enrichment test_gene_enrichment,RangedSummarizedExperiment-method test_gene_enrichment,spec_tbl_df-method test_gene_enrichment,SummarizedExperiment-method test_gene_enrichment,tbl_df-method test_gene_enrichment,tidybulk-method
analyse gene over-representation with GSEAtest_gene_overrepresentation test_gene_overrepresentation,RangedSummarizedExperiment-method test_gene_overrepresentation,spec_tbl_df-method test_gene_overrepresentation,SummarizedExperiment-method test_gene_overrepresentation,tbl_df-method test_gene_overrepresentation,tidybulk-method
analyse gene rank with GSEAtest_gene_rank test_gene_rank,RangedSummarizedExperiment-method test_gene_rank,spec_tbl_df-method test_gene_rank,SummarizedExperiment-method test_gene_rank,tbl_df-method test_gene_rank,tidybulk-method
Test of stratification of biological replicates based on tissue composition, one cell-type at the time, using Kaplan-meier curves.test_stratification_cellularity test_stratification_cellularity,RangedSummarizedExperiment-method test_stratification_cellularity,spec_tbl_df-method test_stratification_cellularity,SummarizedExperiment-method test_stratification_cellularity,tbl_df-method test_stratification_cellularity,tidybulk-method
Creates an annotated `tidybulk` tibble from a `tbl` or `SummarizedExperiment` objecttidybulk tidybulk,RangedSummarizedExperiment-method tidybulk,spec_tbl_df-method tidybulk,SummarizedExperiment-method tidybulk,tbl_df-method
Creates a `tt` object from a list of file names of BAM/SAMtidybulk_SAM_BAM tidybulk_SAM_BAM,character,character-method
Needed for tests tximeta_summarizeToGene_object, It is SummarizedExperiment from tximetatximeta_summarizeToGene_object
unnestnest unnest
Needed for vignette vignette_manuscript_signature_boxplotvignette_manuscript_signature_boxplot
Needed for vignette vignette_manuscript_signature_tsnevignette_manuscript_signature_tsne
Needed for vignette vignette_manuscript_signature_tsne2vignette_manuscript_signature_tsne2
Cibersort referenceX_cibersort