Package: seqArchR 1.11.0

Sarvesh Nikumbh

seqArchR: Identify Different Architectures of Sequence Elements

seqArchR enables unsupervised discovery of _de novo_ clusters with characteristic sequence architectures characterized by position-specific motifs or composition of stretches of nucleotides, e.g., CG-richness. seqArchR does _not_ require any specifications w.r.t. the number of clusters, the length of any individual motifs, or the distance between motifs if and when they occur in pairs/groups; it directly detects them from the data. seqArchR uses non-negative matrix factorization (NMF) as its backbone, and employs a chunking-based iterative procedure that enables processing of large sequence collections efficiently. Wrapper functions are provided for visualizing cluster architectures as sequence logos.

Authors:Sarvesh Nikumbh [aut, cre, cph]

seqArchR_1.11.0.tar.gz
seqArchR_1.11.0.zip(r-4.5)seqArchR_1.11.0.zip(r-4.4)seqArchR_1.11.0.zip(r-4.3)
seqArchR_1.11.0.tgz(r-4.4-any)seqArchR_1.11.0.tgz(r-4.3-any)
seqArchR_1.11.0.tar.gz(r-4.5-noble)seqArchR_1.11.0.tar.gz(r-4.4-noble)
seqArchR_1.11.0.tgz(r-4.4-emscripten)seqArchR_1.11.0.tgz(r-4.3-emscripten)
seqArchR.pdf |seqArchR.html
seqArchR/json (API)
NEWS

# Install 'seqArchR' in R:
install.packages('seqArchR', repos = c('https://bioc.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/snikumbh/seqarchr/issues

On BioConductor:seqArchR-1.11.0(bioc 3.21)seqArchR-1.10.0(bioc 3.20)

motifdiscoverygeneregulationmathematicalbiologysystemsbiologytranscriptomicsgeneticsclusteringdimensionreductionfeatureextractiondnaseqnmfnonnegative-matrix-factorizationpromoter-sequence-architecturesscikit-learnsequence-analysissequence-architecturesunsupervised-machine-learning

4.48 score 1 stars 1 packages 8 scripts 174 downloads 18 exports 82 dependencies

Last updated 23 days agofrom:2981ab5be2. Checks:OK: 1 ERROR: 6. Indexed: yes.

TargetResultDate
Doc / VignettesOKNov 21 2024
R-4.5-winERRORNov 21 2024
R-4.5-linuxERRORNov 21 2024
R-4.4-winERRORNov 21 2024
R-4.4-macERRORNov 21 2024
R-4.3-winERRORNov 21 2024
R-4.3-macERRORNov 21 2024

Exports:collate_clusterscollate_seqArchR_resultget_clBasVecget_clBasVec_kget_clBasVec_mget_one_hot_encoded_seqsget_seqClLabget_seqs_clust_listmake_PWMsplot_arch_for_clustersplot_ggseqlogo_of_seqsprepare_data_from_FASTAseqArchRseqs_strset_configviz_bas_vecviz_pwmviz_seqs_acgt_mat

Dependencies:askpassBHBiocGenericsBiocParallelBiostringsclasscliclustercodetoolscolorspacecpp11crayoncurlcvToolsDEoptimRdiptestfansifarverflexmixformatRfpcfutile.loggerfutile.optionsgenericsGenomeInfoDbGenomeInfoDbDataggplot2ggseqlogogluegtableherehttrIRangesisobandjsonlitekernlablabelinglambda.rlatticelifecyclemagrittrMASSMatrixmatrixStatsmclustmgcvmimemodeltoolsmunsellnlmennetopensslpillarpkgconfigplyrpngprabclusprettyunitsR6rappdirsRColorBrewerRcppRcppTOMLreshape2reticulaterlangrobustbaserprojrootS4VectorsscalessnowstringistringrsystibbleUCSC.utilsutf8vctrsviridisLitewithrXVectorzlibbioc

Example usage of seqArchR on simulated DNA sequences

Rendered fromseqArchR.Rmdusingknitr::rmarkdownon Nov 21 2024.

Last update: 2022-04-01
Started: 2021-12-02