Package: GenProSeq 1.17.1

Dongmin Jung

GenProSeq: Generating Protein Sequences with Deep Generative Models

Generative modeling for protein engineering is key to solving fundamental problems in synthetic biology, medicine, and material science. Machine learning has enabled us to generate useful protein sequences on a variety of scales. Generative models are machine learning methods which seek to model the distribution underlying the data, allowing for the generation of novel samples with similar properties to those on which the model was trained. Generative models of proteins can learn biologically meaningful representations helpful for a variety of downstream tasks. Furthermore, they can learn to generate protein sequences that have not been observed before and to assign higher probability to protein sequences that satisfy desired criteria. In this package, common deep generative models for protein sequences, such as variational autoencoder (VAE), generative adversarial networks (GAN), and autoregressive models are available. In the VAE and GAN, the Word2vec is used for embedding. The transformer encoder is applied to protein sequences for the autoregressive model.

Authors:Dongmin Jung [cre, aut]

GenProSeq_1.17.1.tar.gz
GenProSeq_1.17.1.zip(r-4.7)GenProSeq_1.17.1.zip(r-4.6)GenProSeq_1.17.1.zip(r-4.5)
GenProSeq_1.17.1.tgz(r-4.6-any)GenProSeq_1.17.1.tgz(r-4.5-any)
GenProSeq_1.17.1.tar.gz(r-4.7-any)GenProSeq_1.17.1.tar.gz(r-4.6-any)
GenProSeq_1.17.1.tgz(r-4.6-emscripten)
manual.pdf |manual.html
card.svg |card.png
GenProSeq/json (API)
NEWS

# Install 'GenProSeq' in R:
install.packages('GenProSeq', repos = c('https://bioc.r-universe.dev', 'https://cloud.r-project.org'))
Uses libs:
  • openjdk– OpenJDK Java runtime, using Hotspot JIT
Datasets:

On BioConductor:GenProSeq-1.17.0(bioc 3.24)GenProSeq-1.16.0(bioc 3.23)

This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.

softwareproteomicsopenjdk

4.18 score 3 scripts 314 downloads 11 exports 138 dependencies

Last updated from:e67e085a30. Checks:8 NOTE, 2 OK. Indexed: yes.

TargetResultTimeFilesSyslog
bioc-checksNOTE204
linux-devel-x86_64NOTE315
source / vignettesOK295
linux-release-x86_64NOTE240
macos-release-arm64NOTE152
macos-oldrel-arm64NOTE124
windows-develNOTE710
windows-releaseNOTE210
windows-oldrelNOTE247
wasm-releaseOK189

Exports:fit_ARTfit_GANfit_VAEgen_ARTgen_GANgen_VAElayer_embedding_token_positionlayer_transformer_encoderprot_seq_checkprot2vecvec2prot

Dependencies:askpassbackportsbase64encBHbitbit64bslibcachemCatEncodersclicliprconfigcpp11crayoncurldata.tabledata.treeDeepPINCSDiagrammeRdigestdplyrdttenglishevaluatefarverfastmapfastmatchfingerprintfloatfontawesomefsgenericsglueherehighrhmshtmltoolshtmlwidgetshttrhunspelligraphISOcodesiteratorsitertoolsjquerylibjsonlitekerasknitrkoRpuskoRpus.lang.enlabelinglatticelexiconlgrlifecyclemagrittrmatlabMatrixMatrixExtramclustmemoisemgsubmimemlapiNLPopensslpillarpkgconfigpngprettyunitsprocessxprogressPRROCpspurrrqdapRegexquantedaR6rappdirsrcdkrcdklibsRColorBrewerRcppRcppArmadilloRcppProgressRcppTOMLreadrreticulateRhpcBLASctlrJavarlangrmarkdownrprojrootrsparserstudioapirvestsassscalesselectrslamSnowballCstopwordsstringdiststringistringrsyllysylly.ensyssyuzhettensorflowtext2vectextcleantextshapetextstemtfautographtfrunstibbletidyrtidyselecttinytextmtokenizersttgseatzdbutf8vctrsviridisLitevisNetworkvroomwebchemwhiskerwithrword2vecxfunxml2yamlzeallotzoo

Generating Protein Sequences with Deep Generative Models

Rendered fromGenProSeq.Rmdusingknitr::rmarkdownon May 29 2026.

Last update: 2024-02-03
Started: 2021-09-20