This is an example of how to create an interactive genomic visualization using the shiny.gosling package within a Shiny app. It leverages the shiny package for creating the user interface and interactivity.
We create a Shiny app that visualizes genomic data using the shiny.gosling package. It generates an interactive visualization with tracks displaying DNA base counts and annotations, allowing users to explore genomic data related to the SARS-CoV-2 virus.
Below, we use the track_data() function to fetch data from the specified URL. The data represents base counts for the SARS-CoV-2 virus genome, organized into rows and columns. It includes attributes like base, position, count, and categories (A, T, G, C).
“Multivec” is a term used in genomics to refer to a specific type of data format used for representing and visualizing multi-dimensional numerical data across genomic coordinates. It’s commonly used for representing data like ChIP-seq, ATAC-seq, Hi-C, and other genomic experiments where signals or measurements are collected at various genomic positions.
Multivec data is essentially a matrix where rows correspond to different genomic positions or regions, and columns correspond to different samples or experiments. Each entry in the matrix represents a value associated with a specific genomic position and sample. The genomic positions along the rows of the matrix are usually represented as chromosomal coordinates (chromosome name and base pair position). This allows the data to be aligned with the genome, enabling accurate visualization and analysis. There are different tools and file formats that support multivec data, allowing researchers to work with and visualize this type of data. The bigWig and bedGraph formats are commonly used for representing multivec data. Visualization tools and libraries like the UCSC Genome Browser, IGV (Integrative Genomics Viewer), and libraries like “shiny.gosling” can render multivec data visualizations.
Here are some resources and links where you can learn more about multivec data and how it’s used in genomics research:
UCSC Genome Browser:
The UCSC Genome Browser is a widely used tool for visualizing genomic data, including multivec data. Tutorial on visualizing multivec data in the UCSC Genome Browser
IGV (Integrative Genomics Viewer):
IGV is another popular genome visualization tool that supports multivec data. Tutorial on loading and visualizing multivec data in IGV
BedGraph and BigWig Formats:
These are common file formats used for representing multivec data. Explanation of the BedGraph format Explanation of the BigWig format
Here, we define two tracks (track1 and track2) that will be displayed in the visualization. track1 displays the count of DNA bases using a bar mark, and track2 displays text annotations for certain conditions.
track1 <- add_single_track(
mark = "bar",
y = visual_channel_y(
field = "count", type = "quantitative", axis = "none"
)
)
track2 <- add_single_track(
dataTransform = track_data_transform(
type = "filter",
field = "count",
oneOf = list(0),
not = TRUE
),
mark = "text",
x = visual_channel_x(
field = "start", type = "genomic"
),
xe = visual_channel_x(
field = "end", type = "genomic"
),
size = 24,
color = "white",
visibility = list(list(
operation = "less-than",
measure = "width",
threshold = "|xe-x|",
transitionPadding = 30,
target = "mark"
),
list(
operation = "LT",
measure = "zoomLevel",
threshold = 40,
target = "track"
))
)
Now, lets define visual channels for track1. track1_x specifies the genomic position on the x-axis, track1_color assigns colors based on DNA bases, and track1_text specifies text annotations based on DNA bases.
track1_x <- visual_channel_x(
field = "position", type = "genomic"
)
track1_color <- visual_channel_color(
field = "base",
type = "nominal",
domain = c("A", "T", "G", "C"),
legend = TRUE
)
track1_text <- visual_channel_text(
field = "base", type = "nominal"
)
track1_style <- default_track_styles(
inlineLegend = TRUE
)
This code chunk combines the previously defined tracks (track1 and track2) into a single track (track3) and specifies various properties such as title, alignment, data, visual channels, and style.
Lets create a view (view1) that contains the combined track (track3). It specifies properties like multi-view mode, x-axis domain, alignment, and linking.
Next, we arrange views using the arrange_views function. It sets the title, subtitle, assembly information, layout, spacing, and includes the previously defined view1.
Finally, we define the Shiny user interface (UI) using the fluidPage function. It includes the goslingOutput function to create a placeholder for the visualization. We also define the Shiny server logic. It uses the renderGosling function to render the interactive visualization using the combined_view defined earlier.
sessionInfo()
#> R version 4.4.2 (2024-10-31)
#> Platform: x86_64-pc-linux-gnu
#> Running under: Ubuntu 24.04.1 LTS
#>
#> Matrix products: default
#> BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3
#> LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.26.so; LAPACK version 3.12.0
#>
#> locale:
#> [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
#> [3] LC_TIME=en_US.UTF-8 LC_COLLATE=C
#> [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
#> [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
#> [9] LC_ADDRESS=C LC_TELEPHONE=C
#> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
#>
#> time zone: Etc/UTC
#> tzcode source: system (glibc)
#>
#> attached base packages:
#> [1] stats4 stats graphics grDevices utils datasets methods
#> [8] base
#>
#> other attached packages:
#> [1] sessioninfo_1.2.2 ggbio_1.55.0
#> [3] ggplot2_3.5.1 StructuralVariantAnnotation_1.23.0
#> [5] VariantAnnotation_1.53.0 Rsamtools_2.23.0
#> [7] Biostrings_2.75.1 XVector_0.47.0
#> [9] SummarizedExperiment_1.37.0 Biobase_2.67.0
#> [11] MatrixGenerics_1.19.0 matrixStats_1.4.1
#> [13] rtracklayer_1.67.0 GenomicRanges_1.59.1
#> [15] GenomeInfoDb_1.43.1 IRanges_2.41.1
#> [17] S4Vectors_0.45.2 BiocGenerics_0.53.3
#> [19] generics_0.1.3 shiny_1.9.1
#> [21] shiny.gosling_1.3.0 rmarkdown_2.29
#>
#> loaded via a namespace (and not attached):
#> [1] RColorBrewer_1.1-3 sys_3.4.3 rstudioapi_0.17.1
#> [4] jsonlite_1.8.9 magrittr_2.0.3 GenomicFeatures_1.59.1
#> [7] fs_1.6.5 BiocIO_1.17.0 zlibbioc_1.52.0
#> [10] vctrs_0.6.5 memoise_2.0.1 RCurl_1.98-1.16
#> [13] base64enc_0.1-3 progress_1.2.3 htmltools_0.5.8.1
#> [16] S4Arrays_1.7.1 curl_6.0.1 SparseArray_1.7.2
#> [19] Formula_1.2-5 sass_0.4.9 bslib_0.8.0
#> [22] fontawesome_0.5.3 htmlwidgets_1.6.4 httr2_1.0.6
#> [25] plyr_1.8.9 cachem_1.1.0 buildtools_1.0.0
#> [28] GenomicAlignments_1.43.0 shiny.react_0.4.0 mime_0.12
#> [31] lifecycle_1.0.4 pkgconfig_2.0.3 Matrix_1.7-1
#> [34] R6_2.5.1 fastmap_1.2.0 GenomeInfoDbData_1.2.13
#> [37] digest_0.6.37 colorspace_2.1-1 GGally_2.2.1
#> [40] AnnotationDbi_1.69.0 OrganismDbi_1.49.0 Hmisc_5.2-0
#> [43] RSQLite_2.3.8 filelock_1.0.3 fansi_1.0.6
#> [46] httr_1.4.7 abind_1.4-8 compiler_4.4.2
#> [49] bit64_4.5.2 withr_3.0.2 htmlTable_2.4.3
#> [52] backports_1.5.0 BiocParallel_1.41.0 DBI_1.2.3
#> [55] ggstats_0.7.0 biomaRt_2.63.0 rappdirs_0.3.3
#> [58] DelayedArray_0.33.2 rjson_0.2.23 tools_4.4.2
#> [61] foreign_0.8-87 httpuv_1.6.15 nnet_7.3-19
#> [64] glue_1.8.0 restfulr_0.0.15 promises_1.3.0
#> [67] grid_4.4.2 checkmate_2.3.2 cluster_2.1.6
#> [70] reshape2_1.4.4 gtable_0.3.6 BSgenome_1.75.0
#> [73] tidyr_1.3.1 ensembldb_2.31.0 hms_1.1.3
#> [76] data.table_1.16.2 xml2_1.3.6 utf8_1.2.4
#> [79] pillar_1.9.0 stringr_1.5.1 later_1.3.2
#> [82] dplyr_1.1.4 BiocFileCache_2.15.0 lattice_0.22-6
#> [85] bit_4.5.0 biovizBase_1.55.0 RBGL_1.83.0
#> [88] tidyselect_1.2.1 maketools_1.3.1 knitr_1.49
#> [91] gridExtra_2.3 ProtGenerics_1.39.0 xfun_0.49
#> [94] stringi_1.8.4 UCSC.utils_1.3.0 lazyeval_0.2.2
#> [97] yaml_2.3.10 evaluate_1.0.1 codetools_0.2-20
#> [100] tibble_3.2.1 graph_1.85.0 BiocManager_1.30.25
#> [103] cli_3.6.3 rpart_4.1.23 xtable_1.8-4
#> [106] munsell_0.5.1 jquerylib_0.1.4 dichromat_2.0-0.1
#> [109] Rcpp_1.0.13-1 dbplyr_2.5.0 png_0.1-8
#> [112] XML_3.99-0.17 parallel_4.4.2 assertthat_0.2.1
#> [115] blob_1.2.4 prettyunits_1.2.0 AnnotationFilter_1.31.0
#> [118] bitops_1.0-9 txdbmaker_1.3.0 pwalign_1.3.0
#> [121] scales_1.3.0 purrr_1.0.2 crayon_1.5.3
#> [124] rlang_1.1.4 KEGGREST_1.47.0