The Global Alliance for Genomics and Health (GA4GH) was formed to help accelerate the potential of genomic medicine to advance human health. It brings together over 400 leading institutions working in healthcare, research, disease advocacy, life science, and information technology. The Data Working Group of the GA4GH developed data model schemas and application program interfaces (APIs) for genomic data. These APIs are specifically designed to allow sharing of genomics data in a standardized manner and without having to exchange complete experiments. They developed a reference implementation for these APIs providing a web server for hosting genomic data.
We developed the GA4GHclient package for retrieving and integrating genomic data from GA4GH-based databases. GA4GHshiny is a companion package providing graphical user interface based on web technologies for easly interacting with GA4GH-based databases such as Thousand Genomes and BRCA Exchange. The web interface integrates with Beacon Network providing a list of databases which have the user selected genomic variant (The Global Alliance for Genomics and Health 2016). It was developed using Shiny and related packages.
The package has only one function called app
, which runs
the web application. It can be done within RStudio however the Beacon
Network integration may not work. To solve this problem click in “Open
in Browser” button at the top of the window. For example, the code below
runs application connected at http://1kgenomes.ga4gh.org/.
library(org.Hs.eg.db)
library(TxDb.Hsapiens.UCSC.hg19.knownGene)
library(GA4GHshiny)
app(host = "http://1kgenomes.ga4gh.org/",
serverName = "Hosting Thousand Genomes Project",
orgDb = "org.Hs.eg.db",
txDb = "TxDb.Hsapiens.UCSC.hg19.knownGene")
The host
argument defines which GA4GH-based data server
to connect. This is the only required argument. There are some database
endpoints available.
The value of serverName
argument will show at the top
bar of the interface. If not defined it will show the value of
host
.
The orgDb
and the txDb
package names are
necessary for searching by gene symbol and by genomic feature such as
exons and transcripts. The TxDb package version should be the same of
the reference genome used by database.
GA4GHshiny application can be executed as web site
through Shiny
Server. Create an app.R
file inside ShinyServer
application directory adding the text below. For example
/srv/shiny-server/1kgenomes/app.R
.
library(org.Hs.eg.db)
library(TxDb.Hsapiens.UCSC.hg19.knownGene)
library(GA4GHshiny)
app(host = "http://1kgenomes.ga4gh.org/",
serverName = "Hosting Thousand Genomes Project",
orgDb = "org.Hs.eg.db",
txDb = "TxDb.Hsapiens.UCSC.hg19.knownGene")
The web application will be available at http://localhost:3838/1kgenomes/.
## R version 4.4.2 (2024-10-31)
## Platform: x86_64-pc-linux-gnu
## Running under: Ubuntu 24.04.1 LTS
##
## Matrix products: default
## BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3
## LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.26.so; LAPACK version 3.12.0
##
## locale:
## [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
## [3] LC_TIME=en_US.UTF-8 LC_COLLATE=C
## [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
## [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
## [9] LC_ADDRESS=C LC_TELEPHONE=C
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
##
## time zone: Etc/UTC
## tzcode source: system (glibc)
##
## attached base packages:
## [1] stats graphics grDevices utils datasets methods base
##
## other attached packages:
## [1] BiocStyle_2.35.0
##
## loaded via a namespace (and not attached):
## [1] digest_0.6.37 R6_2.5.1 fastmap_1.2.0
## [4] xfun_0.49 maketools_1.3.1 cachem_1.1.0
## [7] knitr_1.49 htmltools_0.5.8.1 rmarkdown_2.29
## [10] buildtools_1.0.0 lifecycle_1.0.4 cli_3.6.3
## [13] sass_0.4.9 jquerylib_0.1.4 compiler_4.4.2
## [16] sys_3.4.3 tools_4.4.2 evaluate_1.0.1
## [19] bslib_0.8.0 yaml_2.3.10 BiocManager_1.30.25
## [22] jsonlite_1.8.9 rlang_1.1.4