Here we present a vignette for the R package tanggle, and provide an overview of its functions and their usage. Tanggle extends the ggtree R package (Yu et al. 2017) to allow for the visualization of several types of phylogenetic networks using the ggplot2 (Wickham 2016) syntax. More specifically, tanggle contains functions to allow the user to effectively plot: (1) split (i.e. implicit) networks (unrooted, undirected) and (2) explicit networks (rooted, directed) with reticulations. It offers an alternative to the plot functions already available in ape (Paradis and Schliep 2018) and phangorn (Schliep 2011).
Function name | Brief description |
---|---|
geom_splitnet |
Adds a splitnet layer to a ggplot, to combine visualising data and the network |
ggevonet |
Plots an explicit network from a phylo object |
ggsplitnet |
Plots an implicit network from a phylo object |
minimize_overlap |
Reduces the number of reticulation lines crossing over in the plot |
node_depth_evonet |
Returns the depths or heights of nodes and tips in the phylogenetic network |
Install the package from Bioconductor directly:
if (!requireNamespace("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocManager::install("tanggle")
Or install the development version of the package from Github.
if (!requireNamespace("remotes", quietly=TRUE))
install.packages("remotes")
remotes::install_github("KlausVigo/tanggle")
If you need to install ggtree from github:
And load all the libraries:
Split networks are data-display objects which allow for the
definition of 2 (or more) options for non-compatible splits. Split
networks are most often used to visualize consensus networks (Holland et al. 2004) or neighbor-nets (Bryant and Moulton 2004). This can be done
either by using the consensusNet
or
neighbor-net
functions in phangorn (Schliep 2011) or by importing nexus files from
SplitsTree (Huson and Bryant 2006).
tanggle accepts three forms of input data for split networks. The following input options all generate a networx object for plotting.
Nexus file created with SplitsTree (Huson
and Bryant 2006) and read with the
read.nexus.network
function in phangorn (Schliep 2011).
Read in a split network in nexus format:
fdir <- system.file("extdata/trees", package = "phangorn")
Nnet <- phangorn::read.nexus.networx(file.path(fdir,"woodmouse.nxs"))
read.nexus
read.tree
A consensus split network is then
computed using the function consensusNet
in
phangorn (Schliep 2011).read.phyDat
in phangorn (Schliep 2011) or the function
read.dna
in ape (Paradis
and Schliep 2018). Distances matrices are then computed for
specific models of evolution using the function dist.ml
in
phangorn (Schliep 2011) or
dist.dna
in ape (Paradis
and Schliep 2018). From the distance matrix, a split network is
reconstructed using the function neighborNet
in
phangorn (Schliep 2011).
Optional: branch lengths may be estimated
using the function splitsNetworks
in phangorn
(Schliep 2011).We can plot the network with the default options:
When we can set the limits for the x and y axis so that the labels are readable.
You can rename tip labels. Here we changed the names to species from 1 to 15:
We can include the tip labels with geom_tiplab2
, and
customize some of the options. For example, here the tip labels are in
blue and both in bold and italics, and we show the internal nodes in
green:
ggsplitnet(Nnet) + geom_tiplab2(col = "blue", font = 4, hjust = -0.15) +
geom_nodepoint(col = "green", size = 0.25)
Nodes can also be annotated with geom_point
.
The function ggevonet
plots explicit networks
(phylogenetic trees with reticulations). A recent addition to
ape (Paradis and Schliep 2018)
made it possible to read in trees in extended newick format (Cardona, Rosselló, and Valiente 2008).
Read in an explicit network (example from Fig. 2 in Cardona et al. 2008):
Plot an explicit network:
This vignette illustrates all the functions in the R package tanggle, and provides some examples on how to plot both explicit and implicit networks. The split network plots should take most of the functions compatible with unrooted trees in ggtree. The layout options for explicit network plots are rectangular or slanted.
sessionInfo()
#> R version 4.4.2 (2024-10-31)
#> Platform: x86_64-pc-linux-gnu
#> Running under: Ubuntu 24.04.1 LTS
#>
#> Matrix products: default
#> BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3
#> LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.26.so; LAPACK version 3.12.0
#>
#> locale:
#> [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
#> [3] LC_TIME=en_US.UTF-8 LC_COLLATE=C
#> [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
#> [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
#> [9] LC_ADDRESS=C LC_TELEPHONE=C
#> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
#>
#> time zone: Etc/UTC
#> tzcode source: system (glibc)
#>
#> attached base packages:
#> [1] stats graphics grDevices utils datasets methods base
#>
#> other attached packages:
#> [1] phangorn_2.12.1 ape_5.8-1 tanggle_1.13.0 ggtree_3.15.0
#> [5] ggplot2_3.5.1 BiocStyle_2.35.0
#>
#> loaded via a namespace (and not attached):
#> [1] yulab.utils_0.1.9 sass_0.4.9 generics_0.1.3
#> [4] tidyr_1.3.1 ggplotify_0.1.2 lattice_0.22-6
#> [7] digest_0.6.37 magrittr_2.0.3 evaluate_1.0.3
#> [10] grid_4.4.2 fastmap_1.2.0 Matrix_1.7-1
#> [13] jsonlite_1.8.9 BiocManager_1.30.25 purrr_1.0.2
#> [16] aplot_0.2.4 scales_1.3.0 codetools_0.2-20
#> [19] lazyeval_0.2.2 jquerylib_0.1.4 cli_3.6.3
#> [22] rlang_1.1.4 munsell_0.5.1 tidytree_0.4.6
#> [25] withr_3.0.2 cachem_1.1.0 yaml_2.3.10
#> [28] tools_4.4.2 parallel_4.4.2 dplyr_1.1.4
#> [31] colorspace_2.1-1 fastmatch_1.1-6 buildtools_1.0.0
#> [34] vctrs_0.6.5 R6_2.5.1 gridGraphics_0.5-1
#> [37] lifecycle_1.0.4 fs_1.6.5 ggfun_0.1.8
#> [40] treeio_1.31.0 pkgconfig_2.0.3 pillar_1.10.1
#> [43] bslib_0.8.0 gtable_0.3.6 glue_1.8.0
#> [46] Rcpp_1.0.14 xfun_0.50 tibble_3.2.1
#> [49] tidyselect_1.2.1 sys_3.4.3 knitr_1.49
#> [52] farver_2.1.2 igraph_2.1.3 htmltools_0.5.8.1
#> [55] nlme_3.1-166 patchwork_1.3.0 labeling_0.4.3
#> [58] rmarkdown_2.29 maketools_1.3.1 compiler_4.4.2
#> [61] quadprog_1.5-8