Title: | Functional analysis of metabolic and transcriptomic data |
---|---|
Description: | Famat is made to collect data about lists of genes and metabolites provided by user, and to visualize it through a Shiny app. Information collected is: - Pathways containing some of the user's genes and metabolites (obtained using a pathway enrichment analysis). - Direct interactions between user's elements inside pathways. - Information about elements (their identifiers and descriptions). - Go terms enrichment analysis performed on user's genes. The Shiny app is composed of: - information about genes, metabolites, and direct interactions between them inside pathways. - an heatmap showing which elements from the list are in pathways (pathways are structured in hierarchies). - hierarchies of enriched go terms using Molecular Function and Biological Process. |
Authors: | Mathieu Charles [aut, cre] |
Maintainer: | Mathieu Charles <[email protected]> |
License: | GPL-3 |
Version: | 1.17.0 |
Built: | 2024-11-19 03:59:49 UTC |
Source: | https://github.com/bioc/famat |
Complete and prepare data obtained with interactions
function, to
use it in the Shiny interface. GO terms enrichment analysis is performed
using clusterProfiler.
compl_data(listparam)
compl_data(listparam)
listparam |
Output from |
A list containing :
heatmap |
Dataframe heatmap-like, with in abscissa elements of pathways ("X" is written if an element is present in a pathway), and with in ordinate hierarchies of pathways |
meta_list |
User's metabolites given in |
allResBP |
Results of Go BP terms enrichment analysis performed by clusterProfileR (20 best) |
go_genelist |
Dataframe containing enriched GO terms per genes of user's list |
allResMF |
Results of Go MF terms enrichment analysis performed by clusterProfileR (20 best) |
types |
Dataframe containing categories of pathways per pathways |
genetype |
List of genes categories, containing genes of user's list |
metatab |
Dataframe of metabolites and informations about them, as names and chebi ids |
genetab |
Dataframe of genes and informations about them, as gene symbols and names |
intetab |
Dataframe of direct interactions between elements of user's list, and informations about them as elements in the interaction, how they interact, and which pathways are concerned |
gomf_tab |
Dataframe of Go MF terms hierarchies containing our enriched GO terms, plus description of these GO terms and genes of user's list concerned by enriched GO terms |
gobp_tab |
Dataframe of Go BP terms hierarchies containing our enriched GO terms, plus description of these GO terms and genes of user's list concerned by enriched GO terms |
gene_list |
User's genes given in |
gomflist |
List containing GO MF terms hierarchies, with indices in the joliMF dataframe and genes concerned by the hierarchie |
gobplist |
List containing GO BP terms hierarchies, with indices in the joliMF dataframe and genes concerned by the hierarchy |
hierabrite |
List of pathways categories, containing pathways concerned by a category and their indices in "trait" dataframe |
hierapath |
List of hierarchies of pathways, containing indices of pathways in "trait" dataframe and elements contained in the hierarchy. |
save_cluster_elem |
Vector of clustered elements |
centrality |
Matrix trait-like with values of centralities (number of direct interactions between an element of user's list and other elements of the pathway) instead of "X" |
inter_values |
Matrix trait-like with values representing direct interactions (3/2/1, respectively for genes/metabolites implicated in a direct interaction, and for elements not implicated in a direct interaction) instead of "X" |
gene_notin |
Dataframe of genes which aren't in pathways and informations about them, as gene symbols and names |
sub |
Matrix trait-like with pathway and element informations instead of "X" |
This list is used by rshiny
function.
Emilie Secherre [email protected]
Guangchuang Yu, Li-Gen Wang, Yanyan Han and Qing-Yu He. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS: A Journal of Integrative Biology 2012, 16(5):284-287
## load example data data(interactions_result) compl_data_result=compl_data(interactions_result)
## load example data data(interactions_result) compl_data_result=compl_data(interactions_result)
compl_data
function
Enriched and structured informations about genes, metabolites, their interactions, pathways and enriched GO terms.
data("compl_data_result")
data("compl_data_result")
List of 22 items.
Dataframe heatmap-like, with in abscissa elements of pathways ("X" is written if an element is present in a pathway), and with in ordinate hierarchies of pathways. A data frame with 16 rows and 9 variables.
Hierarchies of pathways obtained by pathways enrichment analysis. Pathways are given by their name.
Identifiers of pathways in hierarchies.
Metabolites ratio, so the part of user's metabolites in the total number of metabolites in the pathway.
Genes ratio, so the part of user's genes in the total number of genes in the pathway.
Just an empty column to separate heatmap data from pathways data (NA).
The value "1" shows that the element SLC6A12 is in the pathway on the row, "0" shows it is not in this pathway (0–1).
The value "1" shows that the element Betaine is in the pathway on the row, "0" shows it is not in this pathway (0–1).
The value "1" shows that the element ATP is in the pathway on the row, "0" shows it is not in this pathway (0–1).
The value "1" shows that the interaction Betaine / SLC6A12 is in the pathway on the row, "0" shows it is not in this pathway (0–1).
Vector containing user's metabolites (C00002, C00719)
Results of Go BP terms enrichment analysis performed by clusterProfileR (20 best). A data frame with 20 rows and 9 variables.
Identifiers of enriched GO terms.
Names of enriched GO terms.
Number of user's genes concerned by the enriched GO term, by total number of user's genes.
Number of genes concerned by the enriched GO term, by the total number of annotated genes in the database.
Pvalue of the go term enrichment analysis (0.001392161–0.011536208).
Adjusted pvalue of the go term enrichment analysis (0.01713291–0.02670275).
Qvalue of the go term enrichment analysis (0.001803464–0.002810816).
Entrez Gene identifiers of all genes concerned by the enriched GO term (30, 6539).
Number of genes concerned by the enriched GO term described by an Entrez Gene identifier(1).
Dataframe containing enriched GO terms per genes of user's list. A data frame with 13 rows and 2 variables.
Gene symbol of the gene concerned by an enriched GO term (SLC6A12, ACAA1).
Identifier of the GO term concerning the gene (GO:0005328, GO:0015293, GO:0003333, GO:0015812, GO:0015171, GO:0042165, GO:0008028, GO:0006635, GO:0006625, GO:0033540, GO:0036109, GO:0008206, GO:0000038)
Results of Go MF terms enrichment analysis performed by clusterProfileR (20 best) A data frame with 20 rows and 9 variables.
Identifiers of enriched GO terms.
Names of enriched GO terms.
Number of user's genes concerned by the enriched GO term, by total number of user's genes.
Number of genes concerned by the enriched GO term, by the total number of annotated genes in the database.
Pvalue of the go term enrichment analysis (0.001392161–0.011536208).
Adjusted pvalue of the go term enrichment analysis (0.01713291–0.02670275).
Qvalue of the go term enrichment analysis (0.001803464–0.002810816).
Entrez Gene identifiers of all genes concerned by the enriched GO term (30, 6539).
Number of genes concerned by the enriched GO term described by an Entrez Gene identifier(1).
Dataframe containing categories of pathways per pathways. A data frame with 12 rows and 2 variables.
Identifier of pathways from pathways enrichment analysis.
Name of the pathway category concerning a pathway.
List of genes categories, containing genes of user's list. A list of 3 items.
Dataframe of metabolites and informations about them, as names and chebi ids. A data frame with 2 rows and 2 variables.
Name of user's metabolites (ATP, Betaine).
Chebi identifier of user's metabolites (CHEBI:15422, CHEBI:17750).
Dataframe of genes and informations about them, as gene symbols and names. A data frame with 1 rows and 2 variables.
Gene symbols of user's genes contained in pathways (SLC6A12).
Name of user's genes contained in pathways (solute carrier family 6 member 12).
Dataframe of direct interactions between elements of user's list, and informations about them as elements in the interaction, how they interact, and which pathways are concerned A data frame with 1 row and 8 variables.
Summary of elements concerned by the interaction (Betaine / SLC6A12)
First element of the direct interaction (Betaine)
Description of how the two elements interact (Control(In: ACTIVATION of BiochemicalReaction), controls-transport-of-chemical).
Second element of the direct interaction (SLC6A12)
Value "1" means that a gene of the interaction is concerned by an enriched GO term, "0" means no element is concerned by an enriched GO term (1).
Pathways containing the direct interaction ("R-HSA-112310, R-HSA-112315, R-HSA-112316, R-HSA-382551, R-HSA-425366, R-HSA-425393, R-HSA-425407, R-HSA-888590, R-HSA-352230, R-HSA-442660, R-HSA-888593")
Interaction type, can be gene/gene, metabolite/metabolite, or gene/metabolite (g/m)
Categories of pathways containing the direct interaction (Neuronal System, Transport of small molecules)
Dataframe of Go MF terms hierarchies containing our enriched GO terms, plus description of these GO terms and genes of user's list concerned by enriched GO terms. A data frame with 93 rows and 3 variables.
Hierarchies of enriched GO terms.
Names of GO terms.
Genes concerned by GO terms.
Dataframe of Go BP terms hierarchies containing our enriched GO terms, plus description of these GO terms and genes of user's list concerned by enriched GO terms. A data frame with 107 rows and 3 variables.
Hierarchies of enriched GO terms.
Names of GO terms.
Genes concerned by GO terms.
Vector containing user's genes (ACAA1, SLC6A12)
List containing GO MF terms hierarchies, with indices in the joliMF dataframe and genes concerned by the hierarchie. A list of 3 items.
List containing GO BP terms hierarchies, with indices in the joliMF dataframe and genes concerned by the hierarchy. A list of 5 items.
List of pathways categories, containing pathways concerned by a category and their indices in "trait" dataframe. A list of 3 items.
List of hierarchies of pathways, containing indices of pathways in "trait" dataframe and elements contained in the hierarchy. A list of 3 items.
Vector of clustered elements
Matrix heatmap-like with values of centralities (number of direct interactions between an element of user's list and other elements of the pathway) instead of "X". Other cells contain the value "0" (0–65). An integer matrix with 16 rows and 9 columns.
Matrix heatmap-like with values representing direct interactions (3/2/1, respectively for genes/metabolites implicated in a direct interaction, and for elements not implicated in a direct interaction) instead of "X". Other cells contain the value "0" (0–3). An integer matrix with 16 rows and 9 columns.
Dataframe of genes which aren't in pathways and informations about them, as gene symbols and names. A data frame with 1 row and 2 variables.
Gene symbols of genes (ACAA1).
Names of genes (acetyl-CoA acyltransferase 1).
Matrix heatmap-like with pathway and element informations instead of "X". Cells with no informations contain only "". A character matrix with 16 rows and 9 columns.
compl_data
function
Example of a list of genes that can be provided by an user.
data("genes")
data("genes")
A vector with 2 observations (ACAA1, SLC6A12).
Sample of data from a study on chickens, under heat-stress condition.
Gather informations about direct interactions between genes and metabolites inside a pathway, and about pathways themselves. These informations are direct interactions between these two elements and number of relations between an element from the list provided by the user and other elements of the pathway (centrality). Direct interactions extraction was performed using BioPax, KGML and GPML files parsed with PaxtoolsR, graphite and author's parsers.
interactions(listk, listr, listw)
interactions(listk, listr, listw)
listk |
Output from |
listr |
Output from |
listw |
Output from |
A list containing :
size |
Dataframe containing pathways, genes and metabolites in pathways (from the list or not), and number of elements in pathways |
pathtot |
Dataframe containing pathways names and ids from pathway enrichment analysis on Reactome, Kegg and Wikipathways pathways |
tagged |
Dataframe containing direct interactions between elements from the user's list per pathways |
keggchebiname |
Dataframe containing all human metabolites ids (kegg and chebi) and names |
central |
List of pathways, each pathway containing the number of direct interactions between an element of user's list and other elements in the pathway |
no_path |
Dataframe containing direct interactions between elements from the user's list, but not per pathways |
genes |
User's genes given in |
meta |
User's metabolites given in |
This list is used by compl_data
function.
Emilie Secherre [email protected]
Luna, A., Babur, O., Aksoy, A. B, Demir, E., Sander, C. (2015).“PaxtoolsR: Pathway Analysis in R Using Pathway Commons.” Bioinformatics.
Sales G, Calura E, Cavalieri D, Romualdi C (2012). “graphite - a Bioconductor package to convert pathway topology to gene network.” BMC Bioinformatics. https://bmcbioinformatics.biomedcentral.com/articles/10 . 1186/1471-2105-13-20.
## load example data data(listk) data(listr) data(listw) interactions_result=interactions(listk,listr,listw)
## load example data data(listk) data(listr) data(listw) interactions_result=interactions(listk,listr,listw)
interactions
function
List containing informations about interactions between genes and metabolites, centrality and pathways. Direct interactions extraction was performed using BioPax, KGML and GPML files parsed with PaxtoolsR, graphite and author's parsers.
data("interactions_result")
data("interactions_result")
List of 8 items.
Description on which elements (from user's list or not) are contained in pathways from pathway enrichment analysis. A data frame with 286 rows and 9 variables.
Pathways obtained throught pathways enrichment analysis on KEgg, Reactome and Wikipathways pathways.
Number of user's genes contained in the pathway (0–2).
User's genes contained in the pathway (ACAA1, SLC6A12, ACAA1 # SLC6A12).
Total number of genes contained in the pathway (0–2075).
All the genes contained in the pathway.
Number of user's metabolites contained in the pathway (0–2).
User's metabolites contained in the pathway (Betaine, ATP, Betaine # ATP, ATP # Betaine).
Total number of metabolites contained in the pathway (0–915).
All the metabolites contained in the pathway.
All results of pathways enrichment analysis performed on Kegg, Reactome and Wikipathways pathways. A data frame with 286 rows and 2 variables.
Name of pathways resulting in genes pathway enrichment analysis performed on Kegg, Reactome and Wikipathways.
Identifiers of pathways resulting in genes pathway enrichment analysis performed on Kegg, Reactome and Wikipathways.
Description of all direct interactions between user's elements in pathways. A data frame with 11 rows and 6 variables.
First element of the direct interaction (Betaine, SLC6A12)
Description of how the two elements interact (Control(In: ACTIVATION of BiochemicalReaction), controls-transport-of-chemical).
Second element of the direct interaction (Betaine, SLC6A12)
Pathway containing the direct interaction (R-HSA-112310, R-HSA-112315, R-HSA-112316, R-HSA-382551, R-HSA-425366, R-HSA-425393, R-HSA-425407, R-HSA-888590, R-HSA-352230, R-HSA-442660, R-HSA-888593)
Summary of elements concerned by the interaction (Betaine / SLC6A12, SLC6A12 / Betaine)
Interaction type, can be gene/gene, metabolite/metabolite, or gene/metabolite (g/m)
Dataframe containing all human metabolites ids (kegg and chebi) and names. A data frame with 16075 rows and 3 variables.
Kegg_compound identifiers of all human metabolites.
Chebi identifiers of all human metabolites.
Names of all human metabolites.
List of pathways, each pathway containing the number of direct interactions between an element of user's list and other elements in the pathway. A list of 138 items.
Dataframe containing direct interactions between elements from the user's list, but not per pathways. A data frame with 1 rows and 6 variables.
First element of the direct interaction (Betaine, SLC6A12)
Description of how the two elements interact (Control(In: ACTIVATION of BiochemicalReaction), controls-transport-of-chemical).
Second element of the direct interaction (Betaine, SLC6A12)
Pathways containing the direct interaction ("R-HSA-112310, R-HSA-112315, R-HSA-112316, R-HSA-382551, R-HSA-425366, R-HSA-425393, R-HSA-425407, R-HSA-888590, R-HSA-352230, R-HSA-442660, R-HSA-888593")
Summary of elements concerned by the interaction (Betaine / SLC6A12)
Interaction type, can be gene/gene, metabolite/metabolite, or gene/metabolite (g/m)
Vector containing user's genes (ACAA1, SLC6A12)
Vector containing user's metabolites (C00002, C00719)
interactions
function.
Luna, A., Babur, O., Aksoy, A. B, Demir, E., Sander, C. (2015).“PaxtoolsR: Pathway Analysis in R Using Pathway Commons.” Bioinformatics.
Sales G, Calura E, Cavalieri D, Romualdi C (2012). “graphite - a Bioconductor package to convert pathway topology to gene network.” BMC Bioinformatics. https://bmcbioinformatics.biomedcentral.com/articles/10 . 1186/1471-2105-13-20.
Results of pathways enrichment analysis on the list of genes and metabolites, using KEGG pathways. Pathways enrichment analysis is performed using MPINet for metabolites and gprofiler2 for genes.
data("listk")
data("listk")
A list of 4 items.
Pathway enrichment analysis results for metabolites. A data frame with 7 rows and 2 variables.
Name of pathways resulting in metabolites pathway enrichment analysis.
Identifiers of pathways resulting in metabolites pathway enrichment analysis.
Pathway enrichment analysis results for genes. A data frame with 11 rows and 2 variables.
Name of pathways resulting in genes pathway enrichment analysis.
Identifiers of pathways resulting in genes pathway enrichment analysis.
Vector containing user's genes (ACAA1, SLC6A12)
Vector containing user's metabolites (C00002, C00719)
path_enrich
function.
Yanjun Xu, Chunquan Li and Xia Li (2013). MPINet: The package can implement the network-based metabolite pathway identification of pathways.. R package version 1.0. https://CRAN.R-project.org/package=MPINet
Liis Kolberg and Uku Raudvere (2020). gprofiler2: Interface to the 'g:Profiler' Toolset. R package version 0.2.0. https://CRAN.R-project.org/package=gprofiler2
Results of pathways enrichment analysis on the list of genes and metabolites, using Reactome pathways. Pathways enrichment analysis is performed using MPINet for metabolites and gprofiler2 for genes.
data("listr")
data("listr")
A list of 4 items.
Pathway enrichment analysis results for metabolites. A data frame with 278 rows and 2 variables.
Name of pathways resulting in metabolites pathway enrichment analysis.
Identifiers of pathways resulting in metabolites pathway enrichment analysis.
Pathway enrichment analysis results for genes. A data frame with 27 rows and 2 variables.
Name of pathways resulting in genes pathway enrichment analysis.
Identifiers of pathways resulting in genes pathway enrichment analysis.
Vector containing user's genes (ACAA1, SLC6A12)
Vector containing user's metabolites (C00002, C00719)
path_enrich
function.
Yanjun Xu, Chunquan Li and Xia Li (2013). MPINet: The package can implement the network-based metabolite pathway identification of pathways.. R package version 1.0. https://CRAN.R-project.org/package=MPINet
Liis Kolberg and Uku Raudvere (2020). gprofiler2: Interface to the 'g:Profiler' Toolset. R package version 0.2.0. https://CRAN.R-project.org/package=gprofiler2
Results of pathways enrichment analysis on the list of genes and metabolites, using Wikipathways pathways. Pathways enrichment analysis is performed using MPINet for metabolites and gprofiler2 for genes.
data("listw")
data("listw")
A list of 4 items.
Pathway enrichment analysis results for metabolites. A data frame with 48 rows and 2 variables.
Name of pathways resulting in metabolites pathway enrichment analysis.
Identifiers of pathways resulting in metabolites pathway enrichment analysis.
Pathway enrichment analysis results for genes. A data frame with 8 rows and 2 variables.
Name of pathways resulting in genes pathway enrichment analysis.
Identifiers of pathways resulting in genes pathway enrichment analysis.
Vector containing user's genes (ACAA1, SLC6A12)
Vector containing user's metabolites (C00002, C00719)
path_enrich
function.
Yanjun Xu, Chunquan Li and Xia Li (2013). MPINet: The package can implement the network-based metabolite pathway identification of pathways.. R package version 1.0. https://CRAN.R-project.org/package=MPINet
Liis Kolberg and Uku Raudvere (2020). gprofiler2: Interface to the 'g:Profiler' Toolset. R package version 0.2.0. https://CRAN.R-project.org/package=gprofiler2
Example of a list of metabolites that can be provided by an user.
data("meta")
data("meta")
A vector with 2 observations (C00002, C00719).
Sample of data from a study on chickens, under heat-stress condition.
MPINetData
of the systemThe variables in the environment variable MPINetData
of the system.
An environment variable
Yanjun Xu <[email protected]>, Chunquan Li <[email protected]> and Xia Li <[email protected]>
Perform a pathway enrichment analysis using a list of genes and a list of metabolites. Pathways enrichment analysis is performed using MPINet for metabolites and gprofiler2 for genes.
path_enrich(source, metabo, genes)
path_enrich(source, metabo, genes)
source |
Pathways database used, either Kegg ("KEGG"), Reactome ("REAC") or Wikipathays ("WP") |
metabo |
Dataframe with three columns : the first column contain the list of metabolites, the second some quantitative data about the metabolites, the last one words "DOWN" or "UP" depending on the metabolites concentration behavior in a certain condition. Last two columns can contain only/some NAs. All metabolites ids are KEGG Compound ids. |
genes |
Dataframe with three columns : the first column contain the list of genes, the second some quantitative data about the genes, the last one words "DOWN" or "UP" depending on the genes expression behavior in a certain condition. Last two columns can contain only/some NAs. All genes ids are gene symbol. |
A list containing :
resmeta |
Results of metabolites pathway enrichment analysis |
resgene |
Results of genes pathway enrichment analysis |
genes |
Vector containing genes |
metabo |
Vector containing metabolites |
This list is used by interactions
function.
Emilie Secherre [email protected]
Yanjun Xu, Chunquan Li and Xia Li (2013). MPINet: The package can implement the network-based metabolite pathway identification of pathways.. R package version 1.0. https://CRAN.R-project.org/package=MPINet
Liis Kolberg and Uku Raudvere (2020). gprofiler2: Interface to the 'g:Profiler' Toolset. R package version 0.2.0. https://CRAN.R-project.org/package=gprofiler2
## load example data data(genes) data(meta) ## perform pathway enrichment analysis on Reactome pathways listr=path_enrich("REAC", meta, genes)
## load example data data(genes) data(meta) ## perform pathway enrichment analysis on Reactome pathways listr=path_enrich("REAC", meta, genes)
Visualize and filter all functionnal informations gathered by famat using a Shiny interface.
rshiny(listdata)
rshiny(listdata)
listdata |
Output from |
Shiny interface
Emilie Secherre [email protected]
Winston Chang, Joe Cheng, JJ Allaire, Yihui Xie and Jonathan McPherson (2020). shiny: Web Application Framework for R. R package version 1.5.0. https://CRAN.R-project.org/package=shiny
## load example data data(compl_data_result) ## Not run: rshiny(compl_data_result)
## load example data data(compl_data_result) ## Not run: rshiny(compl_data_result)