Title: | Interaction Based Homogeneity for Evaluating Gene Lists |
---|---|
Description: | This package contains methods for calculating Interaction Based Homogeneity to evaluate fitness of gene lists to an interaction network which is useful for evaluation of clustering results and gene list analysis. BioGRID interactions are used in the calculation. The user can also provide their own interactions. |
Authors: | Kircicegi Korkmaz, Volkan Atalay, Rengul Cetin Atalay. |
Maintainer: | Kircicegi Korkmaz <[email protected]> |
License: | GPL (>= 2) |
Version: | 1.55.0 |
Built: | 2024-11-29 08:10:56 UTC |
Source: | https://github.com/bioc/ibh |
This package contains methods for evaluating Interaction Based Homogeneity for lists of genes. Given a gene list of n genes, we first form an adjacency matrix A whose rows and columns are genes in the list where
if genes i and j have an interaction in the network and
otherwise. The Interaction Based Homogeneity for a gene list
of size n is then calculated as:
Package: | ibh |
Type: | Package |
Version: | 1.0.0 |
Date: | 2011-01-19 |
License: | GPL (version 2 or newer) |
LazyLoad: | yes |
The user can provide his own interaction list or can use predefined gene lists which are created based on the BioGRID Interactions. Both gene lists and result of clustering methods such as kmeans or hclust can be used as inputs. Entrez identifiers, unique ids (systematic names) or official names can be used as gene/protein identifiers.
Kircicegi Korkmaz <[email protected]>, Volkan Atalay <[email protected]>, Rengul Cetin-Atalay <[email protected]>
Maintainer: Kircicegi Korkmaz <[email protected]>
Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M. Biogrid: A General Repository for Interaction Datasets. Nucleic Acids Res. Jan1; 34:D535-9
require(simpIntLists) data(ArabidopsisBioGRIDInteractionEntrezId) listofGeneList <- list(list(839226,817241, 824340, 832179, 818561, 831145, 838782, 826404), list( 832018, 839226, 839226, 838824)); ibhForMultipleGeneLists(ArabidopsisBioGRIDInteractionEntrezId, listofGeneList) require(simpIntLists) listofGeneList <- list(list(839226,817241, 824340, 832179, 818561, 831145, 838782, 826404), list( 832018, 839226, 839226, 838824)); ibhForMultipleGeneListsBioGRID(listofGeneList, organism="arabidopsis", idType = "EntrezId"); listofGeneList <- list(list("YJR151C", "YBL032W", "YAL040C", "YBL072C", "YCL050C", "YCR009C"), list("YDR063W", "YDR074W", "YDR080W", "YDR247W", "YGR183C", "YHL033C"), list("YOL068C" , "YOL015W" , "YOL009C" , "YOL004W" , "YOR065W" )); ibhForMultipleGeneListsBioGRID(listofGeneList, organism="yeast", idType = "UniqueId"); require(yeastCC) require(stats) data(yeastCC) subset <- exprs(yeastCC)[1:50,] d <- dist(subset,method="euclidean") k <- kmeans(d, 3); ibhClusterEvalBioGRID(k$cluster, rownames(subset), organism="yeast", idType="UniqueId")
require(simpIntLists) data(ArabidopsisBioGRIDInteractionEntrezId) listofGeneList <- list(list(839226,817241, 824340, 832179, 818561, 831145, 838782, 826404), list( 832018, 839226, 839226, 838824)); ibhForMultipleGeneLists(ArabidopsisBioGRIDInteractionEntrezId, listofGeneList) require(simpIntLists) listofGeneList <- list(list(839226,817241, 824340, 832179, 818561, 831145, 838782, 826404), list( 832018, 839226, 839226, 838824)); ibhForMultipleGeneListsBioGRID(listofGeneList, organism="arabidopsis", idType = "EntrezId"); listofGeneList <- list(list("YJR151C", "YBL032W", "YAL040C", "YBL072C", "YCL050C", "YCR009C"), list("YDR063W", "YDR074W", "YDR080W", "YDR247W", "YGR183C", "YHL033C"), list("YOL068C" , "YOL015W" , "YOL009C" , "YOL004W" , "YOR065W" )); ibhForMultipleGeneListsBioGRID(listofGeneList, organism="yeast", idType = "UniqueId"); require(yeastCC) require(stats) data(yeastCC) subset <- exprs(yeastCC)[1:50,] d <- dist(subset,method="euclidean") k <- kmeans(d, 3); ibhClusterEvalBioGRID(k$cluster, rownames(subset), organism="yeast", idType="UniqueId")
Find the index of an entry in an interaction list, used mostly for internal purposes
findEntry(interactionList, name)
findEntry(interactionList, name)
interactionList |
list of interactions |
name |
name to be searched in the interaction list |
an integer that is the index of name in the interaction list
require(simpIntLists) data(ArabidopsisBioGRIDInteractionEntrezId) findEntry(ArabidopsisBioGRIDInteractionEntrezId, 832179)
require(simpIntLists) data(ArabidopsisBioGRIDInteractionEntrezId) findEntry(ArabidopsisBioGRIDInteractionEntrezId, 832179)
This function calculates interaction based homogeneity of the given gene list according to the interaction list
ibh(interactionList, geneList)
ibh(interactionList, geneList)
interactionList |
List containing the interactions. For each gene/protein, the is an entry in the list with "name" containing name of the gen/protein and "interactors" containing the list of genes/proteins interacting with it. |
geneList |
List of genes/proteins for which interaction based homogeneity is evaluated. |
Interaction based homogeneity value as float
Kircicegi Korkmaz
require(simpIntLists) data(ArabidopsisBioGRIDInteractionEntrezId) geneList <- list(839226,817241, 824340, 832179, 818561, 831145, 838782, 826404); ibh(ArabidopsisBioGRIDInteractionEntrezId,geneList);
require(simpIntLists) data(ArabidopsisBioGRIDInteractionEntrezId) geneList <- list(839226,817241, 824340, 832179, 818561, 831145, 838782, 826404); ibh(ArabidopsisBioGRIDInteractionEntrezId,geneList);
This function calculated interactios based homogeneity for a gene list according to the BioGRID Interactions for seven organisms: Arabidopsis thaliana(arabidopsis), Caenerhabditis elegans(c.elegans), Drosophila melanogaster (fruitFly), Homo sapiens (human), Mus musculus (mouse), Saccharomyces cerevisae (yeast), Schizosaccharomyces pombe (s.pombe). Unique ids(systematic names), official names or Entrez ids can be used as identifier type.
ibhBioGRID(geneList, organism, idType = "EntrezId")
ibhBioGRID(geneList, organism, idType = "EntrezId")
geneList |
list of genes/proteins for which interaction based homogeneity is evaluated |
organism |
Organism name. Can be one of 'arabidopsis', 'c.elegans', 'fruitFly', 'human', 'mouse', 'yeast', 's.pombe'. |
idType |
Type of identifier used. Can be one of 'EntrezId', 'Official' and 'UniqueId' |
Float representing interaction based homogeneity for each list
Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M. Biogrid: A General Repository for Interaction Datasets. Nucleic Acids Res. Jan1; 34:D535-9
ibh
,ibhForMultipleGeneListsBioGRID
require(simpIntLists) geneList <- list(839226,817241, 824340, 832179, 818561, 831145, 838782, 826404); ibhBioGRID(geneList, organism="arabidopsis", idType = "EntrezId"); geneList <- list("YJR151C", "YBL032W", "YAL040C", "YBL072C", "YCL050C", "YCR009C"); ibhBioGRID(geneList, organism="yeast", idType = "UniqueId");
require(simpIntLists) geneList <- list(839226,817241, 824340, 832179, 818561, 831145, 838782, 826404); ibhBioGRID(geneList, organism="arabidopsis", idType = "EntrezId"); geneList <- list("YJR151C", "YBL032W", "YAL040C", "YBL072C", "YCL050C", "YCR009C"); ibhBioGRID(geneList, organism="yeast", idType = "UniqueId");
This function calculated interaction based homogeneity for a clustering result.
ibhClusterEval(cluster, allGenesList, interactionList)
ibhClusterEval(cluster, allGenesList, interactionList)
cluster |
result of clustering |
allGenesList |
list of genes in the same order of clustering object |
interactionList |
list containing the interactions. For each gene/protein, the is an entry in the list with "name" containing name of the gen/protein and "interactors" containing the list of genes/proteins interacting with it. |
A vector of floats representing interaction based homogeneity for each cluste
require(yeastCC) require(stats) data(yeastCC) require(simpIntLists) data(YeastBioGRIDInteractionUniqueId) subset <- exprs(yeastCC)[1:50,] d <- dist(subset,method="euclidean") k <- kmeans(d, 3); ibhClusterEval(k$cluster, rownames(subset), YeastBioGRIDInteractionUniqueId)
require(yeastCC) require(stats) data(yeastCC) require(simpIntLists) data(YeastBioGRIDInteractionUniqueId) subset <- exprs(yeastCC)[1:50,] d <- dist(subset,method="euclidean") k <- kmeans(d, 3); ibhClusterEval(k$cluster, rownames(subset), YeastBioGRIDInteractionUniqueId)
This function calculated interaction based homogeneity for a clustering result to the BioGRID Interactions for seven organisms: Arabidopsis thaliana(arabidopsis), Caenerhabditis elegans(c.elegans), Drosophila melanogaster (fruitFly), Homo sapiens (human), Mus musculus (mouse), Saccharomyces cerevisae (yeast), Schizosaccharomyces pombe (s.pombe). Unique ids(systematic names), official names or Entrez ids can be used as identifier type.
ibhClusterEvalBioGRID(cluster, allGenesList, organism, idType = "EntrezId")
ibhClusterEvalBioGRID(cluster, allGenesList, organism, idType = "EntrezId")
cluster |
result of clustering |
allGenesList |
list of genes in the same order of clustering object |
organism |
organism name, can be one of 'arabidopsis', 'c.elegans', 'fruitFly', 'human', 'mouse', 'yeast', 's.pombe'. |
idType |
type of identifier, can be one of 'EntrezId', 'Official' and 'UniqueId'. |
A vector of floats representing interaction based homogeneity for each cluste
Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M. Biogrid: A General Repository for Interaction Datasets. Nucleic Acids Res. Jan1; 34:D535-9
require(yeastCC) require(stats) require(simpIntLists) data(yeastCC) subset <- exprs(yeastCC)[1:50,] d <- dist(subset,method="euclidean") k <- kmeans(d, 3); ibhClusterEvalBioGRID(k$cluster, rownames(subset), organism="yeast", idType="UniqueId")
require(yeastCC) require(stats) require(simpIntLists) data(yeastCC) subset <- exprs(yeastCC)[1:50,] d <- dist(subset,method="euclidean") k <- kmeans(d, 3); ibhClusterEvalBioGRID(k$cluster, rownames(subset), organism="yeast", idType="UniqueId")
This function calculated interactios based homogeneity for multiple gene lists according to the interaction list
ibhForMultipleGeneLists(interactionList, listofGeneList)
ibhForMultipleGeneLists(interactionList, listofGeneList)
interactionList |
List containing the interactions. For each gene/protein, the is an entry in the list with "name" containing name of the gen/protein and "interactors" containing the list of genes/proteins interacting with it. |
listofGeneList |
List of list of genes/proteins for which interaction based homogeneity is evaluated. |
A vector of floats representing interaction based homogeneity for each list
Kircicegi Korkmaz
require(simpIntLists) data(ArabidopsisBioGRIDInteractionEntrezId) listofGeneList <- list(list(839226,817241, 824340, 832179, 818561, 831145, 838782, 826404), list( 832018, 839226, 839226, 838824)); ibhForMultipleGeneLists( ArabidopsisBioGRIDInteractionEntrezId, listofGeneList)
require(simpIntLists) data(ArabidopsisBioGRIDInteractionEntrezId) listofGeneList <- list(list(839226,817241, 824340, 832179, 818561, 831145, 838782, 826404), list( 832018, 839226, 839226, 838824)); ibhForMultipleGeneLists( ArabidopsisBioGRIDInteractionEntrezId, listofGeneList)
This function calculated interactios based homogeneity for multiple gene lists according to the BioGRID Interactions for seven organisms: Arabidopsis thaliana(arabidopsis), Caenerhabditis elegans(c.elegans), Drosophila melanogaster (fruitFly), Homo sapiens (human), Mus musculus (mouse), Saccharomyces cerevisae (yeast), Schizosaccharomyces pombe (s.pombe). Unique ids(systematic names), official names or Entrez ids can be used as identifier type.
ibhForMultipleGeneListsBioGRID(listofGeneList, organism, idType = "EntrezId")
ibhForMultipleGeneListsBioGRID(listofGeneList, organism, idType = "EntrezId")
listofGeneList |
List of list of genes/proteins for which interaction based homogeneity is evaluated. |
organism |
Organism name. Can be one of 'arabidopsis', 'c.elegans', 'fruitFly', 'human', 'mouse', 'yeast', 's.pombe'. |
idType |
Type of identifier. Can be one of 'EntrezId', 'Official' and 'UniqueId'. |
A vector of floats representing interaction based homogeneity for each list
Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M. Biogrid: A General Repository for Interaction Datasets. Nucleic Acids Res. Jan1; 34:D535-9
require(simpIntLists) listofGeneList <- list(list(839226,817241, 824340, 832179, 818561, 831145, 838782, 826404), list( 832018, 839226, 839226, 838824)); ibhForMultipleGeneListsBioGRID(listofGeneList, organism="arabidopsis", idType = "EntrezId"); listofGeneList <- list(list("YJR151C", "YBL032W", "YAL040C", "YBL072C", "YCL050C", "YCR009C"), list("YDR063W", "YDR074W", "YDR080W", "YDR247W", "YGR183C", "YHL033C"), list("YOL068C" , "YOL015W" , "YOL009C" , "YOL004W" , "YOR065W" )); ibhForMultipleGeneListsBioGRID(listofGeneList, organism="yeast", idType = "UniqueId");
require(simpIntLists) listofGeneList <- list(list(839226,817241, 824340, 832179, 818561, 831145, 838782, 826404), list( 832018, 839226, 839226, 838824)); ibhForMultipleGeneListsBioGRID(listofGeneList, organism="arabidopsis", idType = "EntrezId"); listofGeneList <- list(list("YJR151C", "YBL032W", "YAL040C", "YBL072C", "YCL050C", "YCR009C"), list("YDR063W", "YDR074W", "YDR080W", "YDR247W", "YGR183C", "YHL033C"), list("YOL068C" , "YOL015W" , "YOL009C" , "YOL004W" , "YOR065W" )); ibhForMultipleGeneListsBioGRID(listofGeneList, organism="yeast", idType = "UniqueId");
This function reads the directed interactions from a csv file and creates the interaction list. The csv file must contain two names: first gene/protein name, second the interactor.
readDirectedInteractionsFromCsv(fileName, sepValue, headerValue)
readDirectedInteractionsFromCsv(fileName, sepValue, headerValue)
fileName |
name of the CSV file containing te interactions |
sepValue |
the same as "sep" in read.csv function,it is the value of the field separator character. |
headerValue |
whether the CSV file has a header or not, TRUE if the file has a header row, FALSE otherwise |
A list containing the interactions. For each gene/protein, the is an entry in the list with "name" containing name of the gen/protein and "interactors" containing the list of genes/proteins interacting with it.
Kircicegi Korkmaz
##-interactionList <- readDirectedInteractionsFromCsv("Arabidopsis_BioGRID-.1.72.entrezid.csv", " ", FALSE);
##-interactionList <- readDirectedInteractionsFromCsv("Arabidopsis_BioGRID-.1.72.entrezid.csv", " ", FALSE);
This function reads the undirected interactions from a csv file and creates the interaction list. The csv file must contain two names: first gene/protein name, second the interactor.
readUndirectedInteractionsFromCsv(fileName, sepValue, headerValue)
readUndirectedInteractionsFromCsv(fileName, sepValue, headerValue)
fileName |
name of the CSV file containing te interactions |
sepValue |
the same as "sep" in read.csv function,it is the value of the field separator character. |
headerValue |
whether the CSV file has a header or not, TRUE if the file has a header row, FALSE otherwise |
A list containing the interactions. For each gene/protein, the is an entry in the list with "name" containing name of the gen/protein and "interactors" containing the list of genes/proteins interacting with it.
Kircicegi Korkmaz
##-interactionList <- readUndirectedInteractionsFromCsv("Arabidopsis_BioGRID-3.1.72.entrezid.csv", " ", FALSE);
##-interactionList <- readUndirectedInteractionsFromCsv("Arabidopsis_BioGRID-3.1.72.entrezid.csv", " ", FALSE);