Title: | Estimate protein complex membership using AP-MS protein data |
---|---|
Description: | Functions to estimate a bipartite graph of protein complex membership using AP-MS data. |
Authors: | Denise Scholtens <[email protected]> |
Maintainer: | Denise Scholtens <[email protected]> |
License: | LGPL |
Version: | 2.73.0 |
Built: | 2024-12-17 03:11:35 UTC |
Source: | https://github.com/bioc/apComplex |
List of complex estimates after filtering baits prone to systematic bias
data(gavin06FilteredEstimates) data(krogan06FilteredEstimates)
data(gavin06FilteredEstimates) data(krogan06FilteredEstimates)
gavin06FilteredEstimates
contains results from an analysis of the AP-MS
data published by Gavin et al. (2006).
These estimates were constructed using findComplexes
with a sensitivity
parameter of .70, specificity of .999, and commonFrac=0.5
krogan06FilteredEstimates
contains results from an analysis of the AP-MS
data published by Krogan et al. (2006).
These estimates were constructed using findComplexes
with a sensitivity
parameter of .70, specificity of .999, and commonFrac=1/3.
Both sets of estimates are reported as lists of length three, corresponding to MBME, SBMH, and UnRBB complex estimate types (see Scholtens et al., 2005). Each of the three elements contains a list of character vectors of estimated complex members.
Scholtens D and Gentleman R. Making sense of high-throughput protein-protein interaction data. Statistical Applications in Genetics and Molecular Biology 3, Article 39 (2004).
Scholtens D, Vidal M, and Gentleman R. Local modeling of global interactome networks. Bioinformatics 21, 3548-3557 (2005).
Gavin, et al. (2006)
Krogan, et al. (2006)
gavinBP2006
,
kroganBPMat2006
,
findComplexes
data(gavin06FilteredEstimates) lapply(gavin06FilteredEstimates,FUN=length) data(krogan06FilteredEstimates) lapply(krogan06FilteredEstimates,FUN=length)
data(gavin06FilteredEstimates) lapply(gavin06FilteredEstimates,FUN=length) data(krogan06FilteredEstimates) lapply(krogan06FilteredEstimates,FUN=length)
A matrix and corresponding graph of AP-MS purifications from a small hypothetical experiment.
data(apEX) data(apEXG)
data(apEX) data(apEXG)
apEX
is a matrix of hypothetical AP-MS data. The 4 rows correspond to bait proteins and the 8 columns correspond to
proteins found as hits in the hypothetical experiment and are named accordingly.
The first 4 column names are the same as the 4 row names; bait proteins
can also be found as hits by other baits, hence their inclusion as columns in
apEX
. An entry of "1" in the ith row and jth column of apEX
indicates that bait protein i found protein j as a hit. All other entries are
"0". There are a total of 9 "1" entries in the matrix, corresponding to 9 comemberships detected in the experiment.
apEXG
is a graphNEL
object in which 8 nodes
represent proteins and 9 directed edges represent comemberships detected in
the purification. Each directed edge originates at the bait and ends at
the hit. Each edge in apEXG
corresponds to an entry of "1" in
the apEX
matrix.
These data are used in the apComplex
vignette.
data(apEX) apEX data(apEXG) adj(apEXG,"P1")
data(apEX) apEX data(apEXG) adj(apEXG,"P1")
Given an adjacency matrix of bait-hit AP-MS protein data, this function finds the maximal BH-complete subgraphs and reports them as an affiliation matrix.
bhmaxSubgraph(adjMat,VBs=NULL,VPs=NULL,unrecip=1)
bhmaxSubgraph(adjMat,VBs=NULL,VPs=NULL,unrecip=1)
adjMat |
|
VBs |
|
VPs |
|
unrecip |
By default set to 1 so that unreciprocated bait-bait edges are treated as present. If set to 0, unreciprocated bait-bait edges will be treated as absent. |
A BH-complete subgraph with n bait nodes and m hit-only nodes for AP-MS data is defined as a subgraph for which all n*(n-1)+nm directed edges exist. A maximal BH-complete subgraph is a BH-complete subgraph which is not contained in any other BH-complete subgraph.
If VBs
and/or VPs
are not specified, then by default VBs
will be assigned the set of baits that detect at least one prey and VPs
the set of prey that are detected by at least one bait.
By default, unreciprocated bait-bait observations will be
treated as present. If unrecip
is set to 0, they will be treated as
absent. If the sensitivity of the AP-MS technology is believed to be less
than the specificity, then it is suggested that unrecip
=1.
This function calls maxCliques
from the RBGL package.
A list of length one named 'maxCliques' which is itself a list of character vectors containing the names of the elements in the cliques.
Denise Scholtens
Scholtens D and Gentleman R. Making sense of high-throughput protein-protein interaction data. Statistical Applications in Genetics and Molecular Biology 3, Article 39 (2004).
Scholtens D, Vidal M, and Gentleman R. Local modeling of global interactome networks. Bioinformatics 21, 3548-3557 (2005).
data(apEX) PCMG0 <- bhmaxSubgraph(apEX) PCMG1 <- mergeComplexes(PCMG0,apEX,sensitivity=.7,specificity=.75)
data(apEX) PCMG0 <- bhmaxSubgraph(apEX) PCMG1 <- mergeComplexes(PCMG0,apEX,sensitivity=.7,specificity=.75)
Performs all steps in the local modeling algorithm described by Scholtens and Gentleman (2004) and Scholtens, Vidal, and Gentleman (submitted), beginning with an adjacency matrix recording bait-hit AP-MS data.
findComplexes(adjMat,VBs=NULL,VPs=NULL,simMat=NULL,sensitivity=.75,specificity=.995,Beta=0,commonFrac=2/3,wsVal = 2e7)
findComplexes(adjMat,VBs=NULL,VPs=NULL,simMat=NULL,sensitivity=.75,specificity=.995,Beta=0,commonFrac=2/3,wsVal = 2e7)
adjMat |
Adjacency matrix of bait-hit data from an AP-MS experiment. Rows correspond to baits and columns to hits. |
VBs |
|
VPs |
|
simMat |
An optional square matrix with entries between 0 and 1. Rows and columns correspond to the proteins in the experiment, and should be reported in the same order as the columns of |
sensitivity |
Believed sensitivity of AP-MS technology. |
specificity |
Believed specificity of AP-MS technology. |
Beta |
Optional additional parameter for the weight to give data
in |
commonFrac |
This is the fraction of baits that need to be overlapping for a complex combination to be considered. |
wsVal |
A numeric. This is the value assigned as the work-space in the call to fisher.test |
findComplexes
performs all steps in the complex estimation algorithm using the apComplex package functions bhmaxSubgraph
, LCdelta
, and mergeComplexes
. These steps can also be performed separately by the user.
If VBs
and/or VPs
are not specified, then by default VBs
will be assigned the set of baits that detect at least one prey and VPs
the set of prey that are detected by at least one bait.
By default commonFrac
is set relatively high at 2/3. This means
that some potentially reasonable complex combinations could be missed. For
smaller data sets, users may consider decreasing the fraction. For larger
data sets, this may cause a large increase in computation time.
A list of character vectors containing the names of the proteins in the estimated complexes.
Denise Scholtens
Scholtens D and Gentleman R. Making sense of high-throughput protein-protein interaction data. Statistical Applications in Genetics and Molecular Biology 3, Article 39 (2004).
Scholtens D, Vidal M, and Gentleman R. Local modeling of global interactome networks. Bioinformatics 21, 3548-3557 (2005).
data(apEX) PCMG2 <- findComplexes(apEX,sensitivity=.7,specificity=.75)
data(apEX) PCMG2 <- findComplexes(apEX,sensitivity=.7,specificity=.75)
TAP data published by Gavin, et al. (2006).
data(gavinBP2006)
data(gavinBP2006)
gavinBP2006
is a matrix of the TAP data published by Gavin, et al. (2006). The 1752 rows correspond to bait proteins and the 2551 columns correspond to
proteins found as hits in the TAP experiment and are named accordingly.
The first 1752 column names are the same as the 1752 row names; bait proteins
can also be found as hits by other baits, hence their inclusion as columns in
gavinBP2006
. An entry of "1" in the ith row and jth column of
gavinBP2006
indicates that bait protein i found protein j as a hit. All other entries are
"0". There are a total of 19105 "1" entries in the matrix, corresponding to
the 19105 comemberships detected in the experiment.
These data are available in the IntAct repository under Gavin et al - 2006.
Gavin, et al. Functional Organization of the Yeast Proteome by Systematic Analysis of the Yeast Complex. Nature 440, 631-636 (2002).
yTAP
,MBMEcTAP
,SBMHcTAP
,UnRBBcTAP
,HMSPCI
data(gavinBP2006) which(gavinBP2006[1,]==1)
data(gavinBP2006) which(gavinBP2006[1,]==1)
HMS-PCI data published by Ho, et al. (2002).
data(HMSPCI) data(HMSPCIgraph)
data(HMSPCI) data(HMSPCIgraph)
HMSPCI
is a matrix of the HMS-PCI data published by Ho, et al. (2002).
The 493 rows correspond to bait proteins and the 1578 columns correspond to
proteins found as hits in the HMSPCI experiment and are named accordingly.
The first 493 column names are the same as the 493 row names; bait proteins
can also be found as hits by other baits, hence their inclusion as columns in
HMSPCI
. An entry of "1" in the ith row and jth column of HMSPCI
indicates that bait protein i found protein j as a hit. All other entries are
"0". There are a total of 3687 "1" entries in the matrix, corresponding to
the 3687 comemberships detected in the experiment.
HMSPCIgraph
is a graphNEL
object in which 1578 nodes
represent proteins and 3687 directed edges represent comemberships detected in
the purification. Each directed edge originates at the bait and ends at
the hit. Each edge in HMSPCIgraph
corresponds to an entry of "1" in
the HMSPCI
matrix.
These data are available at http://www.mdsp.com/yeast.
Ho, et al. Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry. Nature 415, 180-183 (2002).
Scholtens D and Gentleman R. Making sense of high-throughput protein-protein interaction data. Statistical Applications in Genetics and Molecular Biology 3, Article 39 (2004).
Scholtens D, Vidal M, and Gentleman R. Local modeling of global interactome networks. Bioinformatics 21, 3548-3557 (2005).
MBMEcHMSPCI
,SBMHcHMSPCI
,UnRBBcHMSPCI
,TAP
data(HMSPCI) which(HMSPCI["YAL015C",]==1) data(HMSPCIgraph) adj(HMSPCIgraph,"YAL015C")
data(HMSPCI) which(HMSPCI["YAL015C",]==1) data(HMSPCIgraph) adj(HMSPCIgraph,"YAL015C")
Affiliation matrices with rows corresponding to proteins and columns corresponding to complexes.
data(MBMEcHMSPCI) data(SBMHcHMSPCI) data(UnRBBcHMSPCI)
data(MBMEcHMSPCI) data(SBMHcHMSPCI) data(UnRBBcHMSPCI)
These are the results from an analysis of the HMS-PCI data (Ho et al., 2002)
described by Scholtens and Gentleman (2004) and Scholtens, Vidal, and
Gentleman (submitted). These
estimates were constructed using findComplexes
with a sensitivity
parameter of .75, specificity of .99, and Beta=-0.2 for externally derived
similarity measure based on Gene Ontology cellular component annotation (see
Scholtens and Gentleman (2004)).
MBMEcHMSPCI
contains 242 multi-bait-multi-edge complex estimates.
SBMHcHMSPCI
contains 437 single-bait-multi-hit complex estimates.
UnRBBcHMSPCI
contains 329 unreciprocated bait-bait complex
estimates. These data are also available at
http://www.bioconductor.org/Docs/Papers/2003/apComplex.
Scholtens D and Gentleman R. Making sense of high-throughput protein-protein interaction data. Statistical Applications in Genetics and Molecular Biology 3, Article 39 (2004).
Scholtens D, Vidal M, and Gentleman R. Local modeling of global interactome networks. Bioinformatics 21, 3548-3557 (2005).
Ho, et al. Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry. Nature 415, 180-183 (2002).
HMSPCI
,HMSPCIgraph
,findComplexes
data(MBMEcHMSPCI) MBMEcHMSPCI[1:4,1:4] which(MBMEcHMSPCI[,"MBME1"]==1)
data(MBMEcHMSPCI) MBMEcHMSPCI[1:4,1:4] which(MBMEcHMSPCI[,"MBME1"]==1)
AP-MS data published by Krogan et al. (2004)
data(Krogan)
data(Krogan)
Krogan
is a matrix of the AP-MS data published by Krogan et al (2004).
The 153 rows correspond to bait proteins and the 485 columns correspond to
proteins found as hits in the AP-MS experiment and are named accordingly.
The first 153 column names are the same as the 153 row names; bait proteins
can also be found as hits by other baits, hence their inclusion as columns in
Krogan
. An entry of "1" in the ith row and jth column of Krogan
indicates that bait protein i found protein j as a hit. All other entries are
"0". There are a total of 1132 "1" entries in the matrix, corresponding to
the 1132 comemberships detected in the experiment.
These data are available at http://www.molecule.org/cgi/content/full/13/2/225/DC1/.
Krogan, et al. High-Definition Macromolecular Composition of Yeast RNA-Processing Complexes; Molecular Cell, Vol 13, 225-239, 30 January 2004.
Scholtens D and Gentleman R. Making sense of high-throughput protein-protein interaction data. Statistical Applications in Genetics and Molecular Biology 3, Article 39 (2004).
Scholtens D, Vidal M, and Gentleman R. Local modeling of global interactome networks. Bioinformatics 21, 3548-3557 (2005).
data(Krogan)
data(Krogan)
Affiliation matrices with rows corresponding to proteins and columns corresponding to complexes.
data(MBMEcKrogan)
data(MBMEcKrogan)
These are the results from an analysis of the AP-MS data (Krogan et al., 2004).
These
estimates were constructed using findComplexes
with a sensitivity
parameter of .75, specificity of .99, and Beta=-0.2 for externally derived
similarity measure based on Gene Ontology cellular component annotation (see
Scholtens and Gentleman (2004)).
MBMEcHMSPCI
contains 82 multi-bait-multi-edge complex estimates.
Scholtens D and Gentleman R. Making sense of high-throughput protein-protein interaction data. Statistical Applications in Genetics and Molecular Biology 3, Article 39 (2004).
Scholtens D, Vidal M, and Gentleman R. Local modeling of global interactome networks. Bioinformatics 21, 3548-3557 (2005).
High-Definition Macromolecular Composition of Yeast RNA-Processing Complexes; Molecular Cell, Vol 13, 225-239, 30 January 2004
data(MBMEcKrogan) MBMEcKrogan[1:4,1:4]
data(MBMEcKrogan) MBMEcKrogan[1:4,1:4]
TAP data published by Krogan, et al. (2006).
data(kroganBPMat2006)
data(kroganBPMat2006)
kroganBPMat2006
is a matrix of the TAP data published by Krogan, et al. (2006). The 2264 rows correspond to bait proteins and the 5361 columns correspond to
proteins found as hits in the TAP experiment and are named accordingly.
The first 2264 column names are the same as the 2264 row names; bait proteins
can also be found as hits by other baits, hence their inclusion as columns in
kroganBPMat2006
. An entry of "1" in the ith row and jth column of
kroganBPMat2006
indicates that bait protein i found protein j as a hit. All other entries are
"0". There are a total of "1" entries in the matrix, corresponding to
the 63360 comemberships detected in the experiment.
These data were obtained from the Primary Source -
N Krogan et al. Global Landscape of Protein Complexes in the Yeast, S. Cerevisiae. Nature, 440:667-643, 2006.
yTAP
,MBMEcTAP
,SBMHcTAP
,UnRBBcTAP
,HMSPCI
data(kroganBPMat2006) which(kroganBPMat2006[1,]==1)
data(kroganBPMat2006) which(kroganBPMat2006[1,]==1)
Computes the change in the P=LxC measure for AP-MS protein data when two protein complex estimates are combined into one complex.
These functions are used to evaluate changes in the penalized likelihood when two complexes are combined. They are not meant to be directly used.
The numeric value of the change in P=LxC when two complexes are combined.
Denise Scholtens
Scholtens D and Gentleman R. Making sense of high-throughput protein-protein interaction data. Statistical Applications in Genetics and Molecular Biology 3, Article 39 (2004).
Scholtens D, Vidal M, and Gentleman R. Local modeling of global interactome networks. Bioinformatics 21, 3548-3557 (2005).
bhmaxSubgraph
,mergeComplexes
,findComplexes
data(apEX) PCMG0 <- bhmaxSubgraph(apEX) PCMG1 <- mergeComplexes(PCMG0,apEX,sensitivity=.7,specificity=.75)
data(apEX) PCMG0 <- bhmaxSubgraph(apEX) PCMG1 <- mergeComplexes(PCMG0,apEX,sensitivity=.7,specificity=.75)
Repeatedly applies the function LCdelta
to make combinations of columns in the affiliation matrix representing the protein complex membership graph (PCMG) for AP-MS data.
mergeComplexes(bhmax,adjMat,VBs=NULL,VPs=NULL,simMat=NULL,sensitivity=.75,specificity=.995,Beta=0,commonFrac=2/3,wsVal = 2e7)
mergeComplexes(bhmax,adjMat,VBs=NULL,VPs=NULL,simMat=NULL,sensitivity=.75,specificity=.995,Beta=0,commonFrac=2/3,wsVal = 2e7)
bhmax |
Initial complex estimates coming from bhmaxSubgraph |
adjMat |
Adjacency matrix of bait-hit data from an AP-MS experiment. Rows correspond to baits and columns to hits. |
VBs |
|
VPs |
|
simMat |
An optional square matrix with entries between 0 and 1. Rows and columns correspond to the proteins in the experiment, and should be reported in the same order as the columns of |
sensitivity |
Believed sensitivity of AP-MS technology. |
specificity |
Believed specificity of AP-MS technology. |
Beta |
Optional additional parameter for the weight to give data
in |
commonFrac |
This is the fraction of baits that need to be overlapping for a complex combination to be considered. |
wsVal |
A numeric. This is the value assigned to the work-space in the call to fisher.test. |
The local modeling algorithm for AP-MS data described by Scholtens and
Gentleman (2004) and Scholtens, Vidal, and Gentleman (2005) uses a
two-component measure of protein complex estimate quality, namely P=LxC.
Columns in cMat
represent individual complex estimates. The algorithm
works by starting with a maximal BH-complete subgraph estimate of cMat
,
and then improves the estimate by combining complexes such that P=LxC
increases.
By default commonFrac
is set relatively high at 2/3. This means
that some potentially reasonable complex combinations could be missed. For
smaller data sets, users may consider decreasing the fraction. For larger
data sets, this may cause a large increase in computation time.
A list of character vectors containing the names of the proteins in the estimated complexes.
Denise Scholtens
Scholtens D and Gentleman R. Making sense of high-throughput protein-protein interaction data. Statistical Applications in Genetics and Molecular Biology 3, Article 39 (2004).
Scholtens D, Vidal M, and Gentleman R. Local modeling of global interactome networks. Bioinformatics 21, 3548-3557 (2005).
data(apEX) PCMG0 <- bhmaxSubgraph(apEX) PCMG1 <- mergeComplexes(PCMG0,apEX,sensitivity=.7,specificity=.75)
data(apEX) PCMG0 <- bhmaxSubgraph(apEX) PCMG1 <- mergeComplexes(PCMG0,apEX,sensitivity=.7,specificity=.75)
plotComplex renders complex estimates from the apComplex algorithm using Rgraphviz.
plotComplex(complexMembers,g,VBs,VPs,geneName=FALSE,baitColor="yellow",preyColor="white",recipLineColor="red",unrecipBBLineColor="blue",unrecipBPLineColor="gray",y="neato")
plotComplex(complexMembers,g,VBs,VPs,geneName=FALSE,baitColor="yellow",preyColor="white",recipLineColor="red",unrecipBBLineColor="blue",unrecipBPLineColor="gray",y="neato")
complexMembers |
A character vector of proteins composing a complex estimate. |
g |
An object of class graph, the full bait-prey graph of AP-MS data used in analysis. complexMembers must be a subset of the node names of g. |
VBs |
A vector of viable baits used in the AP-MS experiment. |
VPs |
A vector of viable prey used in the AP-MS experiment. |
geneName |
A logical indicating whether or not nodes should be plotted with common gene names as labels rather than systematic names. |
baitColor |
Color of bait nodes. |
preyColor |
Color of prey nodes. |
recipLineColor |
Color of edges connecting baits which both detected each other as prey |
.
unrecipBBLineColor |
Color of edges connecting baits in which one bait finds the other as prey but not vice versa. |
unrecipBPLineColor |
Color of edges extending from baits to proteins that were only used as prey, hence reciprocity is not possible. |
y |
Layout of plot |
This is a simple function for plotting complex estimates resulting from the apComplex algorithm. Giving the upcoming changes in Rgraphviz, it will likely be changed substantially.
A plotted graph of the complex estimate subgraph.
Denise Scholtens
Scholtens D and Gentleman R. Making sense of high-throughput protein-protein interaction data. Statistical Applications in Genetics and Molecular Biology 3, Article 39 (2004).
Scholtens D, Vidal M, and Gentleman R. Local modeling of global interactome networks. Bioinformatics 21, 3548-3557 (2005).
data(apEX) data(apEXG) PCMG2 <- findComplexes(apEX,sensitivity=.7,specificity=.75) PCMG2sorted <- sortComplexes(PCMG2,apEX) VBs <- rownames(apEX) VPs <- setdiff(colnames(apEX),VBs) plotComplex(PCMG2sorted$MBME[[1]],g=apEXG,VBs=VBs, VPs=VPs)
data(apEX) data(apEXG) PCMG2 <- findComplexes(apEX,sensitivity=.7,specificity=.75) PCMG2sorted <- sortComplexes(PCMG2,apEX) VBs <- rownames(apEX) VPs <- setdiff(colnames(apEX),VBs) plotComplex(PCMG2sorted$MBME[[1]],g=apEXG,VBs=VBs, VPs=VPs)
Sorts complexes recorded in PCMG list into three separate lists containing MBME, SBMH, and UnRBB complexes.
sortComplexes(PCMG, adjMat)
sortComplexes(PCMG, adjMat)
PCMG |
Current PCMG estimate |
adjMat |
Adjacency matrix of bait-hit data from an AP-MS experiment. Rows correspond to baits and columns to hits. |
MBME complexes contain multiple bait proteins and multiple edges. SBMH complexes contain one bait and a collection of hit-only proteins. UnRBB complexes contain only two baits (no hit-only proteins) that are connected by an unreciprocated edge.
A list of lists representing the MBME, SBMH, and UnRBB complex estimates.
Denise Scholtens
Scholtens D and Gentleman R. Making sense of high-throughput protein-protein interaction data. Statistical Applications in Genetics and Molecular Biology 3, Article 39 (2004).
Scholtens D, Vidal M, and Gentleman R. Local modeling of global interactome networks. Bioinformatics 21, 3548-3557 (2005).
data(apEX) PCMG2 <- findComplexes(apEX,sensitivity=.7,specificity=.75) sortComplexes(PCMG2,apEX)
data(apEX) PCMG2 <- findComplexes(apEX,sensitivity=.7,specificity=.75) sortComplexes(PCMG2,apEX)
TAP data published by Gavin, et al. (2002).
data(TAP) data(TAPgraph)
data(TAP) data(TAPgraph)
TAP
is a matrix of the TAP data published by Gavin, et al. (2002). The 455 rows correspond to bait proteins and the 1364 columns correspond to
proteins found as hits in the TAP experiment and are named accordingly.
The first 455 column names are the same as the 455 row names; bait proteins
can also be found as hits by other baits, hence their inclusion as columns in
TAP
. An entry of "1" in the ith row and jth column of TAP
indicates that bait protein i found protein j as a hit. All other entries are
"0". There are a total of 3420 "1" entries in the matrix, corresponding to
the 3420 comemberships detected in the experiment.
TAPgraph
is a graphNEL
object in which 1364 nodes
represent proteins and 3420 directed edges represent comemberships detected in
the purification. Each directed edge originates at the bait and ends at
the hit. Each edge in TAPgraph
corresponds to an entry of "1" in
the TAP
matrix.
These data are available in Supplementary Table S1 of Gavin et al. at http://www.nature.com.
Gavin, et al. Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature 415, 141-147 (2002).
Scholtens D and Gentleman R. Making sense of high-throughput protein-protein interaction data. Statistical Applications in Genetics and Molecular Biology 3, Article 39 (2004).
Scholtens D, Vidal M, and Gentleman R. Local modeling of global interactome networks. Bioinformatics 21, 3548-3557 (2005).
yTAP
,MBMEcTAP
,SBMHcTAP
,UnRBBcTAP
,HMSPCI
data(TAP) which(TAP["Abd1",]==1) data(TAPgraph) adj(TAPgraph,"Abd1")
data(TAP) which(TAP["Abd1",]==1) data(TAPgraph) adj(TAPgraph,"Abd1")
Affiliation matrices with rows corresponding to proteins and columns corresponding to complexes.
data(MBMEcTAP) data(SBMHcTAP) data(UnRBBcTAP)
data(MBMEcTAP) data(SBMHcTAP) data(UnRBBcTAP)
These are the results from an analysis of the TAP data (Gavin et al., 2002) by
Scholtens and Gentleman (2004) and Scholtens, Vidal, and Gentleman
(submitted). These estimates were constructed using findComplexes
with a sensitivity
parameter of .75, specificity of .995, and Beta=-0.2 for externally derived
similarity measure based on Gene Ontology cellular component annotation (see
Scholtens and Gentleman (2004)).
MBMEcTAP
contains 260 multi-bait-multi-edge complex estimates.
SBMHcTAP
contains 325 single-bait-multi-hit complex estimates.
UnRBBcTAP
contains 123 unreciprocated bait-bait complex
estimates. These data are also available at
http://www.bioconductor.org/Docs/Papers/2003/apComplex.
Scholtens D and Gentleman R. Making sense of high-throughput protein-protein interaction data. Statistical Applications in Genetics and Molecular Biology 3, Article 39 (2004).
Scholtens D, Vidal M, and Gentleman R. Local modeling of global interactome networks. Bioinformatics 21, 3548-3557 (2005).
Gavin, et al. Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature 415, 141-147 (2002).
TAP
, TAPgraph
,
yTAP
, findComplexes
data(MBMEcTAP) MBMEcTAP[1:10,1:3] which(MBMEcTAP[,"MBME1"]==1)
data(MBMEcTAP) MBMEcTAP[1:10,1:3] which(MBMEcTAP[,"MBME1"]==1)
Affiliation matrices with rows corresponding to proteins and columns corresponding to complexes.
data(yNameTAP)
data(yNameTAP)
These are the results from an analysis of the TAP data (Gavin et al., 2002) by
Scholtens and Gentleman (2004) and Scholtens, Vidal, and Gentleman
(submitted). These estimates were constructed using findComplexes
with a sensitivity
parameter of .75, specificity of .995, and Beta=-0.2 for externally derived
similarity measure based on Gene Ontology cellular component annotation (see
Scholtens and Gentleman (2004)).
yNameTAP
contains 260 multi-bait-multi-edge complex estimates.
Scholtens D and Gentleman R. Making sense of high-throughput protein-protein interaction data. Statistical Applications in Genetics and Molecular Biology 3, Article 39 (2004).
Scholtens D, Vidal M, and Gentleman R. Local modeling of global interactome networks. Bioinformatics 21, 3548-3557 (2005).
Gavin, et al. Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature 415, 141-147 (2002).
TAP
, TAPgraph
,
yTAP
, findComplexes
data(yNameTAP) yNameTAP[1:10,1:3]
data(yNameTAP) yNameTAP[1:10,1:3]
A list of the 232 'yTAP' complexes published by Gavin, et al. (2002) using their TAP data.
data(yTAP)
data(yTAP)
Each element of the yTAP list contains a vector of the proteins reported as part of the complex, as well as the functional annotation category published by Gavin, et al. (2002).
These data are available in Supplementary Table S3 of Gavin, et al. (2002) at http://www.nature.com and at http://yeast.cellzome.com.
Gavin, et al. Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature 415, 141-147 (2002).
TAP
,TAPgraph
,MBMEcTAP
,SBMHcTAP
,UnRBBcTAP
,HMSPCI
data(yTAP) yTAP[1]
data(yTAP) yTAP[1]