Title: | Protein Profiling |
---|---|
Description: | Significance assessment for distance measures of time-course protein profiles |
Authors: | Julian Gehring |
Maintainer: | Julian Gehring <[email protected]> |
License: | GPL-3 |
Version: | 1.47.0 |
Built: | 2025-01-20 06:11:48 UTC |
Source: | https://github.com/bioc/proteinProfiles |
Significance assessment for distance measures of time-course protein profiles
The package is published under the GPL-3 license.
Julian Gehring
Maintainer: Julian Gehring <[email protected]>
filterFeatures, grepAnnotation, profileDistance, plotProfileDistance
Find protein group of interest based on pattern matching against the annotation data.
grepAnnotation(anno, pattern, column, ...)
grepAnnotation(anno, pattern, column, ...)
anno |
Data frame with annotation to analyze. The row names have to match rownames in the matrix containing the protein abundance data. Different columns correspond to different annotation columns, e.g. from different sources. |
pattern |
Character string representing a (sub)string or regular expression matched against the annotation. For details, see ‘grep’. |
column |
Column of the annotation data ‘pattern’ is matched against. |
... |
Optional arguments, passed to the ‘grep’ function. |
Protein identifiers (rownames of ‘anno’) specifying the protein group of interest in the data set.
grep, filterRatios, proteinProfiles
data(ips_sample) index_28S <- grepAnnotation(annotation, pattern="^28S", column="Protein.Name") index_ribosome <- grepAnnotation(annotation, "Ribosome", "KEGG")
data(ips_sample) index_28S <- grepAnnotation(annotation, pattern="^28S", column="Protein.Name") index_ribosome <- grepAnnotation(annotation, "Ribosome", "KEGG")
Filter out proteins exceeding a certain fraction of missing data points.
filterFeatures(values, maxNAfraction, verbose=FALSE, plot=FALSE, ...)
filterFeatures(values, maxNAfraction, verbose=FALSE, plot=FALSE, ...)
values |
Numeric matrix containing the data to analyze. |
maxNAfraction |
Numeric threshold specifying the maximum fraction of data points that can be missing (‘NA’) to still keep the protein. |
verbose |
Logical indicating whether to print the number of proteins before and after filtering (default: FALSE). |
plot |
Logical indicationg whether to plot a diagnostic plot showing the distribution of the fraction of missing data points associated with the proteins (default: FALSE). |
... |
Arguments passed to the '‘plot’' method. |
Numeric matrix with the same structure as ‘values’, with elements not matching the filter criteria removed.
data(ips_sample) ratios_filtered <- filterFeatures(ratios, 0.2, verbose=TRUE)
data(ips_sample) ratios_filtered <- filterFeatures(ratios, 0.2, verbose=TRUE)
IPS sample data, separated in protein ratios and annotation.
data(ips_sample)
data(ips_sample)
Matrix with protein profiles. Rows correspond to proteins, columns to samples.
Data frame with annotation columns, with row names matching those of ‘ratios.’
data(ips_sample) str(ratios) str(annotation)
data(ips_sample) str(ratios) str(annotation)
Compute distance between a protein group of interest and assess its significance by comparing it to the distances of randomly selected groups of proteins.
profileDistance(values, index, nSample=1000, seed) plotProfileDistance(z, ...)
profileDistance(values, index, nSample=1000, seed) plotProfileDistance(z, ...)
values |
Numeric matrix containing the protein data to analyze. |
index |
Protein identifiers for the protein group of interest in the data set ‘values’, as returned from ‘grepAnnotation’ or an equivalent function. |
nSample |
Integer specifying the number of randomly selected groups of proteins with the same number of elements as specified by ‘index’, serving as a reference for the significance assessment (default: 1000). |
seed |
Random seed, relevant for the permutation step. For details, see the ‘seed’ function. |
z |
Return object from ‘profileDistance.’ |
... |
Optional arguments passed to the ‘plot’ function. |
List with elements:
Numeric with mean distance $d_{0}$
of profiles for the proteins of interest.
Vector of numerics with mean distances $d_{1}$
of randomly selected groups of proteins.
Numeric with p-value $p$
.
data(ips_sample) index_28S <- grepAnnotation(annotation, pattern="^28S", column="Protein.Name") z <- profileDistance(ratios, index_28S) z$p.value plotProfileDistance(z)
data(ips_sample) index_28S <- grepAnnotation(annotation, pattern="^28S", column="Protein.Name") z <- profileDistance(ratios, index_28S) z$p.value plotProfileDistance(z)