Title: | Compensates for the bias introduced by median normalization in |
---|---|
Description: | It uses the overlap between enriched and non-enriched datasets to compensate for the bias introduced in global phosphorylation after applying median normalization. |
Authors: | Sohrab Saraei [aut, cre], Tomi Suomi [ctb], Otto Kauko [ctb], Laura Elo [ths] |
Maintainer: | Sohrab Saraei <[email protected]> |
License: | GPL (>= 2) |
Version: | 1.31.0 |
Built: | 2024-11-18 03:53:40 UTC |
Source: | https://github.com/bioc/phosphonormalizer |
A dataset containing sequences, modifications and abundances of about 4000 peptides over 5 samples with 3 technical replicates each.
enriched.rd
enriched.rd
A data frame with 4099 rows and 17 variables, all samples are median normalized:
The sequence of the peptide
The modification and its location
Sample: Control 2 Technical Replicate: 1
Sample: Control 2 Technical Replicate: 2
Sample: Control 2 Technical Replicate: 3
Sample: Control 1 Technical Replicate: 1
Sample: Control 1 Technical Replicate: 2
Sample: Control 1 Technical Replicate: 3
Sample: CIP2A Technical Replicate: 1
Sample: CIP2A Technical Replicate: 2
Sample: CIP2A Technical Replicate: 3
Sample: RAS Technical Replicate: 1
Sample: RAS Technical Replicate: 2
Sample: RAS Technical Replicate: 3
Sample: OA Technical Replicate: 1
Sample: OA Technical Replicate: 2
Sample: OA Technical Replicate: 3
...
Example Non-enriched dataset
http://www.nature.com/articles/srep13099
A dataset containing sequences, modifications and abundances of about 17000 peptides measured over 5 samples with 3 technical replicates each.
non.enriched.rd
non.enriched.rd
A data frame with 16982 rows and 17 variables, all samples are median normalized:
The sequence of the peptide
The modification and its location
Sample: Control 2 Technical Replicate: 1
Sample: Control 2 Technical Replicate: 2
Sample: Control 2 Technical Replicate: 3
Sample: Control 1 Technical Replicate: 1
Sample: Control 1 Technical Replicate: 2
Sample: Control 1 Technical Replicate: 3
Sample: CIP2A Technical Replicate: 1
Sample: CIP2A Technical Replicate: 2
Sample: CIP2A Technical Replicate: 3
Sample: RAS Technical Replicate: 1
Sample: RAS Technical Replicate: 2
Sample: RAS Technical Replicate: 3
Sample: OA Technical Replicate: 1
Sample: OA Technical Replicate: 2
Sample: OA Technical Replicate: 3
...
Example Non-enriched dataset
http://www.nature.com/articles/srep13099
This function compensates for the bias introduced in global phosphorylation in the sample after using median normalization.
normalizePhospho(enriched, non.enriched, phospho = NULL, samplesCols, modseqCols, techRep, plot.fc=NULL)
normalizePhospho(enriched, non.enriched, phospho = NULL, samplesCols, modseqCols, techRep, plot.fc=NULL)
enriched |
The enriched data with the type data.frame or MSnSet, which should contain the sequence, modification of the sequence with their phosphorylation site and their abundanecs across samples |
non.enriched |
The non-enriched data with the type data.frame or MSnSet, which should contain the sequence, modification of the sequence with their phosphorylation site and their abundanecs across samples |
phospho |
a string that shows the term that represents phosphorylation in the modification column of the data. If it is not assigned, "Phospho" will be used as the default value |
samplesCols |
A data.frame with two columns, with the column names enriched and non.enriched, of type numeric or integer, which must contain the column number of samples that hold the abundances |
modseqCols |
A data.frame with two columns, with the names enriched and non.enriched, of type numeric or integer, which must contain the column number of samples that hold the sequence and modifications of the peptides |
techRep |
a factor that holds information about columns order and the technical replicates of the samples |
plot.fc |
This parameter if set plots the fold change distribution before and after pairwise normalization. controls and samples should be set as named vectors in a list (look at the example) |
It is shown that global median normalization can introduce bias in the fold change of global phosphorylation between samples. It is suggested that by taking the non-enriched data into consideration, this bias could be compensated (Kauko et al. 2015).
A data.frame with the normalized values and their sequence and modification.
Sohrab Saraei, Tomi Suomi, Otto Kauko,Laura L. Elo
Maintainer: Sohrab Saraei <[email protected]>
http://www.nature.com/articles/srep13099
#Specifying the column numbers of abundances in the original data.frame, #from both enriched and non-enriched runs samplesCols <- data.frame(enriched=3:17, non.enriched=3:17) #Specifying the column numbers of sequence and modification in the original data.frame, #from both enriched and non-enriched runs modseqCols <- data.frame(enriched = 1:2, non.enriched = 1:2) #The samples and their technical replicates techRep <- factor(x = c(1,1,1,2,2,2,3,3,3,4,4,4,5,5,5)) #Call the function norm <- normalizePhospho(enriched = enriched.rd, non.enriched = non.enriched.rd, samplesCols = samplesCols, modseqCols = modseqCols, techRep = techRep, plot.fc = list(control = c(1,2), samples = c(3,4,5))) head(norm)
#Specifying the column numbers of abundances in the original data.frame, #from both enriched and non-enriched runs samplesCols <- data.frame(enriched=3:17, non.enriched=3:17) #Specifying the column numbers of sequence and modification in the original data.frame, #from both enriched and non-enriched runs modseqCols <- data.frame(enriched = 1:2, non.enriched = 1:2) #The samples and their technical replicates techRep <- factor(x = c(1,1,1,2,2,2,3,3,3,4,4,4,5,5,5)) #Call the function norm <- normalizePhospho(enriched = enriched.rd, non.enriched = non.enriched.rd, samplesCols = samplesCols, modseqCols = modseqCols, techRep = techRep, plot.fc = list(control = c(1,2), samples = c(3,4,5))) head(norm)