Package 'phosphonormalizer' reference manual

Title:	Compensates for the bias introduced by median normalization in
Description:	It uses the overlap between enriched and non-enriched datasets to compensate for the bias introduced in global phosphorylation after applying median normalization.
Authors:	Sohrab Saraei [aut, cre], Tomi Suomi [ctb], Otto Kauko [ctb], Laura Elo [ths]
Maintainer:	Sohrab Saraei <[email protected]>
License:	GPL (>= 2)
Version:	1.31.0
Built:	2025-02-16 03:15:53 UTC
Source:	https://github.com/bioc/phosphonormalizer

Enriched dataset

Description

A dataset containing sequences, modifications and abundances of about 4000 peptides over 5 samples with 3 technical replicates each.

Usage

enriched.rd
enriched.rd

Format

A data frame with 4099 rows and 17 variables, all samples are median normalized:

Sequence: The sequence of the peptide
Modification: The modification and its location
gcNorm.ctrl2.1: Sample: Control 2 Technical Replicate: 1
gcNorm.ctrl2.2: Sample: Control 2 Technical Replicate: 2
gcNorm.ctrl2.3: Sample: Control 2 Technical Replicate: 3
gcNorm.ctrl1.1: Sample: Control 1 Technical Replicate: 1
gcNorm.ctrl1.2: Sample: Control 1 Technical Replicate: 2
gcNorm.ctrl1.3: Sample: Control 1 Technical Replicate: 3
gcNorm.CIP2A.1: Sample: CIP2A Technical Replicate: 1
gcNorm.CIP2A.2: Sample: CIP2A Technical Replicate: 2
gcNorm.CIP2A.3: Sample: CIP2A Technical Replicate: 3
gcNorm.RAS.1: Sample: RAS Technical Replicate: 1
gcNorm.RAS.2: Sample: RAS Technical Replicate: 2
gcNorm.RAS.3: Sample: RAS Technical Replicate: 3
gcNorm.OA.1: Sample: OA Technical Replicate: 1
gcNorm.OA.2: Sample: OA Technical Replicate: 2
gcNorm.OA.3: Sample: OA Technical Replicate: 3

...

Value

Example Non-enriched dataset

Source

http://www.nature.com/articles/srep13099

Non-enriched dataset

Description

A dataset containing sequences, modifications and abundances of about 17000 peptides measured over 5 samples with 3 technical replicates each.

Usage

non.enriched.rd
non.enriched.rd

Format

A data frame with 16982 rows and 17 variables, all samples are median normalized:

Sequence: The sequence of the peptide
Modification: The modification and its location
gcNorm.ctrl2.1: Sample: Control 2 Technical Replicate: 1
gcNorm.ctrl2.2: Sample: Control 2 Technical Replicate: 2
gcNorm.ctrl2.3: Sample: Control 2 Technical Replicate: 3
gcNorm.ctrl1.1: Sample: Control 1 Technical Replicate: 1
gcNorm.ctrl1.2: Sample: Control 1 Technical Replicate: 2
gcNorm.ctrl1.3: Sample: Control 1 Technical Replicate: 3
gcNorm.CIP2A.1: Sample: CIP2A Technical Replicate: 1
gcNorm.CIP2A.2: Sample: CIP2A Technical Replicate: 2
gcNorm.CIP2A.3: Sample: CIP2A Technical Replicate: 3
gcNorm.RAS.1: Sample: RAS Technical Replicate: 1
gcNorm.RAS.2: Sample: RAS Technical Replicate: 2
gcNorm.RAS.3: Sample: RAS Technical Replicate: 3
gcNorm.OA.1: Sample: OA Technical Replicate: 1
gcNorm.OA.2: Sample: OA Technical Replicate: 2
gcNorm.OA.3: Sample: OA Technical Replicate: 3

...

Value

Example Non-enriched dataset

Source

http://www.nature.com/articles/srep13099

Pairwise Normalization of MS-based phosphoproteomic data

Description

This function compensates for the bias introduced in global phosphorylation in the sample after using median normalization.

Usage

normalizePhospho(enriched, non.enriched, phospho = NULL, 
samplesCols, modseqCols, techRep, plot.fc=NULL)
normalizePhospho(enriched, non.enriched, phospho = NULL, 
samplesCols, modseqCols, techRep, plot.fc=NULL)

Arguments

`enriched`	The enriched data with the type data.frame or MSnSet, which should contain the sequence, modification of the sequence with their phosphorylation site and their abundanecs across samples
`non.enriched`	The non-enriched data with the type data.frame or MSnSet, which should contain the sequence, modification of the sequence with their phosphorylation site and their abundanecs across samples
`phospho`	a string that shows the term that represents phosphorylation in the modification column of the data. If it is not assigned, "Phospho" will be used as the default value
`samplesCols`	A data.frame with two columns, with the column names enriched and non.enriched, of type numeric or integer, which must contain the column number of samples that hold the abundances
`modseqCols`	A data.frame with two columns, with the names enriched and non.enriched, of type numeric or integer, which must contain the column number of samples that hold the sequence and modifications of the peptides
`techRep`	a factor that holds information about columns order and the technical replicates of the samples
`plot.fc`	This parameter if set plots the fold change distribution before and after pairwise normalization. controls and samples should be set as named vectors in a list (look at the example)

Details

It is shown that global median normalization can introduce bias in the fold change of global phosphorylation between samples. It is suggested that by taking the non-enriched data into consideration, this bias could be compensated (Kauko et al. 2015).

Value

A data.frame with the normalized values and their sequence and modification.

Author(s)

Sohrab Saraei, Tomi Suomi, Otto Kauko,Laura L. Elo

Maintainer: Sohrab Saraei <[email protected]>

References

http://www.nature.com/articles/srep13099

Examples

    #Specifying the column numbers of abundances in the original data.frame, 
    #from both enriched and non-enriched runs
    samplesCols <- data.frame(enriched=3:17, non.enriched=3:17)
    #Specifying the column numbers of sequence and modification in the original data.frame, 
    #from both enriched and non-enriched runs
    modseqCols <- data.frame(enriched = 1:2, non.enriched = 1:2)
    #The samples and their technical replicates
    techRep <- factor(x = c(1,1,1,2,2,2,3,3,3,4,4,4,5,5,5))
    #Call the function
    norm <- normalizePhospho(enriched = enriched.rd, non.enriched = non.enriched.rd, 
            samplesCols = samplesCols, modseqCols = modseqCols, techRep = techRep, 
			plot.fc = list(control = c(1,2), samples = c(3,4,5)))
    head(norm)
#Specifying the column numbers of abundances in the original data.frame, 
    #from both enriched and non-enriched runs
    samplesCols <- data.frame(enriched=3:17, non.enriched=3:17)
    #Specifying the column numbers of sequence and modification in the original data.frame, 
    #from both enriched and non-enriched runs
    modseqCols <- data.frame(enriched = 1:2, non.enriched = 1:2)
    #The samples and their technical replicates
    techRep <- factor(x = c(1,1,1,2,2,2,3,3,3,4,4,4,5,5,5))
    #Call the function
    norm <- normalizePhospho(enriched = enriched.rd, non.enriched = non.enriched.rd, 
            samplesCols = samplesCols, modseqCols = modseqCols, techRep = techRep, 
			plot.fc = list(control = c(1,2), samples = c(3,4,5)))
    head(norm)

Package 'phosphonormalizer'

Help Index

Enriched dataset

Description

Usage

Format

Value

Source

Non-enriched dataset

Description

Usage

Format

Value

Source

Pairwise Normalization of MS-based phosphoproteomic data

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples