Package 'MAGeCKFlute'

Title:	Integrative Analysis Pipeline for Pooled CRISPR Functional Genetic Screens
Description:	CRISPR (clustered regularly interspaced short palindrome repeats) coupled with nuclease Cas9 (CRISPR/Cas9) screens represent a promising technology to systematically evaluate gene functions. Data analysis for CRISPR/Cas9 screens is a critical process that includes identifying screen hits and exploring biological functions for these hits in downstream analysis. We have previously developed two algorithms, MAGeCK and MAGeCK-VISPR, to analyze CRISPR/Cas9 screen data in various scenarios. These two algorithms allow users to perform quality control, read count generation and normalization, and calculate beta score to evaluate gene selection performance. In downstream analysis, the biological functional analysis is required for understanding biological functions of these identified genes with different screening purposes. Here, We developed MAGeCKFlute for supporting downstream analysis. MAGeCKFlute provides several strategies to remove potential biases within sgRNA-level read counts and gene-level beta scores. The downstream analysis with the package includes identifying essential, non-essential, and target-associated genes, and performing biological functional category analysis, pathway enrichment analysis and protein complex enrichment analysis of these genes. The package also visualizes genes in multiple ways to benefit users exploring screening data. Collectively, MAGeCKFlute enables accurate identification of essential, non-essential, and targeted genes, as well as their related biological functions. This vignette explains the use of the package and demonstrates typical workflows.
Authors:	Binbin Wang, Wubing Zhang, Feizhen Wu, Wei Li & X. Shirley Liu
Maintainer:	Wubing Zhang <[email protected]>
License:	GPL (>=3)
Version:	2.11.0
Built:	2024-10-31 00:48:08 UTC
Source:	https://github.com/bioc/MAGeCKFlute

Help Index

Kegg pathway view and arrange grobs on page
Bar plot
Batch effect removal
Visualize the estimate cell cycle compared to control.
Quantile of normal distribution.
Density plot
Density plot
Gene set enrichment analysis
Do enrichment analysis using hypergeometric test
Enrichment analysis using over-representation test
Enrichment analysis for Positive and Negative selection genes
Enrichment analysis
Simplify the enrichment results based on Jaccard index
Visualize enriched pathways and genes in those pathways
View enriched terms
Enrichment analysis for selected treatment related genes
Downstream analysis based on MAGeCK-MLE result
Downstream analysis based on MAGeCK-RRA result
Map values to colors
Retrieve gene annotations from the NCBI, HNSC, and Uniprot databases.
Get the kegg code of specific mammalia organism.
Retreive reference orthologs annotation.
Extract pathway annotation from GMT file.
Cluster and view cluster tree
Draw heatmap
Identical bar plot
Incorporate Depmap screen into analysis
Load processed Depmap data
View mapping ratio
MAplot of gene beta scores
Blank figure
normalize.loess
Normalize gene beta scores
Omit common essential genes based on depmap data
Rank plot
Read gene beta scores from MAGeCK-MLE results
ReadGMT
Read gene summary file in MAGeCK-RRA results
Read sgRNA summary in MAGeCK-RRA results
Compute the similarity between customized CRISPR screen with Depmap screens
Update genesets from source database
Scatter plot
Select signatures from candidate list (according to the consistence in most samples).
View sgRNA rank.
Scatter plot showing dots in 9 quadrants
Gene ID conversion
Violin plot
Volcano View
Write GMT file

Kegg pathway view and arrange grobs on page

Description

Kegg pathway view and arrange grobs on page.

Usage

arrangePathview(
  genelist,
  pathways = c(),
  top = 4,
  ncol = 2,
  title = NULL,
  sub = NULL,
  organism = "hsa",
  output = ".",
  path.archive = ".",
  kegg.native = TRUE,
  verbose = TRUE
)
arrangePathview(
  genelist,
  pathways = c(),
  top = 4,
  ncol = 2,
  title = NULL,
  sub = NULL,
  organism = "hsa",
  output = ".",
  path.archive = ".",
  kegg.native = TRUE,
  verbose = TRUE
)

Arguments

`genelist`	a data frame with columns of ENTREZID, Control and Treatment. The columns of Control and Treatment represent gene score in Control and Treatment sample.
`pathways`	character vector, the KEGG pathway ID(s), usually 5 digit, may also include the 3 letter KEGG species code.
`top`	integer, specifying how many top enriched pathways to be visualized.
`ncol`	integer, specifying how many column of figures to be arranged in each page.
`title`	optional string, or grob.
`sub`	optional string, or grob.
`organism`	character, either the kegg code, scientific name or the common name of the target species. This applies to both pathway and gene.data or cpd.data. When KEGG ortholog pathway is considered, species="ko". Default species="hsa", it is equivalent to use either "Homo sapiens" (scientific name) or "human" (common name).
`output`	Path to save plot to.
`path.archive`	character, the directory of KEGG pathway data file (.xml) and image file (.png). Users may supply their own data files in the same format and naming convention of KEGG's (species code + pathway id, e.g. hsa04110.xml, hsa04110.png etc) in this directory. Default kegg.dir="." (current working directory).
`kegg.native`	logical, whether to render pathway graph as native KEGG graph (.png) or using graphviz layout engine (.pdf). Default kegg.native=TRUE.
`verbose`	Boolean

Value

plot on the current device

Author(s)

Wubing Zhang

Examples

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
colnames(dd)[2:3] = c("Control", "Treatment")
# arrangePathview(dd, c("hsa00534"), title=NULL, sub=NULL, organism="hsa")

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
colnames(dd)[2:3] = c("Control", "Treatment")
# arrangePathview(dd, c("hsa00534"), title=NULL, sub=NULL, organism="hsa")

Bar plot

Description

Bar plot

Usage

BarView(
  df,
  x = "x",
  y = "y",
  fill = "#FC6665",
  bar.width = 0.8,
  position = "dodge",
  dodge.width = 0.8,
  main = NA,
  xlab = NULL,
  ylab = NA,
  ...
)
BarView(
  df,
  x = "x",
  y = "y",
  fill = "#FC6665",
  bar.width = 0.8,
  position = "dodge",
  dodge.width = 0.8,
  main = NA,
  xlab = NULL,
  ylab = NA,
  ...
)

Arguments

`df`	A data frame.
`x`	A character, specifying the x-axis.
`y`	A character, specifying the y-axis.
`fill`	A character, specifying the fill color.
`bar.width`	A numeric, specifying the width of bar.
`position`	"dodge" (default), "stack", "fill".
`dodge.width`	A numeric, set the width in position_dodge.
`main`	A charater, specifying the figure title.
`xlab`	A character, specifying the title of x-axis.
`ylab`	A character, specifying the title of y-axis.
`...`	Other parameters in geom_bar

Value

An object created by ggplot, which can be assigned and further customized.

Author(s)

Wubing Zhang

Examples

mdata = data.frame(group=letters[1:5], count=sample(1:100,5))
BarView(mdata, x = "group", y = "count")
mdata = data.frame(group=letters[1:5], count=sample(1:100,5))
BarView(mdata, x = "group", y = "count")

Batch effect removal

Description

Batch effect removal

Usage

BatchRemove(
  mat,
  batchMat,
  log2trans = FALSE,
  pca = TRUE,
  positive = FALSE,
  cluster = FALSE,
  outdir = NULL
)
BatchRemove(
  mat,
  batchMat,
  log2trans = FALSE,
  pca = TRUE,
  positive = FALSE,
  cluster = FALSE,
  outdir = NULL
)

Arguments

`mat`	A data frame, each row is a gene, and each column is a sample.
`batchMat`	A data frame, the first column should be 'Samples'(matched colnames of mat) and the second column is 'Batch'. The remaining columns could be Covariates.
`log2trans`	Boolean, specifying whether do logarithmic transformation before batch removal.
`pca`	Boolean, specifying whether return pca plot.
`positive`	Boolean, specifying whether all values should be positive.
`cluster`	Boolean, specifying whether perform hierarchical clustering.
`outdir`	Output directory for hierarchical cluster tree.

Value

A list contrains two objects, including data and p.

Author(s)

Wubing Zhang

Examples

edata = matrix(c(rnorm(2000, 5), rnorm(2000, 8)), 1000)
colnames(edata) = paste0("s", 1:4)
batchMat = data.frame(sample = colnames(edata), batch = rep(1:2, each = 2))
edata1 = BatchRemove(edata, batchMat)
print(edata1$p)

edata = matrix(c(rnorm(2000, 5), rnorm(2000, 8)), 1000)
colnames(edata) = paste0("s", 1:4)
batchMat = data.frame(sample = colnames(edata), batch = rep(1:2, each = 2))
edata1 = BatchRemove(edata, batchMat)
print(edata1$p)

Visualize the estimate cell cycle compared to control.

Description

Estimate cell cycle time in different samples by linear fitting of beta scores.

Usage

ConsistencyView(
  dat,
  ctrlname,
  treatname,
  main = NULL,
  filename = NULL,
  width = 5,
  height = 4,
  ...
)
ConsistencyView(
  dat,
  ctrlname,
  treatname,
  main = NULL,
  filename = NULL,
  width = 5,
  height = 4,
  ...
)

Arguments

`dat`	A data frame.
`ctrlname`	A character, specifying the names of control samples.
`treatname`	A character, specifying the names of treatment samples.
`main`	A character, specifying title.
`filename`	A character, specifying a file name to create on disk. Set filename to be "NULL", if don't want to save the figure.
`width`	Numeric, specifying width of figure.
`height`	Numeric, specifying height of figure.
`...`	Other available parameters in ggsave.

Value

An object created by ggplot, which can be assigned and further customized.

Author(s)

Wubing Zhang

Examples

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
ConsistencyView(dd, ctrlname = "Pmel1_Ctrl", treatname = "Pmel1")

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
ConsistencyView(dd, ctrlname = "Pmel1_Ctrl", treatname = "Pmel1")

Quantile of normal distribution.

Description

Compute cutoff from a normal-distributed vector.

Usage

CutoffCalling(d, scale = 2)
CutoffCalling(d, scale = 2)

Arguments

`d`	A numeric vector.
`scale`	Boolean or numeric, specifying how many standard deviation will be used as cutoff.

Value

A numeric value.

Examples

CutoffCalling(rnorm(10000))
CutoffCalling(rnorm(10000))

Density plot

Description

Plot the distribution of score differences between treatment and control.

Usage

DensityDiffView(
  dat,
  ctrlname = "Control",
  treatname = "Treatment",
  main = NULL,
  filename = NULL,
  width = 5,
  height = 4,
  ...
)
DensityDiffView(
  dat,
  ctrlname = "Control",
  treatname = "Treatment",
  main = NULL,
  filename = NULL,
  width = 5,
  height = 4,
  ...
)

Arguments

`dat`	A data frame.
`ctrlname`	A character, specifying the control samples.
`treatname`	A character, specifying the treatment samples.
`main`	A character, specifying title.
`filename`	A character, specifying a file name to create on disk. Set filename to be "NULL", if don't want to save the figure.
`width`	Numeric, specifying width of figure.
`height`	Numeric, specifying height of figure.
`...`	Other parameters in ggsave.

Value

An object created by ggplot, which can be assigned and further customized.

Author(s)

Wubing Zhang

Examples

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
# Density plot of beta score deviation between control and treatment
DensityDiffView(dd, ctrlname = "Pmel1_Ctrl", treatname = "Pmel1")


file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
# Density plot of beta score deviation between control and treatment
DensityDiffView(dd, ctrlname = "Pmel1_Ctrl", treatname = "Pmel1")

Density plot

Description

Plot the distribution of numeric vectors with the same length.

Usage

DensityView(
  dat,
  samples = NULL,
  main = NULL,
  xlab = "Score",
  filename = NULL,
  width = 5,
  height = 4,
  ...
)
DensityView(
  dat,
  samples = NULL,
  main = NULL,
  xlab = "Score",
  filename = NULL,
  width = 5,
  height = 4,
  ...
)

Arguments

`dat`	A data frame.
`samples`	A character vector, specifying columns in `dat` for plotting.
`main`	A character, specifying title.
`xlab`	A character, specifying title of x-axis.
`filename`	A character, specifying a file name to create on disk. Set filename to be "NULL", if don't want to save the figure.
`width`	Numeric, specifying width of figure.
`height`	Numeric, specifying height of figure.
`...`	Other available parameters in ggsave.

Value

An object created by ggplot, which can be assigned and further customized.

Author(s)

Wubing Zhang

Examples

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
DensityView(dd, samples=c("Pmel1_Ctrl", "Pmel1"))
#or
DensityView(dd[,-1])

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
DensityView(dd, samples=c("Pmel1_Ctrl", "Pmel1"))
#or
DensityView(dd[,-1])

Gene set enrichment analysis

Description

A universal gene set enrichment analysis tools

Usage

enrich.GSE(
  geneList,
  keytype = "Symbol",
  type = "GOBP",
  organism = "hsa",
  pvalueCutoff = 1,
  limit = c(2, 100),
  gmtpath = NULL,
  by = "fgsea",
  verbose = TRUE,
  ...
)
enrich.GSE(
  geneList,
  keytype = "Symbol",
  type = "GOBP",
  organism = "hsa",
  pvalueCutoff = 1,
  limit = c(2, 100),
  gmtpath = NULL,
  by = "fgsea",
  verbose = TRUE,
  ...
)

Arguments

`geneList`	A order ranked numeric vector with geneid as names
`keytype`	"Entrez", "Ensembl", or "Symbol"
`type`	Molecular signatures for testing, available datasets include Pathway (KEGG, REACTOME, C2_CP), GO (GOBP, GOCC, GOMF), MSIGDB (C1, C2 (C2_CP (C2_CP_PID, C2_CP_BIOCARTA), C2_CGP), C3 (C3_MIR, C3_TFT), C4, C6, C7, HALLMARK) and Complex (CORUM). Any combination of them are also accessible (e.g. 'GOBP+GOMF+KEGG+REACTOME')
`organism`	'hsa' or 'mmu'
`pvalueCutoff`	FDR cutoff
`limit`	A two-length vector, specifying the minimal and maximal size of gene sets for enrichent analysis
`gmtpath`	The path to customized gmt file
`by`	One of 'fgsea' or 'DOSE'
`verbose`	Boolean
`...`	Other parameter

Value

An enrichResult instance

Author(s)

Wubing Zhang

Examples

data(geneList, package = "DOSE")
## Not run: 
    enrichRes = enrich.GSE(geneList, keytype = "entrez")
    head(slot(enrichRes, "result"))

## End(Not run)
data(geneList, package = "DOSE")
## Not run: 
    enrichRes = enrich.GSE(geneList, keytype = "entrez")
    head(slot(enrichRes, "result"))

## End(Not run)

Do enrichment analysis using hypergeometric test

Description

Do enrichment analysis using hypergeometric test

Usage

enrich.HGT(
  geneList,
  keytype = "Symbol",
  type = "GOBP",
  organism = "hsa",
  pvalueCutoff = 1,
  limit = c(2, 100),
  universe = NULL,
  gmtpath = NULL,
  verbose = TRUE,
  ...
)
enrich.HGT(
  geneList,
  keytype = "Symbol",
  type = "GOBP",
  organism = "hsa",
  pvalueCutoff = 1,
  limit = c(2, 100),
  universe = NULL,
  gmtpath = NULL,
  verbose = TRUE,
  ...
)

Arguments

`geneList`	A numeric vector with gene as names
`keytype`	"Entrez", "Ensembl", or "Symbol"
`type`	Molecular signatures for testing, available datasets include Pathway (KEGG, REACTOME, C2_CP), GO (GOBP, GOCC, GOMF), MSIGDB (C1, C2 (C2_CP (C2_CP_PID, C2_CP_BIOCARTA), C2_CGP), C3 (C3_MIR, C3_TFT), C4, C6, C7, HALLMARK) and Complex (CORUM). Any combination of them are also accessible (e.g. 'GOBP+GOMF+KEGG+REACTOME')
`organism`	'hsa' or 'mmu'
`pvalueCutoff`	FDR cutoff
`limit`	A two-length vector, specifying the minimal and maximal size of gene sets for enrichent analysis
`universe`	A character vector, specifying the backgound genelist, default is whole genome
`gmtpath`	The path to customized gmt file
`verbose`	Boolean
`...`	Other parameter

Value

An enrichResult instance.

Author(s)

Wubing Zhang

Examples

data(geneList, package = "DOSE")
genes <- geneList[1:300]
enrichRes <- enrich.HGT(genes, type = "KEGG", keytype = "entrez")
head(slot(enrichRes, "result"))

data(geneList, package = "DOSE")
genes <- geneList[1:300]
enrichRes <- enrich.HGT(genes, type = "KEGG", keytype = "entrez")
head(slot(enrichRes, "result"))

Enrichment analysis using over-representation test

Description

Enrichment analysis using over-representation test

Usage

enrich.ORT(
  geneList,
  keytype = "Symbol",
  type = "GOBP",
  organism = "hsa",
  pvalueCutoff = 1,
  limit = c(2, 100),
  universe = NULL,
  gmtpath = NULL,
  verbose = TRUE,
  ...
)
enrich.ORT(
  geneList,
  keytype = "Symbol",
  type = "GOBP",
  organism = "hsa",
  pvalueCutoff = 1,
  limit = c(2, 100),
  universe = NULL,
  gmtpath = NULL,
  verbose = TRUE,
  ...
)

Arguments

`geneList`	A numeric vector with gene as names.
`keytype`	"Entrez" or "Symbol".
`type`	Molecular signatures for testing, available datasets include Pathway (KEGG, REACTOME, C2_CP), GO (GOBP, GOCC, GOMF), MSIGDB (C1, C2 (C2_CP (C2_CP_PID, C2_CP_BIOCARTA), C2_CGP), C3 (C3_MIR, C3_TFT), C4, C6, C7, HALLMARK) and Complex (CORUM). Any combination of them are also accessible (e.g. 'GOBP+GOMF+KEGG+REACTOME').
`organism`	'hsa' or 'mmu'.
`pvalueCutoff`	FDR cutoff.
`limit`	A two-length vector, specifying the minimal and maximal size of gene sets for enrichent analysis.
`universe`	A character vector, specifying the backgound genelist, default is whole genome.
`gmtpath`	The path to customized gmt file.
`verbose`	Boolean
`...`	Other parameter

Value

An enrichedResult instance.

Author(s)

Wubing Zhang

Examples

data(geneList, package = "DOSE")
genes <- geneList[1:100]
enrichedRes <- enrich.ORT(genes, keytype = "entrez")
head(slot(enrichedRes, "result"))

data(geneList, package = "DOSE")
genes <- geneList[1:100]
enrichedRes <- enrich.ORT(genes, keytype = "entrez")
head(slot(enrichedRes, "result"))

Enrichment analysis for Positive and Negative selection genes

Description

Do enrichment analysis for selected genes, in which positive selection and negative selection are termed as Positive and Negative

Usage

EnrichAB(
  data,
  enrich_method = "HGT",
  top = 10,
  limit = c(2, 100),
  filename = NULL,
  out.dir = ".",
  width = 6.5,
  height = 4,
  verbose = TRUE,
  ...
)
EnrichAB(
  data,
  enrich_method = "HGT",
  top = 10,
  limit = c(2, 100),
  filename = NULL,
  out.dir = ".",
  width = 6.5,
  height = 4,
  verbose = TRUE,
  ...
)

Arguments

`data`	A data frame.
`enrich_method`	One of "ORT" (Over-Representing Test) and "HGT" (HyperGemetric test).
`top`	An integer, specifying the number of pathways to show.
`limit`	A two-length vector, specifying the min and max size of pathways for enrichent analysis.
`filename`	Suffix of output file name.
`out.dir`	Path to save plot to (combined with filename).
`width`	As in ggsave.
`height`	As in ggsave.
`verbose`	Boolean
`...`	Other available parameters in ggsave.

Value

A list containing enrichment results for each group genes. This list contains eight items, which contain subitems of gridPlot and enrichRes.

Author(s)

Wubing Zhang

Enrichment analysis

Description

Enrichment analysis

Usage

EnrichAnalyzer(
  geneList,
  keytype = "Symbol",
  type = "Pathway+GOBP",
  method = "HGT",
  organism = "hsa",
  pvalueCutoff = 1,
  limit = c(2, 100),
  universe = NULL,
  filter = FALSE,
  gmtpath = NULL,
  verbose = TRUE
)
EnrichAnalyzer(
  geneList,
  keytype = "Symbol",
  type = "Pathway+GOBP",
  method = "HGT",
  organism = "hsa",
  pvalueCutoff = 1,
  limit = c(2, 100),
  universe = NULL,
  filter = FALSE,
  gmtpath = NULL,
  verbose = TRUE
)

Arguments

`geneList`	A numeric vector with gene as names.
`keytype`	"Entrez" or "Symbol".
`type`	Molecular signatures for testing, available datasets include Pathway (KEGG, REACTOME, C2_CP), GO (GOBP, GOCC, GOMF), MSIGDB (C1, C2 (C2_CP (C2_CP_PID, C2_CP_BIOCARTA), C2_CGP), C3 (C3_MIR, C3_TFT), C4, C6, C7, HALLMARK) and Complex (CORUM). Any combination of them are also accessible (e.g. 'GOBP+GOMF+KEGG+REACTOME').
`method`	One of "ORT"(Over-Representing Test), "GSEA"(Gene Set Enrichment Analysis), and "HGT"(HyperGemetric test).
`organism`	'hsa' or 'mmu'.
`pvalueCutoff`	FDR cutoff.
`limit`	A two-length vector (default: c(2, 200)), specifying the minimal and maximal size of gene sets for enrichent analysis.
`universe`	A character vector, specifying the backgound genelist, default is whole genome.
`filter`	Boolean, specifying whether filter out redundancies from the enrichment results.
`gmtpath`	The path to customized gmt file.
`verbose`	Boolean

Value

enrichRes is an enrichResult instance.

Author(s)

Wubing Zhang

Examples

data(geneList, package = "DOSE")
## Not run: 
  keggA = EnrichAnalyzer(geneList[1:500], keytype = "entrez")
  head(keggA@result)

## End(Not run)
data(geneList, package = "DOSE")
## Not run: 
  keggA = EnrichAnalyzer(geneList[1:500], keytype = "entrez")
  head(keggA@result)

## End(Not run)

Simplify the enrichment results based on Jaccard index

Description

Simplify the enrichment results based on Jaccard index

Usage

EnrichedFilter(enrichment = enrichment, cutoff = 0.8)
EnrichedFilter(enrichment = enrichment, cutoff = 0.8)

Arguments

`enrichment`	A data frame of enrichment result or an enrichResult object.
`cutoff`	A numeric, specifying the cutoff of Jaccard index between two pathways.

Value

A data frame.

Author(s)

Yihan Xiao

Examples

data(geneList, package = "DOSE")
## Not run: 
  enrichRes <- enrich.HGT(geneList, keytype = "entrez")
  EnrichedFilter(enrichRes)

## End(Not run)
data(geneList, package = "DOSE")
## Not run: 
  enrichRes <- enrich.HGT(geneList, keytype = "entrez")
  EnrichedFilter(enrichRes)

## End(Not run)

Visualize enriched pathways and genes in those pathways

Description

Visualize enriched pathways and genes in those pathways

Usage

EnrichedGeneView(
  enrichment,
  geneList,
  rank_by = "p.adjust",
  top = 5,
  bottom = 0,
  keytype = "Symbol",
  gene_cutoff = c(-log2(1.5), log2(1.5)),
  custom_gene = NULL,
  charLength = 40,
  filename = NULL,
  width = 7,
  height = 5,
  ...
)
EnrichedGeneView(
  enrichment,
  geneList,
  rank_by = "p.adjust",
  top = 5,
  bottom = 0,
  keytype = "Symbol",
  gene_cutoff = c(-log2(1.5), log2(1.5)),
  custom_gene = NULL,
  charLength = 40,
  filename = NULL,
  width = 7,
  height = 5,
  ...
)

Arguments

`enrichment`	A data frame of enrichment result or an `enrichResult` object.
`geneList`	A numeric geneList used in enrichment anlaysis.
`rank_by`	"p.adjust" or "NES", specifying the indices for ranking pathways.
`top`	An integer, specifying the number of positively enriched terms to show.
`bottom`	An integer, specifying the number of negatively enriched terms to show.
`keytype`	"Entrez" or "Symbol".
`gene_cutoff`	A two-length numeric vector, specifying cutoff for genes to show.
`custom_gene`	A character vector (gene names), customizing genes to show.
`charLength`	Integer, specifying max length of enriched term name to show as coordinate lab.
`filename`	Figure file name to create on disk. Default filename="NULL", which means no output.
`width`	As in ggsave.
`height`	As in ggsave.
`...`	Other available parameters in ggsave.

Value

An object created by ggplot, which can be assigned and further customized.

Author(s)

Wubing Zhang

Examples

data(geneList, package = "DOSE")
## Not run: 
  enrichRes <- enrich.GSE(geneList, keytype = "Entrez")
  EnrichedGeneView(enrichment=slot(enrichRes, "result"), geneList, keytype = "Entrez")

## End(Not run)
data(geneList, package = "DOSE")
## Not run: 
  enrichRes <- enrich.GSE(geneList, keytype = "Entrez")
  EnrichedGeneView(enrichment=slot(enrichRes, "result"), geneList, keytype = "Entrez")

## End(Not run)

View enriched terms

Description

Grid plot for enriched terms

Usage

EnrichedView(
  enrichment,
  rank_by = "pvalue",
  mode = 1,
  subset = NULL,
  top = 0,
  bottom = 0,
  x = "LogFDR",
  charLength = 40,
  filename = NULL,
  width = 7,
  height = 4,
  ...
)
EnrichedView(
  enrichment,
  rank_by = "pvalue",
  mode = 1,
  subset = NULL,
  top = 0,
  bottom = 0,
  x = "LogFDR",
  charLength = 40,
  filename = NULL,
  width = 7,
  height = 4,
  ...
)

Arguments

`enrichment`	A data frame of enrichment result, with columns of ID, Description, p.adjust and NES.
`rank_by`	"pvalue" or "NES", specifying the indices for ranking pathways.
`mode`	1 or 2.
`subset`	A vector of pathway ids.
`top`	An integer, specifying the number of upregulated terms to show.
`bottom`	An integer, specifying the number of downregulated terms to show.
`x`	Character, "NES", "LogP", or "LogFDR", indicating the variable on the x-axis.
`charLength`	Integer, specifying max length of enriched term name to show as coordinate lab.
`filename`	Figure file name to create on disk. Default filename="NULL".
`width`	As in ggsave.
`height`	As in ggsave.
`...`	Other available parameters in ggsave.

Value

An object created by ggplot, which can be assigned and further customized.

Author(s)

Wubing Zhang

Examples

data(geneList, package = "DOSE")
## Not run: 
    enrichRes = enrich.GSE(geneList, organism="hsa")
    EnrichedView(enrichRes, top = 5, bottom = 5)

## End(Not run)
data(geneList, package = "DOSE")
## Not run: 
    enrichRes = enrich.GSE(geneList, organism="hsa")
    EnrichedView(enrichRes, top = 5, bottom = 5)

## End(Not run)

Enrichment analysis for selected treatment related genes

Description

Do enrichment analysis for selected treatment related genes in 9-squares

Usage

EnrichSquare(
  beta,
  id = "GeneID",
  keytype = "Entrez",
  x = "Control",
  y = "Treatment",
  enrich_method = "ORT",
  top = 5,
  limit = c(2, 100),
  filename = NULL,
  out.dir = ".",
  width = 6.5,
  height = 4,
  verbose = TRUE,
  ...
)
EnrichSquare(
  beta,
  id = "GeneID",
  keytype = "Entrez",
  x = "Control",
  y = "Treatment",
  enrich_method = "ORT",
  top = 5,
  limit = c(2, 100),
  filename = NULL,
  out.dir = ".",
  width = 6.5,
  height = 4,
  verbose = TRUE,
  ...
)

Arguments

`beta`	Data frame, with columns of "GeneID", "group", and "Diff".
`id`	A character, indicating the gene column in the data.
`keytype`	A character, "Symbol" or "Entrez".
`x`	A character, indicating the x-axis in the 9-square scatter plot.
`y`	A character, indicating the y-axis in the 9-square scatter plot.
`enrich_method`	One of "ORT"(Over-Representing Test) and "HGT"(HyperGemetric test).
`top`	An integer, specifying the number of pathways to show.
`limit`	A two-length vector, specifying the min and max size of pathways for enrichent analysis.
`filename`	Suffix of output file name. NULL(default) means no output.
`out.dir`	Path to save plot to (combined with filename).
`width`	As in ggsave.
`height`	As in ggsave.
`verbose`	Boolean.
`...`	Other available parameters in ggsave.

Value

A list containing enrichment results for each group genes. Each item in the returned list has two sub items:

`gridPlot`	an object created by `ggplot`, which can be assigned and further customized.
`enrichRes`	a enrichResult instance.

Author(s)

Wubing Zhang

Downstream analysis based on MAGeCK-MLE result

Description

Integrative analysis pipeline using the gene summary table in MAGeCK MLE results

Usage

FluteMLE(
  gene_summary,
  treatname,
  ctrlname = "Depmap",
  keytype = "Symbol",
  organism = "hsa",
  incorporateDepmap = FALSE,
  cell_lines = NA,
  lineages = "All",
  norm_method = "cell_cycle",
  posControl = NULL,
  omitEssential = TRUE,
  top = 10,
  toplabels = NA,
  scale_cutoff = 2,
  limit = c(0, 200),
  enrich_method = "ORT",
  proj = NA,
  width = 10,
  height = 7,
  outdir = ".",
  pathview.top = 4,
  verbose = TRUE
)
FluteMLE(
  gene_summary,
  treatname,
  ctrlname = "Depmap",
  keytype = "Symbol",
  organism = "hsa",
  incorporateDepmap = FALSE,
  cell_lines = NA,
  lineages = "All",
  norm_method = "cell_cycle",
  posControl = NULL,
  omitEssential = TRUE,
  top = 10,
  toplabels = NA,
  scale_cutoff = 2,
  limit = c(0, 200),
  enrich_method = "ORT",
  proj = NA,
  width = 10,
  height = 7,
  outdir = ".",
  pathview.top = 4,
  verbose = TRUE
)

Arguments

`gene_summary`	A data frame or a file path to gene summary file generated by MAGeCK-MLE.
`treatname`	A character vector, specifying the names of treatment samples.
`ctrlname`	A character vector, specifying the names of control samples. If there is no controls in your CRISPR screen, you can specify "Depmap" as ctrlname and set 'incorporateDepmap=TRUE'.
`keytype`	"Entrez" or "Symbol".
`organism`	"hsa" or "mmu".
`incorporateDepmap`	Boolean, indicating whether incorporate Depmap data into analysis.
`cell_lines`	A character vector, specifying the cell lines in Depmap to be considered.
`lineages`	A character vector, specifying the lineages in Depmap to be considered.
`norm_method`	One of "none", "cell_cycle" (default) or "loess".
`posControl`	A character vector, specifying a list of positive control gene symbols.
`omitEssential`	Boolean, indicating whether omit common essential genes from the downstream analysis.
`top`	An integer, specifying the number of top selected genes to be labeled in rank figure and the number of top pathways to be shown.
`toplabels`	A character vector, specifying interested genes to be labeled in rank figure.
`scale_cutoff`	Boolean or numeric, specifying how many standard deviation will be used as cutoff.
`limit`	A two-length vector, specifying the minimal and maximal size of gene sets for enrichent analysis.
`enrich_method`	One of "ORT"(Over-Representing Test) and "HGT"(HyperGemetric test).
`proj`	A character, indicating the prefix of output file name, which can't contain special characters.
`width`	The width of summary pdf in inches.
`height`	The height of summary pdf in inches.
`outdir`	Output directory on disk.
`pathview.top`	Integer, specifying the number of pathways for pathview visualization.
`verbose`	Boolean

Details

MAGeCK-MLE can be used to analyze screen data from multi-conditioned experiments. MAGeCK-MLE also normalizes the data across multiple samples, making them comparable to each other. The most important ouput of MAGeCK MLE is 'gene_summary' file, which includes the beta scores of multiple conditions and the associated statistics. The 'beta score' for each gene describes how the gene is selected: a positive beta score indicates a positive selection, and a negative beta score indicates a negative selection.

The downstream analysis includes identifying essential, non-essential, and target-associated genes, and performing biological functional category analysis and pathway enrichment analysis of these genes. The function also visualizes genes in the context of pathways to benefit users exploring screening data.

Value

All of the pipeline results is output into the out.dir/MAGeCKFlute_proj, which includes a pdf file and many folders. The pdf file 'FluteMLE_proj_norm_method.pdf' is the summary of pipeline results. For each section in this pipeline, figures and useful data are outputed to corresponding subfolders.

QC: Quality control
Selection: Positive selection and negative selection.
Enrichment: Enrichment analysis for positive and negative selection genes.
PathwayView: Pathway view for top enriched pathways.

Author(s)

Wubing Zhang

Examples

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
## Not run: 
  # functional analysis for MAGeCK MLE results
  FluteMLE(file3, treatname = "Pmel1", ctrlname = "Pmel1_Ctrl", proj = "Pmel1")

## End(Not run)

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
## Not run: 
  # functional analysis for MAGeCK MLE results
  FluteMLE(file3, treatname = "Pmel1", ctrlname = "Pmel1_Ctrl", proj = "Pmel1")

## End(Not run)

Downstream analysis based on MAGeCK-RRA result

Description

Integrative analysis pipeline using the gene summary table in MAGeCK RRA results

Usage

FluteRRA(
  gene_summary,
  sgrna_summary = NULL,
  keytype = "Symbol",
  organism = "hsa",
  incorporateDepmap = FALSE,
  cell_lines = NA,
  lineages = "All",
  omitEssential = TRUE,
  top = 5,
  toplabels = NULL,
  scale_cutoff = 2,
  limit = c(2, 100),
  proj = NA,
  width = 12,
  height = 6,
  outdir = ".",
  verbose = TRUE
)
FluteRRA(
  gene_summary,
  sgrna_summary = NULL,
  keytype = "Symbol",
  organism = "hsa",
  incorporateDepmap = FALSE,
  cell_lines = NA,
  lineages = "All",
  omitEssential = TRUE,
  top = 5,
  toplabels = NULL,
  scale_cutoff = 2,
  limit = c(2, 100),
  proj = NA,
  width = 12,
  height = 6,
  outdir = ".",
  verbose = TRUE
)

Arguments

`gene_summary`	A file path or a data frame of gene summary data.
`sgrna_summary`	A file path or a data frame of sgRNA summary data.
`keytype`	"Entrez" or "Symbol".
`organism`	"hsa" or "mmu".
`incorporateDepmap`	Boolean, indicating whether incorporate Depmap data into analysis.
`cell_lines`	A character vector, specifying the cell lines in Depmap to be considered.
`lineages`	A character vector, specifying the lineages in Depmap to be considered.
`omitEssential`	Boolean, indicating whether omit common essential genes from the downstream analysis.
`top`	An integer, specifying the number of top selected genes to be labeled in rank figure and the number of top pathways to be shown.
`toplabels`	A character vector, specifying interested genes to be labeled in rank figure.
`scale_cutoff`	Boolean or numeric, specifying how many standard deviation will be used as cutoff.
`limit`	A two-length vector, specifying the minimal and maximal size of gene sets for enrichent analysis.
`proj`	A character, indicating the prefix of output file name.
`width`	The width of summary pdf in inches.
`height`	The height of summary pdf in inches.
`outdir`	Output directory on disk.
`verbose`	Boolean

Details

MAGeCK RRA allows for the comparison between two experimental conditions. It can identify genes and sgRNAs are significantly selected between the two conditions. The most important output of MAGeCK RRA is the file 'gene_summary.txt'. MAGeCK RRA will output both the negative score and positive score for each gene. A smaller score indicates higher gene importance. MAGeCK RRA will also output the statistical value for the scores of each gene. Genes that are significantly positively and negatively selected can be identified based on the p-value or FDR.

The downstream analysis of this function includes identifying positive and negative selection genes, and performing biological functional category analysis and pathway enrichment analysis of these genes.

Value

All of the pipeline results is output into the out.dir/proj_Results, which includes a pdf file and a folder named 'RRA'.

Author(s)

Wubing Zhang

Examples

file1 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/rra.gene_summary.txt")
file2 = file.path(system.file("extdata", package = "MAGeCKFlute"),
                  "testdata/rra.sgrna_summary.txt")
## Not run: 
    # Run the FluteRRA pipeline
    FluteRRA(file1, file2, proj="Pmel", organism="hsa", incorporateDepmap = FALSE,
    scale_cutoff = 1, outdir = "./")

## End(Not run)

file1 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/rra.gene_summary.txt")
file2 = file.path(system.file("extdata", package = "MAGeCKFlute"),
                  "testdata/rra.sgrna_summary.txt")
## Not run: 
    # Run the FluteRRA pipeline
    FluteRRA(file1, file2, proj="Pmel", organism="hsa", incorporateDepmap = FALSE,
    scale_cutoff = 1, outdir = "./")

## End(Not run)

Map values to colors

Description

Map values to colors

Usage

getCols(x, palette = 1)
getCols(x, palette = 1)

Arguments

`x`	A numeric vector.
`palette`	diverge, rainbow, sequential

Value

A vector of colors corresponding to input vector.

Author(s)

Wubing Zhang

Examples

getCols(1:4)
getCols(1:4)

Retrieve gene annotations from the NCBI, HNSC, and Uniprot databases.

Description

Retrieve gene annotations from the NCBI, HNSC, and Uniprot databases.

Usage

getGeneAnn(org = "hsa", update = FALSE)
getGeneAnn(org = "hsa", update = FALSE)

Arguments

`org`	Character, hsa (default), bta, cfa, mmu, ptr, rno, ssc are optional.
`update`	Boolean, indicating whether download current annotation.

Value

A data frame.

Author(s)

Wubing Zhang

Examples

## Not run: 
  ann = getGeneAnn("hsa")
  head(ann)

## End(Not run)

## Not run: 
  ann = getGeneAnn("hsa")
  head(ann)

## End(Not run)

Get the kegg code of specific mammalia organism.

Description

Get the kegg code of specific mammalia organism.

Usage

getOrg(organism)
getOrg(organism)

Arguments

organism

Character, KEGG species code, or the common species name. For all potential values check: data(bods); bods. Default org="hsa", and can also be "human" (case insensitive).

Value

A list containing three elements:

org

species

pkgannotation package name

Author(s)

Wubing Zhang

Examples

ann = getOrg("human")
print(ann$pkg)

ann = getOrg("human")
print(ann$pkg)

Retreive reference orthologs annotation.

Description

Retreive reference orthologs annotation.

Usage

getOrtAnn(fromOrg = "mmu", toOrg = "hsa", update = FALSE)
getOrtAnn(fromOrg = "mmu", toOrg = "hsa", update = FALSE)

Arguments

`fromOrg`	Character, hsa (default), bta, cfa, mmu, ptr, rno, ssc are optional.
`toOrg`	Character, hsa (default), bta, cfa, mmu, ptr, rno, ssc are optional.
`update`	Boolean, indicating whether download recent annotation from NCBI.

Value

A data frame.

Author(s)

Wubing Zhang

Examples

## Not run: 
  ann = getOrtAnn("mmu", "hsa")
  head(ann)

## End(Not run)

## Not run: 
  ann = getOrtAnn("mmu", "hsa")
  head(ann)

## End(Not run)

Extract pathway annotation from GMT file.

Description

Extract pathway annotation from GMT file.

Usage

gsGetter(
  gmtpath = NULL,
  type = "All",
  limit = c(0, Inf),
  organism = "hsa",
  update = FALSE
)
gsGetter(
  gmtpath = NULL,
  type = "All",
  limit = c(0, Inf),
  organism = "hsa",
  update = FALSE
)

Arguments

`gmtpath`	The path to customized gmt file.
`type`	Molecular signatures for testing, available datasets include Pathway (KEGG, REACTOME, C2_CP:PID, C2_CP:BIOCARTA), GO (GOBP, GOCC, GOMF), MSIGDB (C1, C2 (C2_CP (C2_CP:PID, C2_CP:BIOCARTA), C2_CGP), C3 (C3_MIR, C3_TFT), C4 (C4_CGN, C4_CM), C5 (C5_BP, C5_CC, C5_MF), C6, C7, H) and Complex (CORUM). Any combination of them are also accessible (e.g. 'GOBP+GOMF+KEGG+REACTOME').
`limit`	A two-length vector, specifying the minimal and maximal size of gene sets to load.
`organism`	'hsa' or 'mmu'.
`update`	Boolean, indicating whether update the gene sets from source database.

Value

A three-column data frame.

Author(s)

Wubing Zhang

Examples

gene2path = gsGetter(type = "REACTOME+KEGG")
head(gene2path)

gene2path = gsGetter(type = "REACTOME+KEGG")
head(gene2path)

Cluster and view cluster tree

Description

Cluster and view cluster tree

Usage

hclustView(
  d,
  method = "average",
  label_cols = NULL,
  bar_cols = NULL,
  main = NA,
  xlab = NA,
  horiz = TRUE,
  ...
)
hclustView(
  d,
  method = "average",
  label_cols = NULL,
  bar_cols = NULL,
  main = NA,
  xlab = NA,
  horiz = TRUE,
  ...
)

Arguments

`d`	A dissimilarity structure as produced by dist.
`method`	The agglomeration method to be used. This should be (an unambiguous abbreviation of) one of "ward.D", "ward.D2", "single", "complete", "average" (= UPGMA), "mcquitty" (= WPGMA), "median" (= WPGMC) or "centroid" (= UPGMC).
`label_cols`	A vector to be used as label's colors for the dendrogram.
`bar_cols`	Either a vector or a matrix, which will be plotted as a colored bar.
`main`	As in 'plot'.
`xlab`	As in 'plot'.
`horiz`	Logical indicating if the dendrogram should be drawn horizontally or not.
`...`	Arguments to be passed to methods, such as graphical parameters (see par).

Value

Plot figure on open device.

Author(s)

Wubing Zhang

Examples

label_cols = rownames(USArrests)
hclustView(dist(USArrests), label_cols=label_cols, bar_cols=label_cols)

label_cols = rownames(USArrests)
hclustView(dist(USArrests), label_cols=label_cols, bar_cols=label_cols)

Draw heatmap

Description

Draw heatmap

Usage

HeatmapView(
  mat,
  limit = c(-2, 2),
  na_col = "gray70",
  colPal = rev(colorRampPalette(c("#c12603", "white", "#0073B6"), space = "Lab")(199)),
  filename = NA,
  width = NA,
  height = NA,
  ...
)
HeatmapView(
  mat,
  limit = c(-2, 2),
  na_col = "gray70",
  colPal = rev(colorRampPalette(c("#c12603", "white", "#0073B6"), space = "Lab")(199)),
  filename = NA,
  width = NA,
  height = NA,
  ...
)

Arguments

`mat`	Matrix like object, each row is gene and each column is sample.
`limit`	Max value in heatmap
`na_col`	Color for missing values
`colPal`	colorRampPalette.
`filename`	File path where to save the picture.
`width`	Manual option for determining the output file width in inches.
`height`	Manual option for determining the output file height in inches.
`...`	Other parameters in pheatmap.

Value

Invisibly a pheatmap object that is a list with components.

Author(s)

Wubing Zhang

Examples

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
gg = cor(dd[,2:ncol(dd)])
HeatmapView(gg, display_numbers = TRUE)

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
gg = cor(dd[,2:ncol(dd)])
HeatmapView(gg, display_numbers = TRUE)

Identical bar plot

Description

Identical bar plot

Usage

IdentBarView(
  gg,
  x = "x",
  y = "y",
  fill = c("#CF3C2B", "#394E80"),
  main = NULL,
  xlab = NULL,
  ylab = NULL,
  filename = NULL,
  width = 5,
  height = 4,
  ...
)
IdentBarView(
  gg,
  x = "x",
  y = "y",
  fill = c("#CF3C2B", "#394E80"),
  main = NULL,
  xlab = NULL,
  ylab = NULL,
  filename = NULL,
  width = 5,
  height = 4,
  ...
)

Arguments

`gg`	A data frame.
`x`	A character, indicating column (in countSummary) of x-axis.
`y`	A character, indicating column (in countSummary) of y-axis.
`fill`	A character, indicating fill color of all bars.
`main`	A charater, specifying the figure title.
`xlab`	A character, specifying the title of x-axis.
`ylab`	A character, specifying the title of y-axis.
`filename`	Figure file name to create on disk. Default filename="NULL", which means don't save the figure on disk.
`width`	As in ggsave.
`height`	As in ggsave.
`...`	Other available parameters in ggsave.

Value

An object created by ggplot, which can be assigned and further customized.

Author(s)

Wubing Zhang

Examples

file4 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/countsummary.txt")
countsummary = read.delim(file4, check.names = FALSE)
IdentBarView(countsummary, x="Label", y="Reads")

file4 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/countsummary.txt")
countsummary = read.delim(file4, check.names = FALSE)
IdentBarView(countsummary, x="Label", y="Reads")

Incorporate Depmap screen into analysis

Description

Incorporate Depmap screen into analysis

Usage

IncorporateDepmap(
  dd,
  symbol = "id",
  cell_lines = NA,
  lineages = "All",
  na.rm = FALSE
)
IncorporateDepmap(
  dd,
  symbol = "id",
  cell_lines = NA,
  lineages = "All",
  na.rm = FALSE
)

Arguments

`dd`	A data frame.
`symbol`	A character, specifying the column name of gene symbols in the data frame.
`cell_lines`	A character vector, specifying the cell lines for incorporation.
`lineages`	A character vector, specifying the cancer types for incorporation.
`na.rm`	Boolean, indicating whether removing NAs from the results.

Value

A data frame with Depmap column (average CERES scores across selected cell lines) attached.

Author(s)

Wubing Zhang

Examples

file1 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/rra.gene_summary.txt")
gdata = ReadRRA(file1)
head(gdata)
## Not run: 
  gdata = IncorporateDepmap(gdata)
  head(gdata)

## End(Not run)
file1 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/rra.gene_summary.txt")
gdata = ReadRRA(file1)
head(gdata)
## Not run: 
  gdata = IncorporateDepmap(gdata)
  head(gdata)

## End(Not run)

Load processed Depmap data

Description

Load processed Depmap data

Usage

LoadDepmap()
LoadDepmap()

Value

A list including two elements, one is the Depmap CRISPR data, and the other is the sample annotation data.

Author(s)

Wubing Zhang

Examples

## Not run: 
  depmapDat = LoadDepmap()

## End(Not run)
## Not run: 
  depmapDat = LoadDepmap()

## End(Not run)

View mapping ratio

Description

View mapping ratio of each sample

Usage

MapRatesView(
  countSummary,
  Label = "Label",
  Reads = "Reads",
  Mapped = "Mapped",
  filename = NULL,
  width = 5,
  height = 4,
  ...
)
MapRatesView(
  countSummary,
  Label = "Label",
  Reads = "Reads",
  Mapped = "Mapped",
  filename = NULL,
  width = 5,
  height = 4,
  ...
)

Arguments

`countSummary`	A data frame, which contains columns of 'Label', 'Reads', and 'Mapped'
`Label`	A character, indicating column (in countSummary) of sample names.
`Reads`	A character, indicating column (in countSummary) of total reads.
`Mapped`	A character, indicating column (in countSummary) of mapped reads.
`filename`	Figure file name to create on disk. Default filename="NULL", which means don't save the figure on disk.
`width`	As in ggsave.
`height`	As in ggsave.
`...`	Other available parameters in ggsave.

Value

An object created by ggplot, which can be assigned and further customized.

Author(s)

Wubing Zhang

Examples

file4 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/countsummary.txt")
countsummary = read.delim(file4, check.names = FALSE)
MapRatesView(countsummary)

file4 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/countsummary.txt")
countsummary = read.delim(file4, check.names = FALSE)
MapRatesView(countsummary)

MAplot of gene beta scores

Description

MAplot of gene beta scores in Control vs Treatment

Usage

MAView(
  beta,
  ctrlname = "Control",
  treatname = "Treatment",
  main = NULL,
  show.statistics = TRUE,
  add.smooth = TRUE,
  lty = 1,
  smooth.col = "red",
  plot.method = c("loess", "lm", "glm", "gam"),
  filename = NULL,
  width = 5,
  height = 4,
  ...
)
MAView(
  beta,
  ctrlname = "Control",
  treatname = "Treatment",
  main = NULL,
  show.statistics = TRUE,
  add.smooth = TRUE,
  lty = 1,
  smooth.col = "red",
  plot.method = c("loess", "lm", "glm", "gam"),
  filename = NULL,
  width = 5,
  height = 4,
  ...
)

Arguments

`beta`	Data frame, including `ctrlname` and `treatname` as columns.
`ctrlname`	Character vector, specifying the name of control sample.
`treatname`	Character vector, specifying the name of treatment sample.
`main`	As in plot.
`show.statistics`	Show statistics .
`add.smooth`	Whether add a smooth line to the plot.
`lty`	Line type for smooth line.
`smooth.col`	Color of smooth line.
`plot.method`	A string specifying the method to fit smooth line, which should be one of "loess" (default), "lm", "glm" and "gam".
`filename`	Figure file name to create on disk. Default filename="NULL", which means don't save the figure on disk.
`width`	As in ggsave.
`height`	As in ggsave.
`...`	Other available parameters in function 'ggsave'.

Value

An object created by ggplot, which can be assigned and further customized.

Author(s)

Wubing Zhang

Examples

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
MAView(dd, ctrlname = "Pmel1_Ctrl", treatname = "Pmel1")
dd2 = NormalizeBeta(dd, method="loess", org = "mmu")
MAView(dd2, ctrlname = "Pmel1_Ctrl", treatname = "Pmel1")

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
MAView(dd, ctrlname = "Pmel1_Ctrl", treatname = "Pmel1")
dd2 = NormalizeBeta(dd, method="loess", org = "mmu")
MAView(dd2, ctrlname = "Pmel1_Ctrl", treatname = "Pmel1")

Blank figure

Description

Blank figure

Usage

noEnrichPlot(main = "No enriched terms")
noEnrichPlot(main = "No enriched terms")

Arguments

main

The title of figure.

Value

An object created by ggplot, which can be assigned and further customized.

Author(s)

Wubing Zhang

normalize.loess

Description

Loess normalization method.

Usage

normalize.loess(
  mat,
  subset = sample(1:(dim(mat)[1]), min(c(5000, nrow(mat)))),
  epsilon = 10^-2,
  maxit = 1,
  log.it = FALSE,
  verbose = TRUE,
  span = 2/3,
  family.loess = "symmetric",
  ...
)
normalize.loess(
  mat,
  subset = sample(1:(dim(mat)[1]), min(c(5000, nrow(mat)))),
  epsilon = 10^-2,
  maxit = 1,
  log.it = FALSE,
  verbose = TRUE,
  span = 2/3,
  family.loess = "symmetric",
  ...
)

Arguments

`mat`	A matrix with columns containing the values of the chips to normalize.
`subset`	A subset of the data to fit a loess to.
`epsilon`	A tolerance value (supposed to be a small value - used as a stopping criterion).
`maxit`	Maximum number of iterations.
`log.it`	Logical. If `TRUE` it takes the log2 of `mat`.
`verbose`	Logical. If `TRUE` displays current pair of chip being worked on.
`span`	Parameter to be passed the function `loess`
`family.loess`	Parameter to be passed the function `loess`. `"gaussian"` or `"symmetric"` are acceptable values for this parameter.
`...`	Any of the options of normalize.loess you would like to modify (described above).

Value

A matrix similar as mat.

Author(s)

Wubing Zhang

Examples

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
beta_loess = normalize.loess(dd[,-1])

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
beta_loess = normalize.loess(dd[,-1])

Normalize gene beta scores

Description

Two normalization methods are available. cell_cycle method normalizes gene beta scores based on positive control genes in CRISPR screening. loess method normalizes gene beta scores using loess.

Usage

NormalizeBeta(
  beta,
  id = 1,
  method = "cell_cycle",
  posControl = NULL,
  samples = NULL,
  org = "hsa"
)
NormalizeBeta(
  beta,
  id = 1,
  method = "cell_cycle",
  posControl = NULL,
  samples = NULL,
  org = "hsa"
)

Arguments

`beta`	Data frame.
`id`	An integer specifying the column of gene.
`method`	Character, one of 'cell_cycle'(default) and 'loess'. or character string giving the name of the table column containing the gene names.
`posControl`	A character vector, specifying a list of positive control genes.
`samples`	Character vector, specifying the sample names in beta columns. If NULL (default), take all beta columns as samples.
`org`	"hsa", "mmu", "bta", "cfa", "ptr", "rno", or "ssc" indicating the organism.

Details

In CRISPR screens, cells treated with different conditions (e.g., with or without drug) may have different proliferation rates. So it's necessary to normalize the proliferation rate based on defined positive control genes among samples. After normalization, the beta scores are comparable across samples. loess is another optional normalization method, which is used to normalize array data before.

Value

A data frame with same format as input data beta.

Author(s)

Wubing Zhang

Examples

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
## Not run: 
  #Cell Cycle normalization
  dd_essential = NormalizeBeta(dd, method="cell_cycle", org = "mmu")
  head(dd_essential)

## End(Not run)
#Optional loess normalization (not recommended)
dd_loess = NormalizeBeta(dd, method="loess")
head(dd_loess)

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
## Not run: 
  #Cell Cycle normalization
  dd_essential = NormalizeBeta(dd, method="cell_cycle", org = "mmu")
  head(dd_essential)

## End(Not run)
#Optional loess normalization (not recommended)
dd_loess = NormalizeBeta(dd, method="loess")
head(dd_loess)

Omit common essential genes based on depmap data

Description

Omit common essential genes based on depmap data

Usage

OmitCommonEssential(
  dd,
  symbol = "id",
  lineages = "All",
  cell_lines = NULL,
  dependency = -0.5
)
OmitCommonEssential(
  dd,
  symbol = "id",
  lineages = "All",
  cell_lines = NULL,
  dependency = -0.5
)

Arguments

`dd`	A data frame.
`symbol`	A character, specifying the column name of gene symbols in the data frame.
`lineages`	A character vector, specifying the lineages for selecting essential genes.
`cell_lines`	A character vector, specifying cell lines for selecting essential genes.
`dependency`	A numeric, specifying the threshold for selecting essential genes.

Value

A data frame.

Author(s)

Wubing Zhang

Examples

## Not run: 
  file1 = file.path(system.file("extdata", package = "MAGeCKFlute"),
                    "testdata/rra.gene_summary.txt")
  gdata = ReadRRA(file1)
  dim(gdata)
  rra.omit = OmitCommonEssential(gdata)
  dim(rra.omit)

## End(Not run)

## Not run: 
  file1 = file.path(system.file("extdata", package = "MAGeCKFlute"),
                    "testdata/rra.gene_summary.txt")
  gdata = ReadRRA(file1)
  dim(gdata)
  rra.omit = OmitCommonEssential(gdata)
  dim(rra.omit)

## End(Not run)

Rank plot

Description

Draw the score and rank of genes on a scatter plot.

Usage

RankView(
  rankdata,
  genelist = NULL,
  decreasing = TRUE,
  top = 5,
  bottom = 5,
  cutoff = 2,
  main = NULL,
  filename = NULL,
  width = 5,
  height = 4,
  ...
)
RankView(
  rankdata,
  genelist = NULL,
  decreasing = TRUE,
  top = 5,
  bottom = 5,
  cutoff = 2,
  main = NULL,
  filename = NULL,
  width = 5,
  height = 4,
  ...
)

Arguments

`rankdata`	A numeric vector, with gene as names.
`genelist`	A character vector, specifying genes to be labeled.
`decreasing`	Boolean, specifying the order of genes to plot.
`top`	Integer, specifying number of positive genes to be labeled.
`bottom`	Integer, specifying number of negative genes to be labeled.
`cutoff`	One numeric value indicating the fold of standard deviation used as cutoff; two number vector, such as c(-1, 1), specifying the exact cutoff for selecting top genes.
`main`	A character, specifying title.
`filename`	A character, specifying a file name to create on disk. Set filename to be "NULL", if don't want to save the figure.
`width`	Numeric, specifying width of figure.
`height`	Numeric, specifying height of figure.
`...`	Other available parameters in the function 'geom_text_repel'.

Value

An object created by ggplot, which can be assigned and further customized.

Author(s)

Wubing Zhang

Examples

file1 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/rra.gene_summary.txt")
gdata = ReadRRA(file1)
rankdata = gdata$Score
names(rankdata) = gdata$id
RankView(rankdata)

file1 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/rra.gene_summary.txt")
gdata = ReadRRA(file1)
rankdata = gdata$Score
names(rankdata) = gdata$id
RankView(rankdata)

Read gene beta scores from MAGeCK-MLE results

Description

Read gene beta scores from MAGeCK-MLE results

Usage

ReadBeta(gene_summary)
ReadBeta(gene_summary)

Arguments

gene_summary

A data frame or a file path to gene summary file generated by MAGeCK-MLE.

Value

A data frame, whose first column is Gene and other columns are comparisons.

Author(s)

Wubing Zhang

Examples

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
head(dd)

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
head(dd)

ReadGMT

Description

Parse gmt file to a data.frame

Usage

ReadGMT(gmtpath, limit = c(0, Inf))
ReadGMT(gmtpath, limit = c(0, Inf))

Arguments

`gmtpath`	The path to gmt file.
`limit`	A integer vector of length two, specifying the limit of geneset size.

Value

An data.frame, in which the first column is gene, and the second column is pathway name.

Author(s)

Wubing Zhang

Read gene summary file in MAGeCK-RRA results

Description

Read gene summary file in MAGeCK-RRA results

Usage

ReadRRA(gene_summary, score = c("lfc", "rra")[1])
ReadRRA(gene_summary, score = c("lfc", "rra")[1])

Arguments

`gene_summary`	A data frame or a file path to gene summary file generated by MAGeCK-RRA.
`score`	"lfc" (default) or "rra", specifying the score type.

Details

If the score type is equal to lfc, then LFC will be returned. If the score type is rra, the log10 transformed RRA score will be returned.

Value

A data frame including three columns, including "id", "LFC" and "FDR".

Author(s)

Wubing Zhang

Examples

file1 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/rra.gene_summary.txt")
gdata = ReadRRA(file1)
head(gdata)

file1 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/rra.gene_summary.txt")
gdata = ReadRRA(file1)
head(gdata)

Read sgRNA summary in MAGeCK-RRA results

Description

Read sgRNA summary in MAGeCK-RRA results

Usage

ReadsgRRA(sgRNA_summary)
ReadsgRRA(sgRNA_summary)

Arguments

sgRNA_summary

A file path or a data frame of sgRNA summary data.

Value

A data frame.

Author(s)

Wubing Zhang

Examples

file2 = file.path(system.file("extdata", package = "MAGeCKFlute"),
                  "testdata/rra.sgrna_summary.txt")
sgrra = ReadsgRRA(file2)
head(sgrra)

file2 = file.path(system.file("extdata", package = "MAGeCKFlute"),
                  "testdata/rra.sgrna_summary.txt")
sgrra = ReadsgRRA(file2)
head(sgrra)

Compute the similarity between customized CRISPR screen with Depmap screens

Description

Compute the similarity between customized CRISPR screen with Depmap screens

Usage

ResembleDepmap(
  dd,
  symbol = "id",
  score = "Score",
  lineages = "All",
  method = c("pearson", "spearman", "kendall")[1]
)
ResembleDepmap(
  dd,
  symbol = "id",
  score = "Score",
  lineages = "All",
  method = c("pearson", "spearman", "kendall")[1]
)

Arguments

`dd`	A data frame.
`symbol`	A character, specifying the column name of gene symbols in the data frame.
`score`	A character, specifying the column name of gene essentiality score in the data frame.
`lineages`	A character vector, specifying the lineages used for common essential gene selection.
`method`	A character, indicating which correlation coefficient is to be used for the test. One of "pearson", "kendall", or "spearman".

Value

A data frame with correlation and test p.value.

Author(s)

Wubing Zhang

Examples

file1 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/rra.gene_summary.txt")
gdata = ReadRRA(file1)
## Not run: 
  rra.omit = OmitCommonEssential(gdata)
  depmap_similarity = ResembleDepmap(rra.omit)
  head(depmap_similarity)

## End(Not run)
file1 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/rra.gene_summary.txt")
gdata = ReadRRA(file1)
## Not run: 
  rra.omit = OmitCommonEssential(gdata)
  depmap_similarity = ResembleDepmap(rra.omit)
  head(depmap_similarity)

## End(Not run)

Update genesets from source database

Description

Update genesets from source database

Usage

retrieve_gs(type = c("KEGG", "REACTOME", "CORUM", "GO"), organism = "hsa")
retrieve_gs(type = c("KEGG", "REACTOME", "CORUM", "GO"), organism = "hsa")

Arguments

`type`	A vector of databases, such as KEGG, REACTOME, CORUM, GO.
`organism`	'hsa' or 'mmu'.

Value

save data to local library.

Author(s)

Wubing Zhang

Scatter plot

Description

Scatter plot supporting groups.

Usage

ScatterView(
  data,
  x = "x",
  y = "y",
  label = 0,
  model = c("none", "ninesquare", "volcano", "rank")[1],
  x_cut = NULL,
  y_cut = NULL,
  slope = 1,
  intercept = NULL,
  auto_cut = FALSE,
  auto_cut_x = auto_cut,
  auto_cut_y = auto_cut,
  auto_cut_diag = auto_cut,
  groups = NULL,
  group_col = NULL,
  groupnames = NULL,
  label.top = TRUE,
  top = 0,
  toplabels = NULL,
  display_cut = FALSE,
  color = NULL,
  shape = 16,
  size = 1,
  alpha = 0.6,
  main = NULL,
  xlab = x,
  ylab = y,
  legend.position = "none",
  ...
)
ScatterView(
  data,
  x = "x",
  y = "y",
  label = 0,
  model = c("none", "ninesquare", "volcano", "rank")[1],
  x_cut = NULL,
  y_cut = NULL,
  slope = 1,
  intercept = NULL,
  auto_cut = FALSE,
  auto_cut_x = auto_cut,
  auto_cut_y = auto_cut,
  auto_cut_diag = auto_cut,
  groups = NULL,
  group_col = NULL,
  groupnames = NULL,
  label.top = TRUE,
  top = 0,
  toplabels = NULL,
  display_cut = FALSE,
  color = NULL,
  shape = 16,
  size = 1,
  alpha = 0.6,
  main = NULL,
  xlab = x,
  ylab = y,
  legend.position = "none",
  ...
)

Arguments

`data`	Data frame.
`x`	A character, specifying the x-axis.
`y`	A character, specifying the y-axis.
`label`	An integer or a character specifying the column used as the label, default value is 0 (row names).
`model`	One of "none" (default), "ninesquare", "volcano", and "rank".
`x_cut`	An one or two-length numeric vector, specifying the cutoff used for x-axis.
`y_cut`	An one or two-length numeric vector, specifying the cutoff used for y-axis.
`slope`	A numberic value indicating slope of the diagonal cutoff.
`intercept`	A numberic value indicating intercept of the diagonal cutoff.
`auto_cut`	Boolean or numeric, specifying how many standard deviation will be used as cutoff.
`auto_cut_x`	Boolean or numeric, specifying how many standard deviation will be used as cutoff on x-axis.
`auto_cut_y`	Boolean or numeric, specifying how many standard deviation will be used as cutoff on y-axis
`auto_cut_diag`	Boolean or numeric, specifying how many standard deviation will be used as cutoff on diagonal.
`groups`	A character vector specifying groups. Optional groups include "top", "mid", "bottom", "left", "center", "right", "topleft", "topcenter", "topright", "midleft", "midcenter", "midright", "bottomleft", "bottomcenter", "bottomright".
`group_col`	A vector of colors for specified groups.
`groupnames`	A vector of group names to show on the legend.
`label.top`	Boolean, specifying whether label top hits.
`top`	Integer, specifying the number of top terms in the groups to be labeled.
`toplabels`	Character vector, specifying terms to be labeled.
`display_cut`	Boolean, indicating whether display the dashed line of cutoffs.
`color`	A character, specifying the column name of color in the data frame.
`shape`	A character, specifying the column name of shape in the data frame.
`size`	A character, specifying the column name of size in the data frame.
`alpha`	A numeric, specifying the transparency of the dots.
`main`	Title of the figure.
`xlab`	Title of x-axis
`ylab`	Title of y-axis.
`legend.position`	Position of legend, "none", "right", "top", "bottom", or a two-length vector indicating the position.
`...`	Other available parameters in function 'geom_text_repel'.

Value

An object created by ggplot, which can be assigned and further customized.

Author(s)

Wubing Zhang

Examples

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
ScatterView(dd, x = "Pmel1_Ctrl", y = "Pmel1", label = "Gene",
auto_cut = 1, groups = "topright", top = 5, display_cut = TRUE)
ScatterView(dd, x = "Pmel1_Ctrl", y = "Pmel1", label = "Gene",
auto_cut = 2, model = "ninesquare", top = 5, display_cut = TRUE)

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
ScatterView(dd, x = "Pmel1_Ctrl", y = "Pmel1", label = "Gene",
auto_cut = 1, groups = "topright", top = 5, display_cut = TRUE)
ScatterView(dd, x = "Pmel1_Ctrl", y = "Pmel1", label = "Gene",
auto_cut = 2, model = "ninesquare", top = 5, display_cut = TRUE)

Select signatures from candidate list (according to the consistence in most samples).

Description

Select signatures from candidate list (according to the consistence in most samples).

Usage

Selector(mat, cutoff = 0, type = "<", select = 0.8)
Selector(mat, cutoff = 0, type = "<", select = 0.8)

Arguments

`mat`	A matrix, each row is candidates (genes), each column is samples.
`cutoff`	Numeric, specifying the cutoff to define the signatures.
`type`	Character, ">" or "<".
`select`	Numeric, specifying the proportion of samples in which signature is selected.

Value

An list containing two elements, the first is the selected signature and the second is a ggplot object.

Examples

mat = matrix(rnorm(1000*30), 1000, 30)
rownames(mat) = paste0("Gene", 1:1000)
colnames(mat) = paste0("Sample", 1:30)
hits = Selector(mat, select = 0.68)
print(hits$p)

mat = matrix(rnorm(1000*30), 1000, 30)
rownames(mat) = paste0("Gene", 1:1000)
colnames(mat) = paste0("Sample", 1:30)
hits = Selector(mat, select = 0.68)
print(hits$p)

View sgRNA rank.

Description

View sgRNA rank.

Usage

sgRankView(
  df,
  gene = NULL,
  top = 3,
  bottom = 3,
  neg_ctrl = NULL,
  binwidth = 0.3,
  interval = 0.1,
  bg.col = "gray90",
  filename = NULL,
  width = 5,
  height = 3.5,
  ...
)
sgRankView(
  df,
  gene = NULL,
  top = 3,
  bottom = 3,
  neg_ctrl = NULL,
  binwidth = 0.3,
  interval = 0.1,
  bg.col = "gray90",
  filename = NULL,
  width = 5,
  height = 3.5,
  ...
)

Arguments

`df`	A data frame, which contains columns of 'sgrna', 'Gene', and 'LFC'.
`gene`	Character vector, specifying genes to be plotted.
`top`	Integer, specifying number of top genes to be plotted.
`bottom`	Integer, specifying number of bottom genes to be plotted.
`neg_ctrl`	A vector specifying negative ctrl genes.
`binwidth`	A numeric value specifying the bar width.
`interval`	A numeric value specifying the interval length between each bar.
`bg.col`	A character value specifying the background color.
`filename`	Figure file name to create on disk. Default filename="NULL", which means no output.
`width`	As in ggsave.
`height`	As in ggsave.
`...`	Other available parameters in function 'ggsave'.

Value

An object created by ggplot.

Author(s)

Yihan Xiao

Examples

file2 = file.path(system.file("extdata", package = "MAGeCKFlute"),
                  "testdata/rra.sgrna_summary.txt")
sgrra = ReadsgRRA(file2)
sgRankView(sgrra)

file2 = file.path(system.file("extdata", package = "MAGeCKFlute"),
                  "testdata/rra.sgrna_summary.txt")
sgrra = ReadsgRRA(file2)
sgRankView(sgrra)

Scatter plot showing dots in 9 quadrants

Description

Scatter plot showing dots in 9 quadrants

Usage

SquareView(
  df,
  ctrlname = "Control",
  treatname = "Treatment",
  label = 0,
  label.top = TRUE,
  top = 5,
  genelist = c(),
  x_cut = NULL,
  y_cut = NULL,
  slope = 1,
  intercept = NULL,
  auto_cut = FALSE,
  auto_cut_x = auto_cut,
  auto_cut_y = auto_cut,
  auto_cut_diag = auto_cut,
  groups = c("midleft", "topcenter", "midright", "bottomcenter"),
  groupnames = paste0("Group", 1:length(groups)),
  legend.position = "none",
  main = NULL,
  filename = NULL,
  width = 6,
  height = 4,
  ...
)
SquareView(
  df,
  ctrlname = "Control",
  treatname = "Treatment",
  label = 0,
  label.top = TRUE,
  top = 5,
  genelist = c(),
  x_cut = NULL,
  y_cut = NULL,
  slope = 1,
  intercept = NULL,
  auto_cut = FALSE,
  auto_cut_x = auto_cut,
  auto_cut_y = auto_cut,
  auto_cut_diag = auto_cut,
  groups = c("midleft", "topcenter", "midright", "bottomcenter"),
  groupnames = paste0("Group", 1:length(groups)),
  legend.position = "none",
  main = NULL,
  filename = NULL,
  width = 6,
  height = 4,
  ...
)

Arguments

`df`	A data frame.
`ctrlname`	A character, specifying the names of control samples, of which the average scores will show as the x-axis.
`treatname`	A character, specifying the name of treatment samples, of which the average scores will show as the y-axis.
`label`	An integer or a character specifying the column used as the label, default value is 0 (row names).
`label.top`	Boolean, whether label the top selected genes, default label the top 10 genes in each group.
`top`	Integer, specifying the number of top selected genes to be labeled. Default is 5.
`genelist`	Character vector, specifying genes to be labeled.
`x_cut`	An one or two-length numeric vector, specifying the cutoff used for x-axis.
`y_cut`	An one or two-length numeric vector, specifying the cutoff used for y-axis.
`slope`	A numberic value indicating slope of the diagonal cutoff.
`intercept`	A numberic value indicating intercept of the diagonal cutoff.
`auto_cut`	Boolean (2-fold SD by default) or numeric, specifying how many standard deviation will be used as cutoff.
`auto_cut_x`	Boolean (2-fold SD by default) or numeric, specifying how many standard deviation will be used as cutoff on x-axis.
`auto_cut_y`	Boolean (2-fold SD by default) or numeric, specifying how many standard deviation will be used as cutoff on y-axis
`auto_cut_diag`	Boolean (2-fold SD by default) or numeric, specifying how many standard deviation will be used as cutoff on diagonal.
`groups`	A character vector, specifying which group to be colored. Optional groups include "topleft", "topcenter", "topright", "midleft", "midright", "bottomleft", "bottomcenter", "bottomright".
`groupnames`	A character vector, specifying group names.
`legend.position`	Position of the legend.
`main`	As in 'plot'.
`filename`	Figure file name to create on disk. Default filename="NULL", which means don't save the figure on disk.
`width`	As in ggsave.
`height`	As in ggsave.
`...`	Other available parameters in function 'ggsave'.

Value

An object created by ggplot, which can be assigned and further customized.

Author(s)

Wubing Zhang

Examples

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
SquareView(dd, ctrlname = "Pmel1_Ctrl", treatname = "Pmel1", label = "Gene")


file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
SquareView(dd, ctrlname = "Pmel1_Ctrl", treatname = "Pmel1", label = "Gene")

Gene ID conversion

Description

Gene ID conversion

Usage

TransGeneID(
  genes,
  fromType = "Symbol",
  toType = "Entrez",
  organism = "hsa",
  fromOrg = organism,
  toOrg = organism,
  ensemblHost = "www.ensembl.org",
  unique = TRUE,
  update = FALSE
)
TransGeneID(
  genes,
  fromType = "Symbol",
  toType = "Entrez",
  organism = "hsa",
  fromOrg = organism,
  toOrg = organism,
  ensemblHost = "www.ensembl.org",
  unique = TRUE,
  update = FALSE
)

Arguments

`genes`	A character vector, input genes to be converted.
`fromType`	The input ID type, one of "entrez", "symbol"(default), "hgnc", "ensembl", "fullname" and "uniprotswissprot"; you can also input other valid attribute names for biomaRt. Look at the code in examples to check valid attributes.
`toType`	The output ID type, similar to 'fromType'.
`organism`	"hsa"(default), "mmu", "bta", "cfa", "ptr", "rno", and "ssc" are optional.
`fromOrg`	"hsa", "mmu", "bta", "cfa", "ptr", "rno", and "ssc" are optional (Only used when transform gene ids between organisms).
`toOrg`	"hsa"(default), "mmu", "bta", "cfa", "ptr", "rno", and "ssc" are optional (Only used when transform gene ids between organisms).
`ensemblHost`	Character, specifying ensembl host, you can use 'listEnsemblArchives()' to show all available Ensembl archives hosts.
`unique`	Boolean, specifying whether do one-to-one mapping.
`update`	Boolean, specifying whether update built-in gene annotation (needs network and takes time).

Value

A character vector, named by unique input gene ids.

Author(s)

Wubing Zhang

Examples

TransGeneID("HLA-A", organism="hsa")
TransGeneID("HLA-A", toType = "uniprot", organism="hsa")
TransGeneID("H2-K1", toType="Symbol", fromOrg = "mmu", toOrg = "hsa")

TransGeneID("HLA-A", organism="hsa")
TransGeneID("HLA-A", toType = "uniprot", organism="hsa")
TransGeneID("H2-K1", toType="Symbol", fromOrg = "mmu", toOrg = "hsa")

Violin plot

Description

Violin plot showing the distribution of numeric vectors with the same length.

Usage

ViolinView(
  dat,
  samples = NULL,
  main = NULL,
  ylab = "Score",
  filename = NULL,
  width = 5,
  height = 4,
  ...
)
ViolinView(
  dat,
  samples = NULL,
  main = NULL,
  ylab = "Score",
  filename = NULL,
  width = 5,
  height = 4,
  ...
)

Arguments

`dat`	A data frame.
`samples`	A character vector, specifying the columns in the `dat` for plotting.
`main`	A character, specifying title.
`ylab`	A character, specifying title of y-axis.
`filename`	A character, specifying a file name to create on disk. Set filename to be "NULL", if don't want to save the figure.
`width`	Numeric, specifying width of figure.
`height`	Numeric, specifying height of figure.
`...`	Other available parameters in function 'ggsave'.

Value

An object created by ggplot, which can be assigned and further customized.

Author(s)

Wubing Zhang

Examples

file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
ViolinView(dd[, -1])


file3 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/mle.gene_summary.txt")
dd = ReadBeta(file3)
ViolinView(dd[, -1])

Volcano View

Description

Volcano plot for differential analysis.

Usage

VolcanoView(
  df,
  x = "logFC",
  y = "adj.P.Val",
  Label = NA,
  top = 5,
  topnames = NULL,
  x_cutoff = log2(1.5),
  y_cutoff = 0.05,
  mycolour = c("gray80", "#e41a1c", "#377eb8"),
  alpha = 0.6,
  force = 0.1,
  main = NULL,
  xlab = "log2FC",
  ylab = "-log10(FDR)",
  filename = NULL,
  width = 4,
  height = 2.5,
  ...
)
VolcanoView(
  df,
  x = "logFC",
  y = "adj.P.Val",
  Label = NA,
  top = 5,
  topnames = NULL,
  x_cutoff = log2(1.5),
  y_cutoff = 0.05,
  mycolour = c("gray80", "#e41a1c", "#377eb8"),
  alpha = 0.6,
  force = 0.1,
  main = NULL,
  xlab = "log2FC",
  ylab = "-log10(FDR)",
  filename = NULL,
  width = 4,
  height = 2.5,
  ...
)

Arguments

`df`	A data frame.
`x`	A character, specifying the x-axis in Volcanno figure, 'logFC' (default).
`y`	A character, specifying the y-axis in Volcanno figure, 'adj.P.Val' (default). log10 transformation will be done automatically.
`Label`	A character, specifying dots to be labeled on the figure.
`top`	An integer, specifying the number of top significant genes to be labeled.
`topnames`	A character vector, indicating positive/negative controls to be labeled.
`x_cutoff`	Numeric, specifying cutoff of the x-axis.
`y_cutoff`	Numeric, specifying cutoff of the y-axis.
`mycolour`	A color vector, specifying colors of non-significant, significantly up and down-regulated genes.
`alpha`	Numeric, parameter in ggplot.
`force`	Numeric, Parameter for geom_text_repel. Force of repulsion between overlapping text labels.
`main`	A character, specifying title.
`xlab`	A character, specifying title of x-axis.
`ylab`	A character, specifying title of y-axis.
`filename`	A character, specifying a file name to create on disk. Set filename to be "NULL", if don't want to save the figure.
`width`	Numeric, specifying width of figure.
`height`	Numeric, specifying height of figure.
`...`	Other available parameters in ggsave.

Value

An object created by ggplot, which can be assigned and further customized.

Author(s)

Wubing Zhang

Examples

file1 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/rra.gene_summary.txt")
gdata = ReadRRA(file1)
VolcanoView(gdata, x = "Score", y = "FDR", Label = "id")

file1 = file.path(system.file("extdata", package = "MAGeCKFlute"),
"testdata/rra.gene_summary.txt")
gdata = ReadRRA(file1)
VolcanoView(gdata, x = "Score", y = "FDR", Label = "id")

Write GMT file

Description

write data frame to a gmt file

Usage

writeGMT(gene2path, gmtfile)
writeGMT(gene2path, gmtfile)

Arguments

`gene2path`	A data frame. The columns should be Gene, Pathway ID, and Pathway Name.
`gmtfile`	Path to gmt file.

Value

Output gmt file to local folder.

Author(s)

Wubing Zhang

Examples

gene2path = gsGetter(type = "Complex")
# writeGMT(gene2path, "Protein_complex.gmt")

gene2path = gsGetter(type = "Complex")
# writeGMT(gene2path, "Protein_complex.gmt")

Package 'MAGeCKFlute'

Help Index

Kegg pathway view and arrange grobs on page

Description

Usage

Arguments

Value

Author(s)

Examples

Bar plot

Description

Usage

Arguments

Value

Author(s)

Examples

Batch effect removal

Description

Usage

Arguments

Value

Author(s)

See Also

Examples

Visualize the estimate cell cycle compared to control.

Description

Usage

Arguments

Value

Author(s)

Examples

Quantile of normal distribution.

Description

Usage

Arguments

Value

Examples

Density plot

Description

Usage

Arguments

Value

Author(s)

Examples

Density plot

Description

Usage

Arguments

Value

Author(s)

See Also

Examples

Gene set enrichment analysis

Description

Usage

Arguments

Value

Author(s)

See Also

Examples

Do enrichment analysis using hypergeometric test

Description

Usage

Arguments

Value

Author(s)

See Also

Examples

Enrichment analysis using over-representation test

Description

Usage

Arguments

Value

Author(s)

See Also

Examples

Enrichment analysis for Positive and Negative selection genes

Description

Usage

Arguments