Here we will illustrate how to
choose and use the appropriate gating methods that are pre-registered in
openCyto
package. And users can always define their own
gating
algorithms and register them as the
plugin
functions in openCyto
framework, see
?registerPlugins
for more details.
Note that all the function names illustrated below are prefixed with
.
indicating that they are simply the wrapper function
registered in openCyto
. The actual
gating engine
behind the wrapper can come from other
packages (e.g. flowCore
, flowClust
). All these
wrappers have these common interfaces: * fr
: a
flowFrame
object * pp_res
: an optional
pre-preocessing
result, which can be ignored in this
document * channels
: channel names used for gating *
...
: any other gating parameters pass on to the actual
gating engine
library(flowCore)
library(flowWorkspace)
library(openCyto)
library(ggcyto)
gs <- load_gs(system.file("extdata/gs_bcell_auto", package = "flowWorkspaceData"))
mindensity
The name of this gating function is self-explaining, that is to find
the minimum as the cutpoint between negative and postive peaks in 1d
density plot. It is fast,robust and extremely easy to use especially
when there is a good separation between +
and
-
populations/peaks.
For example, it is usually easy to gate on CD3
channel
and no need to supply any arguments to the method.
fr <- gh_pop_get_data(gs[[2]], "Live", returnType = "flowFrame")
chnl <- "CD3"
g <- openCyto:::.mindensity(fr, channels = chnl)
autoplot(fr, chnl) + geom_gate(g)
autoplot(fr, chnl, "SSC-A") + geom_gate(g)
However, it may need some guidance when there are more than
2
major peaks/populations detected in densit profile.
fr <- gh_pop_get_data(gs[[1]], "boundary", returnType = "flowFrame")
chnl <- "FSC-A"
g <- openCyto:::.mindensity(fr, channels = chnl)
mylimits <- ggcyto_par_set(limits = "instrument")
p <- autoplot(fr, chnl) + mylimits
p + geom_gate(g)
autoplot(fr, chnl, "SSC-A") + geom_gate(g)
Here we actually want to remove the debris cells
that
are represented by the first negative peak. But mindensity
cuts between the second and third peaks since they are more
predorminant. So we can simply specify a range
that will
limit the locations where the cut point should be placed.
g <- openCyto:::.mindensity(fr, channels = chnl, gate_range=c(7e4,1e5), adjust = 1.5)
p + geom_gate(g)
autoplot(fr, chnl, "SSC-A") + geom_gate(g)
And as shown, we also changed the kernal density
smoothing factor adjust
from 2
(default value
set in openCtyo
) to 1.5
to avoid
over-smoothing.
Alternatively you can achieve the same effect by setting
min
or max
to pre-filter the data before the
mindenstiy
works on it.
To choose one way or the other or combining both is highly dependent on how your data. The more contrains will give you more controls on how gating proceeds yet at cost of robustness of your gating pipeline sometime.
quantileGate
This method is an alternative to tailgate
and it
determines the cutpoint by the events quantile.
g <- openCyto:::.quantileGate(fr, channels = chnl, probs = 0.99)
p <- autoplot(fr, chnl) + mylimits
p + geom_gate(g)
autoplot(fr, chnl, "SSC-A") + geom_gate(g)
This gating method is more commonly used in gating the
rare
populations when the target population is not
prominent enough to stand out as the second peak.
(e.g. cytokine
gates in ICS
assays.)
boundary Gate
It essentially constructs a rectangle gate from input range (min, max), which is useful for filtering out very extreme signals at the bounary.
singletGate
Use the area
vs height
to gate out the
singlets. See details from ?singletGate
.
flowClust.2d
flowClust
package in itself is not limited to
2-dimensional gating. But here we are talking about a dedicated wrapper
function .flowClust.2d
from openCyto
package
that leverages flowClust
clustering engine to work on
2D
cases specifically. You won’t need to write the full
name of the function in csv
gating template, simply put
flowClust
in the gating_method
column, and
then the template parser will automatically dispatch to the right
function.
fr <- gh_pop_get_data(gs[[1]], "nonDebris", returnType = "flowFrame")
chnl <- c("FSC-A", "SSC-A")
g <- openCyto:::.flowClust.2d(fr, channels = chnl, K=2, target=c(1e5,5e4), quantile=0.95)
p <- autoplot(fr, x = chnl[1], y = chnl[2]) + mylimits
p + geom_gate(g)
K
is to tell the algorithm how many major
clusters/populations are expected in the 2d profile. target
specify the mean/center of the target population to get, which doesn’t
have to be precise. If not supplied, flowClust will pick the most
prominent cluster as the target, which would be the right choice in most
cases. quantile
specify how large the ellipse
should be. pp_res
is used to provide the prior
information for flowClust
. (More details are in
?flowClust
)
Transitional gate
flowClust.2d
can optionally construct a
Transitional gate
, which is a speical kind of polygon gate
with one edge placed diagonally that is often seen in
flowJo
. Here is an example:
fr <- gh_pop_get_data(gs[[1]], "CD19andCD20", returnType = "flowFrame")
chnl <- c("CD38", "CD24")
g <- openCyto:::.flowClust.2d(fr, channels = chnl, K=6,transitional=TRUE,target=c(3.5e3,3.5e3), quantile=0.95,translation=0.15, pp_res = NULL)
p <- autoplot(fr, x = chnl[1], y = chnl[2]) + mylimits
p + geom_gate(g)
The rational behind the algorithm is beyond the scope of this
document. Please see its detailed explainations in
?flowClust.2d
.
quadGate.tmix
This gating method identifies two quadrants (first, and third
quadrants) by fitting the data with tmixture model. It is particually
useful when the two markers are not well resolved thus the regular
quadGate method that is based on 1d
gating will not find
the perfect cut points on both dimensions.
gs <- load_gs(system.file("extdata/gs_DC_auto", package = "flowWorkspaceData"))
fr <- gh_pop_get_data(gs[[2]], "HLADR+", returnType = "flowFrame")
chnl <- c("CD11c", "CD123")
p <- autoplot(fr, chnl[1], chnl[2])
g <- openCyto:::.quadGate.tmix(fr, channels = chnl, K = 3, usePrior = "no")
p + geom_gate(g)