Package 'twoddpcr' reference manual

Title:	Classify 2-d Droplet Digital PCR (ddPCR) data and quantify the number of starting molecules
Description:	The twoddpcr package takes Droplet Digital PCR (ddPCR) droplet amplitude data from Bio-Rad's QuantaSoft and can classify the droplets. A summary of the positive/negative droplet counts can be generated, which can then be used to estimate the number of molecules using the Poisson distribution. This is the first open source package that facilitates the automatic classification of general two channel ddPCR data. Previous work includes 'definetherain' (Jones et al., 2014) and 'ddpcRquant' (Trypsteen et al., 2015) which both handle one channel ddPCR experiments only. The 'ddpcr' package available on CRAN (Attali et al., 2016) supports automatic gating of a specific class of two channel ddPCR experiments only.
Authors:	Anthony Chiu [aut, cre]
Maintainer:	Anthony Chiu <[email protected]>
License:	GPL-3
Version:	1.31.0
Built:	2025-03-19 04:03:23 UTC
Source:	https://github.com/bioc/twoddpcr

Classifying and summarising 2-d droplet digitial PCR (ddPCR) data.

Description

The twoddpcr package takes droplet amplitude data from Bio-Rad's QuantaSoft and can classify the droplets. A summary of the positive/negative droplet counts can be generated, which can then be used to estimate the number of molecules using the Poisson distribution.

`df`	A data frame corresponding to a well with droplets corresponding only to "NN" and "NP" or "NN" and "PN".
`channel`	The channel on which to classify (1 or 2).
`centres`	A data frame of centres. The data frame should have columns `Ch1.Amplitude` and `Ch2.Amplitude` and row names corresponding the cluster label, e.g. "NN", "NP", "PN" or "PP".
`minSeparation`	The minimum distance required between two cluster centres in order for us to assume that k-means found two distinct clusters. Defaults to 2000.
`fullTable`	Whether to return a full table or just a vector. Defaults to 'TRUE'

`cl`	List of data frames, where each data frame corresponds to a well with droplets corresponding only to "NN" and "NP" or "NN" and "PN".
`channel`	The channel on which to classify (1 or 2).
`centres`	A data frame of centres. The data frame should have columns `Ch1.Amplitude` and `Ch2.Amplitude` and row names corresponding the cluster label, e.g. "NN", "NP", "PN" or "PP".
`minSeparation`	The minimum distance required between two cluster centres in order for us to assume that k-means found two distinct clusters. Defaults to 2000.
`fullTable`	Whether to return a full table or just a vector. Defaults to 'TRUE'

`droplets`	A data frame of droplets with `Ch1.Amplitude` and `Ch2.Amplitude` columns, as well as a class column (see the parameter `classCol`).
`cl`	The class to focus on. This should be either "NN", "NP", "PN" or "PP".
`maxDistance`	An integer corresponding to the maximum Mahalanobis distance for which we will consider points to be members of the class, i.e. this is the level outside of which we consider droplets to be too far from the cluster.
`classCol`	The column (name or number) from `droplets` representing the class.

`cl`	The cluster of which to find the covariance.
`droplets`	A data frame of droplets with `Ch1.Amplitude` and `Ch2.Amplitude` columns, as well as a class column (see the parameter `classCol`).
`classCol`	The column (name or number) from `droplets` representing the class.

`theObject`	A `ddpcrPlate` object.
`aList`	A list from which we wish to extract well names.

`plate`	A `ddpcrPlate` object from which to extract the centres.
`cMethod`	The classification method to use (in the form of a character string).
`channel`	An integer 1 or 2 corresponding to the channel of interest.
`minSeparation`	The minimum distance required between two cluster centres in order for us to assume that k-means found two distinct clusters. Defaults to 2000.

`colName`	The name of the column to focus on.
`well`	The data frame from which to extract the classifications.

`df`	A data frame.
`ch1Label`	The prefix to use for the channel 1 target. Defaults to "Mt".
`ch2Label`	The prefix to use for the channel 2 target. Defaults to "Wt".

`droplets`	A data frame of droplets with `Ch1.Amplitude` and `Ch2.Amplitude` columns.
`clusStats`	A list of statistics for a cluster generated by `classStats`. This should have a `mean`, `cov` and `cov.inv` values.

`wellDf`	A data frame of the well's droplet amplitudes.
`combinedCentres`	A data frame of the combined (average) centres of the non-normalised wells.
`wellCentres`	A data frame of centres corresponding to the given `channel`.
`channel`	An integer 1 or 2 corresponding to the channel that we are interested in.

`df`	A data frame.
`n`	How many decimal places/significant figures to round to.

`droplets`	An object corresponding to droplet amplitudes and their classifications. This can be in the form of: A data frame with columns `Ch1.Amplitude`, `Ch2.Amplitude` and a classification column (see the parameter `cMethod`). A `ddpcrWell` object. A `ddpcrPlate` object. A `ggplot` (`gg`) object. For example, this could be the output of `ggplot.well` or `ggplot.plate`. We should not need to use this unless we are writing new methods to plot new data types.
`ch1Label`	The label for the channel 1 target. Defaults to "Ch1 Amplitude".
`ch2Label`	The label for the channel 2 target. Defaults to "Ch2 Amplitude".
`...`	Other plotting parameters that depend on the object type of `droplets`.
`cMethod`	This should be the name or column number of `droplets` corresponding to the classification. This column should only have entries in "NN", "PN", "NP, "PP", "Rain" and "N/A". If "None", plots the droplets with all of them classified as `N/A`. Defaults to "None".
`mapping`	A list of aesthetic mappings to use for the plot. Defaults to `ggplot2::aes_string(x="Ch2.Amplitude", y="Ch1.Amplitude", colours=cMethod)`. Not used if `droplets` is a `ggplot` object.
`finalCentres`	A data frmae of final centres to plot (e.g. those returned by the k-means or c-means algorithms). If `NULL`, nothing is plotted. Defaults to `NULL`.
`initialCentres`	A data frame of initial centres to plot (e.g. initial cluster centres used in the k-means). If `NULL`, nothing is plotted. Defaults to `NULL`. This parameter is useful for illustrative reasons.
`selectedCentre`	An initial centre to highlight. This should be either "NN", "NP", "PN" or "PP". If `NULL`, nothing is highlighted. Defaults to `NULL`. This parameter is useful for illustrative reasons.
`pointSize`	The size to draw each droplet. Defaults to 1.
`plotLimits`	A list of 2-element vectors with names `x` and `y`. These are used to fix the x and y limits of the plot, which is especially useful for comparing plots. Defaults to `list(x=c(1000, 9000), y=c(3000, 13500))`.
`legendLabels`	The character vector corresponding to the labels for the legend. The elements of the vector should correspond to the NN, NP, PN, PP, Rain and N/A classes, respectively. Defaults to `ddpcr$classesRain`.

`theObject`	The dataframe to export.
`location`	The location to export to. This should be a filename if we are using `exportZip`, or we are using `exportTable` and `theObject` is a data frame or `ddpcrWell` object. If `theObject` is a `ddpcrPlate` object, this should be a directory.
`delim`	The character to use as a field separator. Defaults to ",", i.e. export a CSV.
`...`	Other options depending on the type of `theObject`.
`leadingColName`	The name of the leading column, i.e. the 'row names' of the dataframe. This could be a patient identifier or the well used in the ddPCR experiment. If `NULL`, the exported heading will be an empty string. Defaults to `NULL`.
`row.names`	If `NULL`, exports a column corresponding to the row names; if `FALSE`, no such column is included. If 'leadingColName' is not `FALSE`, row.names is assumed to be `FALSE`. Defaults to `TRUE`.
`cMethod`	The name or column number of the classification methods in a `ddpcrWell` or `ddpcrPlate` object to export to file. If `NULL`, all of the classification methods are exported. Defaults to `NULL`.
`prefix`	For `ddpcrPlate` objects, this is the prefix to prepend to the output filenames.
`suffix`	For `ddpcrPlate` objects, this is the suffix to append to the output filenames. This is typically the filename extension, e.g. ".csv" or ".txt". Defaults to ".csv".

`droplets`	A `ddpcrPlate` object or a data frame of droplet amplitudes with a "Well" column.
`ch1Label`	The label for the channel 1 target. Defaults to "Ch1 Amplitude".
`ch2Label`	The label for the channel 2 target. Defaults to "Ch2 Amplitude".
`cMethod`	This should be the name or column number of `droplets` corresponding to the classification to be plotted. This column should only have entries in "NN", "PN", "NP, "PP", "Rain" and "N/A". If "None", plots the droplets with all of them classified as `N/A`. If `NULL`, a density plot is plotted. Defaults to `NULL`.
`binwidth`	The width of each hexagonal bin in the density plot. Ignored if `cMethod` is not `NULL` (see `pointSize` instead). Defaults to 100.
`pointSize`	If `cMethod` is not `NULL`, this is the size to draw each droplet. Otherwise this parameter is ignored (see `binwidth` instead). Defaults to 0.1.
`plotLimits`	A list of 2-element vectors with names `x` and `y`. These are used to fix the x and y limits of the plot, which is especially useful for comparing plots. Defaults to `list(x=c(1000, 9000), y=c(3000, 13500))`.
`showEmptyWells`	If `TRUE`, plots a `facet_grid` of all the wells in the plate, including the empty ones. If `FALSE`, plots a `facet_wrap` of only the loaded (nonempty) wells. Defaults to `FALSE`.

`allDf`	A list of data frames, where each one corresponds to a well's droplet amplitudes.
`well`	The name or number of the well to normalise.
`combinedCentres`	A data frame of the combined (average) centres of the non-normalised wells.
`indivCentres1`	A data frame of centres corresponding to channel 1.
`indivCentres2`	A data frame of centres corresponding to channel 2.

`vec`	The vector to split.
`wellSizes`	A numeric vector corresponding to sizes of the wells.
`wellNames`	A character vector corresponding to the names of the wells.

`df`	A data frame created by calling `read.csv` on the raw ddPCR output.
`ch1Label`	The prefix to use for the channel 1 target. Defaults to "Mt".
`ch2Label`	The prefix to use for the channel 2 target. Defaults to "Wt".
`rows`	The number of rows to retain from the original data frame. If `NULL`, all of the wells are used. Defaults to `NULL`.

`theObject`	A `ddpcrWell` or `ddpcrPlate` object.
`cMethod`	The classification method for which to obtain the centres.

`well`	A well with columns `Ch1.Amplitude` and `Ch2.Amplitude` and optional classification columns. This can be in the form of a data frame or the path to a droplet amplitude CSV file.
`object`	Any R object

`droplets`	A data frame of droplet amplitudes, or a `ddpcrWell` or `ddpcrPlate` object.
`ch1Label`	The label for the channel 1 target. Defaults to "Ch1 Amplitude".
`ch2Label`	The label for the channel 2 target. Defaults to "Ch2 Amplitude".
`classString`	The class that all droplets should be classified as. Defaults to the `ddpcr$na` ("N/A") character string.
`initialCentres`	A data frame of initial centres to plot (e.g. initial cluster centres used in the k-means). This is _not_ restricted to the class `classString` only. If `NULL`, nothing is plotted. Defaults to `NULL`.
`selectedCentre`	An initial centre to highlight. This should be either "NN", "PN", "NP" or "PP", but is _not_ restricted to the class 'classString' only. If `NULL`, nothing is highlighted. Defaults to `NULL`.
`plotLimits`	A list of 2-element vectors with names `x` and `y`. These are used to fix the x and y limits of the plot, which is especially useful for comparing plots. Defaults to `list(x=c(1000, 9000), y=c(3000, 13500))`.

`df`	A data frame generated by `fullCountsSummary`.
`extraCols`	A vector of column names from `df` to include. If `NULL`, no extra columns are added. Defaults to `NULL`.

`df`	A data frame with droplet count columns in one of the following formats: `PP`, `PN`, `NP`, `NN`; `Ch1.Ch2.`, `Ch1.Ch2..1`, `Ch1.Ch2..2`, `Ch1.Ch2..3`; `Ch1+Ch2+`, `Ch1+Ch2-`, `Ch1-Ch2+`, `Ch1-Ch2-`; or `Ch1pCh2p`, `Ch1pCh2n`, `Ch1nCh2p`, `Ch1nCh2n`.
`ch1Label`	The prefix to use for the channel 1 target. Defaults to "Mt".
`ch2Label`	The prefix to use for the channel 2 target. Defaults to "Wt".
`rows`	A vector of rows (numbers or well names) to keep from the original data frame. If set to `NULL`, all wells will be used. Defaults to `NULL`.
`rowID`	If set, this field is used as the row names. If `NULL`, the existing row names from `df` are used. Defaults to `NULL`.
`keepCols`	A vector of columns to keep from `df`. If `NULL`, no extra columns are added. Defaults to `NULL`.
`keepColNames`	A vector of new column names for `keepCols`. If `NULL`, the column names from `keepCols` are reused. Defaults to `NULL`.

`droplets`	A data frame of droplets with "Ch1.Amplitude" and "Ch2.Amplitude" columns, as well as a class column (see classCol).
`cl`	The class to focus on. Typically one of "NN", "PN", "NP" and "PP".
`level`	A constant by which we will multiply the standard deviation. Defaults to 5.
`classCol`	The column (name or number) from 'droplets' representing the class.

`data`	A `ddpcrWell` or `ddpcrPlate` object.
`mapping`	A list of aesthetic mappings to use for the plot. Defaults to `ggplot2::aes_string(x="Ch2.Amplitude", y="Ch1.Amplitude", colour=cMethod)`, where `cMethod` is taken from the parameter of the same name.
`cMethod`	The name or column number of the classification to use. This is renamed internally to "class" for use with `mapping`. Defaults to "None".
`...`	Other arguments passed onto `ggplot`.
`environment`	Where to look if a mapping variable is not defined. Defaults to `parent.frame()`, i.e. the environment in which `ggplot.well()` or `ggplot.multiwell()` is called.

Package 'twoddpcr'

Help Index

Classifying and summarising 2-d droplet digitial PCR (ddPCR) data.

Description

Author(s)

See Also

K-means classify a data frame where the droplets are negative in the same channels only.

Description

Usage

Arguments

Value

Author(s)

K-means classify a list of data frames individually, where each data frame comprises droplets that are negative in the same channels only.

Description

Usage

Arguments

Value

Author(s)

Fuzzy clusters by bivariate normal distributions.

Description

Usage

Arguments

Value

Author(s)

Get the covariance of a cluster.

Description

Usage

Arguments

Value

Author(s)

Get a vector of all dependent columns.

Description

Usage

Arguments

Value

Author(s)

Get a vector of essential dependent columns.

Description

Usage

Arguments

Value

Author(s)

Retrieve the well names to use from a given list.

Description

Usage

Arguments

Value

Author(s)

Get a summary of the number of molecules in 20ul.

Description

Usage

Arguments

Value

Author(s)

Find the centres of each of the wells in a given channel.

Description

Usage

Arguments

Value

Author(s)

Extract a classification from a data frame.

Description

Usage

Arguments

Value

Author(s)

Get the mutant copies per 20ul of a data frame.

Description

Usage

Arguments

Value

Author(s)

Extract the well names from a data frame.

Description

Usage

Arguments

Value

Author(s)

Get the wild type copies per 20ul of a data frame.

Description

`droplets`	A `ddpcrWell` object or a data frame of droplet amplitudes with columns `Ch1.Amplitude` and `Ch2.Amplitude`.
`ch1NNThreshold`	The channel 1 upper bound for the NN class. Defaults to 6500.
`ch2NNThreshold`	The channel 2 upper bound for the NN class. Defaults to 1900.
`ch1NPThreshold`	The channel 1 upper bound for the NP class. Defaults to 6500.
`ch2NPThreshold`	The channel 2 lower bound for the NP class. Defaults to 5000.
`ch1PNThreshold`	The channel 1 lower bound for the PN class. Defaults to 10000.
`ch2PNThreshold`	The channel 2 upper bound for the PN class. Defaults to 2900.
`ch1PPThreshold`	The channel 1 lower bound for the PP class. Defaults to 7500.
`ch2PPThreshold`	The channel 2 lower bound for the PP class. Defaults to 5000.
`...`	Other options depending on the type of `droplets`.
`trainingData`	Whether to use the output as training data. If `TRUE`, returns the _full table_ with the "N/A" entries removed; if `FALSE`, the "N/A" entries are retained. Taken to be `FALSE` if `fullTable` is set to `FALSE`. Defaults to `TRUE`. Ignored if `droplets` is not a data frame.
`fullTable`	Whether to return a data frame including amplitude figures. If `TRUE`, a data frame with columns `Ch1.Amplitude`, `Ch1.Amplitude` and `class` is returned. If `FALSE`, a factor with levels in `ddpcr$classesRain` is returned, where each entry corresponds to each row in `droplets` (and `trainingData` is automatically set to `FALSE`). Defaults to `TRUE`. Ignored if `droplets` is not a data frame.
`naLabel`	The label to use for unclassified droplets. Should be either ddpcr$na ("N/A") or ddpcr$rain ("Rain"). Defaults to ddpcr$rain.
`classMethodLabel`	A name (as a character string) of the classification method. Defaults to "grid".

`droplets`	A data frame of droplet amplitudes, a `ggplot`, `ddpcrWell` or `ddpcrPlate` object.
`ch1Label`	The label for the channel 1 target. Defaults to "Ch1 Amplitude".
`ch2Label`	The label for the channel 2 target. Defaults to "Ch2 Amplitude".
`binwidth`	The width of each hexagonal bin in the 2d heat (density) plot. Defaults to 100.
`plotLimits`	A list of 2-element vectors with names `x` and `y`. These are used to fix the x and y limits of the plot, which is especially useful for comparing plots. Defaults to `list(x=c(1000, 9000), y=c(3000, 13500))`.

`droplets`	A `ddpcrWell` or `ddpcrPlate` object, or a data frame with columns `Ch1.Amplitude` and `Ch2.Amplitude`.
`centres`	Either: A matrix corresponding to the initial centres to use for the k-means algorithm; or An integer corresponding to the number of clusters. If this is set, the initial centres are randomly set. Defaults to `matrix(c(0, 0, 10000, 0, 0, 7000, 10000, 7000), ncol=2, byrow=TRUE)`
`...`	Other options depending on the type of `droplets`.
`fullTable`	If `TRUE`, returns a full data frame of droplets and their classification; if `FALSE`, simply returns a factor corresponding to this classification. Defaults to `TRUE`.

`data`	A data frame or vector corresponding to the classification.
`centres`	A data frame listing the final centre points from the k-means algorithm with the corresponding cluster labels.

`droplets`	A `ddpcrWell` or `ddpcrPlate` object, or a data frame with columns `Ch1.Amplitude` and `Ch2.Amplitude`.
`trainData`	A data frame of training data with columns `Ch1.Amplitude` and `Ch2.Amplitude`.
`cl`	A vector of classes corresponding to `trainData`.
`k`	The number of nearest neighbours to use in the algorithm.
`prob`	The minimal proportion of votes for the winning class needed to assert that a droplet belongs to the class. This figure should be a float between 0 and 1. For example, if 0.6 then at least 60 k-nearest neighbours need to be of one class, otherwise it is classified as "Rain". Defaults to 0, i.e. we do not use "Rain".
`...`	Other options depending on the type of `droplets`.
`fullTable`	If `TRUE`, returns a full data frame of droplets and their classification; if `FALSE`, simply returns the classified vector. Defaults to `TRUE`.

`inputId`	The identifier for the numericInput object.
`label`	The text label to display alongside.
`value`	The default value of the numericInput.
`size`	The size (width) and maxlength.

`wells`	Either a `ddpcrPlate` object or a list of data frames, each of which comprises droplet amplitudes and their corresponding classifications in a given well.
`...`	Other options depending on the type of `wells`.
`ch1Label`	The prefix to use for the channel 1 target. Defaults to "Mt".
`ch2Label`	The prefix to use for the channel 2 target. Defaults to "Wt".
`sortByLetter`	If `TRUE`, the resulting data frame is sorted by the letter in the well names first, e.g. "A02" comes before "B01". If `FALSE`, the result is sorted by the numeric component of the well names first, e.g. "B01" comes before "A02". Defaults to `FALSE`.
`cMethod`	The classification method to create a summary for.

`path`	The path containing the CSV files (can be a combination of directories and individual CSV file paths). Each file will have a `Ch1.Amplitude`, `Ch2.Amplitude` and possibly classification columns, e.g. by default, QuantaSoft returns a `Cluster` column too.
`wellCol`	If `TRUE`, an additional column is added with the well name. This is useful if we need to merge all the data in the output list and we want to identify the original well of each droplet. Defaults to `FALSE`.
`sortByLetter`	If `TRUE`, the resulting list is sorted by the letter in the well names first, e.g. "A02" comes before "B01". If `FALSE`, the result is sorted by the numeric component of the well names first, e.g. "B01" comes before "A02". Defaults to `FALSE`.

`droplets`	A data frame of droplet amplitudes with a classification.
`classCol`	The column (name or number) from 'droplets' representing the class.
`presentClasses`	A vector of classes that we want to label. Must be a subset of c("NN", "NP", "PN", "PP") and must have the same number of classes as the number of unique classes in the class column.

`droplets`	A data frame or list of data frames. Each data frame should have at least two columns, where the first two columns should be vectors of doubles.
`ch1`	The channel 1 label.
`ch2`	The channel 2 label.