Title: | An R package for digital data clustering |
---|---|
Description: | Polytect is an advanced computational tool designed for the analysis of multi-color digital PCR data. It provides automatic clustering and labeling of partitions into distinct groups based on clusters first identified by the flowPeaks algorithm. Polytect is particularly useful for researchers in molecular biology and bioinformatics, enabling them to gain deeper insights into their experimental results through precise partition classification and data visualization. |
Authors: | Yao Chen [aut, cre] |
Maintainer: | Yao Chen <[email protected]> |
License: | Artistic-2.0 |
Version: | 0.99.5 |
Built: | 2025-01-11 02:59:20 UTC |
Source: | https://github.com/bioc/Polytect |
A 3-color dPCR data of bovine papilloma virus assay
data(BPV)
data(BPV)
A data frame of fluorescence intensities in each channel. Each row represents each partitions, and each column each channel.
fluorescence intensities of color 1
fluorescence intensities of color 2
fluorescence intensities of color 3
data(BPV) head(BPV)
data(BPV) head(BPV)
2-color competitive assay of competition BRAF V600E assay with 1% mutant
data(CA)
data(CA)
A data frame of fluorescence intensities in each channel. Each row represents each partitions, and each column each channel. data is not orthogonal.
fluorescence intensities of color 1
fluorescence intensities of color 2
data(CA) head(CA)
data(CA) head(CA)
CNV 5-plex universal probes
data(CNV5plex)
data(CNV5plex)
A data frame of fluorescence intensities in each channel. Each row represents each partitions, and each column each channel.
fluorescence intensities of color 1
fluorescence intensities of color 2
fluorescence intensities of color 3
fluorescence intensities of color 4
fluorescence intensities of color 5
data(CNV5plex) head(CNV5plex)
data(CNV5plex) head(CNV5plex)
CNV 6-plex universal probes
data(CNV6plex)
data(CNV6plex)
A data frame of fluorescence intensities in each channel. Each row represents each partitions, and each column each channel.
fluorescence intensities of color 1
fluorescence intensities of color 2
fluorescence intensities of color 3
fluorescence intensities of color 4
fluorescence intensities of color 5
fluorescence intensities of color 6
data(CNV6plex) head(CNV6plex)
data(CNV6plex) head(CNV6plex)
This function takes a data frame of fluorescence intensities and partition clusters as input. It can be results from polytect_clust or polytect_merge. It will give the target concentration as output.
conc_cal(df_data, cluster_num, sampvol = 0.91, volmix = 20, voltemp = 20)
conc_cal(df_data, cluster_num, sampvol = 0.91, volmix = 20, voltemp = 20)
df_data |
A data frame containing partition fluorescence intensities and corresponding cluster label. This can be the output
of |
cluster_num |
the expected number of clusters |
sampvol |
The sample volume in microliters (µL) |
volmix |
The volume of the mixture |
voltemp |
The volume of the template |
a data frame of target concentration.
data(HR) df_data<-polytect_clust(HR,4) conc_cal(df_data,4)
data(HR) df_data<-polytect_clust(HR,4) conc_cal(df_data,4)
A 4-color dPCR data of intact HIV-1 proviruses
data(HIV)
data(HIV)
A data frame of fluorescence intensities in each channel. Each row represents each partitions, and each column each channel.
fluorescence intensities of color 1
fluorescence intensities of color 2
fluorescence intensities of color 3
fluorescence intensities of color 4
https://www.biorxiv.org/content/10.1101/2023.08.18.553846v1
data(HIV) head(HIV)
data(HIV) head(HIV)
A high-resolution 2-color dPCR data of RPP30 genomic DNA assay
data(HR)
data(HR)
A data frame of fluorescence intensities in each channel. Each row represents each partitions, and each column each channel. good separation but some crosstalk.
fluorescence intensities of color 1
fluorescence intensities of color 2
https://pubmed.ncbi.nlm.nih.gov/33992770/
data(HR) head(HR)
data(HR) head(HR)
A low-resolution 2-color dPCR data of development of genotyping assays for plants various
data(LR)
data(LR)
A data frame of fluorescence intensities in each channel. Each row represents each partitions, and each column each channel. barely separable on x-axis.
fluorescence intensities of color 1
fluorescence intensities of color 2
data(LR) head(LR)
data(LR) head(LR)
A multi-mode 2-color dPCR data of HIV gBlock sequences
data(MM)
data(MM)
A data frame of fluorescence intensities in each channel. Each row represents each partitions, and each column each channel. obvious multimodality.
fluorescence intensities of color 1
fluorescence intensities of color 2
https://pubmed.ncbi.nlm.nih.gov/37827643/
data(MM) head(MM)
data(MM) head(MM)
This is the main function for clustering. The function will start with flowPeaks, then merge the excess clusters. It will return a data frame of fluorescence intensities and partition labels.
polytect_clust( data, cluster_num, fp_par = "default", fp_optim = c(0.1, 1, 1.5), lambdas = rep(2, 64 - log2(64)), coefs = rep(1, 6) )
polytect_clust( data, cluster_num, fp_par = "default", fp_optim = c(0.1, 1, 1.5), lambdas = rep(2, 64 - log2(64)), coefs = rep(1, 6) )
data |
A matrix of fluorescence intensities in each channel. Each row represents each partitions, and each column each channel. |
cluster_num |
The expected maximum number of clusters. |
fp_par |
The parameters for flowPeaks. |
fp_optim |
The paramters for flowPeaks that users have to fill in manually when |
lambdas |
The penalty terms for the deviation from the expected cluster centers. Higher |
coefs |
The coefficients to adjust for the expected cluster centers. The default is 1 which can be used for common assay designs and has to be modified for special assays such as competing assays. |
A data frame containing the original fluorescence intensity and the cluster labels.
data(HR) head(polytect_clust(HR, 4))
data(HR) head(polytect_clust(HR, 4))
This function takes the clustering result as input. Users can first perform any clustering algorithm, then use this function. It will return a data frame of fluorescence intensities and partition labels.
polytect_merge( data, cluster_num, base_clust, lambdas = rep(2, 64 - log2(64)), coefs = rep(1, 6) )
polytect_merge( data, cluster_num, base_clust, lambdas = rep(2, 64 - log2(64)), coefs = rep(1, 6) )
data |
A matrix of fluorescence intensities in each channel. Each row represents each partitions, and each column each channel. |
cluster_num |
The expected maximum number of clusters. |
base_clust |
A list that contains partition labels given by initial clustering. |
lambdas |
The penalty terms for the deviation from the expected cluster centers. Higher |
coefs |
The coefficients to adjust for the expected cluster centers. The default is 1 which can be used for common assay designs and has to be modified for special assays such as competing assays. |
A data frame containing the original fluorescence intensity and the cluster labels.
data(HR) dist_matrix <- dist(HR) hc <- hclust(dist_matrix, method = "ward.D2") hc_clusters <- cutree(hc, k = 6) base_clust<-list() base_clust$cluster<-hc_clusters head(polytect_merge(HR, 4, base_clust))
data(HR) dist_matrix <- dist(HR) hc <- hclust(dist_matrix, method = "ward.D2") hc_clusters <- cutree(hc, k = 6) base_clust<-list() base_clust$cluster<-hc_clusters head(polytect_merge(HR, 4, base_clust))
This function takes results from polytect_clust and polytect_merge, or a data frame containing flurescence intensities and partition labels. It will output all combination of 2-color plots.
polytect_plot(df_data, cluster_num, cluster_selected = TRUE)
polytect_plot(df_data, cluster_num, cluster_selected = TRUE)
df_data |
A data frame containing partition fluorescence intensities and corresponding cluster label. This can be the output
of |
cluster_num |
the expected number of clusters |
cluster_selected |
Indicator of whether all the clusters are present in the plots. If TRUE, then only selected ones (the ones only positive in the selected 2 dimensions) are shown. The default value is "TRUE". |
2-color plots.
data(HR) df_data<-polytect_clust(HR,4) polytect_plot(df_data,4)
data(HR) df_data<-polytect_clust(HR,4) polytect_plot(df_data,4)
This function takes results from polytect_clust and polytect_merge, or a data frame containing flurescence intensities and partition labels. It will summarise cluster centers, cluster sizes and cluster silhouette coefficients.
polytect_summary(df_data)
polytect_summary(df_data)
df_data |
A data frame containing partition fluorescence intensities and corresponding cluster label. This can be the output
of |
a data frame of the summary of cluster centers, cluster sizes and cluster silhouette coefficients.
data(HR) df_data<-polytect_clust(HR,4) polytect_summary(df_data)
data(HR) df_data<-polytect_clust(HR,4) polytect_summary(df_data)
This function takes results from polytect_clust and polytect_merge, or a data frame containing flurescence intensities and partition labels. It will output the silhouette coefficients of each cluster.
sil_plot(df_data)
sil_plot(df_data)
df_data |
A data frame containing partition fluorescence intensities and corresponding cluster label. This can be the output
of |
plot of silhouette coefficients for each cluster.
data(HR) df_data<-polytect_clust(HR,4) sil_plot(df_data)
data(HR) df_data<-polytect_clust(HR,4) sil_plot(df_data)