Title: | Well Plate Maker |
---|---|
Description: | The Well-Plate Maker (WPM) is a shiny application deployed as an R package. Functions for a command-line/script use are also available. The WPM allows users to generate well plate maps to carry out their experiments while improving the handling of batch effects. In particular, it helps controlling the "plate effect" thanks to its ability to randomize samples over multiple well plates. The algorithm for placing the samples is inspired by the backtracking algorithm: the samples are placed at random while respecting specific spatial constraints. |
Authors: | Helene Borges [aut, cre], Thomas Burger [aut] |
Maintainer: | Helene Borges <[email protected]> |
License: | Artistic-2.0 |
Version: | 1.17.0 |
Built: | 2024-12-30 05:22:55 UTC |
Source: | https://github.com/bioc/wpm |
Function used to launch the backtracking algorithm on a dataframe with the corresponding plate parameters, number of iterations and special wells
backtracking( max_iter = 20, user_data, wells, rows, columns, nb_plates, constraint, prog = NULL )
backtracking( max_iter = 20, user_data, wells, rows, columns, nb_plates, constraint, prog = NULL )
max_iter |
numeric, the maximal number of iterations to do, default value is 20 |
user_data |
dataframe, user samples to place randomly on the plate |
wells |
dataframe, special wells not to be placed randomly on the plate |
rows |
numeric, number of lines on the plate(s) |
columns |
numeric, number of columns on the plate(s) |
nb_plates |
numeric, number of plates |
constraint |
character, spatial mode |
prog |
progress bar used for shiny app only |
a dataframe containing user samples and special wells with their coordinates for the corresponding plates.
This function makes it possible to distribute the samples equitably on several plates, taking into account the numbers in the groups (if there are any). This means that, for example, if 2 plates are to be filled, then 50 generally, all the plates are assigned the same number of elements. When the numbers do not allow it (in particular when the total number of elements to be allocated is not a multiple of the number of plates), there will be a slight difference in the number of samples on the plates.
balancedGrpDistrib(d, nb_p, df_max_size)
balancedGrpDistrib(d, nb_p, df_max_size)
d |
the user dataframe |
nb_p |
the number of plates to fill |
df_max_size |
the maximum number of samples that can be placed on the current plate |
a list of dataframes each corresponding to a plate to fill.
Finds the neighbors of the current element (row, col) in the matrix m, depending on the chosen constraint pattern. Currently, there are only 3 valid patterns (NS, WE and NEWS)
checkConstraints(m, row, col, mode)
checkConstraints(m, row, col, mode)
m |
matrix |
row |
current selected row in the matrix m |
col |
current selected column in the matrix m |
mode |
spatial constraint |
A vector containing the neighbors of element (row,col) of the matrix m.
Checks if all the inputs given to the function WrapperWPM are correct and intercompatible.
checkWpmInputs( user_df, plate_dims, nb_plates, spatial_constraint, max_iteration )
checkWpmInputs( user_df, plate_dims, nb_plates, spatial_constraint, max_iteration )
user_df |
expected dataframe, returns adapted message error |
plate_dims |
expected list of plate dimensions (rows and columns) |
nb_plates |
expected number of plates |
spatial_constraint |
expected character for spatial constraint |
max_iteration |
expected number of iterations |
returns an error message if a problem is found with some parameter.
This function converts a CSV into a dataframe to make it usable by the shiny application of wpm as well as by the wrapper function (version of wpm in command line). Be sure that the first column of the CSV file corresponds to samples names.
convertCSV(dt_path, row_names = FALSE, gp_field = NULL, ...)
convertCSV(dt_path, row_names = FALSE, gp_field = NULL, ...)
dt_path |
file path. |
row_names |
logical value, indicates if the file has rownames or not. |
gp_field |
the column name indicating the grouping factor for the samples in the csv. If there is no grouping factor, then gp_field must be set to NULL or "none". |
... |
parameters to give to read.csv2 function |
a list containing a dataframe containing the data of the imported CSV and a dataframe containing 3 fields (Sample, Group and ID) which will be used by WPM. Or NULL if there is an error when giving wrong parameters.
test <- data.frame("Sample" = c("s1","s2","s3","s4"), "Group" = c("A","A","B", "C")) tf <- tempfile() write.csv2(test, tf, row.names = FALSE) convertCSV(tf, gp_field = "Group", header = TRUE, sep = ";") # if there are row names in the CSV file write.csv2(test, tf) convertCSV(tf, row_names = TRUE, gp_field="Group", header = TRUE, sep = ";") # if there is no grouping factor in the CSV file convertCSV(tf, row_names = TRUE, gp_field ="none", header = TRUE, sep = ";") # gives the same output as the previous example convertCSV(tf, row_names = TRUE, header = TRUE, sep = ";")
test <- data.frame("Sample" = c("s1","s2","s3","s4"), "Group" = c("A","A","B", "C")) tf <- tempfile() write.csv2(test, tf, row.names = FALSE) convertCSV(tf, gp_field = "Group", header = TRUE, sep = ";") # if there are row names in the CSV file write.csv2(test, tf) convertCSV(tf, row_names = TRUE, gp_field="Group", header = TRUE, sep = ";") # if there is no grouping factor in the CSV file convertCSV(tf, row_names = TRUE, gp_field ="none", header = TRUE, sep = ";") # gives the same output as the previous example convertCSV(tf, row_names = TRUE, header = TRUE, sep = ";")
This function converts an ExpressionSet/MsnSet object into a dataframe to make it usable by the shiny application of wpm as well as by the wrapper function (version of wpm in command line)
convertESet(eSet_obj, gp_field = NULL)
convertESet(eSet_obj, gp_field = NULL)
eSet_obj |
an ExpressionSet/MsnSet object contaning the phenotype data |
gp_field |
character, corresponding to the phenotype data used to categorize samples into distinct groups if any |
a dataframe containing 3 fields: Sample, Group and ID.
sample_names <- c("s1","s2","s3","s4", "s5") M <- matrix(NA, nrow = 4, ncol = 5) colnames(M) <- sample_names rownames(M) <- paste0("id", LETTERS[1:4]) pd <- data.frame(Environment = rep_len(LETTERS[1:3], 5), Category = rep_len(1:2, 5), row.names = sample_names) rownames(pd) <- colnames(M) x <- MSnbase::MSnSet(exprs = M,pData = pd) convertESet(x, "Environment")
sample_names <- c("s1","s2","s3","s4", "s5") M <- matrix(NA, nrow = 4, ncol = 5) colnames(M) <- sample_names rownames(M) <- paste0("id", LETTERS[1:4]) pd <- data.frame(Environment = rep_len(LETTERS[1:3], 5), Category = rep_len(1:2, 5), row.names = sample_names) rownames(pd) <- colnames(M) x <- MSnbase::MSnSet(exprs = M,pData = pd) convertESet(x, "Environment")
Convert the phenotype data of a SummarizedExperiment into a dataframe for WPM
convertSE(se_object, gp_field = NULL)
convertSE(se_object, gp_field = NULL)
se_object |
a SummarizedExperiment object containing the phenotype data |
gp_field |
character, corresponding to the phenotype data used to categorize samples into distinct groups if any |
a dataframe containing 3 fields: Sample, Group and ID.
nrows <- 200 ncols <- 6 counts <- matrix(runif(nrows * ncols, 1, 1e4), nrows) colData <- data.frame(Treatment=rep(c("ChIP", "Input"), 3), row.names=LETTERS[1:6]) se <- SummarizedExperiment::SummarizedExperiment(assays=list(counts=counts), colData=colData) convertSE(se, "Treatment")
nrows <- 200 ncols <- 6 counts <- matrix(runif(nrows * ncols, 1, 1e4), nrows) colData <- data.frame(Treatment=rep(c("ChIP", "Input"), 3), row.names=LETTERS[1:6]) se <- SummarizedExperiment::SummarizedExperiment(assays=list(counts=counts), colData=colData) convertSE(se, "Treatment")
Function converting the format of "Letter-Digit" coordinates into a dataframe containing these coordinates in Row, Column.
convertVector2Df(chr_wells, max_Row, max_Col, status = NA)
convertVector2Df(chr_wells, max_Row, max_Col, status = NA)
chr_wells |
character string containing the wells |
max_Row |
integer, maximal number of lines in the plate |
max_Col |
integer, maximal number of columns in the plate |
status |
character, the status of the wells |
result, dataframe containing wells coordinates
# convert the vector of well coordinates into a dataframe convertVector2Df("A1,C2,A3,B12,C42",3,42,"specify_status") # supports uppercase / lowercase letters convertVector2Df("a1,C2,A3,b12,C42",3,42,"specify_status")
# convert the vector of well coordinates into a dataframe convertVector2Df("A1,C2,A3,B12,C42",3,42,"specify_status") # supports uppercase / lowercase letters convertVector2Df("a1,C2,A3,b12,C42",3,42,"specify_status")
A demo dataset containing the age and other attributes of 193 fictitious patients. It aims to help the user to test the shiny application of the wpm package.
data(data_test)
data(data_test)
A data frame with 193 rows and 7 variables
the samples to be analyzed representing fictitious patients.
age of the patients under 5 age groups, in years (20-30;30-40;40-50;50-60;60-70)
gender of the patients, F (for Female) and M (for Male)
the treatment each patient received, Ctrl (Control), treatment A, treatment B and treatment C
presence of diabetes, 0 (no) and 1 (yes)
A combination between the gender and treatment fields
A combination between the age and the diabetes fields
Helene Borges [email protected]
function to place the buffer solutions on the plate according to the selected mode: it generates a dataframe containing the row and column coordinates for each buffer solution.
defineBufferCoords(p_lines, p_cols, mod = "none", start_buffer)
defineBufferCoords(p_lines, p_cols, mod = "none", start_buffer)
p_lines |
number of rows on the plate |
p_cols |
number of columns on the plate |
mod |
character, can be "none", "by_column", "by_row" or "checkerboard" |
start_buffer |
character, "even" means that the buffers will be positioned on the even rows of the plate. Otherwise, they will be positioned on the odd rows. |
a dataframe containing the buffer wells with their coordinates.
Function to plot the input dataframe containing the Sample names, the Row, Column coordinates, the group and the status
drawMap(df, sample_gps, gp_levels, plate_lines, plate_cols, project_title)
drawMap(df, sample_gps, gp_levels, plate_lines, plate_cols, project_title)
df |
dataframe containing user data and special wells if any. |
sample_gps |
number of distinct groups in the file before adding the special wells to df |
gp_levels |
is Group column levels before adding the special wells to df |
plate_lines |
integer, number of plate's lines |
plate_cols |
integer, number of plate's columns |
project_title |
character, the user's project title |
g, a ggplot object corresponding to the generated plate map.
# example of data containing 5 biological samples, 2 forbidden wells, # 2 buffers and 3 not random wells user_data <- data.frame("Sample" = c(as.character(seq_len(5)), rep_len(NA, 7)), "Group" = c(c("A","B","C","A","B"), rep_len("forbidden", 2), rep_len("buffer", 2), rep_len("fixed", 3)), "ID" = c(seq_len(5),rep_len(NA,7)), "Well" = c("A2","B3","C3","B4","A3","A1","A4","B2","C2","B1","C1","C4"), "Status" = c(rep_len("toRandom", 5), rep_len("forbidden", 2), rep_len("buffer", 2), rep_len("fixed", 3)), "Row" = c(1,2,3,2,1,1,1,2,3,2,3,3), "Column" = c(2,3,3,4,3,1,4,2,2,1,1,4)) p <- "My Project" gp_lvl <- levels(as.factor(c("A","B","C"))) drawMap(df = user_data, sample_gps = 3, gp_levels = gp_lvl, plate_lines = 3, plate_cols = 4, project_title = p) # also works when giving a plate with more wells than the number of samples to place. drawMap(df = user_data, sample_gps = 3, gp_levels = gp_lvl, plate_lines = 8, plate_cols = 12, project_title = p)
# example of data containing 5 biological samples, 2 forbidden wells, # 2 buffers and 3 not random wells user_data <- data.frame("Sample" = c(as.character(seq_len(5)), rep_len(NA, 7)), "Group" = c(c("A","B","C","A","B"), rep_len("forbidden", 2), rep_len("buffer", 2), rep_len("fixed", 3)), "ID" = c(seq_len(5),rep_len(NA,7)), "Well" = c("A2","B3","C3","B4","A3","A1","A4","B2","C2","B1","C1","C4"), "Status" = c(rep_len("toRandom", 5), rep_len("forbidden", 2), rep_len("buffer", 2), rep_len("fixed", 3)), "Row" = c(1,2,3,2,1,1,1,2,3,2,3,3), "Column" = c(2,3,3,4,3,1,4,2,2,1,1,4)) p <- "My Project" gp_lvl <- levels(as.factor(c("A","B","C"))) drawMap(df = user_data, sample_gps = 3, gp_levels = gp_lvl, plate_lines = 3, plate_cols = 4, project_title = p) # also works when giving a plate with more wells than the number of samples to place. drawMap(df = user_data, sample_gps = 3, gp_levels = gp_lvl, plate_lines = 8, plate_cols = 12, project_title = p)
Function for spatial contraints: the North, East, West and South neighbors of the current element (i,j) of the matrix m.
findNEWSneighbors(m, i, j)
findNEWSneighbors(m, i, j)
m |
matrix |
i |
integer, line index in the matrix |
j |
integer, column index in the matrix |
A vector containing the North, East, West and South neighbors of the element (i,j) of the matrix being processed.
Function for spatial constraint that only looks for North (top) and South (bottom) neighbors of the current element (i,j) of the matrix m.
findNSneighbors(m, i, j)
findNSneighbors(m, i, j)
m |
matrix |
i |
integer, line index in the matrix |
j |
integer, column index in the matrix |
A vector containing the North and South neighbors of the element (i,j) of the matrix being processed.
Function for spatial constraint that only looks for West (left) and East (right) neighbors of the current element (i,j) of the matrix m.
findWEneighbors(m, i, j)
findWEneighbors(m, i, j)
m |
matrix |
i |
integer, line index in the matrix |
j |
integer, column index in the matrix |
A vector containing the West and East neighbors of the element (i,j) of the matrix being processed
This function generates a plate map using a backtracking algorithm and returns a dataframe if success. If it fails to find a solution, returns NULL. If there are not enough wells to place all the samples, returns 0.
generateMap( user_df, nb_rows, nb_cols, df_forbidden, mod, max_it, updateProgress = NULL )
generateMap( user_df, nb_rows, nb_cols, df_forbidden, mod, max_it, updateProgress = NULL )
user_df |
dataframe containing 9 features: Sample, ID, Group, Sample.name, Well, Status, Row, Column, Plate. See details. |
nb_rows |
numeric, number of lines on the plate |
nb_cols |
numeric, number of columns on the plate |
df_forbidden |
dataframe with the same structure than user_df, but for the forbidden, buffer solutions and Not randomized wells. |
mod |
character, neighborhood spatial constraint |
max_it |
numeric, maximum number of attempts to generate a plate plan before returning a failure. |
updateProgress |
shiny object, reports progress to the user. |
The dataframe is generated using dedicated functions of the wpm package: 'convertCSV()', 'convertESet()' or 'convertSE()'. But the user can also generate it by hand.
A number of attempts is allowed. Consequently, if the maximal number if attempts is reeched and no solution was found with the backtracking (i.e. the randomWalk does not return a dataframe), this function prints a warning message and returns NULL. If a solution is found, then it returns the dataframe.
Returns a dataframe containing all the data of the plate map(s)
Function that merges dataframes that contain wells of different types. To do this, it verifies that all the conditions provided are compatible with each other in order to be able to launch WPM on this data.
joinDataframes( forbidden_w = NULL, buffer_w = NULL, fixed_w = NULL, nb_samples, totalNbWells, nb_p )
joinDataframes( forbidden_w = NULL, buffer_w = NULL, fixed_w = NULL, nb_samples, totalNbWells, nb_p )
forbidden_w |
dataframe, the forbidden wells |
buffer_w |
datarame, the buffer wells |
fixed_w |
dataframe, the quality control wells |
nb_samples |
numeric, the number of samples to place using the backtracking algorithm. |
totalNbWells |
numeric, the total number of wells that can be filled. |
nb_p |
numeric, number of plates to fill |
a dataframe containing all the special wells
Returns the user dataframe updated after choosing randomly a well on the plate (matrix) and randomly choosing a sample ID that satisfies all the constraints.
randomWalk(m, toVisit, d, constraint)
randomWalk(m, toVisit, d, constraint)
m |
is a matrix corresponding to the plate to be filled. |
toVisit |
contains the wells in form "A1", and contains only the wells authorized to be filled in |
d |
is the dataframe containing the data supplied by the user. |
constraint |
character string corresponding to the spatial constraint selected by the user |
a dataframe corresponding to the user-supplied data. This dataframe is an updated version, where the columns 'Row' and 'Column' are filled with the coordinates of the chosen well. If no solution is found for the current selected well, then this function returns 1.
This function allows to pick up the last element in a vector when the parameter size is equal to 1. Passes parameters to 'sample.int' like size.
resample(x, ...)
resample(x, ...)
x |
is a vector |
... |
parameters given to the function sample.int |
a vector of length equal to size parameter.
This function chooses a sample randomly from among those who respect the neighborhood constraints and who have not yet been assigned to a well.
solveCell(m, d, i, j, already_drawn, constraint)
solveCell(m, d, i, j, already_drawn, constraint)
m |
Matrix representing the plate plan. |
d |
Dataframe containing the samples to place. |
i |
Line index of the chosen well. |
j |
Column index of the chosen well. |
already_drawn |
Vector of samples already affected to wells. |
constraint |
Character. Corresponds to the neighborhood constraint mode. |
If there is no possibility to find a valid sample, the function returns an error value (1). If a sample is chosen, then this function returns two objects: * __m__ The matrix updated with the new added sample. * __already_drawn__ The vector of already placed samples updated.
Run the Shiny Application of Well Plate Maker
wpm(...)
wpm(...)
... |
A series of options to be used inside the app. |
a shiny application object with golem options
if(interactive()) {wpm()}
if(interactive()) {wpm()}
Wrapper function that generates plate plans like the wpm shiny application. This feature allows the user to use the wpm package from the command line rather than going through a web application.
wrapperWPM( user_df, plate_dims, nb_plates, forbidden_wells = NULL, buffer_wells = NULL, fixed_wells = NULL, spatial_constraint = "none", max_iteration = 20 )
wrapperWPM( user_df, plate_dims, nb_plates, forbidden_wells = NULL, buffer_wells = NULL, fixed_wells = NULL, spatial_constraint = "none", max_iteration = 20 )
user_df |
dataframe containing user data obtained with the 'convertCSV()' or 'convertESet()' functions. |
plate_dims |
list, containing 2 values: the first is the number of plate's lines and second is the number of plate's columns. |
nb_plates |
numeric, corresponds to the number of plates to fill |
forbidden_wells |
character, the wells that will not be used at all for the experiment. This argument needs to be a character string giving the wells coordinates of the form "LetterNumber" (eg. "A1" for the well positionned in the first row/ first column of the plate). |
buffer_wells |
character, the wells that will be used during experiment but without biological sample in it. Same input structure as for forbidden_wells parameter. |
fixed_wells |
character, the wells that will be used for Quality Control samples or standards during the Experiment. Same input structure as for forbidden_wells parameter. |
spatial_constraint |
character, is the spatial constraint used to place the samples on the plate. It can also be called neighborhood constraint. Currently, the possible values are "none", "NS" (for North-South), "WE" (for West-East) and "NEWS" (North-South-East-West). |
max_iteration |
numeric, maximal number of attemps for wpm to find a valid solution. |
a dataframe if wpm finds a solution.
# create a MSnSet toy example sample_names <- c("s1","s2","s3","s4", "s5") M <- matrix(NA, nrow = 4, ncol = 5) colnames(M) <- sample_names rownames(M) <- paste0("id", LETTERS[1:4]) pd <- data.frame(Environment = rep_len(LETTERS[1:3], 5), Category = rep_len(1:2, 5), row.names = sample_names) rownames(pd) <- colnames(M) x <- MSnbase::MSnSet(exprs = M,pData = pd) # convert it to a valid dataframe for wpm df <- convertESet(x, "Environment") # run wpm on the toy example wrapperWPM(user_df = df, plate_dims = list(8,12), nb_plates = 1, forbidden_wells = "A1,A2,A3", fixed_wells = "B1,B2", spatial_constraint = "NS")
# create a MSnSet toy example sample_names <- c("s1","s2","s3","s4", "s5") M <- matrix(NA, nrow = 4, ncol = 5) colnames(M) <- sample_names rownames(M) <- paste0("id", LETTERS[1:4]) pd <- data.frame(Environment = rep_len(LETTERS[1:3], 5), Category = rep_len(1:2, 5), row.names = sample_names) rownames(pd) <- colnames(M) x <- MSnbase::MSnSet(exprs = M,pData = pd) # convert it to a valid dataframe for wpm df <- convertESet(x, "Environment") # run wpm on the toy example wrapperWPM(user_df = df, plate_dims = list(8,12), nb_plates = 1, forbidden_wells = "A1,A2,A3", fixed_wells = "B1,B2", spatial_constraint = "NS")