Title: | Inferring tumor-specific cancer dependencies through integrating ex-vivo drug response assays and drug-protein profiling |
---|---|
Description: | DepInfeR integrates two experimentally accessible input data matrices: the drug sensitivity profiles of cancer cell lines or primary tumors ex-vivo (X), and the drug affinities of a set of proteins (Y), to infer a matrix of molecular protein dependencies of the cancers (ß). DepInfeR deconvolutes the protein inhibition effect on the viability phenotype by using regularized multivariate linear regression. It assigns a “dependence coefficient” to each protein and each sample, and therefore could be used to gain a causal and accurate understanding of functional consequences of genomic aberrations in a heterogeneous disease, as well as to guide the choice of pharmacological intervention for a specific cancer type, sub-type, or an individual patient. For more information, please read out preprint on bioRxiv: https://doi.org/10.1101/2022.01.11.475864. |
Authors: | Junyan Lu [aut, cre] , Alina Batzilla [aut] |
Maintainer: | Junyan Lu <[email protected]> |
License: | GPL-3 |
Version: | 1.11.0 |
Built: | 2024-10-30 05:26:56 UTC |
Source: | https://github.com/bioc/DepInfeR |
DepInfeR integrates two experimentally accessible input data matrices: the drug sensitivity profiles of cancer cell lines or primary tumors ex-vivo (X), and the drug affinities of a set of proteins (Y), to infer a matrix of molecular protein dependencies of the cancers (ß). DepInfeR deconvolutes the protein inhibition effect on the viability phenotype by using regularized multivariate linear regression. It assigns a “dependence coefficient” to each protein and each sample, and therefore could be used to gain a causal and accurate understanding of functional consequences of genomic aberrations in a heterogeneous disease, as well as to guide the choice of pharmacological intervention for a specific cancer type, sub-type, or an individual patient. For more information, please read out preprint on bioRxiv: https://doi.org/10.1101/2022.01.11.475864.
The main functions are:
runLASSORegression
- perform inference of target importance
processTarget
- pre-process drug-protein affinity dataset
For detailed information on usage, see the package vignette, by typing
vignette("DepInfeR")
.
All software-related questions should be posted to the Bioconductor Support Site:
https://support.bioconductor.org
The code can be viewed at the GitHub repository. https://github.com/Huber-group-EMBL/DepInfeR
Alina Batzilla, Junyan Lu
Batzilla, A. and Lu, J. et al. (2022) Inferring tumor-specific cancer dependencies through integrating ex-vivo drug response assays and drug-protein profiling. https://www.biorxiv.org/content/10.1101/2022.01.11.475864v1
This is the processed Genomics of Drug Sensitivity in Cancer (GDSC) drug sensitivity dataset. The raw dataset was downloaded from https://www.cancerrxgene.org/downloads/bulk_download. The post-processing steps can be found at: https://www.huber.embl.de/users/jlu/depInfeR/process_GDSC.html.
data(drug_response_GDSC)
data(drug_response_GDSC)
an object of "tbl_df" (tidy table)
data(drug_response_GDSC)
data(drug_response_GDSC)
This cancer type and genomic background annotation for cancer cell lines, use for the analysis of the GDSC dataset in the package vignette. The raw dataset was downloaded from https://www.cancerrxgene.org/downloads/bulk_download. The post-processing steps can be found at: https://www.huber.embl.de/users/jlu/depInfeR/process_GDSC.html.
data(mutation_GDSC)
data(mutation_GDSC)
an object of "tbl_df" (tidy table)
data(mutation_GDSC)
data(mutation_GDSC)
This function is used to preprocess the drug-protein affinity dataset including the following steps: - log-transform kd values (KdAsInput = TRUE) - arctan-transform log(kd) values (KdAsInput = TRUE) - check target similarity and remove highly correlated proteins (removeCorrelated = TRUE) - specify targets that should be kept in the matrix (keepTargets = NULL)
processTarget( targetsMat, KdAsInput = TRUE, removeCorrelated = TRUE, keepTargets = NULL, cutoff = 0.8 )
processTarget( targetsMat, KdAsInput = TRUE, removeCorrelated = TRUE, keepTargets = NULL, cutoff = 0.8 )
targetsMat |
Drug-protein affinity matrix with kd values (or optionally other affinity measurement values at roughly normal distribution). Each row is a drug and each column is a sample (cell line or tumor sample). |
KdAsInput |
A boolean value indicating whether the drug-protein affinity matrix contains kd values which should be log- and arctan-transformed. The default value is TRUE. |
removeCorrelated |
A boolean value indicating whether highly correlated proteins should be summarized in target groups. The default value is TRUE. |
keepTargets |
A character variable that specifies important proteins that should be retained in the matrix. |
cutoff |
A Cosine similarity cutoff value for clustering proteins into one target group. The value should be between 0 and 1. |
All steps within this function are optional depending on input data. The transformation steps should be performed if the affinity matrix consists of kd values. If there are highly correlated features within the affinity matrix, they can be removed using the provided function.
A list of two elements: 1)targetMatrix
Pre-processed drug-protein
affinity matrix; 2)targetCluster
, a list that contains the targets
show high correlations with each other.
data(targetMatrix) processTarget(targetsMat = targetMatrix, KdAsInput = TRUE , removeCorrelated = TRUE)
data(targetMatrix) processTarget(targetsMat = targetMatrix, KdAsInput = TRUE , removeCorrelated = TRUE)
A toy data set that contains processed drug response matrix for examples and test of runLASSORegression function. Rows contain drugs and columns contain samples.
data(responseInput)
data(responseInput)
an object of matrix
data(responseInput)
data(responseInput)
This function performs multivariate linear regression with LASSO penalty and cross-validation to infer per-sample protein dependence coefficients. Please refer to the package vignette for more detailed information about this function. For the mathematical model behind this function, please refer to our preprint on bioRxiv: https://doi.org/10.1101/2022.01.11.475864.
runLASSORegression( TargetMatrix, ResponseMatrix, repeats = 100, BPPARAM = bpparam() )
runLASSORegression( TargetMatrix, ResponseMatrix, repeats = 100, BPPARAM = bpparam() )
TargetMatrix |
Pre-processed drug-protein affinity matrix. Each row is a drug and each column is a protein target. |
ResponseMatrix |
Pre-processed drug-response viability matrix. Each row is a drug and each column is a sample (cell line or tumor sample). |
repeats |
A integer variable specifying the number of regression repeats. The default value is 100. A higher number can result in better stability but also takes longer time. |
BPPARAM |
The BiocParallel back-end. If not specified, the default
|
A list of 6 elements:
coefMat |
A matrix containing the inferred protein dependency coefficient matrix. Each row is a protein and each column is a sample. |
freqMat |
A matrix containing the selection frequency a protein (row) for each sample (column). |
lambdaList |
A vector containing the lambda values selected for all repeated runs. |
varExplain.all |
A vector containing the R2 (variance explained) values for all repeated runs. |
inputX |
A copy of the input drug-protein affinity matrix. |
inputY |
A copy of the input drug-repsonse viability matrix. |
data(responseInput) #load drug response matrix data(targetInput) #load drug-target affinity matrix runLASSORegression(TargetMatrix = targetInput, ResponseMatrix = responseInput, repeats = 5)
data(responseInput) #load drug response matrix data(targetInput) #load drug-target affinity matrix runLASSORegression(TargetMatrix = targetInput, ResponseMatrix = responseInput, repeats = 5)
A toy data set that contains processed drug-target affinity matrix for examples and test of runLASSOregression function. Rows contain drugs and columns contain targets.
data(targetInput)
data(targetInput)
an object of matrix
data(targetInput)
data(targetInput)
A toy data set that contains drug-target affinity matrix for examples and test of processTarget function. Rows contain drugs and columns contain targets.
data(targetMatrix)
data(targetMatrix)
an object of matrix
data(targetMatrix)
data(targetMatrix)
This drug-protein affinity profiling data for the analysis of the GDSC dataset - a subset of the data provided by Klaeger et al. 2017. The raw data can be found in the supplementary file of the paper (Table_S1 & Table_S2): https://science.sciencemag.org/content/358/6367/eaan4368/tab-figures-data. The post-processing steps can be found at: https://www.huber.embl.de/users/jlu/depInfeR/process_kinobeads.html.
data(targetsGDSC)
data(targetsGDSC)
an object of "tbl_df" (tidy table)
data(targetsGDSC)
data(targetsGDSC)