Title: | Bayesian analysis of Gaussian GWAS data |
---|---|
Description: | This package is built to perform GWAS analysis using Bayesian techniques. Currently, GWAS.BAYES has functionality for the implementation of BICOSS (Williams, J., Ferreira, M. A., and Ji, T. (2022). BICOSS: Bayesian iterative conditional stochastic search for GWAS. BMC Bioinformatics), BGWAS (Williams, J., Xu, S., Ferreira, M. A.. (2023) "BGWAS: Bayesian variable selection in linear mixed models with nonlocal priors for genome-wide association studies." BMC Bioinformatics), and GINA. All methods currently are for the analysis of Gaussian phenotypes The research related to this package was supported in part by National Science Foundation awards DMS 1853549, DMS 1853556, and DMS 2054173. |
Authors: | Jacob Williams [aut, cre] |
Maintainer: | Jacob Williams <[email protected]> |
License: | GPL-3 + file LICENSE |
Version: | 1.15.1 |
Built: | 2024-07-17 19:33:56 UTC |
Source: | https://github.com/bioc/GWAS.BAYES |
Performs BGWAS analysis as in the BGWAS manuscript (Williams, J., Xu, S. and Ferreira, M.A., 2023. BGWAS: Bayesian variable selection in linear mixed models with nonlocal priors for genome-wide association studies. BMC Bioinformatics, 24(1), pp.1-20.).
BGWAS( Y, SNPs, FDR_Nominal = 0.05, kinship = diag(nrow(SNPs)), tau = "IG", maxiterations = 4000, runs_til_stop = 400 )
BGWAS( Y, SNPs, FDR_Nominal = 0.05, kinship = diag(nrow(SNPs)), tau = "IG", maxiterations = 4000, runs_til_stop = 400 )
Y |
The observed numeric phenotypes |
SNPs |
The SNP matrix, where each column represents a single SNP encoded as the numeric coding 0, 1, 2. This is entered as a matrix object. |
FDR_Nominal |
The nominal false discovery rate for which SNPs are selected from in the screening step. Defaulted at 0.05. |
kinship |
The observed kinship matrix, has to be a square positive semidefinite matrix. Defaulted as the identity matrix. The function used to create the kinship matrix used in the BICOSS paper is A.mat() from package rrBLUP. |
tau |
This specifies the prior or the fixed value for the nonlocal prior on the SNP effects. To specify the Inverse Gamma prior set tau = "IG". To specify fixed values of tau set tau = 0.022. Fixed values explored in the BBGWAS manuscript were 0.022 and 0.348. |
maxiterations |
The maximum iterations the genetic algorithm in the model selection step iterates for. Defaulted at 400 which is the value used in the BICOSS paper simulation studies. |
runs_til_stop |
The number of iterations at the same best model before the genetic algorithm in the model selection step converges. Defaulted at 40 which is the value used in the BICOSS paper simulation studies. |
A named list that includes the output from the screening step and the output from the model selection step. The model proposed by the BGWAS method is the output of the model selection step.
library(GWAS.BAYES) BGWAS(Y = Y, SNPs = SNPs, kinship = kinship, FDR_Nominal = 0.05, tau = "IG", maxiterations = 400,runs_til_stop = 40)
library(GWAS.BAYES) BGWAS(Y = Y, SNPs = SNPs, kinship = kinship, FDR_Nominal = 0.05, tau = "IG", maxiterations = 400,runs_til_stop = 40)
Performs BICOSS analysis as described in Williams, J., Ferreira, M.A.R. & Ji, T. BICOSS: Bayesian iterative conditional stochastic search for GWAS. BMC Bioinformatics 23, 475 (2022). https://doi.org/10.1186/s12859-022-05030-0.
BICOSS( Y, SNPs, FDR_Nominal = 0.05, kinship = diag(nrow(SNPs)), maxiterations = 400, runs_til_stop = 40, P3D = TRUE )
BICOSS( Y, SNPs, FDR_Nominal = 0.05, kinship = diag(nrow(SNPs)), maxiterations = 400, runs_til_stop = 40, P3D = TRUE )
Y |
The observed numeric phenotypes |
SNPs |
The SNP matrix, where each column represents a single SNP encoded as the numeric coding 0, 1, 2. This is entered as a matrix object. |
FDR_Nominal |
The nominal false discovery rate for which SNPs are selected from in the screening step. |
kinship |
The observed kinship matrix, has to be a square positive semidefinite matrix. Defaulted as the identity matrix. The function used to create the kinship matrix used in the BICOSS paper is A.mat() from package rrBLUP. |
maxiterations |
The maximum iterations the genetic algorithm in the model selection step iterates for. Defaulted at 400 which is the value used in the BICOSS paper simulation studies. |
runs_til_stop |
The number of iterations at the same best model before the genetic algorithm in the model selection step converges. Defaulted at 40 which is the value used in the BICOSS paper simulation studies. |
P3D |
Population previous determined, if TRUE BICOSS uses approximated variance parameters estimated from the baseline model when conducting both the screening and the model selection steps. Setting P3D = TRUE is significantly faster. If FALSE, uses exact estimates of the variance parameters all models in both the screening and model selection step. |
The column indices of SNPs that were in the best model identified by BICOSS.
library(GWAS.BAYES) BICOSS(Y = Y, SNPs = SNPs, kinship = kinship, FDR_Nominal = 0.05,P3D = TRUE, maxiterations = 400,runs_til_stop = 40)
library(GWAS.BAYES) BICOSS(Y = Y, SNPs = SNPs, kinship = kinship, FDR_Nominal = 0.05,P3D = TRUE, maxiterations = 400,runs_til_stop = 40)
Performs GINA
GINA( Y, SNPs, kinship, FDR_Nominal = 0.05, maxiterations = 400, runs_til_stop = 40 )
GINA( Y, SNPs, kinship, FDR_Nominal = 0.05, maxiterations = 400, runs_til_stop = 40 )
Y |
The observed numeric phenotypes |
SNPs |
The SNP matrix, where each column represents a single SNP encoded as the numeric coding 0, 1, 2. This is entered as a matrix object. |
kinship |
The observed kinship matrix, has to be a square positive semidefinite matrix. Defaulted as the identity matrix. The function used to create the kinship matrix used in the BICOSS paper is A.mat() from package rrBLUP. |
FDR_Nominal |
The nominal false discovery rate for which SNPs are selected from in the screening step. |
maxiterations |
The maximum iterations the genetic algorithm in the model selection step iterates for. Defaulted at 400 which is the value used in the BICOSS paper simulation studies. |
runs_til_stop |
The number of iterations at the same best model before the genetic algorithm in the model selection step converges. Defaulted at 40 which is the value used in the BICOSS paper simulation studies. |
The column indices of SNPs that were in the best model identified by BICOSS.
library(GWAS.BAYES) GINA(Y = Y, SNPs = SNPs, kinship = kinship, FDR_Nominal = 0.05, maxiterations = 400,runs_til_stop = 40)
library(GWAS.BAYES) GINA(Y = Y, SNPs = SNPs, kinship = kinship, FDR_Nominal = 0.05, maxiterations = 400,runs_til_stop = 40)
This is a kinship matrix from the TAIR9 genotype information for 328 A. Thaliana Ecotypes from the paper Components of Root Architecture Remodeling in Response to Salt Stress. The kinship matrix was computed using all SNPs with minor allele frequency greater than 0.01.
kinship
kinship
## 'kinship' A matrix with 328 rows and 328 columns corresponding to the 328 ecotypes.
Performs Single Marker Association tests for both Linear Mixed Models and Linear models.
SMA(Y, SNPs, kinship = FALSE, P3D = FALSE)
SMA(Y, SNPs, kinship = FALSE, P3D = FALSE)
Y |
The observed numeric phenotypes |
SNPs |
The SNP matrix, where each column represents a single SNP encoded as the numeric coding 0, 1, 2. This is entered as a matrix object. |
kinship |
The observed kinship matrix, has to be a square positive semidefinite matrix. Defaulted as the identity matrix. The function used to create the kinship matrix used in the BICOSS paper is A.mat() from package rrBLUP. |
P3D |
Population previous determined, if TRUE BICOSS uses approximated variance parameters estimated from the baseline model when conducting both the screening and the model selection steps. Setting P3D = TRUE is significantly faster. If FALSE, uses exact estimates of the variance parameters all models in both the screening and model selection step. |
The p-values corresponding to every column provided in SNPs. These p-values can be used with any threshold of your choosing or with p.adjust().
This is a matrix with 328 observations and 9,000 SNPs. Each row is contains 9,000 SNPs from a single A. Thaliana ecotype in the paper Components of Root Architecture Remodeling in Response to Salt Stress.
SNPs
SNPs
## 'SNPs' A matrix with 328 observations and 9,000 SNPs.
This is a phenotype matrix simulated from the 9,000 SNPs. SNPs at positions 450, 1350, 2250, 3150, 4050, 4950, 5850, 6750, 7650, and 8550 have nonzero coefficients. Further, the data was simulated under the linear mixed model specified in the vignette and the BICOSS manuscript using the kinship matrix (kinship).
Y
Y
## 'Y' A matrix with 328 rows corresponding to the 328 ecotypes.