Package: MungeSumstats 1.15.2

Alan Murphy

MungeSumstats: Standardise summary statistics from GWAS

The *MungeSumstats* package is designed to facilitate the standardisation of GWAS summary statistics. It reformats inputted summary statisitics to include SNP, CHR, BP and can look up these values if any are missing. It also pefrorms dozens of QC and filtering steps to ensure high data quality and minimise inter-study differences.

Authors:Alan Murphy [aut, cre], Brian Schilder [aut, ctb], Nathan Skene [aut]

MungeSumstats_1.15.2.tar.gz
MungeSumstats_1.15.2.zip(r-4.5)MungeSumstats_1.15.2.zip(r-4.4)MungeSumstats_1.15.2.zip(r-4.3)
MungeSumstats_1.15.2.tgz(r-4.4-any)MungeSumstats_1.15.2.tgz(r-4.3-any)
MungeSumstats_1.15.2.tar.gz(r-4.4-noble)
MungeSumstats_1.15.2.tgz(r-4.4-emscripten)MungeSumstats_1.15.2.tgz(r-4.3-emscripten)
MungeSumstats.pdf |MungeSumstats.html
MungeSumstats/json (API)
NEWS

# Install 'MungeSumstats' in R:
install.packages('MungeSumstats', repos = c('https://bioc.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/neurogenomics/mungesumstats/issues

Datasets:

On BioConductor:MungeSumstats-1.15.0(bioc 3.21)MungeSumstats-1.14.1(bioc 3.20)

snpwholegenomegeneticscomparativegenomicsgenomewideassociationgenomicvariationpreprocessing

5.85 score 87 scripts 1.5k downloads 19 exports 91 dependencies

Last updated 22 days agofrom:12077de3ba. Checks:ERROR: 1 WARNING: 5. Indexed: yes.

TargetResultDate
Doc / VignettesFAILNov 08 2024
R-4.5-winWARNINGNov 08 2024
R-4.4-winWARNINGNov 08 2024
R-4.4-macWARNINGNov 08 2024
R-4.3-winWARNINGNov 08 2024
R-4.3-macWARNINGNov 08 2024

Exports:compute_nsizedownload_vcffind_sumstatsformat_sumstatsformatted_exampleget_genome_buildsimport_sumstatsindex_tabularinfer_effect_columnliftoverlist_sumstatsload_snp_loc_dataparse_logsread_headerread_sumstatsread_vcfstandardise_headervcf2dfwrite_sumstats

Dependencies:abindAnnotationDbiaskpassassertthatBHBiobaseBiocGenericsBiocIOBiocParallelBiostringsbitbit64bitopsblobBSgenomecachemclicodetoolscpp11crayoncurldata.tableDBIDelayedArraydigestdplyrfansifastmapformatRfsfutile.loggerfutile.optionsgarglegenericsGenomeInfoDbGenomeInfoDbDataGenomicAlignmentsGenomicFeaturesGenomicRangesgluegoogleAuthRhttrIRangesjsonliteKEGGRESTlambda.rlatticelifecyclemagrittrMatrixMatrixGenericsmatrixStatsmemoisemimeopensslpillarpkgconfigplogrpngR.methodsS3R.ooR.utilsR6rappdirsRCurlrestfulrRhtslibrjsonrlangRsamtoolsRSQLitertracklayerS4ArraysS4VectorssnowSparseArraystringistringrSummarizedExperimentsystibbletidyselectUCSC.utilsutf8VariantAnnotationvctrswithrXMLXVectoryamlzlibbioc

Readme and manuals

Help Manual

Help pageTopics
Ensures that parameters are compatible with LDSC formatcheck_ldsc_format
Check for N column if not present and user wants, impute N based on user's sample size. *NOTE* this will be the same value for each SNP which is not necessarily correct and may cause issues down the line. N can also be inputted with "ldsc", "sum", "giant" or "metal" by passing one or multiple of these.compute_nsize
Download VCF file and its index file from Open GWASdownload_vcf
Search Open GWAS for datasets matching criteriafind_sumstats
Check that summary statistics from GWAS are in a homogeneous formatformat_sumstats
Formatted exampleformatted_example
Get combinations of uncorrected allele and effect (and frq) columnsget_eff_frq_allele_combns
Infer genome buildsget_genome_builds
UCSC Chain file hg19 to hg38hg19ToHg38
UCSC Chain file hg38 to hg19hg38ToHg19
Local ieu-a-298 file from IEU Open GWASieu-a-298
Import full genome-wide GWAS summary statistics from Open GWASimport_sumstats
Tabix-index file: tableindex_tabular
Infer if effect relates to a1 or A2 if ambiguously namedinfer_effect_column
Genome build liftoverliftover
List munged summary statisticslist_sumstats
Load the reference genome data for SNPs of interestload_ref_genome_data
Loads the SNP locations and alleles for Homo sapiens extracted from NCBI dbSNP Build 144. Reference genome version is dependent on user input.load_snp_loc_data
Parse data from log filesparse_logs
GWAS Amyotrophic lateral sclerosis ieu open GWAS project - Subsetraw_ALSvcf
GWAS Educational Attainment Okbay 2016 - Subsetraw_eduAttainOkbay
Read in file headerread_header
Determine summary statistics file type and read them into memoryread_sumstats
Read in VCF fileread_vcf
Register coresregister_cores
Standardise the column headers in the Summary Statistics filesstandardise_header
Summary Statistics Column HeaderssumstatsColHeaders
VCF to DFvcf2df
Write sum stats file to diskwrite_sumstats