The MsDataHub
package provides example mass spectrometry
data, peptide spectrum matches or quantitative data from proteomics and
metabolomics experiments. The data are served through the
infrastructure, which allows download them
only ones and cache them for further use. Currently available data are
summarised in the table below and details in the next section.
To install the package:
Load with
## see ?MsDataHub and browseVignettes('MsDataHub') for documentation
## downloading 1 resources
## retrieving 1 resource
## loading from cache
## MSn data (Spectra) with 7602 spectra in a MsBackendMzR backend:
## msLevel rtime scanIndex
## <integer> <numeric> <integer>
## 1 1 0.231 1
## 2 1 0.351 2
## 3 1 0.471 3
## 4 1 0.591 4
## 5 1 0.711 5
## ... ... ... ...
## 7598 1 899.491 7598
## 7599 1 899.613 7599
## 7600 1 899.747 7600
## 7601 1 899.872 7601
## 7602 1 899.993 7602
## ... 33 more variables/columns.
## file(s):
## 20592f89ded4_7861
## see ?MsDataHub and browseVignettes('MsDataHub') for documentation
## downloading 1 resources
## retrieving 1 resource
## loading from cache
## MSn data (Spectra) with 8999 spectra in a MsBackendMzR backend:
## msLevel rtime scanIndex
## <integer> <numeric> <integer>
## 1 2 0.203 1
## 2 2 0.300 2
## 3 2 0.397 3
## 4 2 0.494 4
## 5 2 0.591 5
## ... ... ... ...
## 8995 2 899.527 8995
## 8996 2 899.624 8996
## 8997 2 899.721 8997
## 8998 2 899.818 8998
## 8999 2 899.915 8999
## ... 33 more variables/columns.
## file(s):
## 20593d6c4887_7862
Load with
## see ?MsDataHub and browseVignettes('MsDataHub') for documentation
## downloading 1 resources
## retrieving 1 resource
## loading from cache
## MSn data (Spectra) with 931 spectra in a MsBackendMzR backend:
## msLevel rtime scanIndex
## <integer> <numeric> <integer>
## 1 1 0.280 1
## 2 1 0.559 2
## 3 1 0.838 3
## 4 1 1.117 4
## 5 1 1.396 5
## ... ... ... ...
## 927 1 258.641 927
## 928 1 258.920 928
## 929 1 259.199 929
## 930 1 259.478 930
## 931 1 259.757 931
## ... 33 more variables/columns.
## file(s):
## 205913692c7e_7859
## see ?MsDataHub and browseVignettes('MsDataHub') for documentation
## downloading 1 resources
## retrieving 1 resource
## loading from cache
## MSn data (Spectra) with 931 spectra in a MsBackendMzR backend:
## msLevel rtime scanIndex
## <integer> <numeric> <integer>
## 1 1 0.275 1
## 2 1 0.554 2
## 3 1 0.833 3
## 4 1 1.112 4
## 5 1 1.391 5
## ... ... ... ...
## 927 1 258.636 927
## 928 1 258.915 928
## 929 1 259.194 929
## 930 1 259.473 930
## 931 1 259.752 931
## ... 33 more variables/columns.
## file(s):
## 20592dfa9978_7860
Load with
## see ?MsDataHub and browseVignettes('MsDataHub') for documentation
## downloading 1 resources
## retrieving 1 resource
## loading from cache
## MSn data (Spectra) with 7534 spectra in a MsBackendMzR backend:
## msLevel rtime scanIndex
## <integer> <numeric> <integer>
## 1 1 0.4584 1
## 2 1 0.9725 2
## 3 1 1.8524 3
## 4 1 2.7424 4
## 5 1 3.6124 5
## ... ... ... ...
## 7530 2 3600.47 7530
## 7531 2 3600.83 7531
## 7532 2 3601.18 7532
## 7533 2 3601.57 7533
## 7534 2 3601.98 7534
## ... 33 more variables/columns.
## file(s):
## 20596616feb0_7858
## see ?MsDataHub and browseVignettes('MsDataHub') for documentation
## downloading 1 resources
## retrieving 1 resource
## loading from cache
## PSM with 5802 rows and 35 columns.
## names(35): sequence spectrumID ... subReplacementResidue subLocation
Load with
## see ?MsDataHub and browseVignettes('MsDataHub') for documentation
## downloading 1 resources
## retrieving 1 resource
## loading from cache
## class: SummarizedExperiment
## dim: 11466 45
## metadata(0):
## assays(1): ''
## rownames(11466): 1 2 ... 11465 11466
## rowData names(143): Sequence N.term.cleavage.window ...
## MS.MS.Count
## colnames(45): Intensity.6A_1 Intensity.6A_2 ... Intensity.6E_8
## Intensity.6E_9
## colData names(0):
## see ?MsDataHub and browseVignettes('MsDataHub') for documentation
## downloading 1 resources
## retrieving 1 resource
## loading from cache
## EH7804
## "/github/home/.cache/R/ExperimentHub/20596f52cac_7854"
## see ?MsDataHub and browseVignettes('MsDataHub') for documentation
## downloading 1 resources
## retrieving 1 resource
## loading from cache
## EH7805
## "/github/home/.cache/R/ExperimentHub/205961578ea9_7855"
Load with
## see ?MsDataHub and browseVignettes('MsDataHub') for documentation
## downloading 1 resources
## retrieving 1 resource
## loading from cache
## MSn data (Spectra) with 1278 spectra in a MsBackendMzR backend:
## msLevel rtime scanIndex
## <integer> <numeric> <integer>
## 1 1 2501.38 1
## 2 1 2502.94 2
## 3 1 2504.51 3
## 4 1 2506.07 4
## 5 1 2507.64 5
## ... ... ... ...
## 1274 1 4493.56 1274
## 1275 1 4495.13 1275
## 1276 1 4496.69 1276
## 1277 1 4498.26 1277
## 1278 1 4499.82 1278
## ... 33 more variables/columns.
## file(s):
## 2059374026e5_7853
Load with
lfdia <- read.delim(MsDataHub::benchmarkingDIA.tsv())
## An instance of class QFeatures containing 24 assays:
## [1] U:\712006-Proteomics\Issues\Issue 253\DIANN\raw-data\RD139_Overlap_UPS1_0_1fmol_inj1.mzML: SummarizedExperiment with 28980 rows and 1 columns
## [2] U:\712006-Proteomics\Issues\Issue 253\DIANN\raw-data\RD139_Overlap_UPS1_0_1fmol_inj2.mzML: SummarizedExperiment with 29495 rows and 1 columns
## [3] U:\712006-Proteomics\Issues\Issue 253\DIANN\raw-data\RD139_Overlap_UPS1_0_1fmol_inj3.mzML: SummarizedExperiment with 29210 rows and 1 columns
## ...
## [22] U:\712006-Proteomics\Issues\Issue 253\DIANN\raw-data\RD139_Overlap_UPS1_5fmol_inj1.mzML: SummarizedExperiment with 30941 rows and 1 columns
## [23] U:\712006-Proteomics\Issues\Issue 253\DIANN\raw-data\RD139_Overlap_UPS1_5fmol_inj2.mzML: SummarizedExperiment with 30321 rows and 1 columns
## [24] U:\712006-Proteomics\Issues\Issue 253\DIANN\raw-data\RD139_Overlap_UPS1_5fmol_inj3.mzML: SummarizedExperiment with 24168 rows and 1 columns
plexdia <- read.delim(MsDataHub::Report.Derks2022.plexDIA.tsv())
readQFeaturesFromDIANN(plexdia, multiplexing = "mTRAQ")
## An instance of class QFeatures containing 54 assays:
## [1] F:\JD\plexDIA\nPOP\wJD1146.raw: SummarizedExperiment with 2635 rows and 3 columns
## [2] F:\JD\plexDIA\nPOP\wJD1147.raw: SummarizedExperiment with 3000 rows and 3 columns
## [3] F:\JD\plexDIA\nPOP\wJD1148.raw: SummarizedExperiment with 2676 rows and 3 columns
## ...
## [52] F:\JD\plexDIA\nPOP\wJD1203.raw: SummarizedExperiment with 4441 rows and 3 columns
## [53] F:\JD\plexDIA\nPOP\wJD1204.raw: SummarizedExperiment with 4416 rows and 3 columns
## [54] F:\JD\plexDIA\nPOP\wJD1205.raw: SummarizedExperiment with 4492 rows and 3 columns
start by opening an issue
in the package’s GitHub repository and describe the new data. In
particular, provide information about it’s provenance, its use, its
format(s) and acknowledge that the data may be shared freely with the
community without any restrictions. You may provide an open licence
specifying the terms it can be re-used, typically a CC-BY-SA
packages and GitHub pull requests,
you may directly send one that adds your data to the package. Make sure
(1) add appropriate references in the manual page and (2) to add
yourself as a contributor of the package in the DESCRIPTION file.## R version 4.4.2 (2024-10-31)
## Platform: x86_64-pc-linux-gnu
## Running under: Ubuntu 24.04.1 LTS
## Matrix products: default
## BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/
## LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/; LAPACK version 3.12.0
## locale:
## time zone: Etc/UTC
## tzcode source: system (glibc)
## attached base packages:
## [1] stats4 stats graphics grDevices utils datasets methods
## [8] base
## other attached packages:
## [1] MsDataHub_1.7.0 QFeatures_1.17.0
## [3] MultiAssayExperiment_1.33.4 SummarizedExperiment_1.37.0
## [5] Biobase_2.67.0 GenomicRanges_1.59.1
## [7] GenomeInfoDb_1.43.2 IRanges_2.41.2
## [9] MatrixGenerics_1.19.0 matrixStats_1.4.1
## [11] PSMatch_1.11.0 Spectra_1.17.4
## [13] BiocParallel_1.41.0 S4Vectors_0.45.2
## [15] BiocGenerics_0.53.3 generics_0.1.3
## [17] BiocStyle_2.35.0
## loaded via a namespace (and not attached):
## [1] DBI_1.2.3 rlang_1.1.4 magrittr_2.0.3
## [4] clue_0.3-66 compiler_4.4.2 RSQLite_2.3.9
## [7] png_0.1-8 vctrs_0.6.5 reshape2_1.4.4
## [10] stringr_1.5.1 ProtGenerics_1.39.1 pkgconfig_2.0.3
## [13] MetaboCoreUtils_1.15.0 crayon_1.5.3 fastmap_1.2.0
## [16] dbplyr_2.5.0 XVector_0.47.1 rmarkdown_2.29
## [19] UCSC.utils_1.3.0 purrr_1.0.2 bit_4.5.0.1
## [22] xfun_0.49 cachem_1.1.0 jsonlite_1.8.9
## [25] blob_1.2.4 DelayedArray_0.33.3 parallel_4.4.2
## [28] cluster_2.1.8 R6_2.5.1 bslib_0.8.0
## [31] stringi_1.8.4 jquerylib_0.1.4 Rcpp_1.0.13-1
## [34] knitr_1.49 Matrix_1.7-1 igraph_2.1.2
## [37] tidyselect_1.2.1 abind_1.4-8 yaml_2.3.10
## [40] codetools_0.2-20 curl_6.0.1 lattice_0.22-6
## [43] tibble_3.2.1 plyr_1.8.9 withr_3.0.2
## [46] KEGGREST_1.47.0 evaluate_1.0.1 BiocFileCache_2.15.0
## [49] ExperimentHub_2.15.0 Biostrings_2.75.3 pillar_1.10.0
## [52] BiocManager_1.30.25 filelock_1.0.3 DT_0.33
## [55] ncdf4_1.23 BiocVersion_3.21.1 glue_1.8.0
## [58] lazyeval_0.2.2 maketools_1.3.1 tools_4.4.2
## [61] AnnotationHub_3.15.0 sys_3.4.3 mzR_2.41.1
## [64] buildtools_1.0.0 fs_1.6.5 grid_4.4.2
## [67] tidyr_1.3.1 crosstalk_1.2.1 MsCoreUtils_1.19.0
## [70] AnnotationDbi_1.69.0 GenomeInfoDbData_1.2.13 cli_3.6.3
## [73] rappdirs_0.3.3 S4Arrays_1.7.1 dplyr_1.1.4
## [76] AnnotationFilter_1.31.0 sass_0.4.9 digest_0.6.37
## [79] SparseArray_1.7.2 htmlwidgets_1.6.4 memoise_2.0.1
## [82] htmltools_0.5.8.1 lifecycle_1.0.4 httr_1.4.7
## [85] mime_0.12 bit64_4.5.2 MASS_7.3-61