• Media type: E-Article
  • Title: A proteomics sample metadata representation for multiomics integration and big data analysis
  • Contributor: Dai, Chengxin; Füllgrabe, Anja; Pfeuffer, Julianus; Solovyeva, Elizaveta M.; Deng, Jingwen; Moreno, Pablo; Kamatchinathan, Selvakumar; Kundu, Deepti Jaiswal; George, Nancy; Fexova, Silvie; Grüning, Björn; Föll, Melanie Christine; Griss, Johannes; Vaudel, Marc; Audain, Enrique; Locard-Paulet, Marie; Turewicz, Michael; Eisenacher, Martin; Uszkoreit, Julian; Van Den Bossche, Tim; Schwämmle, Veit; Webel, Henry; Schulze, Stefan; Bouyssié, David; [...]
  • Published: Springer Science and Business Media LLC, 2021
  • Published in: Nature Communications, 12 (2021) 1
  • Language: English
  • DOI: 10.1038/s41467-021-26111-3
  • ISSN: 2041-1723
  • Origination:
  • Footnote:
  • Description: AbstractThe amount of public proteomics data is rapidly increasing but there is no standardized format to describe the sample metadata and their relationship with the dataset files in a way that fully supports their understanding or reanalysis. Here we propose to develop the transcriptomics data format MAGE-TAB into a standard representation for proteomics sample metadata. We implement MAGE-TAB-Proteomics in a crowdsourcing project to manually curate over 200 public datasets. We also describe tools and libraries to validate and submit sample metadata-related information to the PRIDE repository. We expect that these developments will improve the reproducibility and facilitate the reanalysis and integration of public proteomics datasets.
  • Access State: Open Access