English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

High Performance Computational Analysis of Large-scale Proteome Data Sets to Assess Incremental Contribution to Coverage of the Human Genome

MPS-Authors
/persons/resource/persons78449

Neuhauser,  Nadin
Mann, Matthias / Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Max Planck Society;

/persons/resource/persons78437

Nagaraj,  Nagarjuna
Mann, Matthias / Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Max Planck Society;

/persons/resource/persons78626

Scheltema,  Richard
Mann, Matthias / Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Max Planck Society;

/persons/resource/persons77870

Cox,  Juergen
Mann, Matthias / Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Max Planck Society;

/persons/resource/persons78356

Mann,  Matthias
Mann, Matthias / Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)
There are no public fulltexts stored in PuRe
Supplementary Material (public)
There is no public supplementary material available
Citation

Neuhauser, N., Nagaraj, N., McHardy, P., Zanivan, S., Scheltema, R., Cox, J., et al. (2013). High Performance Computational Analysis of Large-scale Proteome Data Sets to Assess Incremental Contribution to Coverage of the Human Genome. JOURNAL OF PROTEOME RESEARCH, 12(6), 2858-2868. doi:10.1021/pr400181q.


Cite as: https://hdl.handle.net/11858/00-001M-0000-0013-FD7E-8
Abstract
Computational analysis of shotgun proteomics data can now be performed in a completely automated and statistically rigorous way, as exemplified by the freely available MaxQuant environment. The sophisticated algorithms involved and the sheer amount of data translate into very high computational demands. Here we describe parallelization and memory optimization of the MaxQuant software with the aim of executing it on a large computer cluster. We analyze and mitigate bottlenecks in overall performance and find that the most time-consuming algorithms are those detecting peptide features in the MS' data as well as the fragment spectrum search. These tasks scale with the number of raw files and can readily be distributed over many CPUs as long as memory access is properly managed. Here we compared the performance of a parallelized version of MaxQuant running on a standard desktop, an I/O performance optimized desktop computer ("game computer"), and a cluster environment. The modified gaming computer and the cluster vastly outperformed a standard desktop computer when analyzing more than 1000 raw files. We apply our high performance platform to investigate incremental coverage of the human proteome by high resolution MS data originating from in-depth cell line and cancer tissue proteome measurements.