It appears you don't have support to open PDFs in this web browser. To view this file, Open with your PDF reader
Abstract
In common with other omics technologies, mass spectrometry (MS)-based proteomics produces ever-increasing amounts of raw data, making efficient analysis a principal challenge. A plethora of different computational tools can process the MS data to derive peptide and protein identification and quantification. However, during the last years there has been dramatic progress in computer science, including collaboration tools that have transformed research and industry. To leverage these advances, we develop AlphaPept, a Python-based open-source framework for efficient processing of large high-resolution MS data sets. Numba for just-in-time compilation on CPU and GPU achieves hundred-fold speed improvements. AlphaPept uses the Python scientific stack of highly optimized packages, reducing the code base to domain-specific tasks while accessing the latest advances. We provide an easy on-ramp for community contributions through the concept of literate programming, implemented in Jupyter Notebooks. Large datasets can rapidly be processed as shown by the analysis of hundreds of proteomes in minutes per file, many-fold faster than acquisition. AlphaPept can be used to build automated processing pipelines with web-serving functionality and compatibility with downstream analysis tools. It provides easy access via one-click installation, a modular Python library for advanced users, and via an open GitHub repository for developers.
Mass spectrometry-based proteomics faces the challenge of processing vast data amounts. Here, the authors introduce AlphaPept, an open-source, Python-based framework that offers high speed analysis and easy integration for large-scale proteome analysis.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details






1 Max Planck Institute of Biochemistry, Department of Proteomics and Signal Transduction, Martinsried, Germany (GRID:grid.418615.f) (ISNI:0000 0004 0491 845X); University of Copenhagen, NNF Center for Protein Research, Faculty of Health Sciences, Copenhagen, Denmark (GRID:grid.5254.6) (ISNI:0000 0001 0674 042X)
2 Max Planck Institute of Biochemistry, Department of Proteomics and Signal Transduction, Martinsried, Germany (GRID:grid.418615.f) (ISNI:0000 0004 0491 845X)
3 Nvidia Corporation, Santa Clara, USA (GRID:grid.451133.1) (ISNI:0000 0004 0458 4453)
4 Max Planck Institute of Biochemistry, Department of Proteomics and Signal Transduction, Martinsried, Germany (GRID:grid.418615.f) (ISNI:0000 0004 0491 845X); Jena University Hospital, Functional Proteomics, Jena, Germany (GRID:grid.275559.9) (ISNI:0000 0000 8517 6224)