It appears you don't have support to open PDFs in this web browser. To view this file, Open with your PDF reader
Abstract
The ability to computationally generate novel yet physically foldable protein structures could lead to new biological discoveries and new treatments targeting yet incurable diseases. Despite recent advances in protein structure prediction, directly generating diverse, novel protein structures from neural networks remains difficult. In this work, we present a diffusion-based generative model that generates protein backbone structures via a procedure inspired by the natural folding process. We describe a protein backbone structure as a sequence of angles capturing the relative orientation of the constituent backbone atoms, and generate structures by denoising from a random, unfolded state towards a stable folded structure. Not only does this mirror how proteins natively twist into energetically favorable conformations, the inherent shift and rotational invariance of this representation crucially alleviates the need for more complex equivariant networks. We train a denoising diffusion probabilistic model with a simple transformer backbone and demonstrate that our resulting model unconditionally generates highly realistic protein structures with complexity and structural patterns akin to those of naturally-occurring proteins. As a useful resource, we release an open-source codebase and trained models for protein structure diffusion.
The ability to engineer novel protein structures has tremendous scientific and therapeutic impact. Here, authors develop a generative model acting upon an angular representation of protein structures to create high quality protein backbones.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details


1 Stanford University, Department of Computer Science, Stanford, USA (GRID:grid.168010.e) (ISNI:0000 0004 1936 8956); Stanford University, Center for Personal Dynamic Regulomes, Stanford, USA (GRID:grid.168010.e) (ISNI:0000 0004 1936 8956); Stanford University School of Medicine, Department of Biomedical Data Science, Stanford, USA (GRID:grid.168010.e) (ISNI:0000000419368956)
2 Microsoft Research, Cambridge, USA (GRID:grid.24488.32) (ISNI:0000 0004 0503 404X)
3 Microsoft Research, Amsterdam, Netherlands (GRID:grid.24488.32)
4 Stanford University, Department of Computer Science, Stanford, USA (GRID:grid.168010.e) (ISNI:0000 0004 1936 8956); Stanford University School of Medicine, Department of Biomedical Data Science, Stanford, USA (GRID:grid.168010.e) (ISNI:0000000419368956)