Content area
In this work, the authors developed a new method for determination of binding areas in proteins and precomplex structure with allowance for the Brownian diffusion and electrostatic interactions of proteins that occur when proteins approach one another. This method significantly simplies subsequent precise simulation and prediction of the final complex structure. The Brownian dynamics method, which can be used for predicting the structure of protein complexes, considers the interaction of only two molecules in solution. A characteristic feature and novelty of the method, as is shown below, is the possibility to use it for studying interaction of several protein molecules simultaneously. This makes it possible to simulate the formation of a large number of complexes, which takes place in solution or cell compartments and to monitor the real-time kinetics of this process.
ISSN 1607-6729, Doklady Biochemistry and Biophysics, 2009, Vol. 427, pp. 215217. Pleiades Publishing, Ltd., 2009.
Original Russian Text I.B. Kovalenko, A.M. Abaturova, G.Yu. Riznichenko, A.B. Rubin, 2009, published in Doklady Akademii Nauk, 2009, Vol. 427, No. 5, pp. 696698.
BIOCHEMISTRY, BIOPHYSICS,
AND MOLECULAR BIOLOGY
A Novel Approach to Computer Simulation of ProteinProtein Complex Formation
I. B. Kovalenko, A. M. Abaturova, G. Yu. Riznichenko, and Corresponding Member of the RAS A. B. Rubin
Received April 2, 2009
DOI: 10.1134/S1607672909040127
The majority of biochemical processes are associated with the functioning of protein molecules and their complexes in the reactions of enzymatic catalysis and cell signaling. Predicting the structure of protein complexes by their simulation is a complex problem that remains largely unsolved.
The factors that play the key role in complex formation are as follows: the rate of protein diffusion to the docking site; long-range electrostatic interactions between protein surfaces, geometric and chemical complementarity of binding areas; molecular mobility at the proteinprotein interphase, hydrogen bonds, Van der Waals interactions, hydrophobic interactions, and salt bridges. It is known that different factors play different roles at different stages of complex formation[1]. Currently, there is no universal method for simulating protein complex formation that would make it possible to take into account all these factors and accurately predict the structure of protein complex [1, 2].
Molecular diffusion and long-range electrostatic interactions as well as molecule geometry play the decisive role in precomplex formation. The electro-static interactions signicantly accelerate the process of precomplex formation and thereby make it much more effective. If the geometrical correspondence of binding areas is established at the precomplex stage, this ensures the optimal relative position of two molecules prior to subsequent nal complex formation. The hydrophobic interaction, hydrogen bonds, and molecular mobility, in turn, play the key role in the conversion of the precompex into the nal complex [1].
In this work, we developed a new method for determination of binding areas in proteins and precomplex structure with allowance for the Brownian diffusion and electrostatic interactions of proteins that occur when proteins approach one another. This method sig-
nicantly simplies subsequent precise simulation and prediction of the nal complex structure. The Brownian dynamics method, which can be used for predicting the structure of protein complexes, considers the interaction of only two molecules in solution [35]. A characteristic feature and novelty of our method, as is shown below, is the possibility to use it for studying interaction of several protein molecules simultaneously. This makes it possible to simulate the formation of a large number of complexes, which takes place in solution or cell compartments and to monitor the real-time kinetics of this process.
In our method, the process of protein complex formation is conditionally divided into several stages: (1) Brownian diffusion of proteins to the docking site, (2) their approach due to electrostatic attraction forces between molecules, relative spatial position of molecules, and precomplex formation; and (3) nal complex formation. As is shown below, relative position of proteins in precomplexes, predicted on the basis of the computer model suggested, in most cases corresponds to their real orientation in nal complexes.
Method description. The proposed approach is based on direct computer simulation of diffusion and complex formation between mobile electron-transport proteins [68]. Simulation is performed in a virtual 3D cubical reaction volume containing randomly distributed protein molecules. Movement is described by the Langevin equation, which describes changes of each coordinate in time caused by random and outer forces:
where x is the coordinate along which movement is considered; x is the factor of viscous friction along this coordinate; and fx(t) and Fx are the projections of a random and electrostatic forces, respectively, on the abscissa axis. The random force fx(t) is distributed nor
mally with a zero mean and variance . Here, k is
xdx dt
------ f x t
= ( ) Fx,
+
Biological Faculty, Moscow State University, Moscow, 119991 Russia
2kTx t
---------------
215
216
KOVALENKO et al.
Z
140 160180
180 170 160 150 140Y
190
Z
120 190 180 Y
190
180
180
170
170
160
160
150
150
140
140
130
130
170 160 150 140180 160 140
X
The probability of barstar (on the left) and barnase (on the right) binding calculated on the basis of the binding probability model and represented as a probability distribution sphere. Dark circles correspond to the atoms belonging to the contacting amino acid residues revealed on the basis of X-ray data for the barnasebarstar complex. The sectors of the sphere surrounded by dark lines correspond to protein regions with a high binding probability (according to calculations for the model).
the Boltzmann constant, T is temperature, and t is the time increment (constant in our method). To calculate the viscous friction factors in the model, the protein molecule shape is approximated by an ellipsoid of revolution rather than by a sphere, which is commonly accepted in Brownian dynamics models [35].
Three-dimensional protein molecules are constructed on the basis of data extracted from the Protein Data Bank. To calculate protein collisions, the shape of proteins is described using a small number (10100) of spheres, which sufciently adequately reects the molecule surface for calculating collisions with other molecules and simultaneously reduces the calculation time compared to Brownian dynamics methods, in which the atomic resolution of the protein surface is used. The electrostatic interactions between proteins are taken into account when proteins approach one another to distances smaller than 35 . The protein is represented as an area with the dielectric constant = 2 and spatially distributed partial charges; for the surrounding solution, = 80. The electrostatic eld created by the charges on the protein surface was calculated using the PoissonBoltzmann equation, which made it possible to take into account different values of dielectric permittivity of proteins and solution.
In this model, the movement and interaction of several hundreds of protein molecules is simulated, which provides an opportunity to directly monitor the protein interaction kinetics taking into account simultaneous interaction of several molecules and to study precomplex formation depending on the geometrical size and shape of the reaction volume. We divided the set of relative positions of two proteins into paired disjoint sub-
sets corresponding to a 12 step in the angle of rotation of one molecule relative to the other one in a spherical coordinate system (gure). These sectors, obtained as a result of small-step division, contain only one or two amino acid residues located on the molecule surface. The objective of simulation is to nd those sectors and corresponding amino acids that approach one another most closely as a result of diffusion and electrostatic interaction of proteins (i.e., to calculate the probability of approach of various amino acids and select those of them for which this probability is the highest). The movement and interaction of 100 pairs of protein molecules in 1-ms time interval at a constant step of 100 ps was simulated in all numerical experiments with the use of the computer model in a cubical reaction volume of 70 70 70 nm.
To test the proposed method, we used pairs of proteins for which the structures of complexes are known from experiments. If a precomplex is not electrostatically optimal, the nal complex determined in the model will signicantly differ from the nal complex of the test proteins in solution. In total, we studied eight pairs of proteins. For seven of these pairs (barnase barstar, colicin E9DNaseimmune protein 9, acetylcholinesterasefasciculin 2, thrombinthrombomodulin, erythropoietinerythropoietin-binding protein, interleukin 4interleukin 4 receptor, and plastocyanin cytochrome f), the binding areas predicted by simulation corresponded to the experimentally determine areas. Only for one pair (colicin E3 RNaseimmune protein 3), these areas did not coincide. The results of simulation for barnase and barstar molecules are shown in the gure, in which the frequently and rarely
DOKLADY BIOCHEMISTRY AND BIOPHYSICS Vol. 427 2009
A NOVEL APPROACH TO COMPUTER SIMULATION OF PROTEINPROTEIN COMPLEX 217
approaching areas of the proteins are shown in dark and light gray, respectively. As seen in the gure, the areas with a high binding probability, calculated on the basis of the model (the sectors contoured with dark lines) are located opposite to the experimentally determined binding areas (dark circles designating atoms), i.e., correspond to the experimental data. In the model, the predicted precomplex structures form as a result of electro-static interactions and geometrical complementarity of binding areas. Apparently, in the vast majority of cases, the electrostatic interactions between approaching proteins with the highest probability ensure the relative position of molecules that is most advantageous for their subsequent binding and formation of a procomplex that is then converted into the nal complex.
Thus, we have developed a procedure for computer simulation of proteinprotein complex formation, which takes into account simultaneous diffusion and interaction of several hundreds of protein molecules.The surface of the protein molecule is approximated by a set of spheres; the shape, by the ellipsoid of revolution. The electric eld of surface charges is calculated with allowance for different values of the dielectric constant for proteins and water. Using eight pairs of proteins as an example, we showed that this method makes it possible to predict with a sufcient accuracy the binding areas and structure of the precomplex of protein molecules and is sufciently precise to simulate the process of complex formation. The constructed model makes it possible to simulate protein interaction at different distribution of charges on proteins and different ionic strength and pH of medium and, therefore,
to predict the binding areas for various proteins at various conditions.
ACKNOWLEDGMENTSWe are grateful to G. Smith and J. Smith for discussing the structural peculiarities of the protein complexes studied. This study was supported by the Russian Foundation for Basic Research (project nos. 07-04-00375 and 08-04-00354).
REFERENCES1. Kleanthous, C., ProteinProtein Recognition, New York: Oxford Univ. Press, 2000.
2. Tramontano, A., The Ten Most Wanted Solutions in Protein Bioinformatics, Boca Raton: CRC, 2005.
3. Northrup, S.H., Luton, J.A., Boles, J.O., and Reynolds, J.C.L., J. Computer-Aided Mol. Design, 1987, vol. 1, pp. 291311.
4. Gross, E.L. and Pearson, D.C., Jr., Biophys. J., 2003, vol. 85, pp. 20552068.
5. Spaar, A., Dammer, C., Gabdoulline, R., et al., Biophys.J., 2006, vol. 90, no. 6, pp. 19131924.6. Kovalenko, I.B., Abaturova, A.M., Gromov, P.A., et al., Phys. Biol., 2006, vol. 3, pp. 121129.
7. Kovalenko, I.B., Abaturova, A.M., Grachev, E.A., et al., Biozika, 2008, vol. 53, no. 2, pp. 261270.
8. Kovalenko, I.B., Diakonova, A.N., Abaturova, A.M., and Riznichenko, G.Yu., in Proc. Flavins and Flavoproteins Int. Symp., Prensas Univ. de Zaragoza, 2008, pp. 437 442.
DOKLADY BIOCHEMISTRY AND BIOPHYSICS Vol. 427 2009
Pleiades Publishing, Ltd. 2009