Difference between revisions of "Example: Absolute Binding Affinity"
Line 75: | Line 75: | ||
In order to resolve these problems, one of the most standard solutions is to harmonically restrain the ghost state ligand to be in a certain location described by the geometry of the protein. If this is done, you will have to remove the harmonic restraint with the analytical correction shown below for the pulling method before making the transfer to the solvent only system. | In order to resolve these problems, one of the most standard solutions is to harmonically restrain the ghost state ligand to be in a certain location described by the geometry of the protein. If this is done, you will have to remove the harmonic restraint with the analytical correction shown below for the pulling method before making the transfer to the solvent only system. | ||
− | This restraint is different from the pulling method described below though. In the pulling method, the ligand is restricted to a point in space, whereas here, the ligand is confined to a geometric point in the cavity defined by the protein, which will be as close to the average bound location as possible. Although you can get away with simple translational restrictions, there is strong evidence that compuationally efficiency will improve by restraining the orientational configuration of the ligand as well. | + | This restraint is different from the pulling method described below though. In the pulling method, the ligand is restricted to a point in space, whereas here, the ligand is confined to a geometric point in the cavity defined by the protein, which will be as close to the average bound location as possible. Although you can get away with simple translational restrictions, there is strong evidence that compuationally efficiency will improve by restraining the orientational configuration of the ligand as well.{{cite|Boresh2003|Boresch, S., Tettinger, F., Leitgeb, M., and Karplus, M. (2003) Absolute binding free energies: A quantitative approach for their calculation. ''J. Phys. Chem. A'' 107, 9535–9551.|http://www.citeulike.org/group/14929/article/9052098}}{{cite|Mobley2006|Mobley, D. L., Chodera, J. D., and Dill, K. A. (2006) On the use of orientational restraints and symmetry corrections in alchemical free energy calculations. ''J. Chem. Phys.'' 125, 084902.|http://www.citeulike.org/group/14929/article/9052440}}. It should be noted that the free state should not have the restraints and that the restraints are added along intermediaries as well. Rotational and translational degrees of freedom only need to be sampled as the restraints are turned on, instead of at every intermediate. This is by no means the only restraint scheme, just one example. |
==[[Intermediate_States#Pulling_Methods | Pulling Pathway]]== | ==[[Intermediate_States#Pulling_Methods | Pulling Pathway]]== |
Revision as of 14:23, 26 October 2012
Free Energy Fundamentals |
---|
Methods of Free Energy Simulations
|
Free Energy How-to's |
---|
This example will be different still from the other two examples in that this will touch on both the pulling method and the alchemical decoupling. Either of these methods are acceptable and are the most useful for this system.
For this particular example, we will look at the absolute binding free energies of toluene in the apolar cavity of T4 Lysozyme. Furthermore, since we are also doing a solvation coupling step, the solvated environments will be visually emphasized.
What is the Thermodynamic Cycle?
For our simulation, we can use either the pulling method or the alchemical path. We will explore each one individually.
Alchemical Pathway
The alchemical cycle starts with the ligand bound to the protein to make up the complex fully interacting with the environment. We first start by decoupling the ligand from the surrounding environment, both protein and solvent. Note that we do not annihilate the ligand by turning off intramolecular interactions, just intermolecular ones. We can then transfer this "ghost" molecule from the solvent box with protein to a solvent box with no protein. This transfer has a zero free energy cost since the ligand is not interacting at all. We then recouple the ligand with its new solvent box and complete the cycle. Remember, we do not want to remove the ligand from the universe, just its interactions with the protein.
This cycle is shown below where the light blue background represents the solvent environment. This can be done in either implicit or explicit solvent but is shown implicitly to avoid clutter.
The alchemical pathway does suffer from a few problems. Firstly, as you change the interactions, the ligand has a chance of wandering off into infinity or at the least through the entirety simulation box; the ligand could also get stuck in different parts of the protein in the near-"ghost" states. Both of these actions can lead to significant correlation times and making accurate free energy calculations difficult. Furthermore, we do not collect any information about the standard state of the system (the universal constant that makes true absolute free energy calculations possible) resulting in any equilibrium constants being defined only up to the standard state.
In order to resolve these problems, one of the most standard solutions is to harmonically restrain the ghost state ligand to be in a certain location described by the geometry of the protein. If this is done, you will have to remove the harmonic restraint with the analytical correction shown below for the pulling method before making the transfer to the solvent only system.
This restraint is different from the pulling method described below though. In the pulling method, the ligand is restricted to a point in space, whereas here, the ligand is confined to a geometric point in the cavity defined by the protein, which will be as close to the average bound location as possible. Although you can get away with simple translational restrictions, there is strong evidence that compuationally efficiency will improve by restraining the orientational configuration of the ligand as well.[1][2]. It should be noted that the free state should not have the restraints and that the restraints are added along intermediaries as well. Rotational and translational degrees of freedom only need to be sampled as the restraints are turned on, instead of at every intermediate. This is by no means the only restraint scheme, just one example.
Pulling Pathway
The pulling pathway starts with the same configuration of the ligand bound to the protein. We then add a harmonic restraint to the center of mass of the ligand and move the center of the restraint away from the pocket over a series of intermediates. After the ligand is sufficiently far away, we turn off the harmonic restraints. If we are far enough away, there are effectively no interactions with the protein and ligand and the two can be treated as isolated in their own solvent boxes.
It should be noted that the above diagram is a crude representation of the pulling method and is designed to serve only as a visual cue to how the pulling method works.
|
| ||||||
|
| ||||||
|
The effect of the harmonic constraint can be removed analytically by calculating
- [math]\displaystyle{ \displaystyle \Delta A_{unconstrain}^{solvent} = kT\ln\left[\frac{3}{V}\left(\frac{\pi kT}{K}\right)^{3/2}\right] }[/math]
where [math]\displaystyle{ K }[/math] is the force constant of the spring, and [math]\displaystyle{ V }[/math] is the volume of the reference state. The average volume per molecule for a reference concentration of [math]\displaystyle{ 1 M }[/math] is 1.661 x 103 Å3/molecule.
Using this method does have its drawbacks as you will need a very large simulation box and the pathway the ligand should take is not always intuitive. For instance, this particular system has a burred binding site, making the path very difficult to define without running into large repulsion barriers.
What are the End States?
There are two real end states we need here. Since the T4 lysozyme and solvent are not changing in either case, they will be present in both end states.
- State A) T4 lysozyme in solvent with an intermediate molecule that looks like toluene in the pocket.
- State B) T4 lysozyme in solvent with an intermediate molecule that looks like phenol in the pocket.
[[Image:Toluene_Phenol_Topology.png.png|right|thumb|300px|Different topology choices. Even though the intermolecular forces will be for the real atoms, there are still dummy intramolecular forces. Please see [[the full page on topologies for a more in depth description.]] What does it mean to have an "intermediate state look like" a molecule as an end state? This intermediate state could well be just toluene or phenol, however, there will still be the dummy atoms on it based on the topology choice. Recall from the figure on the right that, even though the interacting molecule will be a physical molecule, there are still the dummy interactions that must be built in and considered.
For this example, a dual state topology is shown on the right, but it is by no means the only set of end states one could chose; an ortho- or meta- arrangement could just as easily be selected with the dual topology scheme. There is also not a limitation on just dual topologies as single topologies will also work in any of the arrangements.
Although beyond the scope of this example, it is also worth mentioning there is also research ongoing in predicting how a real molecule will behave and the end point will not have the full interactions of a real molecule, but some unphysical intermediate. The free energies of the real molecule are extrapolated from these results. This is particularly helpful for doing more than one transformation as you would not need a unique set of intermediate states for each pair of real molecule end points.
What are the Intermediate States?
For this example, we will assume that the phenol is the initial state, although we could have just as easily assumed it was the toluene. Observe that the OH moiety must disappear and the methyl must appear to make the transformation complete. A good first approach sample set of intermediates can be constructed that follow this process:
- Turn the charge on the OH and the para- hydrogen to zero. Recall it is important to make sure the overall system is at the same total charge at each intermediate.
- Next, you will want to turn the Lennard-Jones [math]\displaystyle{ \epsilon }[/math] on the hydroxyl and para- hydrogen turned to zero.
- You can simultaneously turn off angle, torsional, and bonded terms linearly.
- Now turn on the methyl's Lennard-Jones [math]\displaystyle{ \epsilon }[/math] and its corresponding para- hydrogen.
- The corresponding angle, torsional, and bonded terms will also need turned on.
- Lastly, turn on the charges of the methyl and corresponding para- hydrogen.
Independent of which pathway you select, you will still need a sufficient number of intermediates. By choosing BAR or MBAR as our method, we may need 2-4 intermediates for the charge and 4-6 for the Lennard-Jones changes. This is less than the ethanol solvation because there are fewer atoms changing, corresponding to better phase space overlap. TI would take roughly 2-3 times the number of intermediates. You could run both Lennard-Jones transformations at once, but you would have to first disable improper torsions first otherwise the atoms will collide with each other.
What Simulations to Run?
As with all free energy methods, start by equilibrating all intermediate states. Unlike in the solvation example though, the starting configuration is more complex since we are in a binding site as opposed to a homogeneous fluid.
The ideal starting point would be a crystal structure configuration, or a homologous ligand to which the target ligand can be approximated as without distorting either protein or ligand. When the binding site is not know, accurate free energies are not easy to obtain, if at all, and docking is not suitable for picking a single "best" binding site. That said, if the binding site is known but no crystal structure is available, then docking can be used to generate a set of potential starting configurations, You will need several nanoseconds (tens of ns ideally) to see if the starting positions interconvert. If they do not, you may have to run multiple simulations for each binding site.[3]
After starting configurations are selected, the best option would be to start from a fully interacting state and run short simulations at each [math]\displaystyle{ \lambda }[/math] values to allow partial equilibration, then run long equilibrations at each [math]\displaystyle{ \lambda }[/math] state at constant pressure to find the equilibrium densities. For the example here, you will need 2-4 ns of equilibration because correlation times are guaranteed to be longer than the small molecule solvation test. Even if one has a crystal structure, you should test multiple starting configurations (from multiple crystal structures if possible) to see if the ligands all sample the same conformational states. [3][4] This is done because different starting configurations may have very different confrontational space they sample and so you will want multiple runs to access the largest space. [4]
Box size should again be large enough for the protein and ligand to not interact with their respective selves, which means the box's width should be twice the cutoff plus the longest width of the collective protein/ligand width. Given the size of this example, around 50 ns of simulation may be required to get consistent results (solvent representation will also affect this). At the time of writing (Oct 2012), even with relatively rigid proteins, getting results that have uncertainties less than 0.5 kcal/mol is very challenging.
What Analysis do we Perform?
For a more diverse example, we will assume we want to analyze this system by TI instead of BAR or MBAR. For this, we can usually assume that {{#tag:math|{{dudl} }} is calculated and printed at each step. If it were not calculated and we were using soft core transformations, TI would be impossible to calculate in post-processing. Because we already have the derivatives, all we have to do is average them for a given state, then perform numeric integration and error estimations by the methods discussed previously.
Relative binding free energy is then the difference between the two transformations.
Catches, Traps, and Anything to Watch out for
Visualize what your system is doing by either observing trajectories or tracking some position properties like RMSD to ensure odd things are not happening. If for instance you are trying to bind p-xylene to this pocket, you would expect the Val111 side chain to change orientation. [5]
More generally though, very little was said about the protein for our example. One can think of the protein as just another environment that the change is occurring within. One difference that occasionally has some relevance is the location of the binding state. If the ligand is a tight binder, it will always remain very localized around the binding site, making binding site irrelevant. As for a weak binding ligand ([math]\displaystyle{ K_d }[/math] 100 µM), comparatively more time is spend outside the binding site, making the identification of a "bound" configuration very hard to define. Scientists who run simulations and those that run experiments both suffer from this difficult definition though, so you will not be alone in this regard. For weak binders, careful comparison of exactly the physical phenomena leading to signalling binding must be exported and getting quantitative results will be difficult (e.g. the Val111 and p-xylene example). For most problems of interest, determining the precise binding affinities of weak binders is not important, but the ability to distinguish between a tight and weak binder will be.
Lastly, starting simulations from the different configurations will provide information about convergence if they all converge to the same point.
References
- ↑ Boresch, S., Tettinger, F., Leitgeb, M., and Karplus, M. (2003) Absolute binding free energies: A quantitative approach for their calculation. J. Phys. Chem. A 107, 9535–9551. - Find at Cite-U-Like
- ↑ Mobley, D. L., Chodera, J. D., and Dill, K. A. (2006) On the use of orientational restraints and symmetry corrections in alchemical free energy calculations. J. Chem. Phys. 125, 084902. - Find at Cite-U-Like
- ↑ 3.0 3.1 Roux, B., and Faraldo-Gómez, J. D. (2007) Characterization of conformational equilibria through Hamiltonian and temperature replica-exchange simulations: Assessing entropic and environmental effects. J. Comput. Chem. 28, 1634–1647. - Find at Cite-U-Like
- ↑ 4.0 4.1 Andersen, H. C. (1980) Molecular dynamics simulations at constant pressure and/or temperature. J. Chem. Phys. 72, 2384–2393. - Find at Cite-U-Like
- ↑ Jiang, W., and Roux, B. (2010) Free Energy Perturbation Hamiltonian Replica-Exchange Molecular Dynamics (FEP/H-REMD) for Absolute Ligand Binding Free Energy Calculations. J. Chem. Theory Comput. 6 (9), 2559–2565. - Find at Cite-U-Like