|
Introduction to RESOLVE
Although density modification (solvent flattening, non-crystallographic symmetry, phase extension, histogram matching, etc.) has been a very powerful tool, its potential is much greater than has been achieved so far. There are two reasons for this:
Problems with the phase recombination approach to density modification.
RESOLVE uses a statistical approach to density modification, while other methods use an approach in which a map is modified to meet expectations and the new phases are recombined with experimental phases. For the mathematical details, see the references for RESOLVE . You might also wish to see the discussion and extensions in Kevin Cowtan's article "Gaussian Likelihoods in real and reciprocal space" in the CCP4 newsletter.
Principal problems with the phase recombination method
What is the optimal relative weighting of modified and experimental phases? | Incorrect relative weighting means that the
final results will not be optimal
Incorrect weighting terms mean that the final figures of merit are almost always inflated |
When do you stop iterating? | In some approaches the maps initially get better, then get worse unless you stop |
Density modification can be thought of as a way to adjust crystallographic phases (or amplitudes) to make them simultaneously consistent with the experimental data and with our expectations of what an electron density map should look like. The statistical approach is a mathematical way to formulate this statement. By using this formulation, the weighting factors and problems with convergence are taken care of automatically.
In RESOLVE, any set of structure factor amplitudes and phases has an associated probability composed of two simple parts:
Probability of a set of phases (and amplitudes)
The probability of the experimental phases | This is the probability that you would have observed your experimental data if this set of phases (and amplitudes) were correct |
The lprobability of the map | This is the probability that the electron density map calculated from this set of phases is drawn from the set of plausible electron density maps for this structure |
RESOLVE adjusts your crystallographic phases so as to maximize the total (posterier) probability of those phases. The mathematics is a little complicated but the idea is very simple. To see the mathematics in detail, have a look at references for RESOLVE .
Note on terminology: The approach used by resolve is now called "Statistical density modification," a name suggested by Kevin Cowtan. It used to be called "Maximum-likelihood density modification", using the term "likelihood" in a colloquial sense of probability. The old name (as pointed out by Gerard Bricogne and others) is confusing because the maximum-likelihood method is a specific technique that uses a specific definition of "likelihood" that is not used in this approach. Sorry to all for the confusion, and hoping that it will now be more clear. The mathematics remains exactly the same.
Using all the available information for density modification
Density modification is usually thought of as a process that is carried out on an experimental electron density map prior to model building, but iterative model-building methods such as ARP/wARP can also be thought of as density modification techniques. With the statistical approach, partial model information can be seamlessly incorporated into the total expression for the probability of the phases. This allows a hierachical approach to incorporating information about phase probability:
Types of information that can be used in statistical density modification
Experimental phases (if available) |
Low-resolution structural information (solvent boundary) |
Non-crystallographic symmetry |
Partial model information (molecular replacement or model building) |
Full atomic model information |
The current version of RESOLVE can incorporate all of these types of information.
Electron density maps obtained using phases calculated from atomic models often show peaks at the coordinates of atoms in the models, even when those atoms are incorrectly placed. This effect can be reduced by careful weighting such as can be accomplished by Randy Read's SIGMAA approach, but it cannot be eliminated unless the phases are changed.
Prime-and-switch phasing is a way to remove model bias by using statistical density modification, but without including the phase information coming from the model once an initial map has been calculated.
The basic procedure is simple:
The initial biased phase information from the model is required
to get the procedure going. The final phases are essentially unbiased
by the model because they are based on the features of the map, not on
the prior phase probabilities.
The final phases are generally improved the most when:
Non-crystallographic symmetry is an important source of information about the probabiltiy of an electron density map. RESOLVE can begin with transformation matrices and an estimate of the center-of-mass of molecule 1 that you input. RESOLVE can also figure out the transformations and center-of-mass automatically from the NCS in heavy-atom sites in a PDB file (if the default file "ha.pdb" exists and you don't specify NCS transformations, RESOLVE will try to find the NCS in those sites).
RESOLVE uses NCS information in the following way.
After the completion of density modification, RESOLVE builds a model of your structure. For versions 2.02 and higher, the model needs sequence information from you. You specify a file with the keyword "seq_file" and RESOLVE expects a sequence of amino acids in 1-letter format. If there are more than one type of chain, RESOLVE expects them separated by a line containing ">>>". . Typically RESOLVE can build 70-90% of the residues for a good map at 2-3 A resolution. You can tell if the model is correct by noting how good the match is to the sequence and by noting the NCS correspondence among chains (if NCS exists). The PDB file that RESOLVE writes out will have the model and also as HETATM records at the end with the heavy atom sites from SOLVE output file ha.pdb.
RESOLVE builds a model in the following way.
On all other cycles (except every n_loop'th cycle)..RESOLVE takes the most recent unique refined models and create a density map ("image") everywhere the models have atoms. RESOLVE applies statistical density modification starting with current best phases, the image, but no prior probabilities. RESOLVE then builds a model, the model is refined with refmac5, and the model is then extended and rebuilt.
On every n_loop'th cycle, the same process is carried out except RESOLVE uses the average of the past 20 models to create the composite density map.
This process is repeated (typically 100 cycles)