|  | Determination of positioning accuracy of ligands in proteins active sites This protocol concerns positioning accuracy of ligands in proteins active sites. The accuracy is defined by the root mean square deviation (RMSD) between ligand docked poses and experimental ligand poses taken from Protein Data Bank. The main difficulty of this validation is concerned with the selection of a suitable test set of protein-ligand complexes. We have selected protein-ligand complexes with respect to next requirements: high-quality experimental structures with high crystallographic resolution to exclude the presence of missing amino acids and atoms and diversity of ligands in selected protein-ligand complexes from small ones (several dozens of atoms) to large ones (more than one hundred atoms). We had no task to create original validation set, so the part of protein-ligand complexes were taken from the test sets described in.25,26 Metalloproteins were excluded from our set since the docking procedure for such complexes does not work correctly because of complexity of ligand-metal interaction calculation. Besides complexes containing different cofactors such as Heme, ATP, NADP and others were excluded as well. Finally we selected 80 complexes with the experimental structures taken directly from PDB27. The complexes were set up according to the rules described above (see the first protocol). Preliminary investigations of the docking procedure of program SOL have demonstrated that only 3 control parameters of 20 influence substantially on the docking success rate: the number of independent runs (NUMBER OF RUNS), population size or the number of individuals (POPULATION SIZE) and the number of generations participating in global optimization (NUMBER OF GENERATIONS). The values of these parameters during validation were set up: NUMBER OF RUNS: 50; POPULATION SIZE: 30000; NUMBER OF GENERATIONS: 500. For comparison AutoDock 3.05 program was used for the same test set with the same value of independent runs (NUMBER OF RUNS=50). The docking quality criterion for this type of validation protocol was the value of root mean square deviation (RMSD) between docked and experimental poses of native ligands taken from the respective PDB complexes. It is possible to distinguish four scales of docking quality: RMSD < 1 ? - excellent docking quality, 1 ? < RMSD < 2 ? - good quality, 2 ? < RMSD < 3 ? - satisfactory quality and 3 ? < RMSD – unsatisfactory quality. The results for 80 calculated complexes are presented in Table 2.  Table 2. The PDB ID of calculated complexes, RMSD between docked and experimental positions of native ligands from these protein-ligand complexes obtained by SOL and AutoDock 3.05 programs, and the number of torsion degrees of freedom for heavy atoms of the respective ligands. The data were sorted according to increase of RMSD obtained by SOL.  
            
              | ID PDB | RMSD by SOL,Å
 | RMSD byAutoDock,
 Å
 | N torsions of heavy atoms | ID PDB | RMSD by SOL,Å
 | RMSD byAutoDock,
 Å
 | N torsions of heavy atoms |  
              | 1pax | 0,31 | 0,57 | 0 | 1mor | 1,18 | 1,92 | 3 |  
              | 3pax | 0,31 | 0,8 | 2 | 1qkq | 1,22 | 4,83 | 1 |  
              | 1fh7 | 0,35 | 3,47 | 2 | 3kiv | 1,45 | 1,51 | 5 |  
              | 1a28 | 0,48 | 0,75 | 1 | 1k1j | 1,63 | 3,85 | 10 |  
              | 1ju4 | 0,5 | 1,38 | 1 | 1icm | 1,69 | 1,79 | 12 |  
              | 1exa | 0,5 | 0,78 | 5 | 1br6 | 1,85 | 2,76 | 4 |  
              | 1h70 | 0,51 | 0,64 | 6 | 1br5 | 1,85 | 1,13 | 3 |  
              | 1mq6 | 0,51 | 1,21 | 10 | 1lif | 1,87 | 1,12 | 16 |  
              | 1c83 | 0,55 | 0,66 | 4 | 1ezq | 1,87 | 2,34 | 11 |  
              | 4rsk | 0,55 | 4,26 | 4 | 2sak | 1,93 | 1,78 | 3 |  
              | 1abe | 0,57 | 1,5 | 0 | 2hmb | 2,04 | 4,06 | 14 |  
              | 1j01 | 0,59 | 3,63 | 2 | 1mq5 | 2,53 | 1,31 | 8 |  
              | 1ifu | 0,63 | 8,87 | 1 | 1l8g | 2,87 | 1,62 | 7 |  
              | 1jd3 | 0,64 | 0,56 | 1 | 1ppa | 2,93 | 6,07 | 0 |  
              | 2pax | 0,69 | 0,41 | 0 | 1art | 2,94 | 0,88 | 3 |  
              | 2dri | 0,7 | 0,57 | 0 | 1ifs | 3,14 | 1,05 | 0 |  
              | 1fm6 | 0,73 | 3,93 | 7 | 1nli | 3,14 | 2,09 | 0 |  
              | 1h1s | 0,73 | 1,49 | 6 | 1f4g | 3,17 | 2,61 | 14 |  
              | 1tsy | 0,74 | 6,49 | 4 | 3jdw | 3,43 | 4,08 | 4 |  
              | 1qpe | 0,74 | 0,51 | 2 | 1mrg | 3,44 | 0,78 | 0 |  
              | 3eng | 0,8 | 1,18 | 4 | 2cmd | 3,44 | 8,0 | 5 |  
              | 1ydr | 0,83 | 1,45 | 2 | 2enb | 3,48 | 2,34 | 6 |  
              | 1i7z | 0,86 | 3,44 | 5 | 1hi3 | 3,83 | 3,76 | 6 |  
              | 1fut | 0,88 | 1,17 | 4 | 1ikg | 4,13 | 0,82 | 15 |  
              | 1b9v | 0,92 | 1,37 | 8 | 1fao | 4,2 | 0,83 | 8 |  
              | 1hpv | 0,94 | 0,99 | 13 | 1rob | 4,39 | 4,64 | 4 |  
              | 1efy | 0,95 | 0,76 | 3 | 4dfr | 4,57 | 1,13 | 10 |  
              | 1h52 | 0,95 | 11,01 | 2 | 1oxp | 4,61 | 1,33 | 8 |  
              | 1mai | 0,96 | 1,09 | 6 | 1d6v | 4,62 | 3,59 | 6 |  
              | 1pot | 0,98 | 0,63 | 7 | 1pph | 4,73 | 3,49 | 8 |  
              | 1jgi | 0,99 | 1,64 | 5 | 1jj0 | 5,18 | 1,88 | 5 |  
              | 1lqd | 0,99 | 0,41 | 7 | 1akb | 5,75 | 1,69 | 9 |  
              | 1ane | 1,01 | 0,36 | 1 | 1a4k | 5,75 | 1,69 | 6 |  
              | 1afq | 1,02 | 3,06 | 11 | 1h1p | 6,06 | 4,12 | 3 |  
              | 2cgr | 1,05 | 1,12 | 10 | 1gor | 6,48 | 3,75 | 2 |  
              | 3ert | 1,09 | 1,6 | 10 | 1htf | 6,85 | 2,74 | 15 |  
              | 1flz | 1,1 | 7,56 | 0 | 1lzg | 7,73 | 4,47 | 8 |  
              | 1fkg | 1,14 | 1,27 | 12 | 1gc5 | 9,27 | 5,44 | 6 |  
              | 2ifb | 1,17 | 1,31 | 14 | 2ovw | 10,06 | 1,5 | 4 |  
              | 1ppc | 1,17 | 5,45 | 11 | 1lr4 | 13,64 | 9,14 | 1 |  Data presented in Table 2 show that the program SOL was able to dock with excellent and good quality (respective RMSD does not exceed 2 Å) 50 native ligands from 80 ones. Meanwhile the program AutoDock 3.05 has demonstrated such docking quality for 48 native ligands. Table 3 contains the number of complexes corresponding to each quality scale and their ratio to the total number of complexes in percents.  Table 3.  
            
              | RMSD, Å
 (quality criterion)
 | The number of complexes   corresponding to the each quality criterion  | The relative number of complexes   with excellent, good and satisfactory quality (RMSD < 3 Å, the upper cell)   and unsatisfactory quality (RMSD > 3 Å, the lower cell  |  
              | SOL | AutoDock | SOL | AutoDock |  
              | < 1 | 32 | 19 | 55 (68.7%) | 54 (67.5%) |  
              | 1 < 2 | 18 | 29 |  
              | 2 < 3 | 5 | 6 |  
              | > 3 | 25 | 26 | 25 (31.3%) | 26 (32.5%) |   The results of the docking quality comparison for the both programs are illustrated by Fig.2. It demonstrates that the number of native ligands docked by SOL with RMSD ≤ 1 ? is almost two times larger than the respective number of ligands docked by AutoDock 3.05. The situation for 1 ?  < RMSD < 2 ? is quite opposite. We did not observed correlation between docking quality demonstrated by SOL and AutoDock programs for the same complexes, i.e.   docking quality (RMSD) for a given complex can be very different for SOL and AutoDock programs. For example, for complex 1ifu SOL gives RMSD equal to 0.63 Å, but RMSD obtained by AutoDock is equal to 8.87 Å. The docking quality of SOL is better than one of AutoDock, if we consider docking quality with the criterion RMSD < 1.5 Å. 
  Figure 2.  Curves demonstrate the quality of positioning of ligands from the validation set with docking programs SOL and AutoDoc 3.05. The relative number of ligands (Y-axis) with RMSD below a given value (X-axis) are presented. |  |