<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>http://wiki.docking.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Mysinger</id>
	<title>DISI - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="http://wiki.docking.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Mysinger"/>
	<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Special:Contributions/Mysinger"/>
	<updated>2026-05-24T00:44:25Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.39.1</generator>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Preparing_the_protein&amp;diff=4125</id>
		<title>Preparing the protein</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Preparing_the_protein&amp;diff=4125"/>
		<updated>2012-05-10T21:44:01Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: switch to solvmap_sev&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Preparing the protein=&lt;br /&gt;
&lt;br /&gt;
Items which are prefixed with &#039;AH&#039; are relevant for docking [[HEI]]s to amidohydrolases and can safely be ignored for most metal-free proteins. &lt;br /&gt;
&lt;br /&gt;
==Modifying the PDB file==&lt;br /&gt;
 &lt;br /&gt;
*prepare &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt;  by removing all lines that do not commence with &#039;ATOM&#039;, all columns to the right of the z-coordinate and the TER statements.  &lt;br /&gt;
*treat all selenomethionines (MSE) as methionines (MET) by replacing the selenium atom (SE&amp;amp;curren;) with sulphur (&amp;amp;curren;SD). Be careful about the correct alignment!  &lt;br /&gt;
*atom enumeration does not matter, so don&#039;t bother to renumber after any of the following steps. Unique numbers are a good idea, presumably.  &lt;br /&gt;
*select the protonation states of HIS residues to be either &amp;amp;delta;- (rename residue to HID), &amp;amp;epsilon;- (rename residue to HIE) or doubly protonated (rename residue to HIP). HIS on the surface should be HIP. HIS residues coordinating the metal ions should have their protons pointing away from the ions. Base your decision on the immediate environment of the HIS residue: are there potential hydrogen bonds that can be formed?; are there charged residues close by?; would a certain protonation lead to clashes with other residues?; etc.&lt;br /&gt;
*AH: the carboxylated LYS of subtype I is CYK, but this is not tolerated by &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt; , so store and delete the 3 surplus atoms and call the residue LYS.  &lt;br /&gt;
*AH: the more buried metal ion is ZB (charge 1.4), the other one ZA (charge 1.3). Atom names are right-aligned!&lt;br /&gt;
&lt;br /&gt;
==Running startdockblaster5==&lt;br /&gt;
   &lt;br /&gt;
*generate the file &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; , which should only contain atoms of the MMFF. Prepare it in the same way as above: remove all columns to the right of the z-coordinate and the TER statements. Change HETATM to ATOM.&lt;br /&gt;
*generate the files &amp;lt;tt&amp;gt;.only_spheres&amp;lt;/tt&amp;gt; and &amp;amp;ndash; in case you would like the matching spheres to be based on the heavy atoms in &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; &amp;amp;ndash; &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt; and write `on&#039; to the latter. Be careful not to add blank lines at the end, this will not be understood by &amp;lt;tt&amp;gt;makespheres2.pl&amp;lt;/tt&amp;gt; . In any case, the entry in &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt;  will be ignored by &amp;lt;tt&amp;gt;makespheres1.pl&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*on sgehead (or, as of [[dock67]], on any machine), run &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt;  to set up the data structure and copy all relevant files. It is a good idea to use csh and to &amp;lt;tt&amp;gt;source .login&amp;lt;/tt&amp;gt; beforehand.  &lt;br /&gt;
*if &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt;  doesn&#039;t finish for any obvious reason and with no clear error message, or &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; has very funny hydrogen placements, make sure that you have no non-printing characters in &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt;. Do that by running your file through &amp;lt;tt&amp;gt;pc2unix rec.pdb&amp;lt;/tt&amp;gt;. Check that your file is clean by looking at it with &amp;lt;tt&amp;gt;od -c rec.pdb | less &amp;lt;/tt&amp;gt;. The only character with a backslash should be \n &amp;amp;mdash; you should see no \t, \r, etc. If this doesn&#039;t solve the problem, your best bet is to re-prepare &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; from scratch &amp;amp;mdash; it is likely that there are some blanks or hidden characters that are causing the problems.  &lt;br /&gt;
*Take any WARNING messages emitted seriously, and continue only if you know why each one is there. Furthermore, verify that &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; still contains &#039;&#039;all&#039;&#039; atoms.  &lt;br /&gt;
*if you do not want to do anything special with the protein, like tarting some residues or modifying the spheres, go directly to chapter [[Running DOCK|3]].&lt;br /&gt;
&lt;br /&gt;
==Removing and modifying files==&lt;br /&gt;
   &lt;br /&gt;
*go to &amp;lt;tt&amp;gt;./grids&amp;lt;/tt&amp;gt;  and remove the surplus files from this directory (some would cause error messages from the subsequent programs):&amp;lt;br&amp;gt;&amp;lt;tt&amp;gt;rm -f PDBPARM chem.* rec+sph.phi solvmap_sev tart.txt OUT*&amp;lt;/tt&amp;gt;&lt;br /&gt;
*modify &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt;: &lt;br /&gt;
**AH: CYK: put the three missing atoms, delete the surplus hydrogens specific for LYS and rename the carboxylated lysine residue CYK.  &lt;br /&gt;
**remove all TER statements that might have been added.  &lt;br /&gt;
**AH: set the atom names of the metal ions to ZA and ZB and the residue name to ZN.  &lt;br /&gt;
**take care of disulfide bonds. Remove the thiol hydrogens (if they have been added) and change the residue name from CYS to CYX.  &lt;br /&gt;
*look at the &amp;lt;tt&amp;gt;box&amp;lt;/tt&amp;gt; and maybe move it, so that the ligands won&#039;t stick out. Modify the &#039;center&#039; and &#039;coordinates&#039; statement in the preamble.  &lt;br /&gt;
*all residues and atoms have to be listed in &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;vdw.parms.amb.mindock&amp;lt;/tt&amp;gt;, respectively &amp;amp;rArr; do not tart any residues in this file!&lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[[chemgrid]]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
   &lt;br /&gt;
*run &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt; and check &amp;lt;tt&amp;gt;OUTPARM&amp;lt;/tt&amp;gt; for the correct van der Waals parameters of all residues.  &lt;br /&gt;
*grep for &amp;lt;tt&amp;gt;0.000&amp;lt;/tt&amp;gt;  in &amp;lt;tt&amp;gt;PDBPARM&amp;lt;/tt&amp;gt;: if any atom has this value in the 3&amp;lt;sup&amp;gt;rd&amp;lt;/sup&amp;gt; and 4&amp;lt;sup&amp;gt;th&amp;lt;/sup&amp;gt; column, it has not been recognized by &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt;  (because it is not listed in &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt;) and is thus &#039;&#039;ignored&#039;&#039; in the van der Waals-maps. There will be no other errors, the docking will finish showing some &amp;quot;bumping&amp;quot; ligands which have extremely favorable energies (&amp;amp;le; -200).&lt;br /&gt;
*Another sign of a problem with atomic radii are any &#039;WARNING&#039;s issued in OUTPARM&lt;br /&gt;
*if one has to run &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt;  again, first remove &amp;lt;tt&amp;gt;PDBPARM OUTPARM OUTCHEM&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;chem.*&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Tarting the protein==&lt;br /&gt;
   &lt;br /&gt;
*cp &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; to &amp;lt;tt&amp;gt;rec+sph.crg&amp;lt;/tt&amp;gt; and continue with the latter file.&lt;br /&gt;
* tarted residues can be found in &amp;lt;tt&amp;gt;$DOCK_BASE/scripts/grids&amp;lt;/tt&amp;gt;, they are the files with the extension &amp;lt;tt&amp;gt;prot2&amp;lt;/tt&amp;gt;.&lt;br /&gt;
* add the relevant resides to the bottom of your &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt; file, being very precise to match the current formatting&lt;br /&gt;
* generate the new &amp;lt;tt&amp;gt;amb.crg.oxt&amp;lt;/tt&amp;gt; from the edited &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt; using:&amp;lt;br&amp;gt;&amp;lt;tt&amp;gt;$mud/prot2crg.py &amp;lt; prot.table.ambcrg.ambH &amp;gt; amb.crg.oxt&amp;lt;/tt&amp;gt;&lt;br /&gt;
*AH: select the appropriate version of &amp;lt;tt&amp;gt;amb.crg.oxt&amp;lt;/tt&amp;gt; depending on the subtype. Files are called &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt;, where &amp;lt;tt&amp;gt;N&amp;lt;/tt&amp;gt; can be &amp;lt;tt&amp;gt;I, III&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;VI&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*AH: edit the residues in the binding site (i.e., all residues complexing the metal ions in the binding site), so that their names conform to the names of the modified residues in &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt;  &lt;br /&gt;
*optionally tart the residues that are in contact with a crystallographic ligand, if any.  &lt;br /&gt;
*AH: check that ZA and ZB, respectively (left-aligned in the atom column), have corresponding entries in &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt;  and &amp;lt;tt&amp;gt;vdw.siz&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Modifying the Delphi spheres==&lt;br /&gt;
   &lt;br /&gt;
*load &amp;lt;tt&amp;gt;match1.sph.pdb&amp;lt;/tt&amp;gt; (i.e., the DelPhi spheres).  &lt;br /&gt;
*delete the spheres that are too close to the solvent.  &lt;br /&gt;
*(AH:) add spheres so that there is one sphere &#039;&#039;between&#039;&#039; the metals, several spheres &#039;&#039;around&#039;&#039; the metals and some spheres close to polar residues.  &lt;br /&gt;
*a good number for DelPhi spheres is 120.  &lt;br /&gt;
*append the spheres to the end of &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; to make &amp;lt;tt&amp;gt;rec+sph.crg&amp;lt;/tt&amp;gt; and put a TER statement after each sphere. Don&#039;t use tabs for whitespace, can cause problems with DelPhi!  &lt;br /&gt;
&lt;br /&gt;
==Modifying the Matching spheres==&lt;br /&gt;
&lt;br /&gt;
*load &amp;lt;tt&amp;gt;match2.sph.pdb&amp;lt;/tt&amp;gt; for sparse initial spheres or &amp;lt;tt&amp;gt;match3.sph.pdb&amp;lt;/tt&amp;gt; denser spheres.&lt;br /&gt;
*If you selected &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt; be careful not to move any spheres based on the ligand atoms.  &lt;br /&gt;
*(AH:) put at least one sphere between the metals and increase the sampling in the region around the metal ions by putting some spheres there.&lt;br /&gt;
*a good number for matching spheres is 50-60.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;pdbtosph matchN.sph.pdb mysph.sph&amp;lt;/tt&amp;gt; to generate the files that will be read by [[DOCK]].   &lt;br /&gt;
*if color matching is desired, run &amp;lt;tt&amp;gt;colorspheres.pl sph/match2.sph&amp;lt;/tt&amp;gt; in the parent directory of the docking run (i.e., &amp;lt;tt&amp;gt;..&amp;lt;/tt&amp;gt; to &amp;lt;tt&amp;gt;sph&amp;lt;/tt&amp;gt; ) to put some color on your spheres.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;cat $mud/header.sph match2.sph&amp;lt;/tt&amp;gt; .&lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[http://bcr.musc.edu/manuals/delphi.htm DelPhi]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
&lt;br /&gt;
*if you changed rec+sph.crg above, you need to run Delphi   &lt;br /&gt;
*if necessary, modify &amp;lt;tt&amp;gt;delphi.com&amp;lt;/tt&amp;gt; so that all the paths and file names are appropriate.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;delphi.com &amp;gt; delphi.log&amp;lt;/tt&amp;gt;  and check the output.&lt;br /&gt;
*any &#039;WARNING&#039; in the log is an indication that some atomic charges might not be correct.&lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[[newsolv.sev]]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
   &lt;br /&gt;
*if you changed rec.crg or the box above, you need to run newsolv.sev   &lt;br /&gt;
*check that all atoms are present in &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; and run &amp;lt;tt&amp;gt;newsolv.sev&amp;lt;/tt&amp;gt; .&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Preparing_the_protein&amp;diff=4124</id>
		<title>Preparing the protein</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Preparing_the_protein&amp;diff=4124"/>
		<updated>2012-05-10T21:42:58Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: change solvmap to newsolv.sev in 2 places&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Preparing the protein=&lt;br /&gt;
&lt;br /&gt;
Items which are prefixed with &#039;AH&#039; are relevant for docking [[HEI]]s to amidohydrolases and can safely be ignored for most metal-free proteins. &lt;br /&gt;
&lt;br /&gt;
==Modifying the PDB file==&lt;br /&gt;
 &lt;br /&gt;
*prepare &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt;  by removing all lines that do not commence with &#039;ATOM&#039;, all columns to the right of the z-coordinate and the TER statements.  &lt;br /&gt;
*treat all selenomethionines (MSE) as methionines (MET) by replacing the selenium atom (SE&amp;amp;curren;) with sulphur (&amp;amp;curren;SD). Be careful about the correct alignment!  &lt;br /&gt;
*atom enumeration does not matter, so don&#039;t bother to renumber after any of the following steps. Unique numbers are a good idea, presumably.  &lt;br /&gt;
*select the protonation states of HIS residues to be either &amp;amp;delta;- (rename residue to HID), &amp;amp;epsilon;- (rename residue to HIE) or doubly protonated (rename residue to HIP). HIS on the surface should be HIP. HIS residues coordinating the metal ions should have their protons pointing away from the ions. Base your decision on the immediate environment of the HIS residue: are there potential hydrogen bonds that can be formed?; are there charged residues close by?; would a certain protonation lead to clashes with other residues?; etc.&lt;br /&gt;
*AH: the carboxylated LYS of subtype I is CYK, but this is not tolerated by &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt; , so store and delete the 3 surplus atoms and call the residue LYS.  &lt;br /&gt;
*AH: the more buried metal ion is ZB (charge 1.4), the other one ZA (charge 1.3). Atom names are right-aligned!&lt;br /&gt;
&lt;br /&gt;
==Running startdockblaster5==&lt;br /&gt;
   &lt;br /&gt;
*generate the file &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; , which should only contain atoms of the MMFF. Prepare it in the same way as above: remove all columns to the right of the z-coordinate and the TER statements. Change HETATM to ATOM.&lt;br /&gt;
*generate the files &amp;lt;tt&amp;gt;.only_spheres&amp;lt;/tt&amp;gt; and &amp;amp;ndash; in case you would like the matching spheres to be based on the heavy atoms in &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; &amp;amp;ndash; &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt; and write `on&#039; to the latter. Be careful not to add blank lines at the end, this will not be understood by &amp;lt;tt&amp;gt;makespheres2.pl&amp;lt;/tt&amp;gt; . In any case, the entry in &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt;  will be ignored by &amp;lt;tt&amp;gt;makespheres1.pl&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*on sgehead (or, as of [[dock67]], on any machine), run &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt;  to set up the data structure and copy all relevant files. It is a good idea to use csh and to &amp;lt;tt&amp;gt;source .login&amp;lt;/tt&amp;gt; beforehand.  &lt;br /&gt;
*if &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt;  doesn&#039;t finish for any obvious reason and with no clear error message, or &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; has very funny hydrogen placements, make sure that you have no non-printing characters in &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt;. Do that by running your file through &amp;lt;tt&amp;gt;pc2unix rec.pdb&amp;lt;/tt&amp;gt;. Check that your file is clean by looking at it with &amp;lt;tt&amp;gt;od -c rec.pdb | less &amp;lt;/tt&amp;gt;. The only character with a backslash should be \n &amp;amp;mdash; you should see no \t, \r, etc. If this doesn&#039;t solve the problem, your best bet is to re-prepare &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; from scratch &amp;amp;mdash; it is likely that there are some blanks or hidden characters that are causing the problems.  &lt;br /&gt;
*Take any WARNING messages emitted seriously, and continue only if you know why each one is there. Furthermore, verify that &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; still contains &#039;&#039;all&#039;&#039; atoms.  &lt;br /&gt;
*if you do not want to do anything special with the protein, like tarting some residues or modifying the spheres, go directly to chapter [[Running DOCK|3]].&lt;br /&gt;
&lt;br /&gt;
==Removing and modifying files==&lt;br /&gt;
   &lt;br /&gt;
*go to &amp;lt;tt&amp;gt;./grids&amp;lt;/tt&amp;gt;  and remove the surplus files from this directory (some would cause error messages from the subsequent programs):&amp;lt;br&amp;gt;&amp;lt;tt&amp;gt;rm -f PDBPARM chem.* rec+sph.phi solvmap tart.txt OUT*&amp;lt;/tt&amp;gt;&lt;br /&gt;
*modify &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt;: &lt;br /&gt;
**AH: CYK: put the three missing atoms, delete the surplus hydrogens specific for LYS and rename the carboxylated lysine residue CYK.  &lt;br /&gt;
**remove all TER statements that might have been added.  &lt;br /&gt;
**AH: set the atom names of the metal ions to ZA and ZB and the residue name to ZN.  &lt;br /&gt;
**take care of disulfide bonds. Remove the thiol hydrogens (if they have been added) and change the residue name from CYS to CYX.  &lt;br /&gt;
*look at the &amp;lt;tt&amp;gt;box&amp;lt;/tt&amp;gt; and maybe move it, so that the ligands won&#039;t stick out. Modify the &#039;center&#039; and &#039;coordinates&#039; statement in the preamble.  &lt;br /&gt;
*all residues and atoms have to be listed in &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;vdw.parms.amb.mindock&amp;lt;/tt&amp;gt;, respectively &amp;amp;rArr; do not tart any residues in this file!    &lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[[chemgrid]]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
   &lt;br /&gt;
*run &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt; and check &amp;lt;tt&amp;gt;OUTPARM&amp;lt;/tt&amp;gt; for the correct van der Waals parameters of all residues.  &lt;br /&gt;
*grep for &amp;lt;tt&amp;gt;0.000&amp;lt;/tt&amp;gt;  in &amp;lt;tt&amp;gt;PDBPARM&amp;lt;/tt&amp;gt;: if any atom has this value in the 3&amp;lt;sup&amp;gt;rd&amp;lt;/sup&amp;gt; and 4&amp;lt;sup&amp;gt;th&amp;lt;/sup&amp;gt; column, it has not been recognized by &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt;  (because it is not listed in &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt;) and is thus &#039;&#039;ignored&#039;&#039; in the van der Waals-maps. There will be no other errors, the docking will finish showing some &amp;quot;bumping&amp;quot; ligands which have extremely favorable energies (&amp;amp;le; -200).&lt;br /&gt;
*Another sign of a problem with atomic radii are any &#039;WARNING&#039;s issued in OUTPARM&lt;br /&gt;
*if one has to run &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt;  again, first remove &amp;lt;tt&amp;gt;PDBPARM OUTPARM OUTCHEM&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;chem.*&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Tarting the protein==&lt;br /&gt;
   &lt;br /&gt;
*cp &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; to &amp;lt;tt&amp;gt;rec+sph.crg&amp;lt;/tt&amp;gt; and continue with the latter file.&lt;br /&gt;
* tarted residues can be found in &amp;lt;tt&amp;gt;$DOCK_BASE/scripts/grids&amp;lt;/tt&amp;gt;, they are the files with the extension &amp;lt;tt&amp;gt;prot2&amp;lt;/tt&amp;gt;.&lt;br /&gt;
* add the relevant resides to the bottom of your &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt; file, being very precise to match the current formatting&lt;br /&gt;
* generate the new &amp;lt;tt&amp;gt;amb.crg.oxt&amp;lt;/tt&amp;gt; from the edited &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt; using:&amp;lt;br&amp;gt;&amp;lt;tt&amp;gt;$mud/prot2crg.py &amp;lt; prot.table.ambcrg.ambH &amp;gt; amb.crg.oxt&amp;lt;/tt&amp;gt;&lt;br /&gt;
*AH: select the appropriate version of &amp;lt;tt&amp;gt;amb.crg.oxt&amp;lt;/tt&amp;gt; depending on the subtype. Files are called &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt;, where &amp;lt;tt&amp;gt;N&amp;lt;/tt&amp;gt; can be &amp;lt;tt&amp;gt;I, III&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;VI&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*AH: edit the residues in the binding site (i.e., all residues complexing the metal ions in the binding site), so that their names conform to the names of the modified residues in &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt;  &lt;br /&gt;
*optionally tart the residues that are in contact with a crystallographic ligand, if any.  &lt;br /&gt;
*AH: check that ZA and ZB, respectively (left-aligned in the atom column), have corresponding entries in &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt;  and &amp;lt;tt&amp;gt;vdw.siz&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Modifying the Delphi spheres==&lt;br /&gt;
   &lt;br /&gt;
*load &amp;lt;tt&amp;gt;match1.sph.pdb&amp;lt;/tt&amp;gt; (i.e., the DelPhi spheres).  &lt;br /&gt;
*delete the spheres that are too close to the solvent.  &lt;br /&gt;
*(AH:) add spheres so that there is one sphere &#039;&#039;between&#039;&#039; the metals, several spheres &#039;&#039;around&#039;&#039; the metals and some spheres close to polar residues.  &lt;br /&gt;
*a good number for DelPhi spheres is 120.  &lt;br /&gt;
*append the spheres to the end of &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; to make &amp;lt;tt&amp;gt;rec+sph.crg&amp;lt;/tt&amp;gt; and put a TER statement after each sphere. Don&#039;t use tabs for whitespace, can cause problems with DelPhi!  &lt;br /&gt;
&lt;br /&gt;
==Modifying the Matching spheres==&lt;br /&gt;
&lt;br /&gt;
*load &amp;lt;tt&amp;gt;match2.sph.pdb&amp;lt;/tt&amp;gt; for sparse initial spheres or &amp;lt;tt&amp;gt;match3.sph.pdb&amp;lt;/tt&amp;gt; denser spheres.&lt;br /&gt;
*If you selected &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt; be careful not to move any spheres based on the ligand atoms.  &lt;br /&gt;
*(AH:) put at least one sphere between the metals and increase the sampling in the region around the metal ions by putting some spheres there.&lt;br /&gt;
*a good number for matching spheres is 50-60.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;pdbtosph matchN.sph.pdb mysph.sph&amp;lt;/tt&amp;gt; to generate the files that will be read by [[DOCK]].   &lt;br /&gt;
*if color matching is desired, run &amp;lt;tt&amp;gt;colorspheres.pl sph/match2.sph&amp;lt;/tt&amp;gt; in the parent directory of the docking run (i.e., &amp;lt;tt&amp;gt;..&amp;lt;/tt&amp;gt; to &amp;lt;tt&amp;gt;sph&amp;lt;/tt&amp;gt; ) to put some color on your spheres.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;cat $mud/header.sph match2.sph&amp;lt;/tt&amp;gt; .&lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[http://bcr.musc.edu/manuals/delphi.htm DelPhi]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
&lt;br /&gt;
*if you changed rec+sph.crg above, you need to run Delphi   &lt;br /&gt;
*if necessary, modify &amp;lt;tt&amp;gt;delphi.com&amp;lt;/tt&amp;gt; so that all the paths and file names are appropriate.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;delphi.com &amp;gt; delphi.log&amp;lt;/tt&amp;gt;  and check the output.&lt;br /&gt;
*any &#039;WARNING&#039; in the log is an indication that some atomic charges might not be correct.&lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[[newsolv.sev]]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
   &lt;br /&gt;
*if you changed rec.crg or the box above, you need to run newsolv.sev   &lt;br /&gt;
*check that all atoms are present in &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; and run &amp;lt;tt&amp;gt;newsolv.sev&amp;lt;/tt&amp;gt; .&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=ZINC_processing_pipeline&amp;diff=4909</id>
		<title>ZINC processing pipeline</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=ZINC_processing_pipeline&amp;diff=4909"/>
		<updated>2012-03-15T20:52:04Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: Adjust current ring puckering settings&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Each molecule in ZINC is processed via our ZINC processing pipeline.  This process is embodied in a set of scripts that we continue to refine as we discover problems.&lt;br /&gt;
&lt;br /&gt;
Frankly, we hope people will simply use ZINC rather than trying to reproduce it. Still, in the interests of clarity, transparency, truth, justice and the Canadian Way (TM), here is our current protocol.&lt;br /&gt;
&lt;br /&gt;
* 1. If you have 2D SDF, convert it to isomeric SMILES.&lt;br /&gt;
&lt;br /&gt;
* 2. sed -e &#039;s/N=S=N/nsn/g&#039; 2.ism &amp;gt; 2-out.ism &lt;br /&gt;
&lt;br /&gt;
* 3. Use molinspiration mitools/mib to eliminate broken SMILES:&lt;br /&gt;
 java -jar /raid1/soft/mitools/mib.jar -singlepart -onlyOrganic -normalizeCharges -f $1 -out smi&lt;br /&gt;
&lt;br /&gt;
* 4. Use OEChem to remove molecules with problematic functional groups:  &lt;br /&gt;
 filter.py rules.txt 4.ism 4-out.ism  &amp;gt; filterlog.txt&lt;br /&gt;
see http://blaster.docking.org/filtering/rules_default.txt for current rules.&lt;br /&gt;
&lt;br /&gt;
* 5. select only 4 of stereochemical expansions from previous step. We just take the first 4, but you can imagine better ways of making the selection.&lt;br /&gt;
&lt;br /&gt;
* 6. get rid of bogus stereochemistry at nitrogen:&lt;br /&gt;
 sed -e &#039;s/\[N@\]/N/g&#039; -e &#039;s/\[N@@\]/N/g&#039; -e &#039;s/\[N@H+\]/\[NH+\]/g&#039; -e &#039;s/\[N@@H+\]/\[NH+\]/g&#039; -e &#039;s/\[N@@+\]/\[N+\]/g&#039; -e &#039;s/\[N@+\]/\[N+\]/g&#039; $1 &amp;gt;  d.ism&lt;br /&gt;
&lt;br /&gt;
* 7. If the molecule is already in ZINC, eliminate it from the list.&lt;br /&gt;
&lt;br /&gt;
* 8. Generate trial 3D structure with corina.&lt;br /&gt;
 corina -d neu,wh,rc,mc=1,canon -i t=smiles -o t=sdf &amp;lt; 1a.ism &amp;gt; 2.sdf&lt;br /&gt;
&lt;br /&gt;
* 9. generate reference pH state using Schrodinger&#039;s Epik. &lt;br /&gt;
 epik -ph 7.05 -ms 1 -imae A.mae -omae B.mae -WAIT&lt;br /&gt;
&lt;br /&gt;
* 10. generate mid, hi and lo pH subsets&lt;br /&gt;
 mid: setenv EPIK &amp;quot;-ph 7.0 -pht 1 -tp 0.20&amp;quot;&lt;br /&gt;
 hi: setenv EPIK &amp;quot;-ph 8.5 -pht 0.75 -tp 0.20&amp;quot;&lt;br /&gt;
 lo:  setenv EPIK &amp;quot;-ph 5.5 -pht 0.75 -tp 0.20&amp;quot;&lt;br /&gt;
 epik $EPIK -imae A.mae -omae B.mae -WAIT&lt;br /&gt;
&lt;br /&gt;
* 11. For each subset (ref, mid, hi, lo) use Corina to generate 3D model of the relevant protonated state.&lt;br /&gt;
 corina -d rc,flapn,de=6,mc=4 -i t=mol2 -o t=mol2&lt;br /&gt;
&lt;br /&gt;
That&#039;s really it.  There is more to do with loading ZINC, but to generate the models, that is what we think you need to know.  Good luck!&lt;br /&gt;
&lt;br /&gt;
-- John Irwin. March 2009.&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Preparing_the_protein&amp;diff=4123</id>
		<title>Preparing the protein</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Preparing_the_protein&amp;diff=4123"/>
		<updated>2012-01-25T23:39:57Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* Running &amp;lt;tt&amp;gt;solvmap&amp;lt;/tt&amp;gt; */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Preparing the protein=&lt;br /&gt;
&lt;br /&gt;
Items which are prefixed with &#039;AH&#039; are relevant for docking [[HEI]]s to amidohydrolases and can safely be ignored for most metal-free proteins. &lt;br /&gt;
&lt;br /&gt;
==Modifying the PDB file==&lt;br /&gt;
 &lt;br /&gt;
*prepare &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt;  by removing all lines that do not commence with &#039;ATOM&#039;, all columns to the right of the z-coordinate and the TER statements.  &lt;br /&gt;
*treat all selenomethionines (MSE) as methionines (MET) by replacing the selenium atom (SE&amp;amp;curren;) with sulphur (&amp;amp;curren;SD). Be careful about the correct alignment!  &lt;br /&gt;
*atom enumeration does not matter, so don&#039;t bother to renumber after any of the following steps. Unique numbers are a good idea, presumably.  &lt;br /&gt;
*select the protonation states of HIS residues to be either &amp;amp;delta;- (rename residue to HID), &amp;amp;epsilon;- (rename residue to HIE) or doubly protonated (rename residue to HIP). HIS on the surface should be HIP. HIS residues coordinating the metal ions should have their protons pointing away from the ions. Base your decision on the immediate environment of the HIS residue: are there potential hydrogen bonds that can be formed?; are there charged residues close by?; would a certain protonation lead to clashes with other residues?; etc.&lt;br /&gt;
*AH: the carboxylated LYS of subtype I is CYK, but this is not tolerated by &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt; , so store and delete the 3 surplus atoms and call the residue LYS.  &lt;br /&gt;
*AH: the more buried metal ion is ZB (charge 1.4), the other one ZA (charge 1.3). Atom names are right-aligned!&lt;br /&gt;
&lt;br /&gt;
==Running startdockblaster5==&lt;br /&gt;
   &lt;br /&gt;
*generate the file &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; , which should only contain atoms of the MMFF. Prepare it in the same way as above: remove all columns to the right of the z-coordinate and the TER statements. Change HETATM to ATOM.&lt;br /&gt;
*generate the files &amp;lt;tt&amp;gt;.only_spheres&amp;lt;/tt&amp;gt; and &amp;amp;ndash; in case you would like the matching spheres to be based on the heavy atoms in &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; &amp;amp;ndash; &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt; and write `on&#039; to the latter. Be careful not to add blank lines at the end, this will not be understood by &amp;lt;tt&amp;gt;makespheres2.pl&amp;lt;/tt&amp;gt; . In any case, the entry in &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt;  will be ignored by &amp;lt;tt&amp;gt;makespheres1.pl&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*on sgehead (or, as of [[dock67]], on any machine), run &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt;  to set up the data structure and copy all relevant files. It is a good idea to use csh and to &amp;lt;tt&amp;gt;source .login&amp;lt;/tt&amp;gt; beforehand.  &lt;br /&gt;
*if &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt;  doesn&#039;t finish for any obvious reason and with no clear error message, or &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; has very funny hydrogen placements, make sure that you have no non-printing characters in &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt;. Do that by running your file through &amp;lt;tt&amp;gt;pc2unix rec.pdb&amp;lt;/tt&amp;gt;. Check that your file is clean by looking at it with &amp;lt;tt&amp;gt;od -c rec.pdb | less &amp;lt;/tt&amp;gt;. The only character with a backslash should be \n &amp;amp;mdash; you should see no \t, \r, etc. If this doesn&#039;t solve the problem, your best bet is to re-prepare &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; from scratch &amp;amp;mdash; it is likely that there are some blanks or hidden characters that are causing the problems.  &lt;br /&gt;
*Take any WARNING messages emitted seriously, and continue only if you know why each one is there. Furthermore, verify that &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; still contains &#039;&#039;all&#039;&#039; atoms.  &lt;br /&gt;
*if you do not want to do anything special with the protein, like tarting some residues or modifying the spheres, go directly to chapter [[Running DOCK|3]].&lt;br /&gt;
&lt;br /&gt;
==Removing and modifying files==&lt;br /&gt;
   &lt;br /&gt;
*go to &amp;lt;tt&amp;gt;./grids&amp;lt;/tt&amp;gt;  and remove the surplus files from this directory (some would cause error messages from the subsequent programs):&amp;lt;br&amp;gt;&amp;lt;tt&amp;gt;rm -f PDBPARM chem.* rec+sph.phi solvmap tart.txt OUT*&amp;lt;/tt&amp;gt;&lt;br /&gt;
*modify &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt;: &lt;br /&gt;
**AH: CYK: put the three missing atoms, delete the surplus hydrogens specific for LYS and rename the carboxylated lysine residue CYK.  &lt;br /&gt;
**remove all TER statements that might have been added.  &lt;br /&gt;
**AH: set the atom names of the metal ions to ZA and ZB and the residue name to ZN.  &lt;br /&gt;
**take care of disulfide bonds. Remove the thiol hydrogens (if they have been added) and change the residue name from CYS to CYX.  &lt;br /&gt;
*look at the &amp;lt;tt&amp;gt;box&amp;lt;/tt&amp;gt; and maybe move it, so that the ligands won&#039;t stick out. Modify the &#039;center&#039; and &#039;coordinates&#039; statement in the preamble.  &lt;br /&gt;
*all residues and atoms have to be listed in &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;vdw.parms.amb.mindock&amp;lt;/tt&amp;gt;, respectively &amp;amp;rArr; do not tart any residues in this file!    &lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[[chemgrid]]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
   &lt;br /&gt;
*run &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt; and check &amp;lt;tt&amp;gt;OUTPARM&amp;lt;/tt&amp;gt; for the correct van der Waals parameters of all residues.  &lt;br /&gt;
*grep for &amp;lt;tt&amp;gt;0.000&amp;lt;/tt&amp;gt;  in &amp;lt;tt&amp;gt;PDBPARM&amp;lt;/tt&amp;gt;: if any atom has this value in the 3&amp;lt;sup&amp;gt;rd&amp;lt;/sup&amp;gt; and 4&amp;lt;sup&amp;gt;th&amp;lt;/sup&amp;gt; column, it has not been recognized by &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt;  (because it is not listed in &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt;) and is thus &#039;&#039;ignored&#039;&#039; in the van der Waals-maps. There will be no other errors, the docking will finish showing some &amp;quot;bumping&amp;quot; ligands which have extremely favorable energies (&amp;amp;le; -200).&lt;br /&gt;
*Another sign of a problem with atomic radii are any &#039;WARNING&#039;s issued in OUTPARM&lt;br /&gt;
*if one has to run &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt;  again, first remove &amp;lt;tt&amp;gt;PDBPARM OUTPARM OUTCHEM&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;chem.*&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Tarting the protein==&lt;br /&gt;
   &lt;br /&gt;
*cp &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; to &amp;lt;tt&amp;gt;rec+sph.crg&amp;lt;/tt&amp;gt; and continue with the latter file.&lt;br /&gt;
* tarted residues can be found in &amp;lt;tt&amp;gt;$DOCK_BASE/scripts/grids&amp;lt;/tt&amp;gt;, they are the files with the extension &amp;lt;tt&amp;gt;prot2&amp;lt;/tt&amp;gt;.&lt;br /&gt;
* add the relevant resides to the bottom of your &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt; file, being very precise to match the current formatting&lt;br /&gt;
* generate the new &amp;lt;tt&amp;gt;amb.crg.oxt&amp;lt;/tt&amp;gt; from the edited &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt; using:&amp;lt;br&amp;gt;&amp;lt;tt&amp;gt;$mud/prot2crg.py &amp;lt; prot.table.ambcrg.ambH &amp;gt; amb.crg.oxt&amp;lt;/tt&amp;gt;&lt;br /&gt;
*AH: select the appropriate version of &amp;lt;tt&amp;gt;amb.crg.oxt&amp;lt;/tt&amp;gt; depending on the subtype. Files are called &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt;, where &amp;lt;tt&amp;gt;N&amp;lt;/tt&amp;gt; can be &amp;lt;tt&amp;gt;I, III&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;VI&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*AH: edit the residues in the binding site (i.e., all residues complexing the metal ions in the binding site), so that their names conform to the names of the modified residues in &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt;  &lt;br /&gt;
*optionally tart the residues that are in contact with a crystallographic ligand, if any.  &lt;br /&gt;
*AH: check that ZA and ZB, respectively (left-aligned in the atom column), have corresponding entries in &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt;  and &amp;lt;tt&amp;gt;vdw.siz&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Modifying the Delphi spheres==&lt;br /&gt;
   &lt;br /&gt;
*load &amp;lt;tt&amp;gt;match1.sph.pdb&amp;lt;/tt&amp;gt; (i.e., the DelPhi spheres).  &lt;br /&gt;
*delete the spheres that are too close to the solvent.  &lt;br /&gt;
*(AH:) add spheres so that there is one sphere &#039;&#039;between&#039;&#039; the metals, several spheres &#039;&#039;around&#039;&#039; the metals and some spheres close to polar residues.  &lt;br /&gt;
*a good number for DelPhi spheres is 120.  &lt;br /&gt;
*append the spheres to the end of &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; to make &amp;lt;tt&amp;gt;rec+sph.crg&amp;lt;/tt&amp;gt; and put a TER statement after each sphere. Don&#039;t use tabs for whitespace, can cause problems with DelPhi!  &lt;br /&gt;
&lt;br /&gt;
==Modifying the Matching spheres==&lt;br /&gt;
&lt;br /&gt;
*load &amp;lt;tt&amp;gt;match2.sph.pdb&amp;lt;/tt&amp;gt; for sparse initial spheres or &amp;lt;tt&amp;gt;match3.sph.pdb&amp;lt;/tt&amp;gt; denser spheres.&lt;br /&gt;
*If you selected &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt; be careful not to move any spheres based on the ligand atoms.  &lt;br /&gt;
*(AH:) put at least one sphere between the metals and increase the sampling in the region around the metal ions by putting some spheres there.&lt;br /&gt;
*a good number for matching spheres is 50-60.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;pdbtosph matchN.sph.pdb mysph.sph&amp;lt;/tt&amp;gt; to generate the files that will be read by [[DOCK]].   &lt;br /&gt;
*if color matching is desired, run &amp;lt;tt&amp;gt;colorspheres.pl sph/match2.sph&amp;lt;/tt&amp;gt; in the parent directory of the docking run (i.e., &amp;lt;tt&amp;gt;..&amp;lt;/tt&amp;gt; to &amp;lt;tt&amp;gt;sph&amp;lt;/tt&amp;gt; ) to put some color on your spheres.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;cat $mud/header.sph match2.sph&amp;lt;/tt&amp;gt; .&lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[http://bcr.musc.edu/manuals/delphi.htm DelPhi]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
&lt;br /&gt;
*if you changed rec+sph.crg above, you need to run Delphi   &lt;br /&gt;
*if necessary, modify &amp;lt;tt&amp;gt;delphi.com&amp;lt;/tt&amp;gt; so that all the paths and file names are appropriate.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;delphi.com &amp;gt; delphi.log&amp;lt;/tt&amp;gt;  and check the output.&lt;br /&gt;
*any &#039;WARNING&#039; in the log is an indication that some atomic charges might not be correct.&lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[[solvmap]]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
   &lt;br /&gt;
*if you changed rec.crg or the box above, you need to run solvmap   &lt;br /&gt;
*check that all atoms are present in &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; and run &amp;lt;tt&amp;gt;newsolv.sev&amp;lt;/tt&amp;gt; .&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=INDOCK_for_DOCK_3.6&amp;diff=3356</id>
		<title>INDOCK for DOCK 3.6</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=INDOCK_for_DOCK_3.6&amp;diff=3356"/>
		<updated>2011-10-26T20:45:32Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: change file date to match sphere change&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;What follows is a documented sample INDOCK file for [[DOCK 3.6]]. Many lines are required, lines starting with # are comments.&lt;br /&gt;
&lt;br /&gt;
Required first line:&lt;br /&gt;
&lt;br /&gt;
 DOCK 3.5 parameter&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 ################## DOCK 3.5 INPUT PARAMETERS 2011/10/26 #######################&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                          INPUT/OUTPUT&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
This is the path to the receptor matching spheres file. Most scripts make a set of directories and copy the INDOCK file into them, so this path sometimes has an extra set of &amp;quot;../&amp;quot; in it compared to what you might think. If you use [[DOCK Blaster]]. Generally, match3 has more spheres than match2, so produces more possible orientations. These spheres are matched to ligand spheres, generated from heavy atoms in the &amp;quot;rigid component&amp;quot; of each ligand. For more about the rigid component, see [[Flexibase Format]].&lt;br /&gt;
&lt;br /&gt;
 receptor_sphere_file          ../../sph/match2.sph&lt;br /&gt;
&lt;br /&gt;
The next line is always 1, and is marked for deprecation.&lt;br /&gt;
&lt;br /&gt;
 cluster_numbers               1&lt;br /&gt;
&lt;br /&gt;
The next line refers to which ligand file to use. If using many of the automated scripts, split_database_index is used, as this allows many ligand files (or just 1) to be placed in the split_database_index file and read in one after another during a DOCK run. If docking small things on your own, you can change this to any file.&lt;br /&gt;
&lt;br /&gt;
 # NOTE: split_database_index is reserved to specify a list of files&lt;br /&gt;
 ligand_atom_file              split_database_index&lt;br /&gt;
&lt;br /&gt;
This will control the file output, again many of the automated scripts expect it to be test. OUTDOCK files are always named OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
 output_file_prefix            test.&lt;br /&gt;
&lt;br /&gt;
This controls the random seed used in the minimization procedure. Changing this will produce slightly different results.&lt;br /&gt;
&lt;br /&gt;
 random_seed                   777&lt;br /&gt;
 #&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                             MATCHING&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
distance_tolerance is how different the distances can be between a pair of receptor matching spheres and a pair of ligand matching spheres for them to still be considered matched.&lt;br /&gt;
&lt;br /&gt;
 distance_tolerance            1.5&lt;br /&gt;
&lt;br /&gt;
This changes how many spheres must be matched to generate an orientation. 3 as a minimum, 4 as a maximum is generally accepted as the right thing to use. Less than 3 is too degenerate to generate an actual orientation, and requiring more than 4 matched spheres does not work well, since we only use heavy atoms in ring systems to generate ligand matching spheres.&lt;br /&gt;
&lt;br /&gt;
 nodes_maximum                 4&lt;br /&gt;
 nodes_minimum                 3&lt;br /&gt;
&lt;br /&gt;
The next 4 parameters control how the histograms of distance differences are generated. The binsize is how big the bins are, the overlap controls if a sphere can be put into multiple bins. The ligand &amp;amp; receptor parameters are not required to be the same. &lt;br /&gt;
&lt;br /&gt;
 ligand_binsize                0.4&lt;br /&gt;
 ligand_overlap                0.2&lt;br /&gt;
 receptor_binsize              0.4&lt;br /&gt;
 receptor_overlap              0.2&lt;br /&gt;
&lt;br /&gt;
Bumping is using a quick check of distances when placing ligand atoms in the binding site to determine if they have a steric clash. The maximum is how many can be &#039;bumped&#039; or in close steric contact per rigid or flexible component of the ligand, as per the [[Flexibase Format]]. Even ligands with some steric clashes can sometimes be rescued by minimization. Setting this number very high will cause many clashed orientations to be scored, which can be prohibitively slow.&lt;br /&gt;
&lt;br /&gt;
 bump_maximum                  1&lt;br /&gt;
&lt;br /&gt;
The next four parameters are unused and unsupported.&lt;br /&gt;
&lt;br /&gt;
 focus_cycles                  0&lt;br /&gt;
 focus_bump                    0 &lt;br /&gt;
 focus_type                    energy&lt;br /&gt;
 critical_clusters             no&lt;br /&gt;
 #&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                             COLORING&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
This controls whether chemical matching or coloring is used at all. If yes, many match lines are necessary. These may not be perfect, but [[DOCK Blaster]] has been using these for a long time. Setting this to no produces many more matched orientations, which can be slow, but can help you understand exactly what the energy function is doing.&lt;br /&gt;
&lt;br /&gt;
 chemical_matching             yes&lt;br /&gt;
 case_sensitive                no&lt;br /&gt;
 #                             ligand color, receptor color&lt;br /&gt;
 match                         positive negative&lt;br /&gt;
 match                         positive negative_or_acceptor&lt;br /&gt;
 match                         positive not_neutral&lt;br /&gt;
 match                         negative positive&lt;br /&gt;
 match                         negative positive_or_donor&lt;br /&gt;
 match                         negative not_neutral&lt;br /&gt;
 match                         donor acceptor&lt;br /&gt;
 match                         donor donacc&lt;br /&gt;
 match                         donor negative_or_acceptor&lt;br /&gt;
 match                         donor neutral_or_acceptor_or_donor&lt;br /&gt;
 match                         donor not_neutral&lt;br /&gt;
 match                         acceptor donor&lt;br /&gt;
 match                         acceptor donacc&lt;br /&gt;
 match                         acceptor positive_or_donor&lt;br /&gt;
 match                         acceptor neutral_or_acceptor_or_donor&lt;br /&gt;
 match                         acceptor not_neutral&lt;br /&gt;
 match                         neutral neutral&lt;br /&gt;
 match                         neutral neutral_or_acceptor_or_donor&lt;br /&gt;
 match                         ester_o donor&lt;br /&gt;
 match                         ester_o donacc&lt;br /&gt;
 match                         ester_o positive_or_donor&lt;br /&gt;
 match                         ester_o not_neutral&lt;br /&gt;
 match                         amide_o donor&lt;br /&gt;
 match                         amide_o donacc&lt;br /&gt;
 match                         amide_o positive_or_donor&lt;br /&gt;
 match                         amide_o not_neutral&lt;br /&gt;
&lt;br /&gt;
Single mode is deprecated, these parameters won&#039;t work. See [[Dock Ligand Clustering]]&lt;br /&gt;
 #&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                             SINGLE MODE&lt;br /&gt;
 #&lt;br /&gt;
 #rmsd_override                 0.0&lt;br /&gt;
 #contact_minimum               0&lt;br /&gt;
 #energy_maximum               1.0e+6&lt;br /&gt;
 ##truncate_output              1000.0&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
Search mode is now the default/only mode of docking. Each parameter is described below.&lt;br /&gt;
&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                             SEARCH MODE&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
The ratio_minimum parameter has been slated for deprecation.&lt;br /&gt;
&lt;br /&gt;
 ratio_minimum                 0.0&lt;br /&gt;
&lt;br /&gt;
These parameters control how many atoms are necessary in the ligand for it to be docked.&lt;br /&gt;
&lt;br /&gt;
 atom_minimum                  5 &lt;br /&gt;
 atom_maximum                  100&lt;br /&gt;
&lt;br /&gt;
How many of the top molecules will be saved in the output test.* file. &lt;br /&gt;
&lt;br /&gt;
 number_save                   50000&lt;br /&gt;
&lt;br /&gt;
The maximum number of molecules that will be scored in any given run.&lt;br /&gt;
&lt;br /&gt;
 molecules_maximum             300000 &lt;br /&gt;
&lt;br /&gt;
How many molecules will be skipped, this feature currently does not work.&lt;br /&gt;
&lt;br /&gt;
 initial_skip                  0&lt;br /&gt;
&lt;br /&gt;
How long a molecule is processed before quitting. This feature currently may not work as expected.&lt;br /&gt;
&lt;br /&gt;
 timeout                       180&lt;br /&gt;
&lt;br /&gt;
There are many scoring options:&lt;br /&gt;
&lt;br /&gt;
 # &lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                             SCORING&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
Valid options for ligand_desolvation are &#039;volume&#039; (partial desolvation a la Mysinger &amp;amp; Shoichet 2010), &#039;full&#039; meaning that the entire ligand is assumed to be desolvated in the binding site and &#039;none&#039;, where no desolvation penalties are applied.&lt;br /&gt;
&lt;br /&gt;
 ligand_desolvation            volume&lt;br /&gt;
&lt;br /&gt;
See the note about relative paths for the matching spheres above, the same comments apply here. There are 2 ways to run &#039;volume&#039; or partial desolvation, one is to use one grid for every ligand atom like this:&lt;br /&gt;
&lt;br /&gt;
 solvmap_file                  ../../grids/solvmap_sev&lt;br /&gt;
&lt;br /&gt;
The other option is to use one grid for ligand heavy atoms and one for ligand hydrogen atoms, you&#039;ll want to uncomment these lines to use them (and comment out the other solvmap_file line).&lt;br /&gt;
&lt;br /&gt;
 #solvmap_file                  ../../grids/solvmap_sev.heavy&lt;br /&gt;
 #hydrogen_solvmap_file         ../../grids/solvmap.sev.hydrogen&lt;br /&gt;
&lt;br /&gt;
This is the phimap file used for electrostatic scoring. For a better understanding of this grid, see [[Visualizing delphi]]. Sometimes this will change if you are using the new Qnifft Delphi maps, see [[Qnifft DOCK 3.6 conversion]].&lt;br /&gt;
&lt;br /&gt;
 delphi_file                   ../../grids/rec+sph.phi&lt;br /&gt;
&lt;br /&gt;
This controls the chemgrid file, which contains the van der Waals scoring for every coordinate (chem.vdw will be called) as well as the distance map grids that will be used for deciphering bumping (chem.bmp will be called).&lt;br /&gt;
&lt;br /&gt;
 chemgrid_file_prefix          ../../grids/chem&lt;br /&gt;
&lt;br /&gt;
This is the parameter file that contains the atom type definitions:&lt;br /&gt;
&lt;br /&gt;
 vdw_parameter_file            ../../grids/vdw.parms.amb.mindock&lt;br /&gt;
&lt;br /&gt;
The following options allow the electrostatics and van der Waals parameters to be scaled relative to each other and the solvation scoring.&lt;br /&gt;
&lt;br /&gt;
 electrostatic_scale           1.0&lt;br /&gt;
 vdw_scale                     1.0&lt;br /&gt;
&lt;br /&gt;
The following parameter lets ligands with internal steric clashes attempt to find a ligand conformation that scores well but does not have any internal clashes. Sometimes this procedure will fail in circumstances where there are many flexible branches, or where a ligand that is too large for the binding site is being docked.&lt;br /&gt;
&lt;br /&gt;
 check_clashes                 yes&lt;br /&gt;
&lt;br /&gt;
If set to yes, this removes the positive solvation from each ligand atom and spreads it evenly over the molecule. This is deprecated because it does unexpected things to solvation, and will be removed entirely soon.&lt;br /&gt;
&lt;br /&gt;
 remove_positive_solvation     no&lt;br /&gt;
&lt;br /&gt;
After each orientation of the rigid component is processed and the many ligand conformations have been examined, the best ligand conformation for that orientation can be minimized using the following parameters.&lt;br /&gt;
&lt;br /&gt;
 #&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                             MINIMIZATION&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
No turns off minimization completely.&lt;br /&gt;
&lt;br /&gt;
 minimize                      yes&lt;br /&gt;
&lt;br /&gt;
Don&#039;t minimize molecules that score above the minimization_max.&lt;br /&gt;
&lt;br /&gt;
 minimization_max             1.0e15&lt;br /&gt;
&lt;br /&gt;
If set to yes, this checks to see if the orientation has already been scored and quits. This has not been tested recently.&lt;br /&gt;
&lt;br /&gt;
 check_degeneracy              no&lt;br /&gt;
&lt;br /&gt;
How many iterations of minimization to do. More means longer run times, but potentially better poses.&lt;br /&gt;
&lt;br /&gt;
 simplex_iterations            250&lt;br /&gt;
&lt;br /&gt;
How much the total energy can changed to be considered converged. Setting this higher will stop faster, setting it lower will cause it to do more iterations before converging (or potentially hitting the iteration max above).&lt;br /&gt;
&lt;br /&gt;
 simplex_convergence           0.1&lt;br /&gt;
&lt;br /&gt;
If the energy changes by this much, restart the minimizer from this newest position.&lt;br /&gt;
&lt;br /&gt;
 simplex_restart               1.0&lt;br /&gt;
&lt;br /&gt;
This is the initial distance in angstroms the molecule is translated (note that translation and rotation used to be swapped for many releases of DOCK).&lt;br /&gt;
&lt;br /&gt;
 simplex_initial_translation   0.2&lt;br /&gt;
&lt;br /&gt;
How many degrees of initial rotation are done.&lt;br /&gt;
&lt;br /&gt;
 simplex_initial_rotation      5.0&lt;br /&gt;
 #&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 ###############################################################################&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=INDOCK_for_DOCK_3.6&amp;diff=3355</id>
		<title>INDOCK for DOCK 3.6</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=INDOCK_for_DOCK_3.6&amp;diff=3355"/>
		<updated>2011-10-26T20:41:22Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: change match3 to match2&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;What follows is a documented sample INDOCK file for [[DOCK 3.6]]. Many lines are required, lines starting with # are comments.&lt;br /&gt;
&lt;br /&gt;
Required first line:&lt;br /&gt;
&lt;br /&gt;
 DOCK 3.5 parameter&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 ################## DOCK 3.5 INPUT PARAMETERS 2011/09/07 #######################&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                          INPUT/OUTPUT&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
This is the path to the receptor matching spheres file. Most scripts make a set of directories and copy the INDOCK file into them, so this path sometimes has an extra set of &amp;quot;../&amp;quot; in it compared to what you might think. If you use [[DOCK Blaster]]. Generally, match3 has more spheres than match2, so produces more possible orientations. These spheres are matched to ligand spheres, generated from heavy atoms in the &amp;quot;rigid component&amp;quot; of each ligand. For more about the rigid component, see [[Flexibase Format]].&lt;br /&gt;
&lt;br /&gt;
 receptor_sphere_file          ../../sph/match2.sph&lt;br /&gt;
&lt;br /&gt;
The next line is always 1, and is marked for deprecation.&lt;br /&gt;
&lt;br /&gt;
 cluster_numbers               1&lt;br /&gt;
&lt;br /&gt;
The next line refers to which ligand file to use. If using many of the automated scripts, split_database_index is used, as this allows many ligand files (or just 1) to be placed in the split_database_index file and read in one after another during a DOCK run. If docking small things on your own, you can change this to any file.&lt;br /&gt;
&lt;br /&gt;
 # NOTE: split_database_index is reserved to specify a list of files&lt;br /&gt;
 ligand_atom_file              split_database_index&lt;br /&gt;
&lt;br /&gt;
This will control the file output, again many of the automated scripts expect it to be test. OUTDOCK files are always named OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
 output_file_prefix            test.&lt;br /&gt;
&lt;br /&gt;
This controls the random seed used in the minimization procedure. Changing this will produce slightly different results.&lt;br /&gt;
&lt;br /&gt;
 random_seed                   777&lt;br /&gt;
 #&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                             MATCHING&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
distance_tolerance is how different the distances can be between a pair of receptor matching spheres and a pair of ligand matching spheres for them to still be considered matched.&lt;br /&gt;
&lt;br /&gt;
 distance_tolerance            1.5&lt;br /&gt;
&lt;br /&gt;
This changes how many spheres must be matched to generate an orientation. 3 as a minimum, 4 as a maximum is generally accepted as the right thing to use. Less than 3 is too degenerate to generate an actual orientation, and requiring more than 4 matched spheres does not work well, since we only use heavy atoms in ring systems to generate ligand matching spheres.&lt;br /&gt;
&lt;br /&gt;
 nodes_maximum                 4&lt;br /&gt;
 nodes_minimum                 3&lt;br /&gt;
&lt;br /&gt;
The next 4 parameters control how the histograms of distance differences are generated. The binsize is how big the bins are, the overlap controls if a sphere can be put into multiple bins. The ligand &amp;amp; receptor parameters are not required to be the same. &lt;br /&gt;
&lt;br /&gt;
 ligand_binsize                0.4&lt;br /&gt;
 ligand_overlap                0.2&lt;br /&gt;
 receptor_binsize              0.4&lt;br /&gt;
 receptor_overlap              0.2&lt;br /&gt;
&lt;br /&gt;
Bumping is using a quick check of distances when placing ligand atoms in the binding site to determine if they have a steric clash. The maximum is how many can be &#039;bumped&#039; or in close steric contact per rigid or flexible component of the ligand, as per the [[Flexibase Format]]. Even ligands with some steric clashes can sometimes be rescued by minimization. Setting this number very high will cause many clashed orientations to be scored, which can be prohibitively slow.&lt;br /&gt;
&lt;br /&gt;
 bump_maximum                  1&lt;br /&gt;
&lt;br /&gt;
The next four parameters are unused and unsupported.&lt;br /&gt;
&lt;br /&gt;
 focus_cycles                  0&lt;br /&gt;
 focus_bump                    0 &lt;br /&gt;
 focus_type                    energy&lt;br /&gt;
 critical_clusters             no&lt;br /&gt;
 #&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                             COLORING&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
This controls whether chemical matching or coloring is used at all. If yes, many match lines are necessary. These may not be perfect, but [[DOCK Blaster]] has been using these for a long time. Setting this to no produces many more matched orientations, which can be slow, but can help you understand exactly what the energy function is doing.&lt;br /&gt;
&lt;br /&gt;
 chemical_matching             yes&lt;br /&gt;
 case_sensitive                no&lt;br /&gt;
 #                             ligand color, receptor color&lt;br /&gt;
 match                         positive negative&lt;br /&gt;
 match                         positive negative_or_acceptor&lt;br /&gt;
 match                         positive not_neutral&lt;br /&gt;
 match                         negative positive&lt;br /&gt;
 match                         negative positive_or_donor&lt;br /&gt;
 match                         negative not_neutral&lt;br /&gt;
 match                         donor acceptor&lt;br /&gt;
 match                         donor donacc&lt;br /&gt;
 match                         donor negative_or_acceptor&lt;br /&gt;
 match                         donor neutral_or_acceptor_or_donor&lt;br /&gt;
 match                         donor not_neutral&lt;br /&gt;
 match                         acceptor donor&lt;br /&gt;
 match                         acceptor donacc&lt;br /&gt;
 match                         acceptor positive_or_donor&lt;br /&gt;
 match                         acceptor neutral_or_acceptor_or_donor&lt;br /&gt;
 match                         acceptor not_neutral&lt;br /&gt;
 match                         neutral neutral&lt;br /&gt;
 match                         neutral neutral_or_acceptor_or_donor&lt;br /&gt;
 match                         ester_o donor&lt;br /&gt;
 match                         ester_o donacc&lt;br /&gt;
 match                         ester_o positive_or_donor&lt;br /&gt;
 match                         ester_o not_neutral&lt;br /&gt;
 match                         amide_o donor&lt;br /&gt;
 match                         amide_o donacc&lt;br /&gt;
 match                         amide_o positive_or_donor&lt;br /&gt;
 match                         amide_o not_neutral&lt;br /&gt;
&lt;br /&gt;
Single mode is deprecated, these parameters won&#039;t work. See [[Dock Ligand Clustering]]&lt;br /&gt;
 #&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                             SINGLE MODE&lt;br /&gt;
 #&lt;br /&gt;
 #rmsd_override                 0.0&lt;br /&gt;
 #contact_minimum               0&lt;br /&gt;
 #energy_maximum               1.0e+6&lt;br /&gt;
 ##truncate_output              1000.0&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
Search mode is now the default/only mode of docking. Each parameter is described below.&lt;br /&gt;
&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                             SEARCH MODE&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
The ratio_minimum parameter has been slated for deprecation.&lt;br /&gt;
&lt;br /&gt;
 ratio_minimum                 0.0&lt;br /&gt;
&lt;br /&gt;
These parameters control how many atoms are necessary in the ligand for it to be docked.&lt;br /&gt;
&lt;br /&gt;
 atom_minimum                  5 &lt;br /&gt;
 atom_maximum                  100&lt;br /&gt;
&lt;br /&gt;
How many of the top molecules will be saved in the output test.* file. &lt;br /&gt;
&lt;br /&gt;
 number_save                   50000&lt;br /&gt;
&lt;br /&gt;
The maximum number of molecules that will be scored in any given run.&lt;br /&gt;
&lt;br /&gt;
 molecules_maximum             300000 &lt;br /&gt;
&lt;br /&gt;
How many molecules will be skipped, this feature currently does not work.&lt;br /&gt;
&lt;br /&gt;
 initial_skip                  0&lt;br /&gt;
&lt;br /&gt;
How long a molecule is processed before quitting. This feature currently may not work as expected.&lt;br /&gt;
&lt;br /&gt;
 timeout                       180&lt;br /&gt;
&lt;br /&gt;
There are many scoring options:&lt;br /&gt;
&lt;br /&gt;
 # &lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                             SCORING&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
Valid options for ligand_desolvation are &#039;volume&#039; (partial desolvation a la Mysinger &amp;amp; Shoichet 2010), &#039;full&#039; meaning that the entire ligand is assumed to be desolvated in the binding site and &#039;none&#039;, where no desolvation penalties are applied.&lt;br /&gt;
&lt;br /&gt;
 ligand_desolvation            volume&lt;br /&gt;
&lt;br /&gt;
See the note about relative paths for the matching spheres above, the same comments apply here. There are 2 ways to run &#039;volume&#039; or partial desolvation, one is to use one grid for every ligand atom like this:&lt;br /&gt;
&lt;br /&gt;
 solvmap_file                  ../../grids/solvmap_sev&lt;br /&gt;
&lt;br /&gt;
The other option is to use one grid for ligand heavy atoms and one for ligand hydrogen atoms, you&#039;ll want to uncomment these lines to use them (and comment out the other solvmap_file line).&lt;br /&gt;
&lt;br /&gt;
 #solvmap_file                  ../../grids/solvmap_sev.heavy&lt;br /&gt;
 #hydrogen_solvmap_file         ../../grids/solvmap.sev.hydrogen&lt;br /&gt;
&lt;br /&gt;
This is the phimap file used for electrostatic scoring. For a better understanding of this grid, see [[Visualizing delphi]]. Sometimes this will change if you are using the new Qnifft Delphi maps, see [[Qnifft DOCK 3.6 conversion]].&lt;br /&gt;
&lt;br /&gt;
 delphi_file                   ../../grids/rec+sph.phi&lt;br /&gt;
&lt;br /&gt;
This controls the chemgrid file, which contains the van der Waals scoring for every coordinate (chem.vdw will be called) as well as the distance map grids that will be used for deciphering bumping (chem.bmp will be called).&lt;br /&gt;
&lt;br /&gt;
 chemgrid_file_prefix          ../../grids/chem&lt;br /&gt;
&lt;br /&gt;
This is the parameter file that contains the atom type definitions:&lt;br /&gt;
&lt;br /&gt;
 vdw_parameter_file            ../../grids/vdw.parms.amb.mindock&lt;br /&gt;
&lt;br /&gt;
The following options allow the electrostatics and van der Waals parameters to be scaled relative to each other and the solvation scoring.&lt;br /&gt;
&lt;br /&gt;
 electrostatic_scale           1.0&lt;br /&gt;
 vdw_scale                     1.0&lt;br /&gt;
&lt;br /&gt;
The following parameter lets ligands with internal steric clashes attempt to find a ligand conformation that scores well but does not have any internal clashes. Sometimes this procedure will fail in circumstances where there are many flexible branches, or where a ligand that is too large for the binding site is being docked.&lt;br /&gt;
&lt;br /&gt;
 check_clashes                 yes&lt;br /&gt;
&lt;br /&gt;
If set to yes, this removes the positive solvation from each ligand atom and spreads it evenly over the molecule. This is deprecated because it does unexpected things to solvation, and will be removed entirely soon.&lt;br /&gt;
&lt;br /&gt;
 remove_positive_solvation     no&lt;br /&gt;
&lt;br /&gt;
After each orientation of the rigid component is processed and the many ligand conformations have been examined, the best ligand conformation for that orientation can be minimized using the following parameters.&lt;br /&gt;
&lt;br /&gt;
 #&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                             MINIMIZATION&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
No turns off minimization completely.&lt;br /&gt;
&lt;br /&gt;
 minimize                      yes&lt;br /&gt;
&lt;br /&gt;
Don&#039;t minimize molecules that score above the minimization_max.&lt;br /&gt;
&lt;br /&gt;
 minimization_max             1.0e15&lt;br /&gt;
&lt;br /&gt;
If set to yes, this checks to see if the orientation has already been scored and quits. This has not been tested recently.&lt;br /&gt;
&lt;br /&gt;
 check_degeneracy              no&lt;br /&gt;
&lt;br /&gt;
How many iterations of minimization to do. More means longer run times, but potentially better poses.&lt;br /&gt;
&lt;br /&gt;
 simplex_iterations            250&lt;br /&gt;
&lt;br /&gt;
How much the total energy can changed to be considered converged. Setting this higher will stop faster, setting it lower will cause it to do more iterations before converging (or potentially hitting the iteration max above).&lt;br /&gt;
&lt;br /&gt;
 simplex_convergence           0.1&lt;br /&gt;
&lt;br /&gt;
If the energy changes by this much, restart the minimizer from this newest position.&lt;br /&gt;
&lt;br /&gt;
 simplex_restart               1.0&lt;br /&gt;
&lt;br /&gt;
This is the initial distance in angstroms the molecule is translated (note that translation and rotation used to be swapped for many releases of DOCK).&lt;br /&gt;
&lt;br /&gt;
 simplex_initial_translation   0.2&lt;br /&gt;
&lt;br /&gt;
How many degrees of initial rotation are done.&lt;br /&gt;
&lt;br /&gt;
 simplex_initial_rotation      5.0&lt;br /&gt;
 #&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 ###############################################################################&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=INDOCK_for_DOCK_3.6&amp;diff=3353</id>
		<title>INDOCK for DOCK 3.6</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=INDOCK_for_DOCK_3.6&amp;diff=3353"/>
		<updated>2011-10-17T01:43:10Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: note that remove_positive_solvation will be removed&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;What follows is a documented sample INDOCK file for [[DOCK 3.6]]. Many lines are required, lines starting with # are comments.&lt;br /&gt;
&lt;br /&gt;
Required first line:&lt;br /&gt;
&lt;br /&gt;
 DOCK 3.5 parameter&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 ################## DOCK 3.5 INPUT PARAMETERS 2011/09/07 #######################&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                          INPUT/OUTPUT&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
This is the path to the receptor matching spheres file. Most scripts make a set of directories and copy the INDOCK file into them, so this path sometimes has an extra set of &amp;quot;../&amp;quot; in it compared to what you might think. If you use [[DOCK Blaster]]. Generally, match3 has more spheres than match2, so produces more possible orientations. These spheres are matched to ligand spheres, generated from heavy atoms in the &amp;quot;rigid component&amp;quot; of each ligand. For more about the rigid component, see [[Flexibase Format]].&lt;br /&gt;
&lt;br /&gt;
 receptor_sphere_file          ../../sph/match3.sph&lt;br /&gt;
&lt;br /&gt;
The next line is always 1, and is marked for deprecation.&lt;br /&gt;
&lt;br /&gt;
 cluster_numbers               1&lt;br /&gt;
&lt;br /&gt;
The next line refers to which ligand file to use. If using many of the automated scripts, split_database_index is used, as this allows many ligand files (or just 1) to be placed in the split_database_index file and read in one after another during a DOCK run. If docking small things on your own, you can change this to any file.&lt;br /&gt;
&lt;br /&gt;
 # NOTE: split_database_index is reserved to specify a list of files&lt;br /&gt;
 ligand_atom_file              split_database_index&lt;br /&gt;
&lt;br /&gt;
This will control the file output, again many of the automated scripts expect it to be test. OUTDOCK files are always named OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
 output_file_prefix            test.&lt;br /&gt;
&lt;br /&gt;
This controls the random seed used in the minimization procedure. Changing this will produce slightly different results.&lt;br /&gt;
&lt;br /&gt;
 random_seed                   777&lt;br /&gt;
 #&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                             MATCHING&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
distance_tolerance is how different the distances can be between a pair of receptor matching spheres and a pair of ligand matching spheres for them to still be considered matched.&lt;br /&gt;
&lt;br /&gt;
 distance_tolerance            1.5&lt;br /&gt;
&lt;br /&gt;
This changes how many spheres must be matched to generate an orientation. 3 as a minimum, 4 as a maximum is generally accepted as the right thing to use. Less than 3 is too degenerate to generate an actual orientation, and requiring more than 4 matched spheres does not work well, since we only use heavy atoms in ring systems to generate ligand matching spheres.&lt;br /&gt;
&lt;br /&gt;
 nodes_maximum                 4&lt;br /&gt;
 nodes_minimum                 3&lt;br /&gt;
&lt;br /&gt;
The next 4 parameters control how the histograms of distance differences are generated. The binsize is how big the bins are, the overlap controls if a sphere can be put into multiple bins. The ligand &amp;amp; receptor parameters are not required to be the same. &lt;br /&gt;
&lt;br /&gt;
 ligand_binsize                0.4&lt;br /&gt;
 ligand_overlap                0.2&lt;br /&gt;
 receptor_binsize              0.4&lt;br /&gt;
 receptor_overlap              0.2&lt;br /&gt;
&lt;br /&gt;
Bumping is using a quick check of distances when placing ligand atoms in the binding site to determine if they have a steric clash. The maximum is how many can be &#039;bumped&#039; or in close steric contact per rigid or flexible component of the ligand, as per the [[Flexibase Format]]. Even ligands with some steric clashes can sometimes be rescued by minimization. Setting this number very high will cause many clashed orientations to be scored, which can be prohibitively slow.&lt;br /&gt;
&lt;br /&gt;
 bump_maximum                  1&lt;br /&gt;
&lt;br /&gt;
The next four parameters are unused and unsupported.&lt;br /&gt;
&lt;br /&gt;
 focus_cycles                  0&lt;br /&gt;
 focus_bump                    0 &lt;br /&gt;
 focus_type                    energy&lt;br /&gt;
 critical_clusters             no&lt;br /&gt;
 #&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                             COLORING&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
This controls whether chemical matching or coloring is used at all. If yes, many match lines are necessary. These may not be perfect, but [[DOCK Blaster]] has been using these for a long time. Setting this to no produces many more matched orientations, which can be slow, but can help you understand exactly what the energy function is doing.&lt;br /&gt;
&lt;br /&gt;
 chemical_matching             yes&lt;br /&gt;
 case_sensitive                no&lt;br /&gt;
 #                             ligand color, receptor color&lt;br /&gt;
 match                         positive negative&lt;br /&gt;
 match                         positive negative_or_acceptor&lt;br /&gt;
 match                         positive not_neutral&lt;br /&gt;
 match                         negative positive&lt;br /&gt;
 match                         negative positive_or_donor&lt;br /&gt;
 match                         negative not_neutral&lt;br /&gt;
 match                         donor acceptor&lt;br /&gt;
 match                         donor donacc&lt;br /&gt;
 match                         donor negative_or_acceptor&lt;br /&gt;
 match                         donor neutral_or_acceptor_or_donor&lt;br /&gt;
 match                         donor not_neutral&lt;br /&gt;
 match                         acceptor donor&lt;br /&gt;
 match                         acceptor donacc&lt;br /&gt;
 match                         acceptor positive_or_donor&lt;br /&gt;
 match                         acceptor neutral_or_acceptor_or_donor&lt;br /&gt;
 match                         acceptor not_neutral&lt;br /&gt;
 match                         neutral neutral&lt;br /&gt;
 match                         neutral neutral_or_acceptor_or_donor&lt;br /&gt;
 match                         ester_o donor&lt;br /&gt;
 match                         ester_o donacc&lt;br /&gt;
 match                         ester_o positive_or_donor&lt;br /&gt;
 match                         ester_o not_neutral&lt;br /&gt;
 match                         amide_o donor&lt;br /&gt;
 match                         amide_o donacc&lt;br /&gt;
 match                         amide_o positive_or_donor&lt;br /&gt;
 match                         amide_o not_neutral&lt;br /&gt;
&lt;br /&gt;
Single mode is deprecated, these parameters won&#039;t work. See [[Dock Ligand Clustering]]&lt;br /&gt;
 #&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                             SINGLE MODE&lt;br /&gt;
 #&lt;br /&gt;
 #rmsd_override                 0.0&lt;br /&gt;
 #contact_minimum               0&lt;br /&gt;
 #energy_maximum               1.0e+6&lt;br /&gt;
 ##truncate_output              1000.0&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
Search mode is now the default/only mode of docking. Each parameter is described below.&lt;br /&gt;
&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                             SEARCH MODE&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
The ratio_minimum parameter has been slated for deprecation.&lt;br /&gt;
&lt;br /&gt;
 ratio_minimum                 0.0&lt;br /&gt;
&lt;br /&gt;
These parameters control how many atoms are necessary in the ligand for it to be docked.&lt;br /&gt;
&lt;br /&gt;
 atom_minimum                  5 &lt;br /&gt;
 atom_maximum                  100&lt;br /&gt;
&lt;br /&gt;
How many of the top molecules will be saved in the output test.* file. &lt;br /&gt;
&lt;br /&gt;
 number_save                   50000&lt;br /&gt;
&lt;br /&gt;
The maximum number of molecules that will be scored in any given run.&lt;br /&gt;
&lt;br /&gt;
 molecules_maximum             300000 &lt;br /&gt;
&lt;br /&gt;
How many molecules will be skipped, this feature currently does not work.&lt;br /&gt;
&lt;br /&gt;
 initial_skip                  0&lt;br /&gt;
&lt;br /&gt;
How long a molecule is processed before quitting. This feature currently may not work as expected.&lt;br /&gt;
&lt;br /&gt;
 timeout                       180&lt;br /&gt;
&lt;br /&gt;
There are many scoring options:&lt;br /&gt;
&lt;br /&gt;
 # &lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                             SCORING&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
Valid options for ligand_desolvation are &#039;volume&#039; (partial desolvation a la Mysinger &amp;amp; Shoichet 2010), &#039;full&#039; meaning that the entire ligand is assumed to be desolvated in the binding site and &#039;none&#039;, where no desolvation penalties are applied.&lt;br /&gt;
&lt;br /&gt;
 ligand_desolvation            volume&lt;br /&gt;
&lt;br /&gt;
See the note about relative paths for the matching spheres above, the same comments apply here. There are 2 ways to run &#039;volume&#039; or partial desolvation, one is to use one grid for every ligand atom like this:&lt;br /&gt;
&lt;br /&gt;
 solvmap_file                  ../../grids/solvmap_sev&lt;br /&gt;
&lt;br /&gt;
The other option is to use one grid for ligand heavy atoms and one for ligand hydrogen atoms, you&#039;ll want to uncomment these lines to use them (and comment out the other solvmap_file line).&lt;br /&gt;
&lt;br /&gt;
 #solvmap_file                  ../../grids/solvmap_sev.heavy&lt;br /&gt;
 #hydrogen_solvmap_file         ../../grids/solvmap.sev.hydrogen&lt;br /&gt;
&lt;br /&gt;
This is the phimap file used for electrostatic scoring. For a better understanding of this grid, see [[Visualizing delphi]].&lt;br /&gt;
&lt;br /&gt;
 delphi_file                   ../../grids/rec+sph.phi&lt;br /&gt;
&lt;br /&gt;
This controls the chemgrid file, which contains the van der Waals scoring for every coordinate (chem.vdw will be called) as well as the distance map grids that will be used for deciphering bumping (chem.bmp will be called).&lt;br /&gt;
&lt;br /&gt;
 chemgrid_file_prefix          ../../grids/chem&lt;br /&gt;
&lt;br /&gt;
This is the parameter file that contains the atom type definitions:&lt;br /&gt;
&lt;br /&gt;
 vdw_parameter_file            ../../grids/vdw.parms.amb.mindock&lt;br /&gt;
&lt;br /&gt;
The following options allow the electrostatics and van der Waals parameters to be scaled relative to each other and the solvation scoring.&lt;br /&gt;
&lt;br /&gt;
 electrostatic_scale           1.0&lt;br /&gt;
 vdw_scale                     1.0&lt;br /&gt;
&lt;br /&gt;
The following parameter lets ligands with internal steric clashes attempt to find a ligand conformation that scores well but does not have any internal clashes. Sometimes this procedure will fail in circumstances where there are many flexible branches, or where a ligand that is too large for the binding site is being docked.&lt;br /&gt;
&lt;br /&gt;
 check_clashes                 yes&lt;br /&gt;
&lt;br /&gt;
If set to yes, this removes the positive solvation from each ligand atom and spreads it evenly over the molecule. This is deprecated because it does unexpected things to solvation, and will be removed entirely soon.&lt;br /&gt;
&lt;br /&gt;
 remove_positive_solvation     no&lt;br /&gt;
&lt;br /&gt;
After each orientation of the rigid component is processed and the many ligand conformations have been examined, the best ligand conformation for that orientation can be minimized using the following parameters.&lt;br /&gt;
&lt;br /&gt;
 #&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 #                             MINIMIZATION&lt;br /&gt;
 #&lt;br /&gt;
&lt;br /&gt;
No turns off minimization completely.&lt;br /&gt;
&lt;br /&gt;
 minimize                      yes&lt;br /&gt;
&lt;br /&gt;
Don&#039;t minimize molecules that score above the minimization_max.&lt;br /&gt;
&lt;br /&gt;
 minimization_max             1.0e15&lt;br /&gt;
&lt;br /&gt;
If set to yes, this checks to see if the orientation has already been scored and quits. This has not been tested recently.&lt;br /&gt;
&lt;br /&gt;
 check_degeneracy              no&lt;br /&gt;
&lt;br /&gt;
How many iterations of minimization to do. More means longer run times, but potentially better poses.&lt;br /&gt;
&lt;br /&gt;
 simplex_iterations            250&lt;br /&gt;
&lt;br /&gt;
How much the total energy can changed to be considered converged. Setting this higher will stop faster, setting it lower will cause it to do more iterations before converging (or potentially hitting the iteration max above).&lt;br /&gt;
&lt;br /&gt;
 simplex_convergence           0.1&lt;br /&gt;
&lt;br /&gt;
If the energy changes by this much, restart the minimizer from this newest position.&lt;br /&gt;
&lt;br /&gt;
 simplex_restart               1.0&lt;br /&gt;
&lt;br /&gt;
This is the initial distance in angstroms the molecule is translated (note that translation and rotation used to be swapped for many releases of DOCK).&lt;br /&gt;
&lt;br /&gt;
 simplex_initial_translation   0.2&lt;br /&gt;
&lt;br /&gt;
How many degrees of initial rotation are done.&lt;br /&gt;
&lt;br /&gt;
 simplex_initial_rotation      5.0&lt;br /&gt;
 #&lt;br /&gt;
 ###############################################################################&lt;br /&gt;
 ###############################################################################&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Running_DOCK&amp;diff=4283</id>
		<title>Running DOCK</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Running_DOCK&amp;diff=4283"/>
		<updated>2011-09-20T23:58:46Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* Running DOCK */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Running DOCK=&lt;br /&gt;
   &lt;br /&gt;
*modify &amp;lt;tt&amp;gt;$mud/INDOCK&amp;lt;/tt&amp;gt; and set up the desired directory structure &amp;amp;ndash; either manually or by running &#039;&amp;lt;tt&amp;gt;md4db.csh bysubset N&amp;lt;sub&amp;gt;1&amp;lt;/sub&amp;gt; N&amp;lt;sub&amp;gt;2&amp;lt;/sub&amp;gt; Type&amp;lt;/tt&amp;gt;&#039;, where &amp;lt;tt&amp;gt;N&amp;lt;sub&amp;gt;1&amp;lt;/sub&amp;gt;&amp;lt;/tt&amp;gt; is the identifier of the library (1: lead-like; 2: fragment-like), &amp;lt;tt&amp;gt;N&amp;lt;sub&amp;gt;2&amp;lt;/sub&amp;gt;&amp;lt;/tt&amp;gt; is the number of chunks (i.e., jobs you can run in parallel), and &amp;lt;tt&amp;gt;Type&amp;lt;/tt&amp;gt; is the category of library (i.e., bysubset, byvendor, etc).&lt;br /&gt;
* if it hasn&#039;t been generated by a script, create the file &amp;lt;tt&amp;gt;dirlist&amp;lt;/tt&amp;gt;, which conatins the list of the directories (i.e., chunks of the database) that you want to dock.&lt;br /&gt;
*if you plan to use any of John&#039;s scripts in the downstream    processing, leave the output file prefixes at &amp;lt;tt&amp;gt;test.&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*take care that the paths to the &amp;lt;tt&amp;gt;.db.gz&amp;lt;/tt&amp;gt;  files in &amp;lt;tt&amp;gt;split_database_index&amp;lt;/tt&amp;gt; do not get too long. If they do, go via    links.  &lt;br /&gt;
*submit the calculations to the cluster with &amp;lt;tt&amp;gt;$mud/submit.csh&amp;lt;/tt&amp;gt; from the directory in which your data (most importantly, &amp;lt;tt&amp;gt;dirlist&amp;lt;/tt&amp;gt;) resides. See [[MUD - Michael&#039;s Utilities for Docking]] for setting the $mud variable.&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Preparing_the_protein&amp;diff=4122</id>
		<title>Preparing the protein</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Preparing_the_protein&amp;diff=4122"/>
		<updated>2011-09-20T23:57:09Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: Remove distmap references, add prot2crg.py&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Preparing the protein=&lt;br /&gt;
&lt;br /&gt;
Items which are prefixed with &#039;AH&#039; are relevant for docking [[HEI]]s to amidohydrolases and can safely be ignored for most metal-free proteins. &lt;br /&gt;
&lt;br /&gt;
==Modifying the PDB file==&lt;br /&gt;
 &lt;br /&gt;
*prepare &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt;  by removing all lines that do not commence with &#039;ATOM&#039;, all columns to the right of the z-coordinate and the TER statements.  &lt;br /&gt;
*treat all selenomethionines (MSE) as methionines (MET) by replacing the selenium atom (SE&amp;amp;curren;) with sulphur (&amp;amp;curren;SD). Be careful about the correct alignment!  &lt;br /&gt;
*atom enumeration does not matter, so don&#039;t bother to renumber after any of the following steps. Unique numbers are a good idea, presumably.  &lt;br /&gt;
*select the protonation states of HIS residues to be either &amp;amp;delta;- (rename residue to HID), &amp;amp;epsilon;- (rename residue to HIE) or doubly protonated (rename residue to HIP). HIS on the surface should be HIP. HIS residues coordinating the metal ions should have their protons pointing away from the ions. Base your decision on the immediate environment of the HIS residue: are there potential hydrogen bonds that can be formed?; are there charged residues close by?; would a certain protonation lead to clashes with other residues?; etc.&lt;br /&gt;
*AH: the carboxylated LYS of subtype I is CYK, but this is not tolerated by &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt; , so store and delete the 3 surplus atoms and call the residue LYS.  &lt;br /&gt;
*AH: the more buried metal ion is ZB (charge 1.4), the other one ZA (charge 1.3). Atom names are right-aligned!&lt;br /&gt;
&lt;br /&gt;
==Running startdockblaster5==&lt;br /&gt;
   &lt;br /&gt;
*generate the file &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; , which should only contain atoms of the MMFF. Prepare it in the same way as above: remove all columns to the right of the z-coordinate and the TER statements. Change HETATM to ATOM.&lt;br /&gt;
*generate the files &amp;lt;tt&amp;gt;.only_spheres&amp;lt;/tt&amp;gt; and &amp;amp;ndash; in case you would like the matching spheres to be based on the heavy atoms in &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; &amp;amp;ndash; &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt; and write `on&#039; to the latter. Be careful not to add blank lines at the end, this will not be understood by &amp;lt;tt&amp;gt;makespheres2.pl&amp;lt;/tt&amp;gt; . In any case, the entry in &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt;  will be ignored by &amp;lt;tt&amp;gt;makespheres1.pl&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*on sgehead (or, as of [[dock67]], on any machine), run &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt;  to set up the data structure and copy all relevant files. It is a good idea to use csh and to &amp;lt;tt&amp;gt;source .login&amp;lt;/tt&amp;gt; beforehand.  &lt;br /&gt;
*if &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt;  doesn&#039;t finish for any obvious reason and with no clear error message, or &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; has very funny hydrogen placements, make sure that you have no non-printing characters in &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt;. Do that by running your file through &amp;lt;tt&amp;gt;pc2unix rec.pdb&amp;lt;/tt&amp;gt;. Check that your file is clean by looking at it with &amp;lt;tt&amp;gt;od -c rec.pdb | less &amp;lt;/tt&amp;gt;. The only character with a backslash should be \n &amp;amp;mdash; you should see no \t, \r, etc. If this doesn&#039;t solve the problem, your best bet is to re-prepare &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; from scratch &amp;amp;mdash; it is likely that there are some blanks or hidden characters that are causing the problems.  &lt;br /&gt;
*Take any WARNING messages emitted seriously, and continue only if you know why each one is there. Furthermore, verify that &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; still contains &#039;&#039;all&#039;&#039; atoms.  &lt;br /&gt;
*if you do not want to do anything special with the protein, like tarting some residues or modifying the spheres, go directly to chapter [[Running DOCK|3]].&lt;br /&gt;
&lt;br /&gt;
==Removing and modifying files==&lt;br /&gt;
   &lt;br /&gt;
*go to &amp;lt;tt&amp;gt;./grids&amp;lt;/tt&amp;gt;  and remove the surplus files from this directory (some would cause error messages from the subsequent programs):&amp;lt;br&amp;gt;&amp;lt;tt&amp;gt;rm -f PDBPARM chem.* rec+sph.phi solvmap tart.txt OUT*&amp;lt;/tt&amp;gt;&lt;br /&gt;
*modify &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt;: &lt;br /&gt;
**AH: CYK: put the three missing atoms, delete the surplus hydrogens specific for LYS and rename the carboxylated lysine residue CYK.  &lt;br /&gt;
**remove all TER statements that might have been added.  &lt;br /&gt;
**AH: set the atom names of the metal ions to ZA and ZB and the residue name to ZN.  &lt;br /&gt;
**take care of disulfide bonds. Remove the thiol hydrogens (if they have been added) and change the residue name from CYS to CYX.  &lt;br /&gt;
*look at the &amp;lt;tt&amp;gt;box&amp;lt;/tt&amp;gt; and maybe move it, so that the ligands won&#039;t stick out. Modify the &#039;center&#039; and &#039;coordinates&#039; statement in the preamble.  &lt;br /&gt;
*all residues and atoms have to be listed in &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;vdw.parms.amb.mindock&amp;lt;/tt&amp;gt;, respectively &amp;amp;rArr; do not tart any residues in this file!    &lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[[chemgrid]]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
   &lt;br /&gt;
*run &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt; and check &amp;lt;tt&amp;gt;OUTPARM&amp;lt;/tt&amp;gt; for the correct van der Waals parameters of all residues.  &lt;br /&gt;
*grep for &amp;lt;tt&amp;gt;0.000&amp;lt;/tt&amp;gt;  in &amp;lt;tt&amp;gt;PDBPARM&amp;lt;/tt&amp;gt;: if any atom has this value in the 3&amp;lt;sup&amp;gt;rd&amp;lt;/sup&amp;gt; and 4&amp;lt;sup&amp;gt;th&amp;lt;/sup&amp;gt; column, it has not been recognized by &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt;  (because it is not listed in &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt;) and is thus &#039;&#039;ignored&#039;&#039; in the van der Waals-maps. There will be no other errors, the docking will finish showing some &amp;quot;bumping&amp;quot; ligands which have extremely favorable energies (&amp;amp;le; -200).&lt;br /&gt;
*Another sign of a problem with atomic radii are any &#039;WARNING&#039;s issued in OUTPARM&lt;br /&gt;
*if one has to run &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt;  again, first remove &amp;lt;tt&amp;gt;PDBPARM OUTPARM OUTCHEM&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;chem.*&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Tarting the protein==&lt;br /&gt;
   &lt;br /&gt;
*cp &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; to &amp;lt;tt&amp;gt;rec+sph.crg&amp;lt;/tt&amp;gt; and continue with the latter file.&lt;br /&gt;
* tarted residues can be found in &amp;lt;tt&amp;gt;$DOCK_BASE/scripts/grids&amp;lt;/tt&amp;gt;, they are the files with the extension &amp;lt;tt&amp;gt;prot2&amp;lt;/tt&amp;gt;.&lt;br /&gt;
* add the relevant resides to the bottom of your &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt; file, being very precise to match the current formatting&lt;br /&gt;
* generate the new &amp;lt;tt&amp;gt;amb.crg.oxt&amp;lt;/tt&amp;gt; from the edited &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt; using:&amp;lt;br&amp;gt;&amp;lt;tt&amp;gt;$mud/prot2crg.py &amp;lt; prot.table.ambcrg.ambH &amp;gt; amb.crg.oxt&amp;lt;/tt&amp;gt;&lt;br /&gt;
*AH: select the appropriate version of &amp;lt;tt&amp;gt;amb.crg.oxt&amp;lt;/tt&amp;gt; depending on the subtype. Files are called &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt;, where &amp;lt;tt&amp;gt;N&amp;lt;/tt&amp;gt; can be &amp;lt;tt&amp;gt;I, III&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;VI&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*AH: edit the residues in the binding site (i.e., all residues complexing the metal ions in the binding site), so that their names conform to the names of the modified residues in &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt;  &lt;br /&gt;
*optionally tart the residues that are in contact with a crystallographic ligand, if any.  &lt;br /&gt;
*AH: check that ZA and ZB, respectively (left-aligned in the atom column), have corresponding entries in &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt;  and &amp;lt;tt&amp;gt;vdw.siz&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Modifying the Delphi spheres==&lt;br /&gt;
   &lt;br /&gt;
*load &amp;lt;tt&amp;gt;match1.sph.pdb&amp;lt;/tt&amp;gt; (i.e., the DelPhi spheres).  &lt;br /&gt;
*delete the spheres that are too close to the solvent.  &lt;br /&gt;
*(AH:) add spheres so that there is one sphere &#039;&#039;between&#039;&#039; the metals, several spheres &#039;&#039;around&#039;&#039; the metals and some spheres close to polar residues.  &lt;br /&gt;
*a good number for DelPhi spheres is 120.  &lt;br /&gt;
*append the spheres to the end of &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; to make &amp;lt;tt&amp;gt;rec+sph.crg&amp;lt;/tt&amp;gt; and put a TER statement after each sphere. Don&#039;t use tabs for whitespace, can cause problems with DelPhi!  &lt;br /&gt;
&lt;br /&gt;
==Modifying the Matching spheres==&lt;br /&gt;
&lt;br /&gt;
*load &amp;lt;tt&amp;gt;match2.sph.pdb&amp;lt;/tt&amp;gt; for sparse initial spheres or &amp;lt;tt&amp;gt;match3.sph.pdb&amp;lt;/tt&amp;gt; denser spheres.&lt;br /&gt;
*If you selected &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt; be careful not to move any spheres based on the ligand atoms.  &lt;br /&gt;
*(AH:) put at least one sphere between the metals and increase the sampling in the region around the metal ions by putting some spheres there.&lt;br /&gt;
*a good number for matching spheres is 50-60.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;pdbtosph matchN.sph.pdb mysph.sph&amp;lt;/tt&amp;gt; to generate the files that will be read by [[DOCK]].   &lt;br /&gt;
*if color matching is desired, run &amp;lt;tt&amp;gt;colorspheres.pl sph/match2.sph&amp;lt;/tt&amp;gt; in the parent directory of the docking run (i.e., &amp;lt;tt&amp;gt;..&amp;lt;/tt&amp;gt; to &amp;lt;tt&amp;gt;sph&amp;lt;/tt&amp;gt; ) to put some color on your spheres.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;cat $mud/header.sph match2.sph&amp;lt;/tt&amp;gt; .&lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[http://bcr.musc.edu/manuals/delphi.htm DelPhi]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
&lt;br /&gt;
*if you changed rec+sph.crg above, you need to run Delphi   &lt;br /&gt;
*if necessary, modify &amp;lt;tt&amp;gt;delphi.com&amp;lt;/tt&amp;gt; so that all the paths and file names are appropriate.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;delphi.com &amp;gt; delphi.log&amp;lt;/tt&amp;gt;  and check the output.&lt;br /&gt;
*any &#039;WARNING&#039; in the log is an indication that some atomic charges might not be correct.&lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[[solvmap]]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
   &lt;br /&gt;
*if you changed rec.crg or the box above, you need to run solvmap   &lt;br /&gt;
*check that all atoms are present in &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; and run &amp;lt;tt&amp;gt;solvmap&amp;lt;/tt&amp;gt; .&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Preparing_the_protein&amp;diff=4121</id>
		<title>Preparing the protein</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Preparing_the_protein&amp;diff=4121"/>
		<updated>2011-06-08T06:06:02Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Preparing the protein=&lt;br /&gt;
&lt;br /&gt;
Items which are prefixed with &#039;AH&#039; are relevant for docking [[HEI]]s to amidohydrolases and can safely be ignored for most metal-free proteins. &lt;br /&gt;
&lt;br /&gt;
==Modifying the PDB file==&lt;br /&gt;
 &lt;br /&gt;
*prepare &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt;  by removing all lines that do not commence with &#039;ATOM&#039;, the chain column, all columns to the right of the z-coordinate and the TER statements.  &lt;br /&gt;
*treat all selenomethionines (MSE) as methionines (MET) by replacing the selenium atom (SE&amp;amp;curren;) with sulphur    (&amp;amp;curren;SD). Be careful about the correct alignment!  &lt;br /&gt;
*atom enumeration does not matter, so don&#039;t bother to renumber    after any of the following steps. Unique numbers are a good idea,    presumably.  &lt;br /&gt;
*select the protonation states of HIS residues to be either &amp;amp;delta;- (rename residue to HID), &amp;amp;epsilon;- (rename residue to HIE) or doubly protonated (rename residue to HIP). HIS on the surface should be HIP. HIS residues coordinating    the metal ions should have their protons pointing away from the    ions. Base your decision on the immediate environment of the HIS residue: are there potential hydrogen bonds that can be formed?; are there charged residues close by?; would a certain protonation lead to clashes with other residues?; etc.&lt;br /&gt;
*AH: the carboxylated LYS of subtype I is CYK, but this is not    tolerated by &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt; , so store and delete the 3    surplus atoms and call the residue LYS.  &lt;br /&gt;
*AH: the more buried metal ion is ZB (charge 1.4), the other one ZA    (charge 1.3). Atom names are right-aligned!&lt;br /&gt;
&lt;br /&gt;
==Running startdockblaster5==&lt;br /&gt;
   &lt;br /&gt;
*generate the file &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; , which should only contain    atoms of the MMFF. Prepare it in the same way as above: remove the    chain column, all columns to the right of the z-coordinate and the    TER statements.  &lt;br /&gt;
*generate the files &amp;lt;tt&amp;gt;.only_spheres&amp;lt;/tt&amp;gt;  and &amp;amp;ndash; in case you    would like the matching spheres to be based on the heavy atoms in &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; &amp;amp;ndash; &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt; and write `on&#039; to the    latter. Be careful not to add blank lines at the end, this will not    be understood by &amp;lt;tt&amp;gt;makespheres2.pl&amp;lt;/tt&amp;gt; . In any case, the entry in    &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt;  will be ignored by &amp;lt;tt&amp;gt;makespheres1.pl&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*on sgehead (or, as of [[dock67]], on any machine), run &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt;  to set up the data structure and copy all relevant files. It is a good idea to use csh and to &amp;lt;tt&amp;gt;source .login&amp;lt;/tt&amp;gt;  beforehand.  &lt;br /&gt;
*if &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt;  doesn&#039;t finish for any obvious reason and with no clear error message, or &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; has very funny hydrogen placements, make sure that you have no non-printing characters in &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt;. Do that by running your file through &amp;lt;tt&amp;gt;pc2unix rec.pdb&amp;lt;/tt&amp;gt;. Check that your file is clean by looking at it with &amp;lt;tt&amp;gt;od -c rec.pdb | less &amp;lt;/tt&amp;gt;. The only character with a backslash should be \n &amp;amp;mdash; you should see no \t, \r, etc. If this doesn&#039;t solve the problem, your best bet is to    reprepare &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; from scratch &amp;amp;mdash; it is    likely that there are some blanks or hidden characters that are    causing the problems.  &lt;br /&gt;
*check the files &amp;lt;tt&amp;gt;stdout&amp;lt;/tt&amp;gt;  and &amp;lt;tt&amp;gt;stderr&amp;lt;/tt&amp;gt;  after the    run for potential mistakes and error messages. Furthermore, verify    that &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt;  still contains &#039;&#039;all&#039;&#039; atoms.  &lt;br /&gt;
*if you do not want to do anything special with the protein, like    tarting some residues or modifying the spheres, go directly to chapter [[Running DOCK|3]].&lt;br /&gt;
&lt;br /&gt;
==Removing and modifying files==&lt;br /&gt;
   &lt;br /&gt;
*go to &amp;lt;tt&amp;gt;./grids&amp;lt;/tt&amp;gt;  and remove the surplus files from this    directory (some would cause error messages from the subsequent    programs):&amp;lt;br&amp;gt;&amp;lt;tt&amp;gt;rm -f PDBPARM chem.* distmap.box  distmap    distmap.log rec+sph.phi solvmap tart.txt OUT*&amp;lt;/tt&amp;gt;&lt;br /&gt;
*modify &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt;:      &lt;br /&gt;
**AH: CYK: put the three missing atoms, delete the surplus hydrogens specific for LYS and rename the carboxylated lysine residue CYK.  &lt;br /&gt;
**remove all TER statements that might have been added.  &lt;br /&gt;
**AH: set the atom names of the metal ions to ZA and ZB and the    residue name to ZN.  &lt;br /&gt;
**take care of disulfide bonds. Remove the thiol hydrogens (if    they have been added) and change the residue name from CYS to CYX.  &lt;br /&gt;
*look at the &amp;lt;tt&amp;gt;box&amp;lt;/tt&amp;gt;  and maybe move it, so that the ligands    won&#039;t stick out. Modify the &#039;center&#039; and &#039;coordinates&#039; statement in    the preamble.  &lt;br /&gt;
*all residues and atoms have to be listed in &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;vdw.parms.amb.mindock&amp;lt;/tt&amp;gt;, respectively &amp;amp;rArr; do not tart any residues in this file!    &lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[[chemgrid]]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
   &lt;br /&gt;
*run &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt; and check &amp;lt;tt&amp;gt;OUTPARM&amp;lt;/tt&amp;gt; for the correct charges of all residues.  &lt;br /&gt;
*grep for &amp;lt;tt&amp;gt;0.000&amp;lt;/tt&amp;gt;  in &amp;lt;tt&amp;gt;PDBPARM&amp;lt;/tt&amp;gt;: if any atom has this    value in the 3&amp;lt;sup&amp;gt;rd&amp;lt;/sup&amp;gt; and 4&amp;lt;sup&amp;gt;th&amp;lt;/sup&amp;gt; column, it has not been    recognized by &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt;  (because it is not listed in &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt;) and is thus &#039;&#039;ignored&#039;&#039; in the van der    Waals-maps. There will be no other errors, the docking will finish    showing some &amp;quot;bumping&amp;quot; ligands which have extremely favorable    energies (&amp;amp;le; -200).&lt;br /&gt;
*Another sign of a problem with atomic radii are any &#039;WARNING&#039; issued in OUTPARM&lt;br /&gt;
*if one has to run &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt;  again, remove &amp;lt;tt&amp;gt;PDBPARM OUTPARM OUTCHEM&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;chem.*&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;distmap&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
 &lt;br /&gt;
* the default is to run &amp;lt;tt&amp;gt;distmap&amp;lt;/tt&amp;gt; on &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt;. If you modified this file, rerun by simply typing &amp;lt;tt&amp;gt;distmap&amp;lt;/tt&amp;gt;.&lt;br /&gt;
* AH: cp &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; to &amp;lt;tt&amp;gt;rec-dist.crg&amp;lt;/tt&amp;gt;  and remove the Zn atoms in the latter file (otherwise there will be lots of bumping ligands). Edit &amp;lt;tt&amp;gt;INDIST&amp;lt;/tt&amp;gt; to update the filename.&lt;br /&gt;
*run &amp;lt;tt&amp;gt;distmap&amp;lt;/tt&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Tarting the protein==&lt;br /&gt;
   &lt;br /&gt;
*cp &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; to &amp;lt;tt&amp;gt;rec+sph.crg&amp;lt;/tt&amp;gt;  and continue with the latter file.&lt;br /&gt;
* tarted residues can be found in &amp;lt;tt&amp;gt;$DOCK_BASE/scripts/grids&amp;lt;/tt&amp;gt;, they are the files with the extension &amp;lt;tt&amp;gt;prot2&amp;lt;/tt&amp;gt;.&lt;br /&gt;
* take care that the format of the &amp;lt;tt&amp;gt;.prot2&amp;lt;/tt&amp;gt; file is consistent with the format in the &amp;lt;tt&amp;gt;amb.crg.oxt&amp;lt;/tt&amp;gt; file, e.g., that there is no leading space before an atom name etc.&lt;br /&gt;
*AH: select the appropriate version of &amp;lt;tt&amp;gt;amb.crg.oxt&amp;lt;/tt&amp;gt;  depending on    the subtype. Files are called &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt; , where &amp;lt;tt&amp;gt;N&amp;lt;/tt&amp;gt;  can    be &amp;lt;tt&amp;gt;I, III&amp;lt;/tt&amp;gt;  or &amp;lt;tt&amp;gt;VI&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*AH: edit the residues in the binding site (i.e., all residues    complexing the metal ions in the binding site), so that their    names conform to the names of the modified residues in &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt;  &lt;br /&gt;
*tart the residues that are in contact with a crystallographic    ligand, if any.  &lt;br /&gt;
*AH: check that ZA and ZB, respectively (left-aligned in the atom    column), have corresponding entries in &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt;  and &amp;lt;tt&amp;gt;vdw.siz&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Modifying the Delphi spheres==&lt;br /&gt;
   &lt;br /&gt;
*load &amp;lt;tt&amp;gt;match1.sph.pdb&amp;lt;/tt&amp;gt; (i.e., the DelPhi spheres).  &lt;br /&gt;
*delete the spheres that are too close to the solvent.  &lt;br /&gt;
*(AH:) add spheres so that there is one sphere &#039;&#039;between&#039;&#039; the metals, several spheres &#039;&#039;around&#039;&#039; the metals and some spheres close to polar residues.  &lt;br /&gt;
*a good number for DelPhi spheres is 120.  &lt;br /&gt;
*append the spheres to the end of &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; to make &amp;lt;tt&amp;gt;rec+sph.crg&amp;lt;/tt&amp;gt; and put a TER statement after each sphere. Don&#039;t use tabs for whitespace, can cause problems with DelPhi!  &lt;br /&gt;
&lt;br /&gt;
==Modifying the Matching spheres==&lt;br /&gt;
&lt;br /&gt;
*load &amp;lt;tt&amp;gt;match2.sph.pdb&amp;lt;/tt&amp;gt; for sparse initial spheres or &amp;lt;tt&amp;gt;match3.sph.pdb&amp;lt;/tt&amp;gt; denser spheres.&lt;br /&gt;
*If you selected &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt; be careful not to move any spheres based on the ligand atoms.  &lt;br /&gt;
*(AH:) put at least one sphere between the metals and increase the sampling in the region around the metal ions by putting some spheres there.&lt;br /&gt;
*a good number for matching spheres is 50-60.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;pdbtosph matchN.sph.pdb mysph.sph&amp;lt;/tt&amp;gt; to generate the files that will be read by [[DOCK]].   &lt;br /&gt;
*if color matching is desired, run &amp;lt;tt&amp;gt;colorspheres.pl sph/match2.sph&amp;lt;/tt&amp;gt; in the parent directory of the docking run (i.e., &amp;lt;tt&amp;gt;..&amp;lt;/tt&amp;gt; to &amp;lt;tt&amp;gt;sph&amp;lt;/tt&amp;gt; ) to put some color on your spheres.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;cat $mud/header.sph match2.sph&amp;lt;/tt&amp;gt; .&lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[http://bcr.musc.edu/manuals/delphi.htm DelPhi]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
&lt;br /&gt;
*if you changed rec+sph.crg above, you need to run Delphi   &lt;br /&gt;
*if necessary, modify &amp;lt;tt&amp;gt;delphi.com&amp;lt;/tt&amp;gt; so that all the paths and file names are appropriate.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;delphi.com &amp;gt; delphi.log&amp;lt;/tt&amp;gt;  and check the output.&lt;br /&gt;
*any &#039;WARNING&#039; in the log is an indication that some atomic charges might not be correct.&lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[[solvmap]]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
   &lt;br /&gt;
*if you changed rec.crg or the box above, you need to run solvmap   &lt;br /&gt;
*check that all atoms are present in &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; and run &amp;lt;tt&amp;gt;solvmap&amp;lt;/tt&amp;gt; .&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Preparing_the_protein&amp;diff=4120</id>
		<title>Preparing the protein</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Preparing_the_protein&amp;diff=4120"/>
		<updated>2011-06-08T06:02:26Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* Modifying the spheres */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Preparing the protein=&lt;br /&gt;
&lt;br /&gt;
Items which are prefixed with &#039;AH&#039; are relevant for docking [[HEI]]s to amidohydrolases and can safely be ignored for most metal-free proteins. &lt;br /&gt;
&lt;br /&gt;
==Modifying the PDB file==&lt;br /&gt;
 &lt;br /&gt;
*prepare &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt;  by removing all lines that do not commence with &#039;ATOM&#039;, the chain column, all columns to the right of the z-coordinate and the TER statements.  &lt;br /&gt;
*treat all selenomethionines (MSE) as methionines (MET) by replacing the selenium atom (SE&amp;amp;curren;) with sulphur    (&amp;amp;curren;SD). Be careful about the correct alignment!  &lt;br /&gt;
*atom enumeration does not matter, so don&#039;t bother to renumber    after any of the following steps. Unique numbers are a good idea,    presumably.  &lt;br /&gt;
*select the protonation states of HIS residues to be either &amp;amp;delta;- (rename residue to HID), &amp;amp;epsilon;- (rename residue to HIE) or doubly protonated (rename residue to HIP). HIS on the surface should be HIP. HIS residues coordinating    the metal ions should have their protons pointing away from the    ions. Base your decision on the immediate environment of the HIS residue: are there potential hydrogen bonds that can be formed?; are there charged residues close by?; would a certain protonation lead to clashes with other residues?; etc.&lt;br /&gt;
*AH: the carboxylated LYS of subtype I is CYK, but this is not    tolerated by &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt; , so store and delete the 3    surplus atoms and call the residue LYS.  &lt;br /&gt;
*AH: the more buried metal ion is ZB (charge 1.4), the other one ZA    (charge 1.3). Atom names are right-aligned!&lt;br /&gt;
&lt;br /&gt;
==Running startdockblaster5==&lt;br /&gt;
   &lt;br /&gt;
*generate the file &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; , which should only contain    atoms of the MMFF. Prepare it in the same way as above: remove the    chain column, all columns to the right of the z-coordinate and the    TER statements.  &lt;br /&gt;
*generate the files &amp;lt;tt&amp;gt;.only_spheres&amp;lt;/tt&amp;gt;  and &amp;amp;ndash; in case you    would like the matching spheres to be based on the heavy atoms in &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; &amp;amp;ndash; &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt; and write `on&#039; to the    latter. Be careful not to add blank lines at the end, this will not    be understood by &amp;lt;tt&amp;gt;makespheres2.pl&amp;lt;/tt&amp;gt; . In any case, the entry in    &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt;  will be ignored by &amp;lt;tt&amp;gt;makespheres1.pl&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*on sgehead (or, as of [[dock67]], on any machine), run &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt;  to set up the data structure and copy all relevant files. It is a good idea to use csh and to &amp;lt;tt&amp;gt;source .login&amp;lt;/tt&amp;gt;  beforehand.  &lt;br /&gt;
*if &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt;  doesn&#039;t finish for any obvious reason and with no clear error message, or &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; has very funny hydrogen placements, make sure that you have no non-printing characters in &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt;. Do that by running your file through &amp;lt;tt&amp;gt;pc2unix rec.pdb&amp;lt;/tt&amp;gt;. Check that your file is clean by looking at it with &amp;lt;tt&amp;gt;od -c rec.pdb | less &amp;lt;/tt&amp;gt;. The only character with a backslash should be \n &amp;amp;mdash; you should see no \t, \r, etc. If this doesn&#039;t solve the problem, your best bet is to    reprepare &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; from scratch &amp;amp;mdash; it is    likely that there are some blanks or hidden characters that are    causing the problems.  &lt;br /&gt;
*check the files &amp;lt;tt&amp;gt;stdout&amp;lt;/tt&amp;gt;  and &amp;lt;tt&amp;gt;stderr&amp;lt;/tt&amp;gt;  after the    run for potential mistakes and error messages. Furthermore, verify    that &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt;  still contains &#039;&#039;all&#039;&#039; atoms.  &lt;br /&gt;
*if you do not want to do anything special with the protein, like    tarting some residues or modifying the spheres, go directly to chapter [[Running DOCK|3]].&lt;br /&gt;
&lt;br /&gt;
==Removing and modifying files==&lt;br /&gt;
   &lt;br /&gt;
*go to &amp;lt;tt&amp;gt;./grids&amp;lt;/tt&amp;gt;  and remove the surplus files from this    directory (some would cause error messages from the subsequent    programs):&amp;lt;br&amp;gt;&amp;lt;tt&amp;gt;rm -f PDBPARM chem.* distmap.box  distmap    distmap.log rec+sph.phi solvmap tart.txt OUT*&amp;lt;/tt&amp;gt;&lt;br /&gt;
*modify &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt;:      &lt;br /&gt;
**AH: CYK: put the three missing atoms, delete the surplus hydrogens specific for LYS and rename the carboxylated lysine residue CYK.  &lt;br /&gt;
**remove all TER statements that might have been added.  &lt;br /&gt;
**AH: set the atom names of the metal ions to ZA and ZB and the    residue name to ZN.  &lt;br /&gt;
**take care of disulfide bonds. Remove the thiol hydrogens (if    they have been added) and change the residue name from CYS to CYX.  &lt;br /&gt;
*look at the &amp;lt;tt&amp;gt;box&amp;lt;/tt&amp;gt;  and maybe move it, so that the ligands    won&#039;t stick out. Modify the &#039;center&#039; and &#039;coordinates&#039; statement in    the preamble.  &lt;br /&gt;
*all residues and atoms have to be listed in &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;vdw.parms.amb.mindock&amp;lt;/tt&amp;gt;, respectively &amp;amp;rArr; do not tart any residues in this file!    &lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[[chemgrid]]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
   &lt;br /&gt;
*run &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt; and check &amp;lt;tt&amp;gt;OUTPARM&amp;lt;/tt&amp;gt; for the correct charges of all residues.  &lt;br /&gt;
*grep for &amp;lt;tt&amp;gt;0.000&amp;lt;/tt&amp;gt;  in &amp;lt;tt&amp;gt;PDBPARM&amp;lt;/tt&amp;gt;: if any atom has this    value in the 3&amp;lt;sup&amp;gt;rd&amp;lt;/sup&amp;gt; and 4&amp;lt;sup&amp;gt;th&amp;lt;/sup&amp;gt; column, it has not been    recognized by &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt;  (because it is not listed in &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt;) and is thus &#039;&#039;ignored&#039;&#039; in the van der    Waals-maps. There will be no other errors, the docking will finish    showing some &amp;quot;bumping&amp;quot; ligands which have extremely favorable    energies (&amp;amp;le; -200).&lt;br /&gt;
*Another sign of a problem with atomic radii are any &#039;WARNING&#039; issued in OUTPARM&lt;br /&gt;
*if one has to run &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt;  again, remove &amp;lt;tt&amp;gt;PDBPARM OUTPARM OUTCHEM&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;chem.*&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;distmap&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
 &lt;br /&gt;
* the default is to run &amp;lt;tt&amp;gt;distmap&amp;lt;/tt&amp;gt; on &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt;. If you modified this file, rerun by simply typing &amp;lt;tt&amp;gt;distmap&amp;lt;/tt&amp;gt;.&lt;br /&gt;
* AH: cp &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; to &amp;lt;tt&amp;gt;rec-dist.crg&amp;lt;/tt&amp;gt;  and remove the Zn atoms in the latter file (otherwise there will be lots of bumping ligands). Edit &amp;lt;tt&amp;gt;INDIST&amp;lt;/tt&amp;gt; to update the filename.&lt;br /&gt;
*run &amp;lt;tt&amp;gt;distmap&amp;lt;/tt&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Tarting the protein==&lt;br /&gt;
   &lt;br /&gt;
*cp &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; to &amp;lt;tt&amp;gt;rec+sph.crg&amp;lt;/tt&amp;gt;  and continue with the latter file.&lt;br /&gt;
* tarted residues can be found in &amp;lt;tt&amp;gt;$DOCK_BASE/scripts/grids&amp;lt;/tt&amp;gt;, they are the files with the extension &amp;lt;tt&amp;gt;prot2&amp;lt;/tt&amp;gt;.&lt;br /&gt;
* take care that the format of the &amp;lt;tt&amp;gt;.prot2&amp;lt;/tt&amp;gt; file is consistent with the format in the &amp;lt;tt&amp;gt;amb.crg.oxt&amp;lt;/tt&amp;gt; file, e.g., that there is no leading space before an atom name etc.&lt;br /&gt;
*AH: select the appropriate version of &amp;lt;tt&amp;gt;amb.crg.oxt&amp;lt;/tt&amp;gt;  depending on    the subtype. Files are called &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt; , where &amp;lt;tt&amp;gt;N&amp;lt;/tt&amp;gt;  can    be &amp;lt;tt&amp;gt;I, III&amp;lt;/tt&amp;gt;  or &amp;lt;tt&amp;gt;VI&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*AH: edit the residues in the binding site (i.e., all residues    complexing the metal ions in the binding site), so that their    names conform to the names of the modified residues in &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt;  &lt;br /&gt;
*tart the residues that are in contact with a crystallographic    ligand, if any.  &lt;br /&gt;
*AH: check that ZA and ZB, respectively (left-aligned in the atom    column), have corresponding entries in &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt;  and &amp;lt;tt&amp;gt;vdw.siz&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Modifying the Delphi spheres==&lt;br /&gt;
   &lt;br /&gt;
*load &amp;lt;tt&amp;gt;match1.sph.pdb&amp;lt;/tt&amp;gt; (i.e., the DelPhi spheres).  &lt;br /&gt;
*delete the spheres that are too close to the solvent.  &lt;br /&gt;
*(AH:) add spheres so that there is one sphere &#039;&#039;between&#039;&#039; the metals, several spheres &#039;&#039;around&#039;&#039; the metals and some spheres close to polar residues.  &lt;br /&gt;
*a good number for DelPhi spheres is 120.  &lt;br /&gt;
*append the spheres to the end of &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; to make &amp;lt;tt&amp;gt;rec+sph.crg&amp;lt;/tt&amp;gt; and put a TER statement after each sphere. Don&#039;t use tabs for whitespace, can cause problems with DelPhi!  &lt;br /&gt;
&lt;br /&gt;
==Modifying the Matching spheres==&lt;br /&gt;
&lt;br /&gt;
*load &amp;lt;tt&amp;gt;match2.sph.pdb&amp;lt;/tt&amp;gt; for sparse initial spheres or &amp;lt;tt&amp;gt;match3.sph.pdb&amp;lt;/tt&amp;gt; denser spheres.&lt;br /&gt;
*If you selected &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt; be careful not to move any spheres based on the ligand atoms.  &lt;br /&gt;
*(AH:) put at least one sphere between the metals and increase the sampling in the region around the metal ions by putting some spheres there.&lt;br /&gt;
*a good number for matching spheres is 50-60.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;pdbtosph matchN.sph.pdb mysph.sph&amp;lt;/tt&amp;gt; to generate the files that will be read by [[DOCK]].   &lt;br /&gt;
*if color matching is desired, run &amp;lt;tt&amp;gt;colorspheres.pl sph/match2.sph&amp;lt;/tt&amp;gt; in the parent directory of the docking run (i.e., &amp;lt;tt&amp;gt;..&amp;lt;/tt&amp;gt; to &amp;lt;tt&amp;gt;sph&amp;lt;/tt&amp;gt; ) to put some color on your spheres.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;cat $mud/header.sph match2.sph&amp;lt;/tt&amp;gt; .&lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[http://bcr.musc.edu/manuals/delphi.htm DelPhi]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
   &lt;br /&gt;
*if necessary, modify &amp;lt;tt&amp;gt;delphi.com&amp;lt;/tt&amp;gt;  so that all the paths and file names are    appropriate.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;delphi.com &amp;gt; delphi.log&amp;lt;/tt&amp;gt;  and check the output.&lt;br /&gt;
*any &#039;WARNING&#039; in the log is an indication that some atomic charges might not be correct.&lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[[solvmap]]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
   &lt;br /&gt;
*check that all atoms are present in &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; and run &amp;lt;tt&amp;gt;solvmap&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*after the run, make sure that the file &amp;lt;tt&amp;gt;solvmap&amp;lt;/tt&amp;gt; contains &#039;&#039;&#039;no&#039;&#039;&#039; blank lines.&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=103</id>
		<title>Analysing the results</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=103"/>
		<updated>2011-05-28T03:47:18Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* Atomic contributions to the desolvation */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Some analyses that can be performed=&lt;br /&gt;
&lt;br /&gt;
See [[MUD - Michael&#039;s Utilities for Docking]] for a lot of tools to help with analyzing DOCK runs.&lt;br /&gt;
&lt;br /&gt;
==Combining the results of all subdirectories==&lt;br /&gt;
   &lt;br /&gt;
*in the subdirectory that contains all the individual directories for each chunk of the library, run &amp;lt;tt&amp;gt;$mud/combine.py&amp;lt;/tt&amp;gt;. Then generate a file containing the top 500 molecules using &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -o top500.pdb&amp;lt;/tt&amp;gt;&#039;, which you can read into ViewDOCK in chimera as a DOCK 4, 5, or 6 style file.&lt;br /&gt;
*to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file containing the top 500 molecules just run &amp;lt;tt&amp;gt;$mud/topdock.py -e&amp;lt;/tt&amp;gt;. If one wants to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Getting individual atom contributions with scoreopt_so==&lt;br /&gt;
&lt;br /&gt;
===First you need an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file to be scored===&lt;br /&gt;
&lt;br /&gt;
=====For the xtal-lig.mol2 in its crystallographic pose=====&lt;br /&gt;
&lt;br /&gt;
New way that outputs your.eel1 starting from your.pdb directly&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;$mud/to_eel1.csh your.pdb&amp;lt;/tt&amp;gt;&#039;. &lt;br /&gt;
&lt;br /&gt;
If that fails, use the old way to convert an input &amp;lt;tt&amp;gt;[http://www.tripos.com/index.php?family=modules,SimplePage,,,&amp;amp;page=sup_mol2&amp;amp;s=0 .mol2]&amp;lt;/tt&amp;gt; file into an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file&lt;br /&gt;
*run &amp;lt;tt&amp;gt;amsol&amp;lt;/tt&amp;gt;  as described [[Preparing_the_ligand#Running amsol|here]] to calculate atomic solvation energies.&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;file2file.py -s path/to/amsol.solv path/to/amsol.nmol2 ligand.eel1&amp;lt;/tt&amp;gt;&#039;.&lt;br /&gt;
&lt;br /&gt;
=====For molecules that have already been docked=====&lt;br /&gt;
&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -e -o top500.eel1&#039; to generate an .eel1 containing the top 500 docked molecules.&lt;br /&gt;
*or unzip the dock output &#039;&amp;lt;tt&amp;gt;gunzip -c test.eel1.gz &amp;gt; test.eel1&amp;lt;/tt&amp;gt;&#039;&lt;br /&gt;
*or to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &#039;&amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;&#039;.&lt;br /&gt;
&lt;br /&gt;
===Overall molecular score compiled from all scoreopt_so options===&lt;br /&gt;
&lt;br /&gt;
For default grids&lt;br /&gt;
*run &amp;lt;tt&amp;gt;&#039;$mud/doscoreopt.csh your.eel1 ../path/to/grids&#039;&amp;lt;/tt&amp;gt;&lt;br /&gt;
Or for custom grids, used below to run SEV-based desolvation grids&lt;br /&gt;
*run &amp;lt;tt&amp;gt;&#039;$mud/doscoreopt.csh your.eel1 ../path/to/grids rec+sph.phi chem solvmap_sev&#039;&amp;lt;/tt&amp;gt; &lt;br /&gt;
The summary for the whole molecule is output to your.eel1.scores in combine.scores format &lt;br /&gt;
&lt;br /&gt;
===Atomic contributions to the coulombic energy===&lt;br /&gt;
&lt;br /&gt;
In your.eel1.delphi from the wrapper &lt;br /&gt;
*in every ATOM line, columns 9, 10 and 11 are the partial charge, the electrostatic field and the energy in kT (i.e., 9 &amp;amp;times; 10) of the atom, respectively.  &lt;br /&gt;
*the DelPhi electrostatic score is the sum over the entries in column 11 times 0.5924 (conversion from kT to kcal/mol) and can be compared to the elect column in OUTDOCK.&lt;br /&gt;
Or to generate this data yourself&lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;2&#039; in the first menu.  &lt;br /&gt;
*enter the name of the DelPhi potential file, presumably &amp;lt;tt&amp;gt;grids/rec+sph.phi&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;top500.eel1&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.delphi&amp;lt;/tt&amp;gt; .&lt;br /&gt;
&lt;br /&gt;
===Atomic contributions to the van der Waals energy===&lt;br /&gt;
&lt;br /&gt;
In your.eel1.vdw from the wrapper  &lt;br /&gt;
*be adequately [http://www.merriam-webster.com/dictionary/scared scared].  &lt;br /&gt;
*the van der Waals interaction energy is calculated as  &amp;lt;math&amp;gt;{vdW}_{(r)}=\frac{A}{r^{12}}-\frac{B}{r^6}=a-b&amp;lt;/math&amp;gt;. In every ATOM line, columns 9, 10 and 11 are &amp;lt;math&amp;gt;a&amp;lt;/math&amp;gt;, &amp;lt;math&amp;gt;b&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;a-b&amp;lt;/math&amp;gt;,    respectively.&lt;br /&gt;
* DO NOT use the interaction energy, as we only use the vdw component now. Instead, use the vdwsum to compare with the vdW column in OUTDOCK.&lt;br /&gt;
Or to generate this data yourself&lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;3&#039; in the first menu.  &lt;br /&gt;
*enter the prefix name of grids for ff scoring as a full path,    i.e., &amp;lt;tt&amp;gt;grids/chem&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the van der Waals parameter file, presumably    &amp;lt;tt&amp;gt;grids/vdw.parms.amb.mindock&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*answer the question about interpolation with &#039;yes&#039;.  &lt;br /&gt;
*enter a sufficiently large number as maximal van der Waals    energy, e.g. 10000.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.vdw&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===Atomic contributions to the desolvation===&lt;br /&gt;
   &lt;br /&gt;
In your.eel1.solv from the wrapper  &lt;br /&gt;
*in every ATOM line, columns 9, 10, and 11 are the total atomic solvation energy (polar + apolar), percentage desolvation, and atomic desolvation energy (i.e. - 9 &amp;amp;times; 10) of the atom, respectively.&lt;br /&gt;
*the total desolvation is the sum over the entries in column 11 and can be compared to the sum of the polsol and apolsol columns in OUTDOCK.&lt;br /&gt;
Or to generate this data yourself&lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;4&#039; in the first menu.  &lt;br /&gt;
*enter the name of the grid for partial desolvation, presumably &amp;lt;tt&amp;gt;grids/solvmap&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;grids/solvmap_sev&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.solv&amp;lt;/tt&amp;gt; .&lt;br /&gt;
&lt;br /&gt;
==Other small useful things==&lt;br /&gt;
===Obtaining the net charge of a docked molecule===&lt;br /&gt;
   &lt;br /&gt;
*take the output &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file and run &amp;lt;tt&amp;gt;molcharge_pdb.pl &amp;lt; output.eel1&amp;lt;/tt&amp;gt;. This will output the sequential number of the molecule, the [http://zinc.docking.org/ ZINC] identifier, the total charge and the number of atoms for every molecule in the file.&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=102</id>
		<title>Analysing the results</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=102"/>
		<updated>2011-05-28T03:46:45Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* Atomic contributions to the coulombic energy */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Some analyses that can be performed=&lt;br /&gt;
&lt;br /&gt;
See [[MUD - Michael&#039;s Utilities for Docking]] for a lot of tools to help with analyzing DOCK runs.&lt;br /&gt;
&lt;br /&gt;
==Combining the results of all subdirectories==&lt;br /&gt;
   &lt;br /&gt;
*in the subdirectory that contains all the individual directories for each chunk of the library, run &amp;lt;tt&amp;gt;$mud/combine.py&amp;lt;/tt&amp;gt;. Then generate a file containing the top 500 molecules using &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -o top500.pdb&amp;lt;/tt&amp;gt;&#039;, which you can read into ViewDOCK in chimera as a DOCK 4, 5, or 6 style file.&lt;br /&gt;
*to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file containing the top 500 molecules just run &amp;lt;tt&amp;gt;$mud/topdock.py -e&amp;lt;/tt&amp;gt;. If one wants to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Getting individual atom contributions with scoreopt_so==&lt;br /&gt;
&lt;br /&gt;
===First you need an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file to be scored===&lt;br /&gt;
&lt;br /&gt;
=====For the xtal-lig.mol2 in its crystallographic pose=====&lt;br /&gt;
&lt;br /&gt;
New way that outputs your.eel1 starting from your.pdb directly&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;$mud/to_eel1.csh your.pdb&amp;lt;/tt&amp;gt;&#039;. &lt;br /&gt;
&lt;br /&gt;
If that fails, use the old way to convert an input &amp;lt;tt&amp;gt;[http://www.tripos.com/index.php?family=modules,SimplePage,,,&amp;amp;page=sup_mol2&amp;amp;s=0 .mol2]&amp;lt;/tt&amp;gt; file into an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file&lt;br /&gt;
*run &amp;lt;tt&amp;gt;amsol&amp;lt;/tt&amp;gt;  as described [[Preparing_the_ligand#Running amsol|here]] to calculate atomic solvation energies.&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;file2file.py -s path/to/amsol.solv path/to/amsol.nmol2 ligand.eel1&amp;lt;/tt&amp;gt;&#039;.&lt;br /&gt;
&lt;br /&gt;
=====For molecules that have already been docked=====&lt;br /&gt;
&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -e -o top500.eel1&#039; to generate an .eel1 containing the top 500 docked molecules.&lt;br /&gt;
*or unzip the dock output &#039;&amp;lt;tt&amp;gt;gunzip -c test.eel1.gz &amp;gt; test.eel1&amp;lt;/tt&amp;gt;&#039;&lt;br /&gt;
*or to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &#039;&amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;&#039;.&lt;br /&gt;
&lt;br /&gt;
===Overall molecular score compiled from all scoreopt_so options===&lt;br /&gt;
&lt;br /&gt;
For default grids&lt;br /&gt;
*run &amp;lt;tt&amp;gt;&#039;$mud/doscoreopt.csh your.eel1 ../path/to/grids&#039;&amp;lt;/tt&amp;gt;&lt;br /&gt;
Or for custom grids, used below to run SEV-based desolvation grids&lt;br /&gt;
*run &amp;lt;tt&amp;gt;&#039;$mud/doscoreopt.csh your.eel1 ../path/to/grids rec+sph.phi chem solvmap_sev&#039;&amp;lt;/tt&amp;gt; &lt;br /&gt;
The summary for the whole molecule is output to your.eel1.scores in combine.scores format &lt;br /&gt;
&lt;br /&gt;
===Atomic contributions to the coulombic energy===&lt;br /&gt;
&lt;br /&gt;
In your.eel1.delphi from the wrapper &lt;br /&gt;
*in every ATOM line, columns 9, 10 and 11 are the partial charge, the electrostatic field and the energy in kT (i.e., 9 &amp;amp;times; 10) of the atom, respectively.  &lt;br /&gt;
*the DelPhi electrostatic score is the sum over the entries in column 11 times 0.5924 (conversion from kT to kcal/mol) and can be compared to the elect column in OUTDOCK.&lt;br /&gt;
Or to generate this data yourself&lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;2&#039; in the first menu.  &lt;br /&gt;
*enter the name of the DelPhi potential file, presumably &amp;lt;tt&amp;gt;grids/rec+sph.phi&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;top500.eel1&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.delphi&amp;lt;/tt&amp;gt; .&lt;br /&gt;
&lt;br /&gt;
===Atomic contributions to the van der Waals energy===&lt;br /&gt;
&lt;br /&gt;
In your.eel1.vdw from the wrapper  &lt;br /&gt;
*be adequately [http://www.merriam-webster.com/dictionary/scared scared].  &lt;br /&gt;
*the van der Waals interaction energy is calculated as  &amp;lt;math&amp;gt;{vdW}_{(r)}=\frac{A}{r^{12}}-\frac{B}{r^6}=a-b&amp;lt;/math&amp;gt;. In every ATOM line, columns 9, 10 and 11 are &amp;lt;math&amp;gt;a&amp;lt;/math&amp;gt;, &amp;lt;math&amp;gt;b&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;a-b&amp;lt;/math&amp;gt;,    respectively.&lt;br /&gt;
* DO NOT use the interaction energy, as we only use the vdw component now. Instead, use the vdwsum to compare with the vdW column in OUTDOCK.&lt;br /&gt;
Or to generate this data yourself&lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;3&#039; in the first menu.  &lt;br /&gt;
*enter the prefix name of grids for ff scoring as a full path,    i.e., &amp;lt;tt&amp;gt;grids/chem&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the van der Waals parameter file, presumably    &amp;lt;tt&amp;gt;grids/vdw.parms.amb.mindock&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*answer the question about interpolation with &#039;yes&#039;.  &lt;br /&gt;
*enter a sufficiently large number as maximal van der Waals    energy, e.g. 10000.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.vdw&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===Atomic contributions to the desolvation===&lt;br /&gt;
   &lt;br /&gt;
In your.eel1.solv from the wrapper  &lt;br /&gt;
*in every ATOM line, columns 9, 10, and 11 are the total atomic solvation energy (polar + apolar), percentage desolvation, and atomic desolvation energy (i.e. - 9 &amp;amp;times; 10) of the atom, respectively.&lt;br /&gt;
*the total desolvation is the sum over the entries in column 11 and can be compared to the sum of the polsol and apolsol columns in OUTDOCK.&lt;br /&gt;
Or to generate this data yourself&lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;4&#039; in the first menu.  &lt;br /&gt;
*enter the name of the grid for partial desolvation, presumably &amp;lt;tt&amp;gt;grids/solvmap&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.solv&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
&lt;br /&gt;
==Other small useful things==&lt;br /&gt;
===Obtaining the net charge of a docked molecule===&lt;br /&gt;
   &lt;br /&gt;
*take the output &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file and run &amp;lt;tt&amp;gt;molcharge_pdb.pl &amp;lt; output.eel1&amp;lt;/tt&amp;gt;. This will output the sequential number of the molecule, the [http://zinc.docking.org/ ZINC] identifier, the total charge and the number of atoms for every molecule in the file.&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=101</id>
		<title>Analysing the results</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=101"/>
		<updated>2011-05-28T03:46:00Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: Update to the modern way to scoreopt&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Some analyses that can be performed=&lt;br /&gt;
&lt;br /&gt;
See [[MUD - Michael&#039;s Utilities for Docking]] for a lot of tools to help with analyzing DOCK runs.&lt;br /&gt;
&lt;br /&gt;
==Combining the results of all subdirectories==&lt;br /&gt;
   &lt;br /&gt;
*in the subdirectory that contains all the individual directories for each chunk of the library, run &amp;lt;tt&amp;gt;$mud/combine.py&amp;lt;/tt&amp;gt;. Then generate a file containing the top 500 molecules using &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -o top500.pdb&amp;lt;/tt&amp;gt;&#039;, which you can read into ViewDOCK in chimera as a DOCK 4, 5, or 6 style file.&lt;br /&gt;
*to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file containing the top 500 molecules just run &amp;lt;tt&amp;gt;$mud/topdock.py -e&amp;lt;/tt&amp;gt;. If one wants to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Getting individual atom contributions with scoreopt_so==&lt;br /&gt;
&lt;br /&gt;
===First you need an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file to be scored===&lt;br /&gt;
&lt;br /&gt;
=====For the xtal-lig.mol2 in its crystallographic pose=====&lt;br /&gt;
&lt;br /&gt;
New way that outputs your.eel1 starting from your.pdb directly&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;$mud/to_eel1.csh your.pdb&amp;lt;/tt&amp;gt;&#039;. &lt;br /&gt;
&lt;br /&gt;
If that fails, use the old way to convert an input &amp;lt;tt&amp;gt;[http://www.tripos.com/index.php?family=modules,SimplePage,,,&amp;amp;page=sup_mol2&amp;amp;s=0 .mol2]&amp;lt;/tt&amp;gt; file into an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file&lt;br /&gt;
*run &amp;lt;tt&amp;gt;amsol&amp;lt;/tt&amp;gt;  as described [[Preparing_the_ligand#Running amsol|here]] to calculate atomic solvation energies.&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;file2file.py -s path/to/amsol.solv path/to/amsol.nmol2 ligand.eel1&amp;lt;/tt&amp;gt;&#039;.&lt;br /&gt;
&lt;br /&gt;
=====For molecules that have already been docked=====&lt;br /&gt;
&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -e -o top500.eel1&#039; to generate an .eel1 containing the top 500 docked molecules.&lt;br /&gt;
*or unzip the dock output &#039;&amp;lt;tt&amp;gt;gunzip -c test.eel1.gz &amp;gt; test.eel1&amp;lt;/tt&amp;gt;&#039;&lt;br /&gt;
*or to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &#039;&amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;&#039;.&lt;br /&gt;
&lt;br /&gt;
===Overall molecular score compiled from all scoreopt_so options===&lt;br /&gt;
&lt;br /&gt;
For default grids&lt;br /&gt;
*run &amp;lt;tt&amp;gt;&#039;$mud/doscoreopt.csh your.eel1 ../path/to/grids&#039;&amp;lt;/tt&amp;gt;&lt;br /&gt;
Or for custom grids, used below to run SEV-based desolvation grids&lt;br /&gt;
*run &amp;lt;tt&amp;gt;&#039;$mud/doscoreopt.csh your.eel1 ../path/to/grids rec+sph.phi chem solvmap_sev&#039;&amp;lt;/tt&amp;gt; &lt;br /&gt;
The summary for the whole molecule is output to your.eel1.scores in combine.scores format &lt;br /&gt;
&lt;br /&gt;
===Atomic contributions to the coulombic energy===&lt;br /&gt;
&lt;br /&gt;
In your.eel1.delphi from the wrapper &lt;br /&gt;
*in every ATOM line, columns 9, 10 and 11 are the partial charge, the electrostatic field and the energy in kT (i.e., 9 &amp;amp;times; 10) of the atom, respectively.  &lt;br /&gt;
*the DelPhi electrostatic score is the sum over the entries in column 11 times 0.5924 (conversion from kT to kcal/mol) and can be compared to the elect column in OUTDOCK.&lt;br /&gt;
Or to generate this data yourself&lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;2&#039; in the first menu.  &lt;br /&gt;
*enter the name of the DelPhi potential file, presumably &amp;lt;tt&amp;gt;grids/rec+sph.phi&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;top500.eel1&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.elec&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===Atomic contributions to the van der Waals energy===&lt;br /&gt;
&lt;br /&gt;
In your.eel1.vdw from the wrapper  &lt;br /&gt;
*be adequately [http://www.merriam-webster.com/dictionary/scared scared].  &lt;br /&gt;
*the van der Waals interaction energy is calculated as  &amp;lt;math&amp;gt;{vdW}_{(r)}=\frac{A}{r^{12}}-\frac{B}{r^6}=a-b&amp;lt;/math&amp;gt;. In every ATOM line, columns 9, 10 and 11 are &amp;lt;math&amp;gt;a&amp;lt;/math&amp;gt;, &amp;lt;math&amp;gt;b&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;a-b&amp;lt;/math&amp;gt;,    respectively.&lt;br /&gt;
* DO NOT use the interaction energy, as we only use the vdw component now. Instead, use the vdwsum to compare with the vdW column in OUTDOCK.&lt;br /&gt;
Or to generate this data yourself&lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;3&#039; in the first menu.  &lt;br /&gt;
*enter the prefix name of grids for ff scoring as a full path,    i.e., &amp;lt;tt&amp;gt;grids/chem&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the van der Waals parameter file, presumably    &amp;lt;tt&amp;gt;grids/vdw.parms.amb.mindock&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*answer the question about interpolation with &#039;yes&#039;.  &lt;br /&gt;
*enter a sufficiently large number as maximal van der Waals    energy, e.g. 10000.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.vdw&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===Atomic contributions to the desolvation===&lt;br /&gt;
   &lt;br /&gt;
In your.eel1.solv from the wrapper  &lt;br /&gt;
*in every ATOM line, columns 9, 10, and 11 are the total atomic solvation energy (polar + apolar), percentage desolvation, and atomic desolvation energy (i.e. - 9 &amp;amp;times; 10) of the atom, respectively.&lt;br /&gt;
*the total desolvation is the sum over the entries in column 11 and can be compared to the sum of the polsol and apolsol columns in OUTDOCK.&lt;br /&gt;
Or to generate this data yourself&lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;4&#039; in the first menu.  &lt;br /&gt;
*enter the name of the grid for partial desolvation, presumably &amp;lt;tt&amp;gt;grids/solvmap&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.solv&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
&lt;br /&gt;
==Other small useful things==&lt;br /&gt;
===Obtaining the net charge of a docked molecule===&lt;br /&gt;
   &lt;br /&gt;
*take the output &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file and run &amp;lt;tt&amp;gt;molcharge_pdb.pl &amp;lt; output.eel1&amp;lt;/tt&amp;gt;. This will output the sequential number of the molecule, the [http://zinc.docking.org/ ZINC] identifier, the total charge and the number of atoms for every molecule in the file.&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Chembl2pdb&amp;diff=320</id>
		<title>Chembl2pdb</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Chembl2pdb&amp;diff=320"/>
		<updated>2011-03-18T21:43:09Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* GENERATION PROCEDURE */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== CURRENT DATA ==&lt;br /&gt;
&lt;br /&gt;
__ Updated 02/24/2011 __&lt;br /&gt;
&lt;br /&gt;
The current data relating the ChEMBL09 protein targets to structures on the PDB can be found at:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;/raid3/people/mysinger/pxc/pdb_to_chembl/chembl09&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
There are 3 subfolders:&lt;br /&gt;
        &lt;br /&gt;
  - &#039;&#039;&#039;uniprot&#039;&#039;&#039;: categorized by target uniprot id&lt;br /&gt;
         &lt;br /&gt;
  - &#039;&#039;&#039;pdb_ligand&#039;&#039;&#039;: all pdb codes that have a bound ligand (as defined by be_blasti.csh script from DOCKBlaster)&lt;br /&gt;
                    with the corresponding activity data from ChEMBL (actives.smi)&lt;br /&gt;
          &lt;br /&gt;
  - &#039;&#039;&#039;pdb_other&#039;&#039;&#039;: all pdb codes that do NOT have a bound crystal ligand (as defined by be_blasti.csh script from DOCKBlaster) &lt;br /&gt;
                      with the corresponding actives from chEMBL(actives.smi)&lt;br /&gt;
&lt;br /&gt;
In order to get some statistics: how many pdb codes, how many targets have ChEMBL ligands, you can simply count the number of subfolders in each &amp;quot;byXXX&amp;quot; folder.&lt;br /&gt;
 &lt;br /&gt;
  eg: How many UniProt targets have ChEMBL ligands?&lt;br /&gt;
        % cd uniprot&lt;br /&gt;
        % wc -l uniprot&lt;br /&gt;
         &lt;br /&gt;
  eg: How many pdb structures have ChEMBL actives and a bound crystal ligand?&lt;br /&gt;
        % cd bypdb_ligand/&lt;br /&gt;
        % ls -d ????| wc -l&lt;br /&gt;
  &lt;br /&gt;
  eg: How many pdb structures have ChEMBL actives BUT WITHOUT a bound crystal ligand?&lt;br /&gt;
        % cd pdb_other/&lt;br /&gt;
        % ls -d ???? | wc -l&lt;br /&gt;
&lt;br /&gt;
== GENERATION PROCEDURE ==&lt;br /&gt;
&lt;br /&gt;
In future, if you want to generate the data again, you need to do the following:&lt;br /&gt;
   &lt;br /&gt;
*Step I: Load new ChEMBL SQL database into zincdb1 ( do this only if there is a new ChEMBL release)   &lt;br /&gt;
*Step II.: Make a new directory, run the script pointing to the new sql database name, and wait a day or two for it to finish&lt;br /&gt;
          mkdir chembl10&lt;br /&gt;
          cd chembl10&lt;br /&gt;
          /raid3/people/mysinger/pxc/pdb_to_chembl/generate_chembl_map.csh chembl10&lt;br /&gt;
&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Chembl2pdb&amp;diff=319</id>
		<title>Chembl2pdb</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Chembl2pdb&amp;diff=319"/>
		<updated>2011-03-18T21:42:29Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* GENERATION PROCEDURE */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== CURRENT DATA ==&lt;br /&gt;
&lt;br /&gt;
__ Updated 02/24/2011 __&lt;br /&gt;
&lt;br /&gt;
The current data relating the ChEMBL09 protein targets to structures on the PDB can be found at:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;/raid3/people/mysinger/pxc/pdb_to_chembl/chembl09&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
There are 3 subfolders:&lt;br /&gt;
        &lt;br /&gt;
  - &#039;&#039;&#039;uniprot&#039;&#039;&#039;: categorized by target uniprot id&lt;br /&gt;
         &lt;br /&gt;
  - &#039;&#039;&#039;pdb_ligand&#039;&#039;&#039;: all pdb codes that have a bound ligand (as defined by be_blasti.csh script from DOCKBlaster)&lt;br /&gt;
                    with the corresponding activity data from ChEMBL (actives.smi)&lt;br /&gt;
          &lt;br /&gt;
  - &#039;&#039;&#039;pdb_other&#039;&#039;&#039;: all pdb codes that do NOT have a bound crystal ligand (as defined by be_blasti.csh script from DOCKBlaster) &lt;br /&gt;
                      with the corresponding actives from chEMBL(actives.smi)&lt;br /&gt;
&lt;br /&gt;
In order to get some statistics: how many pdb codes, how many targets have ChEMBL ligands, you can simply count the number of subfolders in each &amp;quot;byXXX&amp;quot; folder.&lt;br /&gt;
 &lt;br /&gt;
  eg: How many UniProt targets have ChEMBL ligands?&lt;br /&gt;
        % cd uniprot&lt;br /&gt;
        % wc -l uniprot&lt;br /&gt;
         &lt;br /&gt;
  eg: How many pdb structures have ChEMBL actives and a bound crystal ligand?&lt;br /&gt;
        % cd bypdb_ligand/&lt;br /&gt;
        % ls -d ????| wc -l&lt;br /&gt;
  &lt;br /&gt;
  eg: How many pdb structures have ChEMBL actives BUT WITHOUT a bound crystal ligand?&lt;br /&gt;
        % cd pdb_other/&lt;br /&gt;
        % ls -d ???? | wc -l&lt;br /&gt;
&lt;br /&gt;
== GENERATION PROCEDURE ==&lt;br /&gt;
&lt;br /&gt;
In future, if you want to generate the data again, you need to do the following:&lt;br /&gt;
   &lt;br /&gt;
*Step I: Load new ChEMBL SQL database into zincdb1 ( do this only if there is a new ChEMBL release)   &lt;br /&gt;
*Step II.: Make a new directory, run the script, and wait a day or two for it to finish&lt;br /&gt;
          mkdir chembl10&lt;br /&gt;
          cd chembl10&lt;br /&gt;
          /raid3/people/mysinger/pxc/pdb_to_chembl/generate_chembl_map.csh chembl10&lt;br /&gt;
&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Chembl2pdb&amp;diff=318</id>
		<title>Chembl2pdb</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Chembl2pdb&amp;diff=318"/>
		<updated>2011-03-18T21:41:50Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* GENERATION PROCEDURE */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== CURRENT DATA ==&lt;br /&gt;
&lt;br /&gt;
__ Updated 02/24/2011 __&lt;br /&gt;
&lt;br /&gt;
The current data relating the ChEMBL09 protein targets to structures on the PDB can be found at:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;/raid3/people/mysinger/pxc/pdb_to_chembl/chembl09&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
There are 3 subfolders:&lt;br /&gt;
        &lt;br /&gt;
  - &#039;&#039;&#039;uniprot&#039;&#039;&#039;: categorized by target uniprot id&lt;br /&gt;
         &lt;br /&gt;
  - &#039;&#039;&#039;pdb_ligand&#039;&#039;&#039;: all pdb codes that have a bound ligand (as defined by be_blasti.csh script from DOCKBlaster)&lt;br /&gt;
                    with the corresponding activity data from ChEMBL (actives.smi)&lt;br /&gt;
          &lt;br /&gt;
  - &#039;&#039;&#039;pdb_other&#039;&#039;&#039;: all pdb codes that do NOT have a bound crystal ligand (as defined by be_blasti.csh script from DOCKBlaster) &lt;br /&gt;
                      with the corresponding actives from chEMBL(actives.smi)&lt;br /&gt;
&lt;br /&gt;
In order to get some statistics: how many pdb codes, how many targets have ChEMBL ligands, you can simply count the number of subfolders in each &amp;quot;byXXX&amp;quot; folder.&lt;br /&gt;
 &lt;br /&gt;
  eg: How many UniProt targets have ChEMBL ligands?&lt;br /&gt;
        % cd uniprot&lt;br /&gt;
        % wc -l uniprot&lt;br /&gt;
         &lt;br /&gt;
  eg: How many pdb structures have ChEMBL actives and a bound crystal ligand?&lt;br /&gt;
        % cd bypdb_ligand/&lt;br /&gt;
        % ls -d ????| wc -l&lt;br /&gt;
  &lt;br /&gt;
  eg: How many pdb structures have ChEMBL actives BUT WITHOUT a bound crystal ligand?&lt;br /&gt;
        % cd pdb_other/&lt;br /&gt;
        % ls -d ???? | wc -l&lt;br /&gt;
&lt;br /&gt;
== GENERATION PROCEDURE ==&lt;br /&gt;
&lt;br /&gt;
In future, if you want to generate the data again, you need to do the following:&lt;br /&gt;
   &lt;br /&gt;
*Step I: Load new ChEMBL SQL database into zincdb1 ( do this only if there is a new ChEMBL release)   &lt;br /&gt;
*Step II.: Make a new directory, run the script, and wait a day or two for it to finish&lt;br /&gt;
          ```mkdir chembl10```&lt;br /&gt;
          ```cd chembl10```&lt;br /&gt;
          ```/raid3/people/mysinger/pxc/pdb_to_chembl/generate_chembl_map.csh chembl10```&lt;br /&gt;
&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Chembl2pdb&amp;diff=317</id>
		<title>Chembl2pdb</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Chembl2pdb&amp;diff=317"/>
		<updated>2011-03-18T21:41:30Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* GENERATION PROCEDURE */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== CURRENT DATA ==&lt;br /&gt;
&lt;br /&gt;
__ Updated 02/24/2011 __&lt;br /&gt;
&lt;br /&gt;
The current data relating the ChEMBL09 protein targets to structures on the PDB can be found at:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;/raid3/people/mysinger/pxc/pdb_to_chembl/chembl09&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
There are 3 subfolders:&lt;br /&gt;
        &lt;br /&gt;
  - &#039;&#039;&#039;uniprot&#039;&#039;&#039;: categorized by target uniprot id&lt;br /&gt;
         &lt;br /&gt;
  - &#039;&#039;&#039;pdb_ligand&#039;&#039;&#039;: all pdb codes that have a bound ligand (as defined by be_blasti.csh script from DOCKBlaster)&lt;br /&gt;
                    with the corresponding activity data from ChEMBL (actives.smi)&lt;br /&gt;
          &lt;br /&gt;
  - &#039;&#039;&#039;pdb_other&#039;&#039;&#039;: all pdb codes that do NOT have a bound crystal ligand (as defined by be_blasti.csh script from DOCKBlaster) &lt;br /&gt;
                      with the corresponding actives from chEMBL(actives.smi)&lt;br /&gt;
&lt;br /&gt;
In order to get some statistics: how many pdb codes, how many targets have ChEMBL ligands, you can simply count the number of subfolders in each &amp;quot;byXXX&amp;quot; folder.&lt;br /&gt;
 &lt;br /&gt;
  eg: How many UniProt targets have ChEMBL ligands?&lt;br /&gt;
        % cd uniprot&lt;br /&gt;
        % wc -l uniprot&lt;br /&gt;
         &lt;br /&gt;
  eg: How many pdb structures have ChEMBL actives and a bound crystal ligand?&lt;br /&gt;
        % cd bypdb_ligand/&lt;br /&gt;
        % ls -d ????| wc -l&lt;br /&gt;
  &lt;br /&gt;
  eg: How many pdb structures have ChEMBL actives BUT WITHOUT a bound crystal ligand?&lt;br /&gt;
        % cd pdb_other/&lt;br /&gt;
        % ls -d ???? | wc -l&lt;br /&gt;
&lt;br /&gt;
== GENERATION PROCEDURE ==&lt;br /&gt;
&lt;br /&gt;
In future, if you want to generate the data again, you need to do the following:&lt;br /&gt;
   &lt;br /&gt;
 Step I: Load new ChEMBL SQL database into zincdb1 ( do this only if there is a new ChEMBL release)   &lt;br /&gt;
 Step II.: Make a new directory, run the script, and wait a day or two for it to finish&lt;br /&gt;
          ```mkdir chembl10```&lt;br /&gt;
          ```cd chembl10```&lt;br /&gt;
          ```/raid3/people/mysinger/pxc/pdb_to_chembl/generate_chembl_map.csh chembl10```&lt;br /&gt;
&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Chembl2pdb&amp;diff=316</id>
		<title>Chembl2pdb</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Chembl2pdb&amp;diff=316"/>
		<updated>2011-03-18T21:41:04Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: New simplified generation procedure&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== CURRENT DATA ==&lt;br /&gt;
&lt;br /&gt;
__ Updated 02/24/2011 __&lt;br /&gt;
&lt;br /&gt;
The current data relating the ChEMBL09 protein targets to structures on the PDB can be found at:&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;/raid3/people/mysinger/pxc/pdb_to_chembl/chembl09&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
There are 3 subfolders:&lt;br /&gt;
        &lt;br /&gt;
  - &#039;&#039;&#039;uniprot&#039;&#039;&#039;: categorized by target uniprot id&lt;br /&gt;
         &lt;br /&gt;
  - &#039;&#039;&#039;pdb_ligand&#039;&#039;&#039;: all pdb codes that have a bound ligand (as defined by be_blasti.csh script from DOCKBlaster)&lt;br /&gt;
                    with the corresponding activity data from ChEMBL (actives.smi)&lt;br /&gt;
          &lt;br /&gt;
  - &#039;&#039;&#039;pdb_other&#039;&#039;&#039;: all pdb codes that do NOT have a bound crystal ligand (as defined by be_blasti.csh script from DOCKBlaster) &lt;br /&gt;
                      with the corresponding actives from chEMBL(actives.smi)&lt;br /&gt;
&lt;br /&gt;
In order to get some statistics: how many pdb codes, how many targets have ChEMBL ligands, you can simply count the number of subfolders in each &amp;quot;byXXX&amp;quot; folder.&lt;br /&gt;
 &lt;br /&gt;
  eg: How many UniProt targets have ChEMBL ligands?&lt;br /&gt;
        % cd uniprot&lt;br /&gt;
        % wc -l uniprot&lt;br /&gt;
         &lt;br /&gt;
  eg: How many pdb structures have ChEMBL actives and a bound crystal ligand?&lt;br /&gt;
        % cd bypdb_ligand/&lt;br /&gt;
        % ls -d ????| wc -l&lt;br /&gt;
  &lt;br /&gt;
  eg: How many pdb structures have ChEMBL actives BUT WITHOUT a bound crystal ligand?&lt;br /&gt;
        % cd pdb_other/&lt;br /&gt;
        % ls -d ???? | wc -l&lt;br /&gt;
&lt;br /&gt;
== GENERATION PROCEDURE ==&lt;br /&gt;
&lt;br /&gt;
In future, if you want to generate the data again, you need to do the following:&lt;br /&gt;
   &lt;br /&gt;
Step I: Load new ChEMBL SQL database into zincdb1 ( do this only if there is a new ChEMBL release)   &lt;br /&gt;
Step II.: Make a new directory, run the script, and wait a day or two for it to finish&lt;br /&gt;
          ```mkdir chembl10```&lt;br /&gt;
          ```cd chembl10```&lt;br /&gt;
          ```/raid3/people/mysinger/pxc/pdb_to_chembl/generate_chembl_map.csh chembl10```&lt;br /&gt;
&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=MUD_-_Michael%27s_Utilities_for_Docking&amp;diff=3545</id>
		<title>MUD - Michael&#039;s Utilities for Docking</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=MUD_-_Michael%27s_Utilities_for_Docking&amp;diff=3545"/>
		<updated>2010-01-13T00:39:31Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* Computing Enrichments */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==What&#039;s in MUD?==&lt;br /&gt;
&lt;br /&gt;
*Tools to start, check, and restart dock jobs&lt;br /&gt;
*Tools to combine, enrich, plot, and view docking results&lt;br /&gt;
&lt;br /&gt;
==Setting up MUD==&lt;br /&gt;
&lt;br /&gt;
*For convenience, point a shell variable to the base mud directory to save typing&lt;br /&gt;
 set mud=~mysinger/code/mud/trunk&lt;br /&gt;
*If you use MUD a lot, you can add this to your ~/.login&lt;br /&gt;
*Then simply run commands like this:&lt;br /&gt;
 $mud/submit.csh&lt;br /&gt;
 $mud/check.py -h&lt;br /&gt;
*Use -h or --help to get full help information for the .py (python) scripts&lt;br /&gt;
*The .csh scripts will automatically print usage information if mis-used&lt;br /&gt;
*The scripts automatically use their invocation path to find other scripts and libraries they depend on.&lt;br /&gt;
&lt;br /&gt;
==Job Control==&lt;br /&gt;
   &lt;br /&gt;
===Main Workflow===&lt;br /&gt;
&lt;br /&gt;
For a quick summary of what to do first see [[SGE_Cluster_Docking]]. For a detailed look at how to get the details right see [[How to run and analyze a DOCK run by hand]].&lt;br /&gt;
&lt;br /&gt;
*Submit a parallel job to the cluser&lt;br /&gt;
 $mud/submit.csh&lt;br /&gt;
Uses &#039;dirlist&#039; to determine which directories to run. Similar to startdockbksX, but also indicates job submission by touching a submitted file in each directory.&lt;br /&gt;
*Check parallel job status&lt;br /&gt;
 $mud/check.py&lt;br /&gt;
Indicates the status of unfinished (or unsubmitted) jobs. Note that it simply returns nothing if everything is finished.&lt;br /&gt;
*Restart all failed subjobs&lt;br /&gt;
 $mud/restart.py&lt;br /&gt;
This works even if some subjobs are still running. Occasionally, however, jobs can fail with no detectable remnants. To force those jobs to restart you can use the -f option, but beware that this will also restart all subjobs that are still running.&lt;br /&gt;
&lt;br /&gt;
===Specialized Commands===&lt;br /&gt;
*Submit job to the local machine&lt;br /&gt;
 $mud/sublocal.csh&lt;br /&gt;
*Submit a single directory to the cluster&lt;br /&gt;
 qsub $mud/runsge.csh&lt;br /&gt;
*Submit a single directory to the local machine&lt;br /&gt;
 $mud/runsubdir.csh&lt;br /&gt;
*Remove docking output leaving only input - will DELETE even completed jobs&lt;br /&gt;
 $mud/clean.py&lt;br /&gt;
*Restart single directory&lt;br /&gt;
 $mud/restartdir.py&lt;br /&gt;
&lt;br /&gt;
==Job Analysis==&lt;br /&gt;
&lt;br /&gt;
*Enrichment plots are sensitive to consistent treatment and proper accounting for all docked molecules. The combine script properly accounts for all docked molecules by detecting bumped out, no matched, and timed out molecules. &lt;br /&gt;
&lt;br /&gt;
To achieve consistency, you have two options:&lt;br /&gt;
1. Write coordinates for all molecules (what I use)&lt;br /&gt;
In INDOCK, set number_save to 50000 or something high enough to capture all dockable hierarchies. DOCK output is now gzipped so this is cheaper in disk space than it used to be.&lt;br /&gt;
2. Do not check for broken molecules&lt;br /&gt;
Use the -b option when running combine.py&lt;br /&gt;
&lt;br /&gt;
===Combining Parallel Jobs===&lt;br /&gt;
*Merge all parallel jobs into a single set of unique scores.&lt;br /&gt;
 $mud/combine.py&lt;br /&gt;
This combine carefully accounts for all docked molecules, for more informative enrichment plots.&lt;br /&gt;
&lt;br /&gt;
*Options:&lt;br /&gt;
Use -b or --broken to skip finding broken molecules. Use -d or --done to indicate that all subjobs are complete, for the case where you did not submit with a MUD submission script. Use -p or --prefix if your output files are named something other than test. Use --box if your  box file is not at ../../grids/box relative to your subjob directories.&lt;br /&gt;
&lt;br /&gt;
*Creates:&lt;br /&gt;
#combine.scores - fully processed scores, using the best one for each id&lt;br /&gt;
#combine.raw - contains all scores as scrapped from DOCK output&lt;br /&gt;
#combine.broken - broken molecules and the reason they failed&lt;br /&gt;
#combine.zeroes - important sanity check&lt;br /&gt;
&lt;br /&gt;
format of combine.scores:&lt;br /&gt;
 &amp;lt;id&amp;gt; &amp;lt;shape&amp;gt; &amp;lt;elect&amp;gt; &amp;lt;VdW&amp;gt; &amp;lt;polar solv&amp;gt; &amp;lt;apolar solv&amp;gt; &amp;lt;total&amp;gt; &amp;lt;subdir&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The .zeroes file is a sanity check because it lists the number of molecules followed by the number of zeroes in each scoring column. Past experience has shown that when DOCK fails randomly and silently, it often generates a large  number of zero scores. If this happens, simply re-running the job will give better results. &lt;br /&gt;
&lt;br /&gt;
===Computing Enrichments===&lt;br /&gt;
*Compute enrichment starting from the combined scores.&lt;br /&gt;
 $mud/enrich.py -s -l LIGAND_FILE&lt;br /&gt;
 &amp;lt; or &amp;gt;&lt;br /&gt;
 $mud/enrich.py -l LIGAND_FILE -d DECOY_FILE&lt;br /&gt;
Generates both enrichment and roc curves, both for the ligands against all molecules and for the ligands versus just the decoys. It will try to run combine if it has not been run yet, but will do so only with defaults for every option.&lt;br /&gt;
&lt;br /&gt;
*Input:&lt;br /&gt;
Use -l to specify the ligand identifier file and -d to specify the decoy identifier file.&lt;br /&gt;
&lt;br /&gt;
The identifier files simply contain an id for each known ligand that matched the one in the docking databases. The script is smart enough to match &amp;quot;ZINC12345678&amp;quot; to &amp;quot;C12345678&amp;quot;, so either form is acceptable.&lt;br /&gt;
&lt;br /&gt;
*Options:&lt;br /&gt;
Use -s or --skip-own-curves to skip consideration of decoys and thus generation of _own curves. Use -f to force combine to run again.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;span id=&amp;quot;Enrich_Types&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
*Creates:&lt;br /&gt;
#enrich.txt - Enrichment curve for ligands versus all molecules&lt;br /&gt;
#roc.txt - ROC curve for ligands versus all molecules&lt;br /&gt;
#enrich_own.txt - Enrichment curve for ligands versus only the decoys&lt;br /&gt;
#roc_own.txt - ROC curve for ligands versus only the decoys&lt;br /&gt;
_own files are not generate is the -s option is used.&lt;br /&gt;
&lt;br /&gt;
format for output files:&lt;br /&gt;
 #AUC 50.00  LogAUC 0.00&lt;br /&gt;
 &amp;lt;x&amp;gt; &amp;lt;y&amp;gt;&lt;br /&gt;
 &amp;lt;x&amp;gt; &amp;lt;y&amp;gt;&lt;br /&gt;
  ...&lt;br /&gt;
AUC is area under the curve and the random expectation value is 50%. [[LogAUC]] is the area between the log curve and the log random curve, so the random expectation value is 0%. &amp;lt;y&amp;gt; is always &amp;quot;% ligands found&amp;quot;, and &amp;lt;x&amp;gt; is either &amp;quot;% database searched&amp;quot; for enrichment plots or &amp;quot;% non-ligands found&amp;quot; for ROC plots.&lt;br /&gt;
&lt;br /&gt;
===Plotting Enrichments===&lt;br /&gt;
Easily plot enrichment and roc curves from one or more jobs.&lt;br /&gt;
 $mud/plots.py -i . -l New_Run -i ../old_run_dir -l Old_Run -t AmpC&lt;br /&gt;
 &amp;lt; or &amp;gt;&lt;br /&gt;
 $mud/plots.py -i .&lt;br /&gt;
Generates plots with one curve for each -i input_directory.&lt;br /&gt;
&lt;br /&gt;
*Options:&lt;br /&gt;
Use -s or --skip-own-curves to skip _own curves, especially if they don&#039;t exist because enrich.py was run with -s. You can either label each -i INDIR with a -l LABEL, or use no -l options to get the default labels based on parent directory names. Use -t TITLE to change the plot title and filename. Use -o to specify a different output directory. Use -n to get normal instead of semi-log plots (and AUC in place of LogAUC).&lt;br /&gt;
&lt;br /&gt;
*Creates:&lt;br /&gt;
#[title_]enrich.png&lt;br /&gt;
#[title_]roc.png&lt;br /&gt;
#[title_]enrich_own.png&lt;br /&gt;
#[title_]roc_own.png&lt;br /&gt;
&lt;br /&gt;
The various graphs have the same meaning as their respective cures from [[#Computing Enrichments]]. [title_] is optional and exists when a custom title is given with the -t option.&lt;br /&gt;
&lt;br /&gt;
===Computing Energy Histograms===&lt;br /&gt;
*Compute energy distributions starting from the combined scores.&lt;br /&gt;
 $mud/energies.py -s -l LIGAND_FILE&lt;br /&gt;
 &amp;lt; or &amp;gt;&lt;br /&gt;
 $mud/energies.py -l LIGAND_FILE -d DECOY_FILE&lt;br /&gt;
Generates the energy distributions for the ligands, decoys, and all the other molecules.&lt;br /&gt;
&lt;br /&gt;
*Input:&lt;br /&gt;
Use -l to specify the ligand identifier file and -d to specify the decoy identifier file.&lt;br /&gt;
&lt;br /&gt;
The identifier files simply contain an id for each known ligand that matched the one in the docking databases. The script is smart enough to match &amp;quot;ZINC12345678&amp;quot; to &amp;quot;C12345678&amp;quot;, so either form is acceptable.&lt;br /&gt;
&lt;br /&gt;
*Options:&lt;br /&gt;
Use -s or --skip-own-curves to skip consideration of decoys.&lt;br /&gt;
&lt;br /&gt;
*Creates:&lt;br /&gt;
#counts.txt - Energy distributions&lt;br /&gt;
&lt;br /&gt;
format for output:&lt;br /&gt;
 number_of_sections number_of_bins min_energy_threshold max_energy_threshold&lt;br /&gt;
 ##### section_name&lt;br /&gt;
 bin_upper_edge1 count_below_edge1&lt;br /&gt;
 ...&lt;br /&gt;
 bin_upper_edgeN count_below_edgeN&lt;br /&gt;
 ABOVE count_above_last_edge&lt;br /&gt;
The sections are for ligands, decoys (optional), and others. The bins and counts define the energy histogram. The bins are finely spaced here in order to have more resolution when combine with other runs, whose energy ranges may be different.&lt;br /&gt;
&lt;br /&gt;
===Plotting Energy Histograms===&lt;br /&gt;
Easily plot energy histograms from one or more jobs.&lt;br /&gt;
 $mud/eplots.py -i . -l New_Run -i ../old_run_dir -l Old_Run -t AmpC&lt;br /&gt;
 &amp;lt; or &amp;gt;&lt;br /&gt;
 $mud/eplots.py -i .&lt;br /&gt;
Generates plots with energy distributions for each -i input_directory.&lt;br /&gt;
&lt;br /&gt;
*Options:&lt;br /&gt;
You can either label each -i INDIR with a -l LABEL, or use no -l options to get the default labels based on parent directory names. Use -t TITLE to change the plot title and filename. Use -o to specify a different output directory.&lt;br /&gt;
&lt;br /&gt;
*Creates:&lt;br /&gt;
#[title_]counts.png&lt;br /&gt;
&lt;br /&gt;
===Visualizing Molecule by Molecule Results===&lt;br /&gt;
Create a DOCK 4,5,6 type pdb file for use in Chimera&#039;s ViewDOCK.&lt;br /&gt;
 $mud/topdock.py -o topdock.pdb&lt;br /&gt;
&lt;br /&gt;
*Options:&lt;br /&gt;
Use -o to specify an output file besides stdout. Use -t NUMBER to get whatever number of top scoring molecules.&lt;br /&gt;
&lt;br /&gt;
&amp;amp;rarr; Back to [[Tutorials]]&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=LogAUC&amp;diff=3479</id>
		<title>LogAUC</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=LogAUC&amp;diff=3479"/>
		<updated>2010-01-13T00:37:18Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==What is LogAUC?==&lt;br /&gt;
&lt;br /&gt;
LogAUC is a metric to evaluate virtual screening performance that has many of the same advantages as area under the curve (AUC), but is based on a plot where the x-axis is semilog in order to focus on early enrichment.&lt;br /&gt;
&lt;br /&gt;
==Motivation==&lt;br /&gt;
&lt;br /&gt;
When we look at virtual screening performance, we plot an ROC curve (or enrichment curve) with a base 10 semilog x-axis, because this has the advantage of focusing the graph on &amp;quot;early enrichment&amp;quot;, where molecules are most likely to be selected for further testing. If we had instead plotted the curve with the usual linear x-axis, then the area under the curve (AUC) is a well-regarded metric to summarize the overall performance of a virtual screening campaign as a single number&amp;lt;sup&amp;gt;1&amp;lt;/sup&amp;gt;. While AUC can be formulated alternate ways&amp;lt;sup&amp;gt;2,3&amp;lt;/sup&amp;gt;, it can be mechanically constructed by simply integrating under the curve, and interpreted as the fraction of the area under the curve over the area under the best possible ROC curve. It just happens that in a linear ROC plot, the AUC of the best possible curve is the entire unit square, with an area of 1. By analogy, in our typical semilog plots, we can construct the same fraction of the area under the log curve, over the area under the perfect log curve, and define that fraction as the logAUC. The lone nuisance is that the area under the log curve is infinite in general. However, if we are practical and limit our focus to a region of log space that we can actually measure, say above a certain threshold &amp;lt;math&amp;gt;\lambda&amp;lt;/math&amp;gt;, then the perfect log area is finite.&lt;br /&gt;
&lt;br /&gt;
==Definition==&lt;br /&gt;
&lt;br /&gt;
Formally, we define &amp;lt;math&amp;gt;logAUC_\lambda&amp;lt;/math&amp;gt;, where the log area computations run from &amp;lt;math&amp;gt;\lambda&amp;lt;/math&amp;gt; to 1.0, and we typically refer to &amp;lt;math&amp;gt;logAUC_{0.001}&amp;lt;/math&amp;gt; as simply &amp;lt;math&amp;gt;logAUC&amp;lt;/math&amp;gt;, where the area is integrated from 0.1 percent (0.001) to 100 percent (1.0) of decoys found. For integrating the area under the curve, we use the trapezoidal rule as follows:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;math&amp;gt;LogAUC_\lambda=\frac{\displaystyle \sum_{i}^{where~x_i\ge\lambda} (\log_{10} x_{i+1} - \log_{10} x_i)(\frac{y_{i+1}+y_i}{2})}{\log_{10}\frac{1}{\lambda}}&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Discussion==&lt;br /&gt;
&lt;br /&gt;
From similar reasoning based on semilog ROC plots, Clark and Webster-Clark construct the pROC AUC metric&amp;lt;sup&amp;gt;2&amp;lt;/sup&amp;gt;, which is similar to the numerator of logAUC except that the integration is done over horizontal bars instead of vertical trapezoids. The advantage of constructing logAUC as a fraction over the ideal area is that the choice of base for the logarithm is irrelevant, because changing base simply results in a constant that cancels between numerator and denominator.  Also, by explicitly defining the area of interest using λ and integrating vertically, we are able to avoid the singularity at &amp;lt;math&amp;gt;x_i=0&amp;lt;/math&amp;gt; encountered in pROC. More importantly, the fixed integration area means we can more directly compare &amp;lt;math&amp;gt;logAUC_\lambda&amp;lt;/math&amp;gt; values across databases of different sizes and across targets with different ratios of actives to inactives. The final advantage of logAUC is that if you are used to looking at semilog ROC plots plotted from λ to 1, and understand that logAUC is just the percentage of the total area below the curve, then you can at some point gain the same intuitive feel as AUC has for linear ROC plots. In a semilog ROC plot the random line occupies only a sliver of the total area, and indeed its logAUC is just 14.462%. In order to more easily compare a given logAUC to this random value, we instead report the “adjusted logAUC” as the calculated value minus 14.462%, so that positive values mean overall enrichments better than random. &lt;br /&gt;
&lt;br /&gt;
&amp;lt;math&amp;gt;Adjusted~LogAUC=LogAUC_{0.001}-0.14462&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
## Nicholls, A., What do we know and when do we know it? J Comput Aided Mol Des 2008, 22, (3-4), 239-55.&lt;br /&gt;
## Clark, R. D.; Webster-Clark, D. J., Managing bias in ROC curves. J Comput Aided Mol Des 2008, 22, (3-4), 141-6.&lt;br /&gt;
## Truchon, J. F.; Bayly, C. I., Evaluating virtual screening methods: good and bad metrics for the &amp;quot;early recognition&amp;quot; problem. J Chem Inf Model 2007, 47, (2), 488-508.&lt;br /&gt;
&lt;br /&gt;
==Citation==&lt;br /&gt;
Michael Mysinger, Brian Shoichet. &amp;quot;Rapid Context-Dependent Ligand Desolvation in Molecular Docking&amp;quot;. 2010. (in preparation for J Chem Inf Model)&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=LogAUC&amp;diff=3478</id>
		<title>LogAUC</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=LogAUC&amp;diff=3478"/>
		<updated>2010-01-13T00:32:03Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==What is LogAUC?==&lt;br /&gt;
&lt;br /&gt;
LogAUC is a metric to evaluate virtual screening performance that has many of the same advantages as area under the curve (AUC), but is based on a plot where the x-axis is semilog in order to focus on early enrichment.&lt;br /&gt;
&lt;br /&gt;
==Motivation==&lt;br /&gt;
&lt;br /&gt;
When we look at virtual screening performance, we plot an ROC curve (or enrichment curve) with a base 10 semilog x-axis, because this has the advantage of focusing the graph on &amp;quot;early enrichment&amp;quot;, where molecules are most likely to be selected for further testing. If we had instead plotted the curve with the usual linear x-axis, then the area under the curve (AUC) is a well-regarded metric to summarize the overall performance of a virtual screening campaign as a single number&amp;lt;sup&amp;gt;1&amp;lt;/sup&amp;gt;. While AUC can be formulated alternate ways&amp;lt;sup&amp;gt;2,3&amp;lt;/sup&amp;gt;, it can be mechanically constructed by simply integrating under the curve, and interpreted as the fraction of the area under the curve over the area under the best possible ROC curve. It just happens that in a linear ROC plot, the AUC of the best possible curve is the entire unit square, with an area of 1. By analogy, in our typical semilog plots, we can construct the same fraction of the area under the log curve, over the area under the perfect log curve, and define that fraction as the logAUC. The lone nuisance is that the area under the log curve is infinite in general. However, if we are practical and limit our focus to a region of log space that we can actually measure, say above a certain threshold &amp;lt;math&amp;gt;\lambda&amp;lt;/math&amp;gt;, then the perfect log area is finite.&lt;br /&gt;
&lt;br /&gt;
==Definition==&lt;br /&gt;
&lt;br /&gt;
Formally, we define &amp;lt;math&amp;gt;logAUC_\lambda&amp;lt;/math&amp;gt;, where the log area computations run from &amp;lt;math&amp;gt;\lambda&amp;lt;/math&amp;gt; to 1.0, and we typically refer to &amp;lt;math&amp;gt;logAUC_{0.001}&amp;lt;/math&amp;gt; as simply &amp;lt;math&amp;gt;logAUC&amp;lt;/math&amp;gt;, where the area is integrated from 0.1 percent (0.001) to 100 percent (1.0) of decoys found. For integrating the area under the curve, we use the trapezoidal rule as follows:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;math&amp;gt;LogAUC_\lambda=\frac{\displaystyle \sum_{i}^{where~x_i\ge\lambda} (\log_{10} x_{i+1} - \log_{10} x_i)(\frac{y_{i+1}+y_i}{2})}{\log_{10}\frac{1}{\lambda}}&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Discussion==&lt;br /&gt;
&lt;br /&gt;
From similar reasoning based on semilog ROC plots, Clark and Webster-Clark construct the pROC AUC metric&amp;lt;sub&amp;gt;2&amp;lt;/sub&amp;gt;, which is similar to the numerator of logAUC except that the integration is done over horizontal bars instead of vertical trapezoids. The advantage of constructing logAUC as a fraction over the ideal area is that the choice of base for the logarithm is irrelevant, because changing base simply results in a constant that cancels between numerator and denominator.  Also, by explicitly defining the area of interest using λ and integrating vertically, we are able to avoid the singularity at &amp;lt;math&amp;gt;x_i=0&amp;lt;/math&amp;gt; encountered in pROC. More importantly, the fixed integration area means we can more directly compare &amp;lt;math&amp;gt;logAUC_\lambda&amp;lt;/math&amp;gt; values across databases of different sizes and across targets with different ratios of actives to inactives. The final advantage of logAUC is that if you are used to looking at semilog ROC plots plotted from λ to 1, and understand that logAUC is just the percentage of the total area below the curve, then you can at some point gain the same intuitive feel as AUC has for linear ROC plots. In a semilog ROC plot the random line occupies only a sliver of the total area, and indeed its logAUC is just 14.462%. In order to more easily compare a given logAUC to this random value, we instead report the “adjusted logAUC” as the calculated value minus 14.462%, so that positive values mean overall enrichments better than random. &lt;br /&gt;
&lt;br /&gt;
&amp;lt;math&amp;gt;Adjusted~LogAUC=LogAUC_{0.001}-.14462&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
## Nicholls, A., What do we know and when do we know it? J Comput Aided Mol Des 2008, 22, (3-4), 239-55.&lt;br /&gt;
## Clark, R. D.; Webster-Clark, D. J., Managing bias in ROC curves. J Comput Aided Mol Des 2008, 22, (3-4), 141-6.&lt;br /&gt;
## Truchon, J. F.; Bayly, C. I., Evaluating virtual screening methods: good and bad metrics for the &amp;quot;early recognition&amp;quot; problem. J Chem Inf Model 2007, 47, (2), 488-508.&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=LogAUC&amp;diff=3477</id>
		<title>LogAUC</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=LogAUC&amp;diff=3477"/>
		<updated>2010-01-13T00:15:46Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==What is LogAUC?==&lt;br /&gt;
&lt;br /&gt;
LogAUC is a metric to evaluate virtual screening performance that has some nice characteristics. It is intuitive to use &lt;br /&gt;
&lt;br /&gt;
==Motivation==&lt;br /&gt;
&lt;br /&gt;
When we look at virtual screening performance, we plot an ROC curve (or enrichment curve) with a base 10 semilog x-axis, because this has the advantage of focusing the graph on &amp;quot;early enrichment&amp;quot;, where molecules are most likely to be selected for further testing. If we had instead plotted the curve with the usual linear x-axis, then the area under the curve (AUC) is a well-regarded metric to summarize the overall performance of a virtual screening campaign as a single number&amp;lt;sup&amp;gt;1&amp;lt;/sup&amp;gt;. While AUC can be formulated alternate ways&amp;lt;sup&amp;gt;2,3&amp;lt;/sup&amp;gt;, it can be mechanically constructed by simply integrating under the curve, and interpreted as the fraction of the area under the curve over the area under the best possible ROC curve. It just happens that in a linear ROC plot, the AUC of the best possible curve is the entire unit square, with an area of 1. By analogy, in our typical semilog plots, we can construct the same fraction of the area under the log curve, over the area under the perfect log curve, and define that fraction as the logAUC. The lone nuisance is that the area under the log curve is infinite in general. However, if we are practical and limit our focus to a region of log space that we can actually measure, say above a certain threshold &amp;lt;math&amp;gt;\lambda&amp;lt;/math&amp;gt;, then the perfect log area is finite.&lt;br /&gt;
&lt;br /&gt;
==Definition==&lt;br /&gt;
&lt;br /&gt;
Formally, we define &amp;lt;math&amp;gt;logAUC_\lambda&amp;lt;/math&amp;gt;, where the log area computations run from &amp;lt;math&amp;gt;\lambda&amp;lt;/math&amp;gt; to 1.0, and we typically refer to &amp;lt;math&amp;gt;logAUC_{0.001}&amp;lt;/math&amp;gt; as simply logAUC, where the area is integrated from 0.1 percent (0.001) to 100 percent (1.0) of decoys found. For integrating the area under the curve, we use the trapezoidal rule as follows:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;math&amp;gt;LogAUC_\lambda=\frac{\displaystyle \sum_{i}^{where~x_i\ge\lambda} (\log_{10} x_{i+1} - \log_{10} x_i)(\frac{y_{i+1}+y_i}{2})}{\log_{10}\frac{1}{\lambda}}&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
1. Nicholls, A., What do we know and when do we know it? J Comput Aided Mol Des 2008, 22, (3-4), 239-55.&lt;br /&gt;
2.&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=LogAUC&amp;diff=3476</id>
		<title>LogAUC</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=LogAUC&amp;diff=3476"/>
		<updated>2010-01-13T00:00:40Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==What is LogAUC?==&lt;br /&gt;
&lt;br /&gt;
LogAUC is a metric to evaluate virtual screening performance that has some nice characteristics. It is intuitive to use &lt;br /&gt;
&lt;br /&gt;
==Motivation==&lt;br /&gt;
&lt;br /&gt;
When we look at virtual screening performance, we plot an ROC curve (or enrichment curve) with a base 10 semilog x-axis, because this has the advantage of focusing the graph on &amp;quot;early enrichment&amp;quot;, where molecules are most likely to be selected for further testing. If we had instead plotted the curve with the usual linear x-axis, then the area under the curve (AUC) is a well-regarded metric to summarize the overall performance of a virtual screening campaign as a single number&amp;lt;sup&amp;gt;1&amp;lt;/sup&amp;gt;. While ROC AUC can be formulated alternate ways, it can be &lt;br /&gt;
&lt;br /&gt;
&amp;lt;math&amp;gt;LogAUC_\lambda=\frac{\displaystyle \sum_{i}^{where~x_i\ge\lambda} (\log_{10} x_{i+1} - \log_{10} x_i)(\frac{y_{i+1}+y_i}{2})}{\log_{10}\frac{1}{\lambda}}&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==References==&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=LogAUC&amp;diff=3475</id>
		<title>LogAUC</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=LogAUC&amp;diff=3475"/>
		<updated>2010-01-12T23:41:33Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
==What is LogAUC?==&lt;br /&gt;
&lt;br /&gt;
LogAUC is a metric to evaluate virtual screening performance that has some nice characteristics. It is intuitive to use &lt;br /&gt;
&lt;br /&gt;
==Motivation==&lt;br /&gt;
&lt;br /&gt;
When we look at virtual screening performance, we plot an ROC curve (or enrichment curve) with a base 10 semilog x-axis, because this has the advantage of focusing the graph on &amp;quot;early enrichment&amp;quot;, where molecules are most likely to be selected for further testing. If we had instead plotted the curve with the usual linear x-axis, then the area under the curve (AUC) is a well-regarded metric to summarize the overall performance of a virtual screening campaign as a single number.&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=How_To_Guides&amp;diff=3191</id>
		<title>How To Guides</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=How_To_Guides&amp;diff=3191"/>
		<updated>2010-01-12T22:56:00Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;What do you want to do? &lt;br /&gt;
&lt;br /&gt;
{{TOCright}}&lt;br /&gt;
&lt;br /&gt;
= Tutorials =&lt;br /&gt;
&lt;br /&gt;
* [[Tutorials]]&lt;br /&gt;
&lt;br /&gt;
= Protocols =&lt;br /&gt;
&lt;br /&gt;
* [[Automated Database Preparation]] - unix protocol&lt;br /&gt;
* [[LogAUC]] - metric to measure virtual screening performance&lt;br /&gt;
* [[DOCK Blaster:Protocols]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Running_DOCK&amp;diff=4282</id>
		<title>Running DOCK</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Running_DOCK&amp;diff=4282"/>
		<updated>2009-12-05T02:56:41Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* Running DOCK */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Running DOCK=&lt;br /&gt;
   &lt;br /&gt;
*modify &amp;lt;tt&amp;gt;INDOCK&amp;lt;/tt&amp;gt;  and set up the desired directory structure &amp;amp;ndash; either manually or by running &#039;&amp;lt;tt&amp;gt;md4db.csh bysubset N&amp;lt;sub&amp;gt;1&amp;lt;/sub&amp;gt; N&amp;lt;sub&amp;gt;2&amp;lt;/sub&amp;gt; Type&amp;lt;/tt&amp;gt;&#039;, where &amp;lt;tt&amp;gt;N&amp;lt;sub&amp;gt;1&amp;lt;/sub&amp;gt;&amp;lt;/tt&amp;gt; is the identifier of the library (1: lead-like; 2: fragment-like), &amp;lt;tt&amp;gt;N&amp;lt;sub&amp;gt;2&amp;lt;/sub&amp;gt;&amp;lt;/tt&amp;gt; is the number of chunks (i.e., jobs you can run in parallel), and &amp;lt;tt&amp;gt;Type&amp;lt;/tt&amp;gt; is the category of library (i.e., bysubset, byvendor, etc).&lt;br /&gt;
* if it hasn&#039;t been generated by a script, create the file &amp;lt;tt&amp;gt;dirlist&amp;lt;/tt&amp;gt;, which conatins the list of the directories (i.e., chunks of the database) that you want to dock.&lt;br /&gt;
*if you plan to use any of John&#039;s scripts in the downstream    processing, leave the output file prefixes at &amp;lt;tt&amp;gt;test.&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*take care that the paths to the &amp;lt;tt&amp;gt;.db.gz&amp;lt;/tt&amp;gt;  files in &amp;lt;tt&amp;gt;split_database_index&amp;lt;/tt&amp;gt; do not get too long. If they do, go via    links.  &lt;br /&gt;
*submit the calculations to the cluster with &amp;lt;tt&amp;gt;$mud/submit.csh&amp;lt;/tt&amp;gt; from the directory in which your data (most importantly, &amp;lt;tt&amp;gt;dirlist&amp;lt;/tt&amp;gt;) resides. See [[MUD - Michael&#039;s Utilities for Docking]] for setting the $mud variable.&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Running_DOCK&amp;diff=4281</id>
		<title>Running DOCK</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Running_DOCK&amp;diff=4281"/>
		<updated>2009-12-05T02:56:27Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* Running DOCK */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Running DOCK=&lt;br /&gt;
   &lt;br /&gt;
*modify &amp;lt;tt&amp;gt;INDOCK&amp;lt;/tt&amp;gt;  and set up the desired directory structure &amp;amp;ndash; either manually or by running &amp;lt;tt&amp;gt;md4db.csh bysubset N&amp;lt;sub&amp;gt;1&amp;lt;/sub&amp;gt; N&amp;lt;sub&amp;gt;2&amp;lt;/sub&amp;gt; Type&amp;lt;/tt&amp;gt;, where &amp;lt;tt&amp;gt;N&amp;lt;sub&amp;gt;1&amp;lt;/sub&amp;gt;&amp;lt;/tt&amp;gt; is the identifier of the library (1: lead-like; 2: fragment-like), &amp;lt;tt&amp;gt;N&amp;lt;sub&amp;gt;2&amp;lt;/sub&amp;gt;&amp;lt;/tt&amp;gt; is the number of chunks (i.e., jobs you can run in parallel), and &amp;lt;tt&amp;gt;Type&amp;lt;/tt&amp;gt; is the category of library (i.e., bysubset, byvendor, etc).&lt;br /&gt;
* if it hasn&#039;t been generated by a script, create the file &amp;lt;tt&amp;gt;dirlist&amp;lt;/tt&amp;gt;, which conatins the list of the directories (i.e., chunks of the database) that you want to dock.&lt;br /&gt;
*if you plan to use any of John&#039;s scripts in the downstream    processing, leave the output file prefixes at &amp;lt;tt&amp;gt;test.&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*take care that the paths to the &amp;lt;tt&amp;gt;.db.gz&amp;lt;/tt&amp;gt;  files in &amp;lt;tt&amp;gt;split_database_index&amp;lt;/tt&amp;gt; do not get too long. If they do, go via    links.  &lt;br /&gt;
*submit the calculations to the cluster with &amp;lt;tt&amp;gt;$mud/submit.csh&amp;lt;/tt&amp;gt; from the directory in which your data (most importantly, &amp;lt;tt&amp;gt;dirlist&amp;lt;/tt&amp;gt;) resides. See [[MUD - Michael&#039;s Utilities for Docking]] for setting the $mud variable.&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Running_DOCK&amp;diff=4280</id>
		<title>Running DOCK</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Running_DOCK&amp;diff=4280"/>
		<updated>2009-12-05T02:55:58Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Running DOCK=&lt;br /&gt;
   &lt;br /&gt;
*modify &amp;lt;tt&amp;gt;INDOCK&amp;lt;/tt&amp;gt;  and set up the desired directory structure &amp;amp;ndash; either manually or by running &amp;lt;tt&amp;gt;mksdir3.csh N&amp;lt;sub&amp;gt;1&amp;lt;/sub&amp;gt; N&amp;lt;sub&amp;gt;2&amp;lt;/sub&amp;gt; Type&amp;lt;/tt&amp;gt;, where &amp;lt;tt&amp;gt;N&amp;lt;sub&amp;gt;1&amp;lt;/sub&amp;gt;&amp;lt;/tt&amp;gt; is the identifier of the library (1: lead-like; 2: fragment-like), &amp;lt;tt&amp;gt;N&amp;lt;sub&amp;gt;2&amp;lt;/sub&amp;gt;&amp;lt;/tt&amp;gt; is the number of chunks (i.e., jobs you can run in parallel), and &amp;lt;tt&amp;gt;Type&amp;lt;/tt&amp;gt; is the category of library (i.e., bysubset, byvendor, etc).&lt;br /&gt;
* if it hasn&#039;t been generated by a script, create the file &amp;lt;tt&amp;gt;dirlist&amp;lt;/tt&amp;gt;, which conatins the list of the directories (i.e., chunks of the database) that you want to dock.&lt;br /&gt;
*if you plan to use any of John&#039;s scripts in the downstream    processing, leave the output file prefixes at &amp;lt;tt&amp;gt;test.&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*take care that the paths to the &amp;lt;tt&amp;gt;.db.gz&amp;lt;/tt&amp;gt;  files in &amp;lt;tt&amp;gt;split_database_index&amp;lt;/tt&amp;gt; do not get too long. If they do, go via    links.  &lt;br /&gt;
*submit the calculations to the cluster with &amp;lt;tt&amp;gt;$mud/submit.csh&amp;lt;/tt&amp;gt; from the directory in which your data (most importantly, &amp;lt;tt&amp;gt;dirlist&amp;lt;/tt&amp;gt;) resides. See [[MUD - Michael&#039;s Utilities for Docking]] for setting the $mud variable.&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Preparing_the_ligand&amp;diff=4089</id>
		<title>Preparing the ligand</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Preparing_the_ligand&amp;diff=4089"/>
		<updated>2009-12-05T02:55:07Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Preparing a ligand=&lt;br /&gt;
&lt;br /&gt;
==Automatic way, starting from SMILES==&lt;br /&gt;
&lt;br /&gt;
This way will make use of John&#039;s automatic scripts for database&lt;br /&gt;
preparation and actually upload new molecules to a special section of&lt;br /&gt;
[http://zinc.docking.org/ ZINC].&lt;br /&gt;
   &lt;br /&gt;
*it is advisable to create a special subdirectory, since many new    files will be generated.  &lt;br /&gt;
*the file containing the [http://www.daylight.com/smiles/ SMILES] strings should contain a string    followed by an identifier on each line.  &lt;br /&gt;
*OPTIONAL: run &amp;lt;tt&amp;gt;convert.py --i=yourname.smi --o=yourname.ism&amp;lt;/tt&amp;gt; . This will  convert your SMILES to &#039;&#039;isomeric&#039;&#039; SMILES.&lt;br /&gt;
*run &amp;lt;tt&amp;gt;dbgen.csh yourname.smi&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*you should obtain a file &amp;lt;tt&amp;gt;somename.db.gz&amp;lt;/tt&amp;gt; .&lt;br /&gt;
&lt;br /&gt;
==Manual way==&lt;br /&gt;
&lt;br /&gt;
===Isolating the ligand as &amp;lt;tt&amp;gt;.mol2&amp;lt;/tt&amp;gt;  file===&lt;br /&gt;
   &lt;br /&gt;
*extract the ligand structure from the &amp;lt;tt&amp;gt;.pdb&amp;lt;/tt&amp;gt;  file.  &lt;br /&gt;
*assign hydrogens.  &lt;br /&gt;
*assign all atom ([http://www.tripos.com/mol2/atom_types.html Sybyl/TAFF]) and bond types.  &lt;br /&gt;
*save it as &amp;lt;tt&amp;gt;ligandname.mol2&amp;lt;/tt&amp;gt;  file.  &lt;br /&gt;
&lt;br /&gt;
===Running &amp;lt;tt&amp;gt;omega&amp;lt;/tt&amp;gt; ===&lt;br /&gt;
   &lt;br /&gt;
*run [http://www.eyesopen.com/products/applications/omega.html OMEGA], but don&#039;t ask me how to do that yet.&lt;br /&gt;
&lt;br /&gt;
===Running amsol===&lt;br /&gt;
&lt;br /&gt;
*find more information about amsol [http://comp.chem.umn.edu/amsol/ on its homepage].  &lt;br /&gt;
*&amp;lt;tt&amp;gt;mkdir ./amsol2&amp;lt;/tt&amp;gt;   &lt;br /&gt;
*Use file2file.py to get the right formal charge to feed to AMSOL. It is also important to change the name, otherwise the original &amp;lt;tt&amp;gt;.mol2&amp;lt;/tt&amp;gt; file will be overwritten!&lt;br /&gt;
&amp;lt;tt&amp;gt;file2file.py -g ligandname.mol2 ./amsol2/someothername.mol2&amp;lt;/tt&amp;gt;      &lt;br /&gt;
*edit &amp;lt;tt&amp;gt;./amsol2/someothername.mol2&amp;lt;/tt&amp;gt; :     &lt;br /&gt;
*&lt;br /&gt;
*delete all lines prior to &amp;lt;tt&amp;gt;@&amp;lt;TRIPOS&amp;gt;MOLECULE&amp;lt;/tt&amp;gt;   &lt;br /&gt;
*&lt;br /&gt;
*change line 2 (molecule name) to something of the format &amp;lt;tt&amp;gt;ABCD12345678&amp;lt;/tt&amp;gt; (four capital letters followed by eight numbers).  &lt;br /&gt;
*&lt;br /&gt;
*line 3 should be &amp;lt;tt&amp;gt;n&amp;lt;sub&amp;gt;atoms&amp;lt;/sub&amp;gt; n&amp;lt;sub&amp;gt;bonds&amp;lt;/sub&amp;gt; 0 0 0&amp;lt;/tt&amp;gt;&lt;br /&gt;
*&lt;br /&gt;
*the &amp;lt;tt&amp;gt;@&amp;lt;TRIPOS&amp;gt;MOLECULE&amp;lt;/tt&amp;gt;  section must consist of exactly &#039;&#039;&#039;5&#039;&#039;&#039; lines (adjust by adding/deleting blanks).  &lt;br /&gt;
*&lt;br /&gt;
*remove all sections after the &amp;lt;tt&amp;gt;@&amp;lt;TRIPOS&amp;gt;BOND&amp;lt;/tt&amp;gt;  section.&lt;br /&gt;
*&lt;br /&gt;
*delete the blank lines between the &amp;lt;tt&amp;gt;ATOM&amp;lt;/tt&amp;gt;  and &amp;lt;tt&amp;gt;BOND&amp;lt;/tt&amp;gt;     sections, if there are any.    &lt;br /&gt;
*run &amp;lt;tt&amp;gt;RunAMSOL3.csh WAIT&amp;lt;/tt&amp;gt;   &lt;br /&gt;
*the output &amp;lt;tt&amp;gt;someothername.solv&amp;lt;/tt&amp;gt;  file will contain the following:&lt;br /&gt;
{| style=&amp;quot;text-align: center; border:1px solid #aaa; margin: 1em 1em 1em 0; background: #f9f9f9; border-collapse: collapse;&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;0&amp;quot; &lt;br /&gt;
|+ &#039;&#039;&#039;AMSOL output&#039;&#039;&#039;&lt;br /&gt;
|-&lt;br /&gt;
! style=&amp;quot;border:1px #aaa solid; padding: 0.2em;&amp;quot; | line #1&lt;br /&gt;
| style=&amp;quot;border:1px #aaa solid; padding: 0.2em;&amp;quot; | molname || style=&amp;quot;border:1px #aaa solid; padding: 0.2em;&amp;quot; | &amp;lt;math&amp;gt;n_{atoms}&amp;lt;/math&amp;gt; || style=&amp;quot;border:1px #aaa solid; padding: 0.2em;&amp;quot; | charge || style=&amp;quot;border:1px #aaa solid; padding: 0.2em;&amp;quot; | pol_solv || style=&amp;quot;border:1px #aaa solid; padding: 0.2em;&amp;quot; | ? || style=&amp;quot;border:1px #aaa solid; padding: 0.2em;&amp;quot; | apol_solv || style=&amp;quot;border:1px #aaa solid; padding: 0.2em;&amp;quot; | total_solv&lt;br /&gt;
|-&lt;br /&gt;
! style=&amp;quot;border:1px #aaa solid; padding: 0.2em;&amp;quot; | other lines &lt;br /&gt;
| style=&amp;quot;border:1px #aaa solid; padding: 0.2em;&amp;quot; | charge || style=&amp;quot;border:1px #aaa solid; padding: 0.2em;&amp;quot; | pol_solv || style=&amp;quot;border:1px #aaa solid; padding: 0.2em;&amp;quot; | ? || style=&amp;quot;border:1px #aaa solid; padding: 0.2em;&amp;quot; | apol_solv || style=&amp;quot;border:1px #aaa solid; padding: 0.2em;&amp;quot; | total_solv&lt;br /&gt;
|-&lt;br /&gt;
| style=&amp;quot;border:1px #aaa solid; padding: 0.2em;&amp;quot; | &#039;&#039;(per_atom)&#039;&#039;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*furthermore, there will be &amp;lt;tt&amp;gt;someothername.nmol2&amp;lt;/tt&amp;gt;  file    which contains the correct partial charges.&lt;br /&gt;
&lt;br /&gt;
===Running &amp;lt;tt&amp;gt;mol2db&amp;lt;/tt&amp;gt; ===&lt;br /&gt;
   &lt;br /&gt;
*edit &amp;lt;tt&amp;gt;someothername.nmol2&amp;lt;/tt&amp;gt;  so that the &amp;lt;tt&amp;gt;@&amp;lt;TRIPOS&amp;gt;MOLECULE&amp;lt;/tt&amp;gt; section consists of exactly &#039;&#039;&#039;6&#039;&#039;&#039; lines.  &lt;br /&gt;
*edit the &amp;lt;tt&amp;gt;inhier&amp;lt;/tt&amp;gt;  file so that the &#039;mol2_file&#039;,    &#039;db_file&#039; and &#039;solvation_table&#039; entries are correct.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;mol2db inhier&amp;lt;/tt&amp;gt;   &lt;br /&gt;
*add the preamble at the top of the file.  &lt;br /&gt;
*&amp;lt;tt&amp;gt;gzip&amp;lt;/tt&amp;gt;  the resulting file so that it can be used by &amp;lt;tt&amp;gt;DOCK&amp;lt;/tt&amp;gt; .&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=100</id>
		<title>Analysing the results</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=100"/>
		<updated>2009-12-05T02:54:53Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Some analyses that can be performed=&lt;br /&gt;
&lt;br /&gt;
See [[MUD - Michael&#039;s Utilities for Docking]] for a lot of tools to help with analyzing DOCK runs.&lt;br /&gt;
&lt;br /&gt;
==Combining the results of all subdirectories==&lt;br /&gt;
   &lt;br /&gt;
*in the subdirectory that contains all the individual directories for each chunk of the library, run &amp;lt;tt&amp;gt;$mud/combine.py&amp;lt;/tt&amp;gt;. Then generate a file containing the top 500 molecules using &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -o top500.pdb&amp;lt;/tt&amp;gt;&#039;, which you can read into ViewDOCK in chimera as a DOCK 4, 5, or 6 style file.&lt;br /&gt;
*to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file containing the top 500 molecules just run &amp;lt;tt&amp;gt;$mud/topdock.py -e&amp;lt;/tt&amp;gt;. If one wants to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Getting individual atom contributions with scoreopt_so==&lt;br /&gt;
&lt;br /&gt;
===First you need an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file to be scored===&lt;br /&gt;
&lt;br /&gt;
=====For the xtal-lig.mol2 in its crystallographic pose=====&lt;br /&gt;
&lt;br /&gt;
Convert an input &amp;lt;tt&amp;gt;[http://www.tripos.com/index.php?family=modules,SimplePage,,,&amp;amp;page=sup_mol2&amp;amp;s=0 .mol2]&amp;lt;/tt&amp;gt; file into an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file&lt;br /&gt;
*run &amp;lt;tt&amp;gt;amsol&amp;lt;/tt&amp;gt;  as described [[Preparing_the_ligand#Running amsol|here]] to calculate atomic solvation energies.&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;file2file.py -s path/to/amsol.solv path/to/amsol.nmol2 ligand.eel1&amp;lt;/tt&amp;gt;&#039;.&lt;br /&gt;
&lt;br /&gt;
=====For molecules that have already been docked=====&lt;br /&gt;
&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -e -o top500.eel1&#039; to generate an .eel1 containing the top 500 docked molecules.&lt;br /&gt;
*or unzip the dock output &#039;&amp;lt;tt&amp;gt;gunzip -c test.eel1.gz &amp;gt; test.eel1&amp;lt;/tt&amp;gt;&#039;&lt;br /&gt;
*or to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &#039;&amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;&#039;.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the coulombic energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;2&#039; in the first menu.  &lt;br /&gt;
*enter the name of the DelPhi potential file, presumably &amp;lt;tt&amp;gt;grids/rec+sph.phi&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;top500.eel1&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.elec&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10 and 11 are the partial charge, the electrostatic field and the energy in kT (i.e., 9 &amp;amp;times; 10) of the atom, respectively.  &lt;br /&gt;
*the DelPhi electrostatic score is the sum over the entries in column 11 times 0.5924 (conversion from kT to kcal/mol) and can be compared to the elect column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the van der Waals energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;3&#039; in the first menu.  &lt;br /&gt;
*enter the prefix name of grids for ff scoring as a full path,    i.e., &amp;lt;tt&amp;gt;grids/chem&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the van der Waals parameter file, presumably    &amp;lt;tt&amp;gt;grids/vdw.parms.amb.mindock&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*answer the question about interpolation with &#039;yes&#039;.  &lt;br /&gt;
*enter a sufficiently large number as maximal van der Waals    energy, e.g. 10000.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.vdw&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*be adequately [http://www.merriam-webster.com/dictionary/scared scared].  &lt;br /&gt;
*the van der Waals interaction energy is calculated as  &amp;lt;math&amp;gt;{vdW}_{(r)}=\frac{A}{r^{12}}-\frac{B}{r^6}=a-b&amp;lt;/math&amp;gt;. In every ATOM line, columns 9, 10 and 11 are &amp;lt;math&amp;gt;a&amp;lt;/math&amp;gt;, &amp;lt;math&amp;gt;b&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;a-b&amp;lt;/math&amp;gt;,    respectively.&lt;br /&gt;
* DO NOT use the interaction energy, as we only use the vdw component now. Instead, use the vdwsum to compare with the vdW column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the desolvation===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;4&#039; in the first menu.  &lt;br /&gt;
*enter the name of the grid for partial desolvation, presumably &amp;lt;tt&amp;gt;grids/solvmap&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.solv&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10, and 11 are the total atomic solvation energy (polar + apolar), percentage desolvation, and atomic desolvation energy (i.e. - 9 &amp;amp;times; 10) of the atom, respectively.&lt;br /&gt;
*the total desolvation is the sum over the entries in column 11 and can be compared to the sum of the polsol and apolsol columns in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
==Other small useful things==&lt;br /&gt;
===Obtaining the net charge of a docked molecule===&lt;br /&gt;
   &lt;br /&gt;
*take the output &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file and run &amp;lt;tt&amp;gt;molcharge_pdb.pl &amp;lt; output.eel1&amp;lt;/tt&amp;gt;. This will output the sequential number of the molecule, the [http://zinc.docking.org/ ZINC] identifier, the total charge and the number of atoms for every molecule in the file.&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Preparing_the_protein&amp;diff=4117</id>
		<title>Preparing the protein</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Preparing_the_protein&amp;diff=4117"/>
		<updated>2009-12-05T02:54:32Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Preparing the protein=&lt;br /&gt;
&lt;br /&gt;
Items which are prefixed with &#039;AH&#039; are relevant for docking [[HEI]]s to amidohydrolases and can safely be ignored for most metal-free proteins. &lt;br /&gt;
&lt;br /&gt;
==Modifying the PDB file==&lt;br /&gt;
 &lt;br /&gt;
*prepare &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt;  by removing all lines that do not commence with &#039;ATOM&#039;, the chain column, all columns to the right of the z-coordinate and the TER statements.  &lt;br /&gt;
*treat all selenomethionines (MSE) as methionines (MET) by replacing the selenium atom (SE&amp;amp;curren;) with sulphur    (&amp;amp;curren;SD). Be careful about the correct alignment!  &lt;br /&gt;
*atom enumeration does not matter, so don&#039;t bother to renumber    after any of the following steps. Unique numbers are a good idea,    presumably.  &lt;br /&gt;
*select the protonation states of HIS residues to be either &amp;amp;delta;- (rename residue to HID), &amp;amp;epsilon;- (rename residue to HIE) or doubly protonated (rename residue to HIP). HIS on the surface should be HIP. HIS residues coordinating    the metal ions should have their protons pointing away from the    ions. Base your decision on the immediate environment of the HIS residue: are there potential hydrogen bonds that can be formed?; are there charged residues close by?; would a certain protonation lead to clashes with other residues?; etc.&lt;br /&gt;
*AH: the carboxylated LYS of subtype I is CYK, but this is not    tolerated by &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt; , so store and delete the 3    surplus atoms and call the residue LYS.  &lt;br /&gt;
*AH: the more buried metal ion is ZB (charge 1.4), the other one ZA    (charge 1.3). Atom names are right-aligned!&lt;br /&gt;
&lt;br /&gt;
==Running startdockblaster5==&lt;br /&gt;
   &lt;br /&gt;
*generate the file &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; , which should only contain    atoms of the MMFF. Prepare it in the same way as above: remove the    chain column, all columns to the right of the z-coordinate and the    TER statements.  &lt;br /&gt;
*generate the files &amp;lt;tt&amp;gt;.only_spheres&amp;lt;/tt&amp;gt;  and &amp;amp;ndash; in case you    would like the matching spheres to be based on the heavy atoms in &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; &amp;amp;ndash; &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt; and write `on&#039; to the    latter. Be careful not to add blank lines at the end, this will not    be understood by &amp;lt;tt&amp;gt;makespheres2.pl&amp;lt;/tt&amp;gt; . In any case, the entry in    &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt;  will be ignored by &amp;lt;tt&amp;gt;makespheres1.pl&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*on sgehead (or, as of [[dock67]], on any machine), run &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt;  to set up the data structure and copy all relevant files. It is a good idea to use csh and to &amp;lt;tt&amp;gt;source .login&amp;lt;/tt&amp;gt;  beforehand.  &lt;br /&gt;
*if &amp;lt;tt&amp;gt;startdockblaster5&amp;lt;/tt&amp;gt;  doesn&#039;t finish for any obvious reason and with no clear error message, or &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; has very funny hydrogen placements, make sure that you have no non-printing characters in &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt;. Do that by running your file through &amp;lt;tt&amp;gt;pc2unix rec.pdb&amp;lt;/tt&amp;gt;. Check that your file is clean by looking at it with &amp;lt;tt&amp;gt;od -c rec.pdb | less &amp;lt;/tt&amp;gt;. The only character with a backslash should be \n &amp;amp;mdash; you should see no \t, \r, etc. If this doesn&#039;t solve the problem, your best bet is to    reprepare &amp;lt;tt&amp;gt;rec.pdb&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;xtal-lig.pdb&amp;lt;/tt&amp;gt; from scratch &amp;amp;mdash; it is    likely that there are some blanks or hidden characters that are    causing the problems.  &lt;br /&gt;
*check the files &amp;lt;tt&amp;gt;stdout&amp;lt;/tt&amp;gt;  and &amp;lt;tt&amp;gt;stderr&amp;lt;/tt&amp;gt;  after the    run for potential mistakes and error messages. Furthermore, verify    that &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt;  still contains &#039;&#039;all&#039;&#039; atoms.  &lt;br /&gt;
*if you do not want to do anything special with the protein, like    tarting some residues or modifying the spheres, go directly to chapter [[Running DOCK|3]].&lt;br /&gt;
&lt;br /&gt;
==Removing and modifying files==&lt;br /&gt;
   &lt;br /&gt;
*go to &amp;lt;tt&amp;gt;./grids&amp;lt;/tt&amp;gt;  and remove the surplus files from this    directory (some would cause error messages from the subsequent    programs):&amp;lt;br&amp;gt;&amp;lt;tt&amp;gt;rm -f PDBPARM chem.* distmap.box  distmap    distmap.log rec+sph.phi solvmap tart.txt OUT*&amp;lt;/tt&amp;gt;&lt;br /&gt;
*modify &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt;:      &lt;br /&gt;
*&lt;br /&gt;
*AH: CYK: put the three missing atoms, delete the surplus hydrogens specific for LYS and rename the carboxylated lysine residue CYK.  &lt;br /&gt;
*&lt;br /&gt;
*remove all TER statements that might have been added.  &lt;br /&gt;
*&lt;br /&gt;
*AH: set the atom names of the metal ions to ZA and ZB and the    residue name to ZN.  &lt;br /&gt;
*&lt;br /&gt;
*take care of disulfide bonds. Remove the thiol hydrogens (if    they have been added) and change the residue name from CYS to CYX.  &lt;br /&gt;
*&lt;br /&gt;
*look at the &amp;lt;tt&amp;gt;box&amp;lt;/tt&amp;gt;  and maybe move it, so that the ligands    won&#039;t stick out. Modify the &#039;center&#039; and &#039;coordinates&#039; statement in    the preamble.  &lt;br /&gt;
*&lt;br /&gt;
*all residues and atoms have to be listed in &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;vdw.parms.amb.mindock&amp;lt;/tt&amp;gt;, respectively &amp;amp;rArr; do not tart any residues in this file!    &lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[[chemgrid]]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
   &lt;br /&gt;
*run &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt; and check &amp;lt;tt&amp;gt;OUTPARM&amp;lt;/tt&amp;gt; for the correct charges of all residues.  &lt;br /&gt;
*grep for &amp;lt;tt&amp;gt;0.000&amp;lt;/tt&amp;gt;  in &amp;lt;tt&amp;gt;PDBPARM&amp;lt;/tt&amp;gt;: if any atom has this    value in the 3&amp;lt;sup&amp;gt;rd&amp;lt;/sup&amp;gt; and 4&amp;lt;sup&amp;gt;th&amp;lt;/sup&amp;gt; column, it has not been    recognized by &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt;  (because it is not listed in &amp;lt;tt&amp;gt;prot.table.ambcrg.ambH&amp;lt;/tt&amp;gt;) and is thus &#039;&#039;ignored&#039;&#039; in the van der    Waals-maps. There will be no other errors, the docking will finish    showing some &amp;quot;bumping&amp;quot; ligands which have extremely favorable    energies (&amp;amp;le; -200).&lt;br /&gt;
*Another sign of a problem with atomic radii are any &#039;WARNING&#039; issued in OUTPARM&lt;br /&gt;
*if one has to run &amp;lt;tt&amp;gt;chemgrid&amp;lt;/tt&amp;gt;  again, remove &amp;lt;tt&amp;gt;PDBPARM OUTPARM OUTCHEM&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;chem.*&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;distmap&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
 &lt;br /&gt;
* the default is to run &amp;lt;tt&amp;gt;distmap&amp;lt;/tt&amp;gt; on &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt;. If you modified this file, rerun by simply typing &amp;lt;tt&amp;gt;distmap&amp;lt;/tt&amp;gt;.&lt;br /&gt;
* AH: cp &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; to &amp;lt;tt&amp;gt;rec-dist.crg&amp;lt;/tt&amp;gt;  and remove the Zn atoms in the latter file (otherwise there will be lots of bumping ligands). Edit &amp;lt;tt&amp;gt;INDIST&amp;lt;/tt&amp;gt; to update the filename.&lt;br /&gt;
*run &amp;lt;tt&amp;gt;distmap&amp;lt;/tt&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Tarting the protein==&lt;br /&gt;
   &lt;br /&gt;
*cp &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; to &amp;lt;tt&amp;gt;rec+sph.crg&amp;lt;/tt&amp;gt;  and continue with the latter file.&lt;br /&gt;
* tarted residues can be found in &amp;lt;tt&amp;gt;$DOCK_BASE/scripts/grids&amp;lt;/tt&amp;gt;, they are the files with the extension &amp;lt;tt&amp;gt;prot2&amp;lt;/tt&amp;gt;.&lt;br /&gt;
* take care that the format of the &amp;lt;tt&amp;gt;.prot2&amp;lt;/tt&amp;gt; file is consistent with the format in the &amp;lt;tt&amp;gt;amb.crg.oxt&amp;lt;/tt&amp;gt; file, e.g., that there is no leading space before an atom name etc.&lt;br /&gt;
*AH: select the appropriate version of &amp;lt;tt&amp;gt;amb.crg.oxt&amp;lt;/tt&amp;gt;  depending on    the subtype. Files are called &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt; , where &amp;lt;tt&amp;gt;N&amp;lt;/tt&amp;gt;  can    be &amp;lt;tt&amp;gt;I, III&amp;lt;/tt&amp;gt;  or &amp;lt;tt&amp;gt;VI&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*AH: edit the residues in the binding site (i.e., all residues    complexing the metal ions in the binding site), so that their    names conform to the names of the modified residues in &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt;  &lt;br /&gt;
*tart the residues that are in contact with a crystallographic    ligand, if any.  &lt;br /&gt;
*AH: check that ZA and ZB, respectively (left-aligned in the atom    column), have corresponding entries in &amp;lt;tt&amp;gt;amb.crg.oxt.N&amp;lt;/tt&amp;gt;  and &amp;lt;tt&amp;gt;vdw.siz&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Modifying the spheres==&lt;br /&gt;
   &lt;br /&gt;
*load &amp;lt;tt&amp;gt;match1.sph.pdb&amp;lt;/tt&amp;gt;  (i.e., the DelPhi spheres).  &lt;br /&gt;
*delete the spheres that are too close to the solvent.  &lt;br /&gt;
*(AH:) add spheres so that there is one sphere &#039;&#039;between&#039;&#039; the metals, several spheres &#039;&#039;around&#039;&#039; the metals and some spheres    close to polar residues.  &lt;br /&gt;
*a good number for DelPhi spheres is 120.  &lt;br /&gt;
*append the spheres to the end of &amp;lt;tt&amp;gt;rec+sph.crg&amp;lt;/tt&amp;gt;  and put a TER    statement after each sphere. Don&#039;t use tabs for whitespace, can    cause problems with DelPhi!  &lt;br /&gt;
*do the same for &amp;lt;tt&amp;gt;match2.sph.pdb&amp;lt;/tt&amp;gt;  (i.e., the matching    spheres); put at least one sphere between the metals and increase    the sampling in the region around the metal ions by putting some    spheres there. If you selected &amp;lt;tt&amp;gt;.useligsph&amp;lt;/tt&amp;gt; . be careful not to    move any spheres based on the ligand atoms.  &lt;br /&gt;
*a good number for matching spheres is 40.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;pdb_to_spheres.py matchN.sph.pdb matchN.sph&amp;lt;/tt&amp;gt;  to    generate the files that will be read by DelPhi/[[DOCK]].  &lt;br /&gt;
*if desired, run &amp;lt;tt&amp;gt;colorspheres.pl sph/match2.sph&amp;lt;/tt&amp;gt;  in the    parent directory of the docking run (i.e., &amp;lt;tt&amp;gt;..&amp;lt;/tt&amp;gt;  to &amp;lt;tt&amp;gt;grids&amp;lt;/tt&amp;gt; )    to put some color on your spheres.  &lt;br /&gt;
*in any case, put the preamble (&amp;quot;DOCK 5.2 ligand_atoms...&amp;quot;)    into &amp;lt;tt&amp;gt;match2.sph&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[http://bcr.musc.edu/manuals/delphi.htm DelPhi]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
   &lt;br /&gt;
*if necessary, modify &amp;lt;tt&amp;gt;delphi.com&amp;lt;/tt&amp;gt;  so that all the paths and file names are    appropriate.  &lt;br /&gt;
*run &amp;lt;tt&amp;gt;delphi.com &amp;gt; delphi.log&amp;lt;/tt&amp;gt;  and check the output.&lt;br /&gt;
*any &#039;WARNING&#039; in the log is an indication that some atomic charges might not be correct.&lt;br /&gt;
&lt;br /&gt;
==Running &amp;lt;tt&amp;gt;[[solvmap]]&amp;lt;/tt&amp;gt; ==&lt;br /&gt;
   &lt;br /&gt;
*check that all atoms are present in &amp;lt;tt&amp;gt;rec.crg&amp;lt;/tt&amp;gt; and run &amp;lt;tt&amp;gt;solvmap&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*after the run, make sure that the file &amp;lt;tt&amp;gt;solvmap&amp;lt;/tt&amp;gt; contains &#039;&#039;&#039;no&#039;&#039;&#039; blank lines.&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=99</id>
		<title>Analysing the results</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=99"/>
		<updated>2009-12-05T02:53:11Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* Getting individual atom contributions with scoreopt_so */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Some analyses that can be performed=&lt;br /&gt;
&lt;br /&gt;
See [[MUD - Michael&#039;s Utilities for Docking]] for a lot of tools to help with analyzing DOCK runs.&lt;br /&gt;
&lt;br /&gt;
==Combining the results of all subdirectories==&lt;br /&gt;
   &lt;br /&gt;
*in the subdirectory that contains all the individual directories for each chunk of the library, run &amp;lt;tt&amp;gt;$mud/combine.py&amp;lt;/tt&amp;gt;. Then generate a file containing the top 500 molecules using &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -o top500.pdb&amp;lt;/tt&amp;gt;&#039;, which you can read into ViewDOCK in chimera as a DOCK 4, 5, or 6 style file.&lt;br /&gt;
*to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file containing the top 500 molecules just run &amp;lt;tt&amp;gt;$mud/topdock.py -e&amp;lt;/tt&amp;gt;. If one wants to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Getting individual atom contributions with scoreopt_so==&lt;br /&gt;
&lt;br /&gt;
===First you need an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file to be scored===&lt;br /&gt;
&lt;br /&gt;
=====For the xtal-lig.mol2 in its crystallographic pose=====&lt;br /&gt;
&lt;br /&gt;
Convert an input &amp;lt;tt&amp;gt;[http://www.tripos.com/index.php?family=modules,SimplePage,,,&amp;amp;page=sup_mol2&amp;amp;s=0 .mol2]&amp;lt;/tt&amp;gt; file into an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file&lt;br /&gt;
*run &amp;lt;tt&amp;gt;amsol&amp;lt;/tt&amp;gt;  as described [[Preparing_the_ligand#Running amsol|here]] to calculate atomic solvation energies.&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;file2file.py -s path/to/amsol.solv path/to/amsol.nmol2 ligand.eel1&amp;lt;/tt&amp;gt;&#039;.&lt;br /&gt;
&lt;br /&gt;
=====For molecules that have already been docked=====&lt;br /&gt;
&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -e -o top500.eel1&#039; to generate an .eel1 containing the top 500 docked molecules.&lt;br /&gt;
*or unzip the dock output &#039;&amp;lt;tt&amp;gt;gunzip -c test.eel1.gz &amp;gt; test.eel1&amp;lt;/tt&amp;gt;&#039;&lt;br /&gt;
*or to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &#039;&amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;&#039;.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the coulombic energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;2&#039; in the first menu.  &lt;br /&gt;
*enter the name of the DelPhi potential file, presumably &amp;lt;tt&amp;gt;grids/rec+sph.phi&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;top500.eel1&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.elec&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10 and 11 are the partial charge, the electrostatic field and the energy in kT (i.e., 9 &amp;amp;times; 10) of the atom, respectively.  &lt;br /&gt;
*the DelPhi electrostatic score is the sum over the entries in column 11 times 0.5924 (conversion from kT to kcal/mol) and can be compared to the elect column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the van der Waals energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;3&#039; in the first menu.  &lt;br /&gt;
*enter the prefix name of grids for ff scoring as a full path,    i.e., &amp;lt;tt&amp;gt;grids/chem&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the van der Waals parameter file, presumably    &amp;lt;tt&amp;gt;grids/vdw.parms.amb.mindock&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*answer the question about interpolation with &#039;yes&#039;.  &lt;br /&gt;
*enter a sufficiently large number as maximal van der Waals    energy, e.g. 10000.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.vdw&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*be adequately [http://www.merriam-webster.com/dictionary/scared scared].  &lt;br /&gt;
*the van der Waals interaction energy is calculated as  &amp;lt;math&amp;gt;{vdW}_{(r)}=\frac{A}{r^{12}}-\frac{B}{r^6}=a-b&amp;lt;/math&amp;gt;. In every ATOM line, columns 9, 10 and 11 are &amp;lt;math&amp;gt;a&amp;lt;/math&amp;gt;, &amp;lt;math&amp;gt;b&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;a-b&amp;lt;/math&amp;gt;,    respectively.&lt;br /&gt;
* DO NOT use the interaction energy, as we only use the vdw component now. Instead, use the vdwsum to compare with the vdW column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the desolvation===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;4&#039; in the first menu.  &lt;br /&gt;
*enter the name of the grid for partial desolvation, presumably &amp;lt;tt&amp;gt;grids/solvmap&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.solv&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10, and 11 are the total atomic solvation energy (polar + apolar), percentage desolvation, and atomic desolvation energy (i.e. - 9 &amp;amp;times; 10) of the atom, respectively.&lt;br /&gt;
*the total desolvation is the sum over the entries in column 11 and can be compared to the sum of the polsol and apolsol columns in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
==Other small useful things==&lt;br /&gt;
===Obtaining the net charge of a docked molecule===&lt;br /&gt;
   &lt;br /&gt;
*take the output &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file and run &amp;lt;tt&amp;gt;molcharge_pdb.pl &amp;lt; output.eel1&amp;lt;/tt&amp;gt;. This will output the sequential number of the molecule, the [http://zinc.docking.org/ ZINC] identifier, the total charge and the number of atoms for every molecule in the file.&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=98</id>
		<title>Analysing the results</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=98"/>
		<updated>2009-12-05T02:52:29Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* Some analyses that can be performed */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Some analyses that can be performed=&lt;br /&gt;
&lt;br /&gt;
See [[MUD - Michael&#039;s Utilities for Docking]] for a lot of tools to help with analyzing DOCK runs.&lt;br /&gt;
&lt;br /&gt;
==Combining the results of all subdirectories==&lt;br /&gt;
   &lt;br /&gt;
*in the subdirectory that contains all the individual directories for each chunk of the library, run &amp;lt;tt&amp;gt;$mud/combine.py&amp;lt;/tt&amp;gt;. Then generate a file containing the top 500 molecules using &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -o top500.pdb&amp;lt;/tt&amp;gt;&#039;, which you can read into ViewDOCK in chimera as a DOCK 4, 5, or 6 style file.&lt;br /&gt;
*to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file containing the top 500 molecules just run &amp;lt;tt&amp;gt;$mud/topdock.py -e&amp;lt;/tt&amp;gt;. If one wants to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Getting individual atom contributions with scoreopt_so==&lt;br /&gt;
&lt;br /&gt;
===First you need and &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file to be scored===&lt;br /&gt;
&lt;br /&gt;
=====For the xtal-lig.mol2 in its crystallographic pose=====&lt;br /&gt;
&lt;br /&gt;
Convert an input &amp;lt;tt&amp;gt;[http://www.tripos.com/index.php?family=modules,SimplePage,,,&amp;amp;page=sup_mol2&amp;amp;s=0 .mol2]&amp;lt;/tt&amp;gt; file into an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file&lt;br /&gt;
*run &amp;lt;tt&amp;gt;amsol&amp;lt;/tt&amp;gt;  as described [[Preparing_the_ligand#Running amsol|here]] to calculate atomic solvation energies.&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;file2file.py -s path/to/amsol.solv path/to/amsol.nmol2 ligand.eel1&amp;lt;/tt&amp;gt;&#039;.&lt;br /&gt;
&lt;br /&gt;
=====For molecules that have already been docked=====&lt;br /&gt;
&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -e -o top500.eel1&#039; to generate an .eel1 containing the top 500 docked molecules.&lt;br /&gt;
*or unzip the dock output &#039;&amp;lt;tt&amp;gt;gunzip -c test.eel1.gz &amp;gt; test.eel1&amp;lt;/tt&amp;gt;&#039;&lt;br /&gt;
*or to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &#039;&amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;&#039;.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the coulombic energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;2&#039; in the first menu.  &lt;br /&gt;
*enter the name of the DelPhi potential file, presumably &amp;lt;tt&amp;gt;grids/rec+sph.phi&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;top500.eel1&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.elec&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10 and 11 are the partial charge, the electrostatic field and the energy in kT (i.e., 9 &amp;amp;times; 10) of the atom, respectively.  &lt;br /&gt;
*the DelPhi electrostatic score is the sum over the entries in column 11 times 0.5924 (conversion from kT to kcal/mol) and can be compared to the elect column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the van der Waals energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;3&#039; in the first menu.  &lt;br /&gt;
*enter the prefix name of grids for ff scoring as a full path,    i.e., &amp;lt;tt&amp;gt;grids/chem&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the van der Waals parameter file, presumably    &amp;lt;tt&amp;gt;grids/vdw.parms.amb.mindock&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*answer the question about interpolation with &#039;yes&#039;.  &lt;br /&gt;
*enter a sufficiently large number as maximal van der Waals    energy, e.g. 10000.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.vdw&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*be adequately [http://www.merriam-webster.com/dictionary/scared scared].  &lt;br /&gt;
*the van der Waals interaction energy is calculated as  &amp;lt;math&amp;gt;{vdW}_{(r)}=\frac{A}{r^{12}}-\frac{B}{r^6}=a-b&amp;lt;/math&amp;gt;. In every ATOM line, columns 9, 10 and 11 are &amp;lt;math&amp;gt;a&amp;lt;/math&amp;gt;, &amp;lt;math&amp;gt;b&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;a-b&amp;lt;/math&amp;gt;,    respectively.&lt;br /&gt;
* DO NOT use the interaction energy, as we only use the vdw component now. Instead, use the vdwsum to compare with the vdW column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the desolvation===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;4&#039; in the first menu.  &lt;br /&gt;
*enter the name of the grid for partial desolvation, presumably &amp;lt;tt&amp;gt;grids/solvmap&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.solv&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10, and 11 are the total atomic solvation energy (polar + apolar), percentage desolvation, and atomic desolvation energy (i.e. - 9 &amp;amp;times; 10) of the atom, respectively.&lt;br /&gt;
*the total desolvation is the sum over the entries in column 11 and can be compared to the sum of the polsol and apolsol columns in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
==Other small useful things==&lt;br /&gt;
===Obtaining the net charge of a docked molecule===&lt;br /&gt;
   &lt;br /&gt;
*take the output &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file and run &amp;lt;tt&amp;gt;molcharge_pdb.pl &amp;lt; output.eel1&amp;lt;/tt&amp;gt;. This will output the sequential number of the molecule, the [http://zinc.docking.org/ ZINC] identifier, the total charge and the number of atoms for every molecule in the file.&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=97</id>
		<title>Analysing the results</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=97"/>
		<updated>2009-12-05T02:50:23Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* Getting individual atom contributions with scoreopt_so */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Some analyses that can be performed=&lt;br /&gt;
==Combining the results of all subdirectories==&lt;br /&gt;
   &lt;br /&gt;
*in the subdirectory that contains all the individual directories for each chunk of the library, run &amp;lt;tt&amp;gt;$mud/combine.py&amp;lt;/tt&amp;gt;. Then generate a file containing the top 500 molecules using &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -o top500.pdb&amp;lt;/tt&amp;gt;&#039;, which you can read into ViewDOCK in chimera as a DOCK 4, 5, or 6 style file.&lt;br /&gt;
*to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file containing the top 500 molecules just run &amp;lt;tt&amp;gt;$mud/topdock.py -e&amp;lt;/tt&amp;gt;. If one wants to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Getting individual atom contributions with scoreopt_so==&lt;br /&gt;
&lt;br /&gt;
===First you need and &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file to be scored===&lt;br /&gt;
&lt;br /&gt;
=====For the xtal-lig.mol2 in its crystallographic pose=====&lt;br /&gt;
&lt;br /&gt;
Convert an input &amp;lt;tt&amp;gt;[http://www.tripos.com/index.php?family=modules,SimplePage,,,&amp;amp;page=sup_mol2&amp;amp;s=0 .mol2]&amp;lt;/tt&amp;gt; file into an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file&lt;br /&gt;
*run &amp;lt;tt&amp;gt;amsol&amp;lt;/tt&amp;gt;  as described [[Preparing_the_ligand#Running amsol|here]] to calculate atomic solvation energies.&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;file2file.py -s path/to/amsol.solv path/to/amsol.nmol2 ligand.eel1&amp;lt;/tt&amp;gt;&#039;.&lt;br /&gt;
&lt;br /&gt;
=====For molecules that have already been docked=====&lt;br /&gt;
&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -e -o top500.eel1&#039; to generate an .eel1 containing the top 500 docked molecules.&lt;br /&gt;
*or unzip the dock output &#039;&amp;lt;tt&amp;gt;gunzip -c test.eel1.gz &amp;gt; test.eel1&amp;lt;/tt&amp;gt;&#039;&lt;br /&gt;
*or to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &#039;&amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;&#039;.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the coulombic energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;2&#039; in the first menu.  &lt;br /&gt;
*enter the name of the DelPhi potential file, presumably &amp;lt;tt&amp;gt;grids/rec+sph.phi&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;top500.eel1&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.elec&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10 and 11 are the partial charge, the electrostatic field and the energy in kT (i.e., 9 &amp;amp;times; 10) of the atom, respectively.  &lt;br /&gt;
*the DelPhi electrostatic score is the sum over the entries in column 11 times 0.5924 (conversion from kT to kcal/mol) and can be compared to the elect column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the van der Waals energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;3&#039; in the first menu.  &lt;br /&gt;
*enter the prefix name of grids for ff scoring as a full path,    i.e., &amp;lt;tt&amp;gt;grids/chem&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the van der Waals parameter file, presumably    &amp;lt;tt&amp;gt;grids/vdw.parms.amb.mindock&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*answer the question about interpolation with &#039;yes&#039;.  &lt;br /&gt;
*enter a sufficiently large number as maximal van der Waals    energy, e.g. 10000.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.vdw&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*be adequately [http://www.merriam-webster.com/dictionary/scared scared].  &lt;br /&gt;
*the van der Waals interaction energy is calculated as  &amp;lt;math&amp;gt;{vdW}_{(r)}=\frac{A}{r^{12}}-\frac{B}{r^6}=a-b&amp;lt;/math&amp;gt;. In every ATOM line, columns 9, 10 and 11 are &amp;lt;math&amp;gt;a&amp;lt;/math&amp;gt;, &amp;lt;math&amp;gt;b&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;a-b&amp;lt;/math&amp;gt;,    respectively.&lt;br /&gt;
* DO NOT use the interaction energy, as we only use the vdw component now. Instead, use the vdwsum to compare with the vdW column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the desolvation===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;4&#039; in the first menu.  &lt;br /&gt;
*enter the name of the grid for partial desolvation, presumably &amp;lt;tt&amp;gt;grids/solvmap&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.solv&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10, and 11 are the total atomic solvation energy (polar + apolar), percentage desolvation, and atomic desolvation energy (i.e. - 9 &amp;amp;times; 10) of the atom, respectively.&lt;br /&gt;
*the total desolvation is the sum over the entries in column 11 and can be compared to the sum of the polsol and apolsol columns in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
==Other small useful things==&lt;br /&gt;
===Obtaining the net charge of a docked molecule===&lt;br /&gt;
   &lt;br /&gt;
*take the output &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file and run &amp;lt;tt&amp;gt;molcharge_pdb.pl &amp;lt; output.eel1&amp;lt;/tt&amp;gt;. This will output the sequential number of the molecule, the [http://zinc.docking.org/ ZINC] identifier, the total charge and the number of atoms for every molecule in the file.&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=96</id>
		<title>Analysing the results</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=96"/>
		<updated>2009-12-05T02:50:01Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Some analyses that can be performed=&lt;br /&gt;
==Combining the results of all subdirectories==&lt;br /&gt;
   &lt;br /&gt;
*in the subdirectory that contains all the individual directories for each chunk of the library, run &amp;lt;tt&amp;gt;$mud/combine.py&amp;lt;/tt&amp;gt;. Then generate a file containing the top 500 molecules using &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -o top500.pdb&amp;lt;/tt&amp;gt;&#039;, which you can read into ViewDOCK in chimera as a DOCK 4, 5, or 6 style file.&lt;br /&gt;
*to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file containing the top 500 molecules just run &amp;lt;tt&amp;gt;$mud/topdock.py -e&amp;lt;/tt&amp;gt;. If one wants to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Getting individual atom contributions with scoreopt_so==&lt;br /&gt;
&lt;br /&gt;
===First you need and &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file to be scored===&lt;br /&gt;
&lt;br /&gt;
====For the xtal-lig.mol2 in its crystallographic pose====&lt;br /&gt;
&lt;br /&gt;
Convert an input &amp;lt;tt&amp;gt;[http://www.tripos.com/index.php?family=modules,SimplePage,,,&amp;amp;page=sup_mol2&amp;amp;s=0 .mol2]&amp;lt;/tt&amp;gt; file into an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file&lt;br /&gt;
*run &amp;lt;tt&amp;gt;amsol&amp;lt;/tt&amp;gt;  as described [[Preparing_the_ligand#Running amsol|here]] to calculate atomic solvation energies.&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;file2file.py -s path/to/amsol.solv path/to/amsol.nmol2 ligand.eel1&amp;lt;/tt&amp;gt;&#039;.&lt;br /&gt;
&lt;br /&gt;
====For molecules that have already been docked====&lt;br /&gt;
&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -e -o top500.eel1&#039; to generate an .eel1 containing the top 500 docked molecules.&lt;br /&gt;
*or unzip the dock output &#039;&amp;lt;tt&amp;gt;gunzip -c test.eel1.gz &amp;gt; test.eel1&amp;lt;/tt&amp;gt;&#039;&lt;br /&gt;
*or to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &#039;&amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;&#039;.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the coulombic energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;2&#039; in the first menu.  &lt;br /&gt;
*enter the name of the DelPhi potential file, presumably &amp;lt;tt&amp;gt;grids/rec+sph.phi&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;top500.eel1&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.elec&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10 and 11 are the partial charge, the electrostatic field and the energy in kT (i.e., 9 &amp;amp;times; 10) of the atom, respectively.  &lt;br /&gt;
*the DelPhi electrostatic score is the sum over the entries in column 11 times 0.5924 (conversion from kT to kcal/mol) and can be compared to the elect column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the van der Waals energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;3&#039; in the first menu.  &lt;br /&gt;
*enter the prefix name of grids for ff scoring as a full path,    i.e., &amp;lt;tt&amp;gt;grids/chem&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the van der Waals parameter file, presumably    &amp;lt;tt&amp;gt;grids/vdw.parms.amb.mindock&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*answer the question about interpolation with &#039;yes&#039;.  &lt;br /&gt;
*enter a sufficiently large number as maximal van der Waals    energy, e.g. 10000.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.vdw&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*be adequately [http://www.merriam-webster.com/dictionary/scared scared].  &lt;br /&gt;
*the van der Waals interaction energy is calculated as  &amp;lt;math&amp;gt;{vdW}_{(r)}=\frac{A}{r^{12}}-\frac{B}{r^6}=a-b&amp;lt;/math&amp;gt;. In every ATOM line, columns 9, 10 and 11 are &amp;lt;math&amp;gt;a&amp;lt;/math&amp;gt;, &amp;lt;math&amp;gt;b&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;a-b&amp;lt;/math&amp;gt;,    respectively.&lt;br /&gt;
* DO NOT use the interaction energy, as we only use the vdw component now. Instead, use the vdwsum to compare with the vdW column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the desolvation===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;4&#039; in the first menu.  &lt;br /&gt;
*enter the name of the grid for partial desolvation, presumably &amp;lt;tt&amp;gt;grids/solvmap&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.solv&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10, and 11 are the total atomic solvation energy (polar + apolar), percentage desolvation, and atomic desolvation energy (i.e. - 9 &amp;amp;times; 10) of the atom, respectively.&lt;br /&gt;
*the total desolvation is the sum over the entries in column 11 and can be compared to the sum of the polsol and apolsol columns in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
==Other small useful things==&lt;br /&gt;
===Obtaining the net charge of a docked molecule===&lt;br /&gt;
   &lt;br /&gt;
*take the output &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file and run &amp;lt;tt&amp;gt;molcharge_pdb.pl &amp;lt; output.eel1&amp;lt;/tt&amp;gt;. This will output the sequential number of the molecule, the [http://zinc.docking.org/ ZINC] identifier, the total charge and the number of atoms for every molecule in the file.&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=95</id>
		<title>Analysing the results</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=95"/>
		<updated>2009-12-05T02:49:07Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* Getting individual atom contributions with scoreopt_so */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Some analyses that can be performed=&lt;br /&gt;
==Combining the results of all subdirectories==&lt;br /&gt;
   &lt;br /&gt;
*in the subdirectory that contains all the individual directories for each chunk of the library, run &amp;lt;tt&amp;gt;$mud/combine.py&amp;lt;/tt&amp;gt;. Then generate a file containing the top 500 molecules using &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -o top500.pdb&amp;lt;/tt&amp;gt;&#039;, which you can read into ViewDOCK in chimera as a DOCK 4, 5, or 6 style file.&lt;br /&gt;
*to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file containing the top 500 molecules just run &amp;lt;tt&amp;gt;$mud/topdock.py -e&amp;lt;/tt&amp;gt;. If one wants to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Getting individual atom contributions with scoreopt_so==&lt;br /&gt;
&lt;br /&gt;
===First you need and &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file to be scored===&lt;br /&gt;
&lt;br /&gt;
=For the xtal-lig.mol2 in its crystallographic pose=&lt;br /&gt;
&lt;br /&gt;
Convert an input &amp;lt;tt&amp;gt;[http://www.tripos.com/index.php?family=modules,SimplePage,,,&amp;amp;page=sup_mol2&amp;amp;s=0 .mol2]&amp;lt;/tt&amp;gt; file into an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file&lt;br /&gt;
*run &amp;lt;tt&amp;gt;amsol&amp;lt;/tt&amp;gt;  as described [[Preparing_the_ligand#Running amsol|here]] to calculate atomic solvation energies.&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;file2file.py -s path/to/amsol.solv path/to/amsol.nmol2 ligand.eel1&amp;lt;/tt&amp;gt;&#039;.&lt;br /&gt;
&lt;br /&gt;
=For molecules that have already been docked=&lt;br /&gt;
&lt;br /&gt;
*run &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -e -o top500.eel1&#039; to generate an .eel1 containing the top 500 docked molecules.&lt;br /&gt;
*or unzip the dock output &#039;&amp;lt;tt&amp;gt;gunzip -c test.eel1.gz &amp;gt; test.eel1&amp;lt;/tt&amp;gt;&#039;&lt;br /&gt;
*or to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &#039;&amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;&#039;.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the coulombic energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;2&#039; in the first menu.  &lt;br /&gt;
*enter the name of the DelPhi potential file, presumably &amp;lt;tt&amp;gt;grids/rec+sph.phi&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;top500.eel1&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.elec&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10 and 11 are the partial charge, the electrostatic field and the energy in kT (i.e., 9 &amp;amp;times; 10) of the atom, respectively.  &lt;br /&gt;
*the DelPhi electrostatic score is the sum over the entries in column 11 times 0.5924 (conversion from kT to kcal/mol) and can be compared to the elect column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the van der Waals energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;3&#039; in the first menu.  &lt;br /&gt;
*enter the prefix name of grids for ff scoring as a full path,    i.e., &amp;lt;tt&amp;gt;grids/chem&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the van der Waals parameter file, presumably    &amp;lt;tt&amp;gt;grids/vdw.parms.amb.mindock&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*answer the question about interpolation with &#039;yes&#039;.  &lt;br /&gt;
*enter a sufficiently large number as maximal van der Waals    energy, e.g. 10000.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.vdw&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*be adequately [http://www.merriam-webster.com/dictionary/scared scared].  &lt;br /&gt;
*the van der Waals interaction energy is calculated as  &amp;lt;math&amp;gt;{vdW}_{(r)}=\frac{A}{r^{12}}-\frac{B}{r^6}=a-b&amp;lt;/math&amp;gt;. In every ATOM line, columns 9, 10 and 11 are &amp;lt;math&amp;gt;a&amp;lt;/math&amp;gt;, &amp;lt;math&amp;gt;b&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;a-b&amp;lt;/math&amp;gt;,    respectively.&lt;br /&gt;
* DO NOT use the interaction energy, as we only use the vdw component now. Instead, use the vdwsum to compare with the vdW column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the desolvation===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;4&#039; in the first menu.  &lt;br /&gt;
*enter the name of the grid for partial desolvation, presumably &amp;lt;tt&amp;gt;grids/solvmap&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.solv&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10, and 11 are the total atomic solvation energy (polar + apolar), percentage desolvation, and atomic desolvation energy (i.e. - 9 &amp;amp;times; 10) of the atom, respectively.&lt;br /&gt;
*the total desolvation is the sum over the entries in column 11 and can be compared to the sum of the polsol and apolsol columns in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
==Other small useful things==&lt;br /&gt;
===Obtaining the net charge of a docked molecule===&lt;br /&gt;
   &lt;br /&gt;
*take the output &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file and run &amp;lt;tt&amp;gt;molcharge_pdb.pl &amp;lt; output.eel1&amp;lt;/tt&amp;gt;. This will output the sequential number of the molecule, the [http://zinc.docking.org/ ZINC] identifier, the total charge and the number of atoms for every molecule in the file.&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=94</id>
		<title>Analysing the results</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=94"/>
		<updated>2009-12-05T02:39:01Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* Obtaining the net charge of a docked molecule */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Some analyses that can be performed=&lt;br /&gt;
==Combining the results of all subdirectories==&lt;br /&gt;
   &lt;br /&gt;
*in the subdirectory that contains all the individual directories for each chunk of the library, run &amp;lt;tt&amp;gt;$mud/combine.py&amp;lt;/tt&amp;gt;. Then generate a file containing the top 500 molecules using &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -o top500.pdb&amp;lt;/tt&amp;gt;&#039;, which you can read into ViewDOCK in chimera as a DOCK 4, 5, or 6 style file.&lt;br /&gt;
*to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file containing the top 500 molecules just run &amp;lt;tt&amp;gt;$mud/topdock.py -e&amp;lt;/tt&amp;gt;. If one wants to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Getting individual atom contributions with scoreopt_so==&lt;br /&gt;
&lt;br /&gt;
===Converting a &amp;lt;tt&amp;gt;[http://www.tripos.com/index.php?family=modules,SimplePage,,,&amp;amp;page=sup_mol2&amp;amp;s=0 .mol2]&amp;lt;/tt&amp;gt;  file into an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt;  file===&lt;br /&gt;
   &lt;br /&gt;
*run &amp;lt;tt&amp;gt;amsol&amp;lt;/tt&amp;gt;  as described [[Preparing_the_ligand#Running amsol|here]] to calculate atomic solvation energies.&lt;br /&gt;
*run &amp;lt;tt&amp;gt;file2file.py -s path/to/amsol.solv path/to/amsol.nmol2 ligand.eel1&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the coulombic energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;2&#039; in the first menu.  &lt;br /&gt;
*enter the name of the DelPhi potential file, presumably &amp;lt;tt&amp;gt;grids/rec+sph.phi&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.elec&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10 and 11 are the partial charge, the electrostatic field and the energy in kT (i.e., 9 &amp;amp;times; 10) of the atom, respectively.  &lt;br /&gt;
*the DelPhi electrostatic score is the sum over the entries in column 11 times 0.5924 (conversion from kT to kcal/mol) and can be compared to the elect column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the van der Waals energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;3&#039; in the first menu.  &lt;br /&gt;
*enter the prefix name of grids for ff scoring as a full path,    i.e., &amp;lt;tt&amp;gt;grids/chem&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the van der Waals parameter file, presumably    &amp;lt;tt&amp;gt;grids/vdw.parms.amb.mindock&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*answer the question about interpolation with &#039;yes&#039;.  &lt;br /&gt;
*enter a sufficiently large number as maximal van der Waals    energy, e.g. 10000.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.vdw&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*be adequately [http://www.merriam-webster.com/dictionary/scared scared].  &lt;br /&gt;
*the van der Waals interaction energy is calculated as  &amp;lt;math&amp;gt;{vdW}_{(r)}=\frac{A}{r^{12}}-\frac{B}{r^6}=a-b&amp;lt;/math&amp;gt;. In every ATOM line, columns 9, 10 and 11 are &amp;lt;math&amp;gt;a&amp;lt;/math&amp;gt;, &amp;lt;math&amp;gt;b&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;a-b&amp;lt;/math&amp;gt;,    respectively.&lt;br /&gt;
* DO NOT use the interaction energy, as we only use the vdw component now. Instead, use the vdwsum to compare with the vdW column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the desolvation===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;4&#039; in the first menu.  &lt;br /&gt;
*enter the name of the grid for partial desolvation, presumably &amp;lt;tt&amp;gt;grids/solvmap&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.solv&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10, and 11 are the total atomic solvation energy (polar + apolar), percentage desolvation, and atomic desolvation energy (i.e. - 9 &amp;amp;times; 10) of the atom, respectively.&lt;br /&gt;
*the total desolvation is the sum over the entries in column 11 and can be compared to the sum of the polsol and apolsol columns in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
==Other small useful things==&lt;br /&gt;
===Obtaining the net charge of a docked molecule===&lt;br /&gt;
   &lt;br /&gt;
*take the output &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file and run &amp;lt;tt&amp;gt;molcharge_pdb.pl &amp;lt; output.eel1&amp;lt;/tt&amp;gt;. This will output the sequential number of the molecule, the [http://zinc.docking.org/ ZINC] identifier, the total charge and the number of atoms for every molecule in the file.&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=93</id>
		<title>Analysing the results</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=93"/>
		<updated>2009-12-05T02:37:08Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* Combining the results of all subdirectories */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Some analyses that can be performed=&lt;br /&gt;
==Combining the results of all subdirectories==&lt;br /&gt;
   &lt;br /&gt;
*in the subdirectory that contains all the individual directories for each chunk of the library, run &amp;lt;tt&amp;gt;$mud/combine.py&amp;lt;/tt&amp;gt;. Then generate a file containing the top 500 molecules using &#039;&amp;lt;tt&amp;gt;$mud/topdock.py -o top500.pdb&amp;lt;/tt&amp;gt;&#039;, which you can read into ViewDOCK in chimera as a DOCK 4, 5, or 6 style file.&lt;br /&gt;
*to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file containing the top 500 molecules just run &amp;lt;tt&amp;gt;$mud/topdock.py -e&amp;lt;/tt&amp;gt;. If one wants to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Getting individual atom contributions with scoreopt_so==&lt;br /&gt;
&lt;br /&gt;
===Converting a &amp;lt;tt&amp;gt;[http://www.tripos.com/index.php?family=modules,SimplePage,,,&amp;amp;page=sup_mol2&amp;amp;s=0 .mol2]&amp;lt;/tt&amp;gt;  file into an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt;  file===&lt;br /&gt;
   &lt;br /&gt;
*run &amp;lt;tt&amp;gt;amsol&amp;lt;/tt&amp;gt;  as described [[Preparing_the_ligand#Running amsol|here]] to calculate atomic solvation energies.&lt;br /&gt;
*run &amp;lt;tt&amp;gt;file2file.py -s path/to/amsol.solv path/to/amsol.nmol2 ligand.eel1&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the coulombic energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;2&#039; in the first menu.  &lt;br /&gt;
*enter the name of the DelPhi potential file, presumably &amp;lt;tt&amp;gt;grids/rec+sph.phi&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.elec&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10 and 11 are the partial charge, the electrostatic field and the energy in kT (i.e., 9 &amp;amp;times; 10) of the atom, respectively.  &lt;br /&gt;
*the DelPhi electrostatic score is the sum over the entries in column 11 times 0.5924 (conversion from kT to kcal/mol) and can be compared to the elect column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the van der Waals energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;3&#039; in the first menu.  &lt;br /&gt;
*enter the prefix name of grids for ff scoring as a full path,    i.e., &amp;lt;tt&amp;gt;grids/chem&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the van der Waals parameter file, presumably    &amp;lt;tt&amp;gt;grids/vdw.parms.amb.mindock&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*answer the question about interpolation with &#039;yes&#039;.  &lt;br /&gt;
*enter a sufficiently large number as maximal van der Waals    energy, e.g. 10000.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.vdw&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*be adequately [http://www.merriam-webster.com/dictionary/scared scared].  &lt;br /&gt;
*the van der Waals interaction energy is calculated as  &amp;lt;math&amp;gt;{vdW}_{(r)}=\frac{A}{r^{12}}-\frac{B}{r^6}=a-b&amp;lt;/math&amp;gt;. In every ATOM line, columns 9, 10 and 11 are &amp;lt;math&amp;gt;a&amp;lt;/math&amp;gt;, &amp;lt;math&amp;gt;b&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;a-b&amp;lt;/math&amp;gt;,    respectively.&lt;br /&gt;
* DO NOT use the interaction energy, as we only use the vdw component now. Instead, use the vdwsum to compare with the vdW column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the desolvation===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;4&#039; in the first menu.  &lt;br /&gt;
*enter the name of the grid for partial desolvation, presumably &amp;lt;tt&amp;gt;grids/solvmap&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.solv&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10, and 11 are the total atomic solvation energy (polar + apolar), percentage desolvation, and atomic desolvation energy (i.e. - 9 &amp;amp;times; 10) of the atom, respectively.&lt;br /&gt;
*the total desolvation is the sum over the entries in column 11 and can be compared to the sum of the polsol and apolsol columns in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
==Other small useful things==&lt;br /&gt;
===Obtaining the net charge of a docked molecule===&lt;br /&gt;
   &lt;br /&gt;
*take the output &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file and run &amp;lt;tt&amp;gt;molcharge_pdb.pl &amp;lt; output.eel1&amp;lt;/tt&amp;gt;. This will output the sequential number of the    molecule, the [http://zinc.docking.org/ ZINC] identifier, the total charge and the number of    atoms for every molecule in the file. This script is called by &amp;lt;tt&amp;gt;combine10.csh&amp;lt;/tt&amp;gt; and the output is called &amp;lt;tt&amp;gt;FF.new.chg&amp;lt;/tt&amp;gt; (cf. section [[#Combining the results of all subdirectories|5.1]]).&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=92</id>
		<title>Analysing the results</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=92"/>
		<updated>2009-12-05T02:36:36Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* Combining the results of all subdirectories */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Some analyses that can be performed=&lt;br /&gt;
==Combining the results of all subdirectories==&lt;br /&gt;
   &lt;br /&gt;
*in the subdirectory that contains all the individual directories for each chunk of the library, run &amp;lt;tt&amp;gt;$mud/combine.py&amp;lt;/tt&amp;gt;. Then generate a file containing the top 500 molecules using &amp;lt;tt&amp;gt;$mud/topdock.py -o top500.pdb&amp;lt;/tt&amp;gt;, which you can read into ViewDOCK in chimera as a DOCK 4,5, or 6 style file.&lt;br /&gt;
*to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file containing the top 500 molecules just run &amp;lt;tt&amp;gt;$mud/topdock.py -e&amp;lt;/tt&amp;gt;. If one wants to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Getting individual atom contributions with scoreopt_so==&lt;br /&gt;
&lt;br /&gt;
===Converting a &amp;lt;tt&amp;gt;[http://www.tripos.com/index.php?family=modules,SimplePage,,,&amp;amp;page=sup_mol2&amp;amp;s=0 .mol2]&amp;lt;/tt&amp;gt;  file into an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt;  file===&lt;br /&gt;
   &lt;br /&gt;
*run &amp;lt;tt&amp;gt;amsol&amp;lt;/tt&amp;gt;  as described [[Preparing_the_ligand#Running amsol|here]] to calculate atomic solvation energies.&lt;br /&gt;
*run &amp;lt;tt&amp;gt;file2file.py -s path/to/amsol.solv path/to/amsol.nmol2 ligand.eel1&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the coulombic energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;2&#039; in the first menu.  &lt;br /&gt;
*enter the name of the DelPhi potential file, presumably &amp;lt;tt&amp;gt;grids/rec+sph.phi&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.elec&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10 and 11 are the partial charge, the electrostatic field and the energy in kT (i.e., 9 &amp;amp;times; 10) of the atom, respectively.  &lt;br /&gt;
*the DelPhi electrostatic score is the sum over the entries in column 11 times 0.5924 (conversion from kT to kcal/mol) and can be compared to the elect column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the van der Waals energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;3&#039; in the first menu.  &lt;br /&gt;
*enter the prefix name of grids for ff scoring as a full path,    i.e., &amp;lt;tt&amp;gt;grids/chem&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the van der Waals parameter file, presumably    &amp;lt;tt&amp;gt;grids/vdw.parms.amb.mindock&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*answer the question about interpolation with &#039;yes&#039;.  &lt;br /&gt;
*enter a sufficiently large number as maximal van der Waals    energy, e.g. 10000.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.vdw&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*be adequately [http://www.merriam-webster.com/dictionary/scared scared].  &lt;br /&gt;
*the van der Waals interaction energy is calculated as  &amp;lt;math&amp;gt;{vdW}_{(r)}=\frac{A}{r^{12}}-\frac{B}{r^6}=a-b&amp;lt;/math&amp;gt;. In every ATOM line, columns 9, 10 and 11 are &amp;lt;math&amp;gt;a&amp;lt;/math&amp;gt;, &amp;lt;math&amp;gt;b&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;a-b&amp;lt;/math&amp;gt;,    respectively.&lt;br /&gt;
* DO NOT use the interaction energy, as we only use the vdw component now. Instead, use the vdwsum to compare with the vdW column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the desolvation===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;4&#039; in the first menu.  &lt;br /&gt;
*enter the name of the grid for partial desolvation, presumably &amp;lt;tt&amp;gt;grids/solvmap&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.solv&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10, and 11 are the total atomic solvation energy (polar + apolar), percentage desolvation, and atomic desolvation energy (i.e. - 9 &amp;amp;times; 10) of the atom, respectively.&lt;br /&gt;
*the total desolvation is the sum over the entries in column 11 and can be compared to the sum of the polsol and apolsol columns in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
==Other small useful things==&lt;br /&gt;
===Obtaining the net charge of a docked molecule===&lt;br /&gt;
   &lt;br /&gt;
*take the output &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file and run &amp;lt;tt&amp;gt;molcharge_pdb.pl &amp;lt; output.eel1&amp;lt;/tt&amp;gt;. This will output the sequential number of the    molecule, the [http://zinc.docking.org/ ZINC] identifier, the total charge and the number of    atoms for every molecule in the file. This script is called by &amp;lt;tt&amp;gt;combine10.csh&amp;lt;/tt&amp;gt; and the output is called &amp;lt;tt&amp;gt;FF.new.chg&amp;lt;/tt&amp;gt; (cf. section [[#Combining the results of all subdirectories|5.1]]).&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=91</id>
		<title>Analysing the results</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Analysing_the_results&amp;diff=91"/>
		<updated>2009-12-05T02:36:05Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* Combining the results of all subdirectories */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Some analyses that can be performed=&lt;br /&gt;
==Combining the results of all subdirectories==&lt;br /&gt;
   &lt;br /&gt;
*in the subdirectory that contains all the individual directories for each chunk of the library, run &amp;lt;tt&amp;gt;$mud/combine.py&amp;lt;/tt&amp;gt;. Then generate a file containing the top 500 molecules using &amp;lt;tt&amp;gt;$mud/topdock.py -o top500.pdb&amp;lt;/tt&amp;gt;, which you can read into ViewDOCK in chimera as a DOCK 4,5, or 6 style file.&lt;br /&gt;
*to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file containing the top 500 molecules just run &amp;lt;&amp;lt;tt&amp;gt;&amp;gt;$mud/topdock.py -e&amp;lt;&amp;lt;/tt&amp;gt;&amp;gt;. If one wants to create an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file for a different subset of the molecules, first create the list of molecule names plus their energies (on one line) and then feed it to &amp;lt;tt&amp;gt;getxpdb.pl name_energy.list &amp;lt; FF.test.eel1 &amp;gt; subset_name.eel1&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Getting individual atom contributions with scoreopt_so==&lt;br /&gt;
&lt;br /&gt;
===Converting a &amp;lt;tt&amp;gt;[http://www.tripos.com/index.php?family=modules,SimplePage,,,&amp;amp;page=sup_mol2&amp;amp;s=0 .mol2]&amp;lt;/tt&amp;gt;  file into an &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt;  file===&lt;br /&gt;
   &lt;br /&gt;
*run &amp;lt;tt&amp;gt;amsol&amp;lt;/tt&amp;gt;  as described [[Preparing_the_ligand#Running amsol|here]] to calculate atomic solvation energies.&lt;br /&gt;
*run &amp;lt;tt&amp;gt;file2file.py -s path/to/amsol.solv path/to/amsol.nmol2 ligand.eel1&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the coulombic energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;2&#039; in the first menu.  &lt;br /&gt;
*enter the name of the DelPhi potential file, presumably &amp;lt;tt&amp;gt;grids/rec+sph.phi&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.elec&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10 and 11 are the partial charge, the electrostatic field and the energy in kT (i.e., 9 &amp;amp;times; 10) of the atom, respectively.  &lt;br /&gt;
*the DelPhi electrostatic score is the sum over the entries in column 11 times 0.5924 (conversion from kT to kcal/mol) and can be compared to the elect column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the van der Waals energy===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;3&#039; in the first menu.  &lt;br /&gt;
*enter the prefix name of grids for ff scoring as a full path,    i.e., &amp;lt;tt&amp;gt;grids/chem&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the van der Waals parameter file, presumably    &amp;lt;tt&amp;gt;grids/vdw.parms.amb.mindock&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*answer the question about interpolation with &#039;yes&#039;.  &lt;br /&gt;
*enter a sufficiently large number as maximal van der Waals    energy, e.g. 10000.  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.vdw&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*be adequately [http://www.merriam-webster.com/dictionary/scared scared].  &lt;br /&gt;
*the van der Waals interaction energy is calculated as  &amp;lt;math&amp;gt;{vdW}_{(r)}=\frac{A}{r^{12}}-\frac{B}{r^6}=a-b&amp;lt;/math&amp;gt;. In every ATOM line, columns 9, 10 and 11 are &amp;lt;math&amp;gt;a&amp;lt;/math&amp;gt;, &amp;lt;math&amp;gt;b&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;a-b&amp;lt;/math&amp;gt;,    respectively.&lt;br /&gt;
* DO NOT use the interaction energy, as we only use the vdw component now. Instead, use the vdwsum to compare with the vdW column in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
===Individual contributions to the desolvation===&lt;br /&gt;
   &lt;br /&gt;
*start &amp;lt;tt&amp;gt;scoreopt_so&amp;lt;/tt&amp;gt;  and choose option &#039;4&#039; in the first menu.  &lt;br /&gt;
*enter the name of the grid for partial desolvation, presumably &amp;lt;tt&amp;gt;grids/solvmap&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the ligand file, i.e., &amp;lt;tt&amp;gt;ligand.eel1&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*enter the name of the output file, e.g. &amp;lt;tt&amp;gt;ligand.solv&amp;lt;/tt&amp;gt; .  &lt;br /&gt;
*in every ATOM line, columns 9, 10, and 11 are the total atomic solvation energy (polar + apolar), percentage desolvation, and atomic desolvation energy (i.e. - 9 &amp;amp;times; 10) of the atom, respectively.&lt;br /&gt;
*the total desolvation is the sum over the entries in column 11 and can be compared to the sum of the polsol and apolsol columns in OUTDOCK.&lt;br /&gt;
&lt;br /&gt;
==Other small useful things==&lt;br /&gt;
===Obtaining the net charge of a docked molecule===&lt;br /&gt;
   &lt;br /&gt;
*take the output &amp;lt;tt&amp;gt;.eel1&amp;lt;/tt&amp;gt; file and run &amp;lt;tt&amp;gt;molcharge_pdb.pl &amp;lt; output.eel1&amp;lt;/tt&amp;gt;. This will output the sequential number of the    molecule, the [http://zinc.docking.org/ ZINC] identifier, the total charge and the number of    atoms for every molecule in the file. This script is called by &amp;lt;tt&amp;gt;combine10.csh&amp;lt;/tt&amp;gt; and the output is called &amp;lt;tt&amp;gt;FF.new.chg&amp;lt;/tt&amp;gt; (cf. section [[#Combining the results of all subdirectories|5.1]]).&lt;br /&gt;
&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Running_DOCK&amp;diff=4279</id>
		<title>Running DOCK</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Running_DOCK&amp;diff=4279"/>
		<updated>2009-12-05T02:28:43Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* Running DOCK */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Running DOCK=&lt;br /&gt;
   &lt;br /&gt;
*modify &amp;lt;tt&amp;gt;INDOCK&amp;lt;/tt&amp;gt;  and set up the desired directory structure &amp;amp;ndash; either manually or by running &amp;lt;tt&amp;gt;mksdir3.csh N&amp;lt;sub&amp;gt;1&amp;lt;/sub&amp;gt; N&amp;lt;sub&amp;gt;2&amp;lt;/sub&amp;gt; Type&amp;lt;/tt&amp;gt;, where &amp;lt;tt&amp;gt;N&amp;lt;sub&amp;gt;1&amp;lt;/sub&amp;gt;&amp;lt;/tt&amp;gt; is the identifier of the library (1: lead-like; 2: fragment-like), &amp;lt;tt&amp;gt;N&amp;lt;sub&amp;gt;2&amp;lt;/sub&amp;gt;&amp;lt;/tt&amp;gt; is the number of chunks (i.e., jobs you can run in parallel), and &amp;lt;tt&amp;gt;Type&amp;lt;/tt&amp;gt; is the category of library (i.e., bysubset, byvendor, etc).&lt;br /&gt;
* if it hasn&#039;t been generated by a script, create the file &amp;lt;tt&amp;gt;dirlist&amp;lt;/tt&amp;gt;, which conatins the list of the directories (i.e., chunks of the database) that you want to dock.&lt;br /&gt;
*if you plan to use any of John&#039;s scripts in the downstream    processing, leave the output file prefixes at &amp;lt;tt&amp;gt;test.&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
*take care that the paths to the &amp;lt;tt&amp;gt;.db.gz&amp;lt;/tt&amp;gt;  files in &amp;lt;tt&amp;gt;split_database_index&amp;lt;/tt&amp;gt; do not get too long. If they do, go via    links.  &lt;br /&gt;
*submit the calculations to the cluster with &amp;lt;tt&amp;gt;$mud/submit.csh&amp;lt;/tt&amp;gt; from the directory in which your data (most importantly, &amp;lt;tt&amp;gt;dirlist&amp;lt;/tt&amp;gt;) resides. See [[MUD - Michael&#039;s Utilities for Docking]] for setting the $mud variable.&lt;br /&gt;
[[Category:Manual_DOCK]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=MUD_-_Michael%27s_Utilities_for_Docking&amp;diff=3544</id>
		<title>MUD - Michael&#039;s Utilities for Docking</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=MUD_-_Michael%27s_Utilities_for_Docking&amp;diff=3544"/>
		<updated>2009-12-05T01:59:28Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==What&#039;s in MUD?==&lt;br /&gt;
&lt;br /&gt;
*Tools to start, check, and restart dock jobs&lt;br /&gt;
*Tools to combine, enrich, plot, and view docking results&lt;br /&gt;
&lt;br /&gt;
==Setting up MUD==&lt;br /&gt;
&lt;br /&gt;
*For convenience, point a shell variable to the base mud directory to save typing&lt;br /&gt;
 set mud=~mysinger/code/mud/trunk&lt;br /&gt;
*If you use MUD a lot, you can add this to your ~/.login&lt;br /&gt;
*Then simply run commands like this:&lt;br /&gt;
 $mud/submit.csh&lt;br /&gt;
 $mud/check.py -h&lt;br /&gt;
*Use -h or --help to get full help information for the .py (python) scripts&lt;br /&gt;
*The .csh scripts will automatically print usage information if mis-used&lt;br /&gt;
*The scripts automatically use their invocation path to find other scripts and libraries they depend on.&lt;br /&gt;
&lt;br /&gt;
==Job Control==&lt;br /&gt;
   &lt;br /&gt;
===Main Workflow===&lt;br /&gt;
&lt;br /&gt;
For a quick summary of what to do first see [[SGE_Cluster_Docking]]. For a detailed look at how to get the details right see [[How to run and analyze a DOCK run by hand]].&lt;br /&gt;
&lt;br /&gt;
*Submit a parallel job to the cluser&lt;br /&gt;
 $mud/submit.csh&lt;br /&gt;
Uses &#039;dirlist&#039; to determine which directories to run. Similar to startdockbksX, but also indicates job submission by touching a submitted file in each directory.&lt;br /&gt;
*Check parallel job status&lt;br /&gt;
 $mud/check.py&lt;br /&gt;
Indicates the status of unfinished (or unsubmitted) jobs. Note that it simply returns nothing if everything is finished.&lt;br /&gt;
*Restart all failed subjobs&lt;br /&gt;
 $mud/restart.py&lt;br /&gt;
This works even if some subjobs are still running. Occasionally, however, jobs can fail with no detectable remnants. To force those jobs to restart you can use the -f option, but beware that this will also restart all subjobs that are still running.&lt;br /&gt;
&lt;br /&gt;
===Specialized Commands===&lt;br /&gt;
*Submit job to the local machine&lt;br /&gt;
 $mud/sublocal.csh&lt;br /&gt;
*Submit a single directory to the cluster&lt;br /&gt;
 qsub $mud/runsge.csh&lt;br /&gt;
*Submit a single directory to the local machine&lt;br /&gt;
 $mud/runsubdir.csh&lt;br /&gt;
*Remove docking output leaving only input - will DELETE even completed jobs&lt;br /&gt;
 $mud/clean.py&lt;br /&gt;
*Restart single directory&lt;br /&gt;
 $mud/restartdir.py&lt;br /&gt;
&lt;br /&gt;
==Job Analysis==&lt;br /&gt;
&lt;br /&gt;
*Enrichment plots are sensitive to consistent treatment and proper accounting for all docked molecules. The combine script properly accounts for all docked molecules by detecting bumped out, no matched, and timed out molecules. &lt;br /&gt;
&lt;br /&gt;
To achieve consistency, you have two options:&lt;br /&gt;
1. Write coordinates for all molecules (what I use)&lt;br /&gt;
In INDOCK, set number_save to 50000 or something high enough to capture all dockable hierarchies. DOCK output is now gzipped so this is cheaper in disk space than it used to be.&lt;br /&gt;
2. Do not check for broken molecules&lt;br /&gt;
Use the -b option when running combine.py&lt;br /&gt;
&lt;br /&gt;
===Combining Parallel Jobs===&lt;br /&gt;
*Merge all parallel jobs into a single set of unique scores.&lt;br /&gt;
 $mud/combine.py&lt;br /&gt;
This combine carefully accounts for all docked molecules, for more informative enrichment plots.&lt;br /&gt;
&lt;br /&gt;
*Options:&lt;br /&gt;
Use -b or --broken to skip finding broken molecules. Use -d or --done to indicate that all subjobs are complete, for the case where you did not submit with a MUD submission script. Use -p or --prefix if your output files are named something other than test. Use --box if your  box file is not at ../../grids/box relative to your subjob directories.&lt;br /&gt;
&lt;br /&gt;
*Creates:&lt;br /&gt;
#combine.scores - fully processed scores, using the best one for each id&lt;br /&gt;
#combine.raw - contains all scores as scrapped from DOCK output&lt;br /&gt;
#combine.broken - broken molecules and the reason they failed&lt;br /&gt;
#combine.zeroes - important sanity check&lt;br /&gt;
&lt;br /&gt;
format of combine.scores:&lt;br /&gt;
 &amp;lt;id&amp;gt; &amp;lt;shape&amp;gt; &amp;lt;elect&amp;gt; &amp;lt;VdW&amp;gt; &amp;lt;polar solv&amp;gt; &amp;lt;apolar solv&amp;gt; &amp;lt;total&amp;gt; &amp;lt;subdir&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The .zeroes file is a sanity check because it lists the number of molecules followed by the number of zeroes in each scoring column. Past experience has shown that when DOCK fails randomly and silently, it often generates a large  number of zero scores. If this happens, simply re-running the job will give better results. &lt;br /&gt;
&lt;br /&gt;
===Computing Enrichments===&lt;br /&gt;
*Compute enrichment starting from the combined scores.&lt;br /&gt;
 $mud/enrich.py -s -l LIGAND_FILE&lt;br /&gt;
 &amp;lt; or &amp;gt;&lt;br /&gt;
 $mud/enrich.py -l LIGAND_FILE -d DECOY_FILE&lt;br /&gt;
Generates both enrichment and roc curves, both for the ligands against all molecules and for the ligands versus just the decoys. It will try to run combine if it has not been run yet, but will do so only with defaults for every option.&lt;br /&gt;
&lt;br /&gt;
*Input:&lt;br /&gt;
Use -l to specify the ligand identifier file and -d to specify the decoy identifier file.&lt;br /&gt;
&lt;br /&gt;
The identifier files simply contain an id for each known ligand that matched the one in the docking databases. The script is smart enough to match &amp;quot;ZINC12345678&amp;quot; to &amp;quot;C12345678&amp;quot;, so either form is acceptable.&lt;br /&gt;
&lt;br /&gt;
*Options:&lt;br /&gt;
Use -s or --skip-own-curves to skip consideration of decoys and thus generation of _own curves. Use -f to force combine to run again.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;span id=&amp;quot;Enrich_Types&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
*Creates:&lt;br /&gt;
#enrich.txt - Enrichment curve for ligands versus all molecules&lt;br /&gt;
#roc.txt - ROC curve for ligands versus all molecules&lt;br /&gt;
#enrich_own.txt - Enrichment curve for ligands versus only the decoys&lt;br /&gt;
#roc_own.txt - ROC curve for ligands versus only the decoys&lt;br /&gt;
_own files are not generate is the -s option is used.&lt;br /&gt;
&lt;br /&gt;
format for output files:&lt;br /&gt;
 #AUC 50.00  LogAUC 0.00&lt;br /&gt;
 &amp;lt;x&amp;gt; &amp;lt;y&amp;gt;&lt;br /&gt;
 &amp;lt;x&amp;gt; &amp;lt;y&amp;gt;&lt;br /&gt;
  ...&lt;br /&gt;
AUC is area under the curve and the random expectation value is 50%. LogAUC is the area between the log curve and the log random curve, so the random expectation value is 0%. &amp;lt;y&amp;gt; is always &amp;quot;% ligands found&amp;quot;, and &amp;lt;x&amp;gt; is either &amp;quot;% database searched&amp;quot; for enrichment plots or &amp;quot;% non-ligands found&amp;quot; for ROC plots.&lt;br /&gt;
&lt;br /&gt;
===Plotting Enrichments===&lt;br /&gt;
Easily plot enrichment and roc curves from one or more jobs.&lt;br /&gt;
 $mud/plots.py -i . -l New_Run -i ../old_run_dir -l Old_Run -t AmpC&lt;br /&gt;
 &amp;lt; or &amp;gt;&lt;br /&gt;
 $mud/plots.py -i .&lt;br /&gt;
Generates plots with one curve for each -i input_directory.&lt;br /&gt;
&lt;br /&gt;
*Options:&lt;br /&gt;
Use -s or --skip-own-curves to skip _own curves, especially if they don&#039;t exist because enrich.py was run with -s. You can either label each -i INDIR with a -l LABEL, or use no -l options to get the default labels based on parent directory names. Use -t TITLE to change the plot title and filename. Use -o to specify a different output directory. Use -n to get normal instead of semi-log plots (and AUC in place of LogAUC).&lt;br /&gt;
&lt;br /&gt;
*Creates:&lt;br /&gt;
#[title_]enrich.png&lt;br /&gt;
#[title_]roc.png&lt;br /&gt;
#[title_]enrich_own.png&lt;br /&gt;
#[title_]roc_own.png&lt;br /&gt;
&lt;br /&gt;
The various graphs have the same meaning as their respective cures from [[#Computing Enrichments]]. [title_] is optional and exists when a custom title is given with the -t option.&lt;br /&gt;
&lt;br /&gt;
===Computing Energy Histograms===&lt;br /&gt;
*Compute energy distributions starting from the combined scores.&lt;br /&gt;
 $mud/energies.py -s -l LIGAND_FILE&lt;br /&gt;
 &amp;lt; or &amp;gt;&lt;br /&gt;
 $mud/energies.py -l LIGAND_FILE -d DECOY_FILE&lt;br /&gt;
Generates the energy distributions for the ligands, decoys, and all the other molecules.&lt;br /&gt;
&lt;br /&gt;
*Input:&lt;br /&gt;
Use -l to specify the ligand identifier file and -d to specify the decoy identifier file.&lt;br /&gt;
&lt;br /&gt;
The identifier files simply contain an id for each known ligand that matched the one in the docking databases. The script is smart enough to match &amp;quot;ZINC12345678&amp;quot; to &amp;quot;C12345678&amp;quot;, so either form is acceptable.&lt;br /&gt;
&lt;br /&gt;
*Options:&lt;br /&gt;
Use -s or --skip-own-curves to skip consideration of decoys.&lt;br /&gt;
&lt;br /&gt;
*Creates:&lt;br /&gt;
#counts.txt - Energy distributions&lt;br /&gt;
&lt;br /&gt;
format for output:&lt;br /&gt;
 number_of_sections number_of_bins min_energy_threshold max_energy_threshold&lt;br /&gt;
 ##### section_name&lt;br /&gt;
 bin_upper_edge1 count_below_edge1&lt;br /&gt;
 ...&lt;br /&gt;
 bin_upper_edgeN count_below_edgeN&lt;br /&gt;
 ABOVE count_above_last_edge&lt;br /&gt;
The sections are for ligands, decoys (optional), and others. The bins and counts define the energy histogram. The bins are finely spaced here in order to have more resolution when combine with other runs, whose energy ranges may be different.&lt;br /&gt;
&lt;br /&gt;
===Plotting Energy Histograms===&lt;br /&gt;
Easily plot energy histograms from one or more jobs.&lt;br /&gt;
 $mud/eplots.py -i . -l New_Run -i ../old_run_dir -l Old_Run -t AmpC&lt;br /&gt;
 &amp;lt; or &amp;gt;&lt;br /&gt;
 $mud/eplots.py -i .&lt;br /&gt;
Generates plots with energy distributions for each -i input_directory.&lt;br /&gt;
&lt;br /&gt;
*Options:&lt;br /&gt;
You can either label each -i INDIR with a -l LABEL, or use no -l options to get the default labels based on parent directory names. Use -t TITLE to change the plot title and filename. Use -o to specify a different output directory.&lt;br /&gt;
&lt;br /&gt;
*Creates:&lt;br /&gt;
#[title_]counts.png&lt;br /&gt;
&lt;br /&gt;
===Visualizing Molecule by Molecule Results===&lt;br /&gt;
Create a DOCK 4,5,6 type pdb file for use in Chimera&#039;s ViewDOCK.&lt;br /&gt;
 $mud/topdock.py -o topdock.pdb&lt;br /&gt;
&lt;br /&gt;
*Options:&lt;br /&gt;
Use -o to specify an output file besides stdout. Use -t NUMBER to get whatever number of top scoring molecules.&lt;br /&gt;
&lt;br /&gt;
&amp;amp;rarr; Back to [[Tutorials]]&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=SGE_Cluster_Docking&amp;diff=4312</id>
		<title>SGE Cluster Docking</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=SGE_Cluster_Docking&amp;diff=4312"/>
		<updated>2009-12-05T01:58:23Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== SGE Cluster Information ==&lt;br /&gt;
&lt;br /&gt;
*&#039;sgehead.compbio.ucsf.edu&#039; is the submit machine for the Sun Grid Engine (SGE) cluster. wilco is also authorized to submit jobs.&lt;br /&gt;
*There are around 250 cluster nodes providing 600 total cores to run jobs in the sge queue as of May, 2009, named like &#039;node-1-1&#039; through &#039;node-3-36&#039; where the first number is the rack # and the second is the slot # in that rack.&lt;br /&gt;
&lt;br /&gt;
== SGE Commands ==&lt;br /&gt;
*sgestat: high level overview of cluster status&lt;br /&gt;
*qsub: submit jobs&amp;lt;br&amp;gt;&lt;br /&gt;
*qstat: check job status&amp;lt;br&amp;gt;&lt;br /&gt;
*qdel: remove jobs&amp;lt;br&amp;gt;&lt;br /&gt;
*qhost: check cluster status&amp;lt;br&amp;gt; &lt;br /&gt;
*man sge_intro: start of manpage documentation&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Typical Docking Workflow ==&lt;br /&gt;
&lt;br /&gt;
*Generate spheres and grids - See [[Using MakeDOCK]] for more information, including how to prepare the receptor and ligand&lt;br /&gt;
 ssh sgehead.compbio.ucsf.edu   # ssh to SGE submit machine&lt;br /&gt;
 mkdir example                  # make docking directory&lt;br /&gt;
 cd example                     # change to docking directory&lt;br /&gt;
 cp &amp;lt;somedir&amp;gt;/rec.pdb .         # copy or create rec.pdb&lt;br /&gt;
 cp &amp;lt;somedir&amp;gt;/xtal-lig.mol2 .   # copy or create xtal-lig.mol2 (or even xtal-lig.pdb)&lt;br /&gt;
 startdockblaster5              # create spheres and grids                 &lt;br /&gt;
                                # Check output for WARNING messages, correct as needed&lt;br /&gt;
&lt;br /&gt;
* Setting up a docking run&lt;br /&gt;
 cp calibrate/INDOCK.1.A INDOCK # copy or create INDOCK&lt;br /&gt;
 md4db.csh bysubset 2 100       # create directories for docking run with 100 chunks&lt;br /&gt;
                                # 2 indicates we want the fragment-like subset of ZINC (See http://zinc.docking.org/subset1)&lt;br /&gt;
 cd run.2                       # chdir into run.2 directory&lt;br /&gt;
&lt;br /&gt;
* Everything else&lt;br /&gt;
 See [[MUD - Michael&#039;s Utilities for Docking]] for how to submit, check, and analyse the docking run.&lt;br /&gt;
&lt;br /&gt;
For information on which ZINC&lt;br /&gt;
&lt;br /&gt;
[[Category:Internal]]&lt;br /&gt;
[[Category:Tutorials]]&lt;br /&gt;
[[Category:Cluster]]&lt;br /&gt;
[[Category:Unix]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=SGE_Cluster_Docking&amp;diff=4311</id>
		<title>SGE Cluster Docking</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=SGE_Cluster_Docking&amp;diff=4311"/>
		<updated>2009-12-05T01:54:22Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: Update to modern workflow&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== SGE Cluster Information ==&lt;br /&gt;
&lt;br /&gt;
*&#039;sgehead.compbio.ucsf.edu&#039; is the submit machine for the Sun Grid Engine (SGE) cluster. wilco is also authorized to submit jobs.&lt;br /&gt;
*There are around 250 cluster nodes providing 600 total cores to run jobs in the sge queue as of May, 2009, named like &#039;node-1-1&#039; through &#039;node-3-36&#039; where the first number is the rack # and the second is the slot # in that rack.&lt;br /&gt;
&lt;br /&gt;
== SGE Commands ==&lt;br /&gt;
*sgestat: high level overview of cluster status&lt;br /&gt;
*qsub: submit jobs&amp;lt;br&amp;gt;&lt;br /&gt;
*qstat: check job status&amp;lt;br&amp;gt;&lt;br /&gt;
*qdel: remove jobs&amp;lt;br&amp;gt;&lt;br /&gt;
*qhost: check cluster status&amp;lt;br&amp;gt; &lt;br /&gt;
*man sge_intro: start of manpage documentation&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Typical Docking Workflow ==&lt;br /&gt;
&lt;br /&gt;
*Generate spheres and grids - See [[Using MakeDOCK]] for more information, including how to prepare the receptor and ligand&lt;br /&gt;
 ssh sgehead.compbio.ucsf.edu   # ssh to SGE submit machine&lt;br /&gt;
 mkdir example                  # make docking directory&lt;br /&gt;
 cd example                     # change to docking directory&lt;br /&gt;
 cp &amp;lt;somedir&amp;gt;/rec.pdb .         # copy or create rec.pdb&lt;br /&gt;
 cp &amp;lt;somedir&amp;gt;/xtal-lig.mol2 .   # copy or create xtal-lig.mol2 (or even xtal-lig.pdb)&lt;br /&gt;
 startdockblaster5              # create spheres and grids                 &lt;br /&gt;
                                # Check output for WARNING messages, correct as needed&lt;br /&gt;
&lt;br /&gt;
* Setting up a docking run&lt;br /&gt;
 cp calibrate/INDOCK.1.A INDOCK # copy or create INDOCK&lt;br /&gt;
 md4db.csh bysubset 2 100       # create directories for docking run with 100 chunks&lt;br /&gt;
                                # 2 indicates we want the fragment-like subset of ZINC (See http://zinc.docking.org/subset1)&lt;br /&gt;
 cd run.2                       # chdir into run.2 directory&lt;br /&gt;
&lt;br /&gt;
* Everything else&lt;br /&gt;
 See [[MUD - Michael&#039;s Utilities for Docking]] for how to submit, check, and analyse the docking run.&lt;br /&gt;
&lt;br /&gt;
For information on which ZINC&lt;br /&gt;
&lt;br /&gt;
[[Category:Internal]]&lt;br /&gt;
[[Category:Cluster]]&lt;br /&gt;
[[Category:Unix]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=MUD_-_Michael%27s_Utilities_for_Docking&amp;diff=3543</id>
		<title>MUD - Michael&#039;s Utilities for Docking</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=MUD_-_Michael%27s_Utilities_for_Docking&amp;diff=3543"/>
		<updated>2009-12-05T01:44:35Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: Add energy histogram programs&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==What&#039;s in MUD?==&lt;br /&gt;
&lt;br /&gt;
*Tools to start, check, and restart dock jobs&lt;br /&gt;
*Tools to combine, enrich, plot, and view docking results&lt;br /&gt;
&lt;br /&gt;
==Setting up MUD==&lt;br /&gt;
&lt;br /&gt;
*For convenience, point a shell variable to the base mud directory to save typing&lt;br /&gt;
 set mud=~mysinger/code/mud/trunk&lt;br /&gt;
*If you use MUD a lot, you can add this to your ~/.login&lt;br /&gt;
*Then simply run commands like this:&lt;br /&gt;
 $mud/submit.csh&lt;br /&gt;
 $mud/check.py -h&lt;br /&gt;
*Use -h or --help to get full help information for the .py (python) scripts&lt;br /&gt;
*The .csh scripts will automatically print usage information if mis-used&lt;br /&gt;
*The scripts automatically use their invocation path to find other scripts and libraries they depend on.&lt;br /&gt;
&lt;br /&gt;
==Job Control==&lt;br /&gt;
   &lt;br /&gt;
===Main Workflow===&lt;br /&gt;
*Submit a parallel job to the cluser&lt;br /&gt;
 $mud/submit.csh&lt;br /&gt;
Uses &#039;dirlist&#039; to determine which directories to run. Similar to startdockbksX, but also indicates job submission by touching a submitted file in each directory.&lt;br /&gt;
*Check parallel job status&lt;br /&gt;
 $mud/check.py&lt;br /&gt;
Indicates the status of unfinished (or unsubmitted) jobs. Note that it simply returns nothing if everything is finished.&lt;br /&gt;
*Restart all failed subjobs&lt;br /&gt;
 $mud/restart.py&lt;br /&gt;
This works even if some subjobs are still running. Occasionally, however, jobs can fail with no detectable remnants. To force those jobs to restart you can use the -f option, but beware that this will also restart all subjobs that are still running.&lt;br /&gt;
&lt;br /&gt;
===Specialized Commands===&lt;br /&gt;
*Submit job to the local machine&lt;br /&gt;
 $mud/sublocal.csh&lt;br /&gt;
*Submit a single directory to the cluster&lt;br /&gt;
 qsub $mud/runsge.csh&lt;br /&gt;
*Submit a single directory to the local machine&lt;br /&gt;
 $mud/runsubdir.csh&lt;br /&gt;
*Remove docking output leaving only input - will DELETE even completed jobs&lt;br /&gt;
 $mud/clean.py&lt;br /&gt;
*Restart single directory&lt;br /&gt;
 $mud/restartdir.py&lt;br /&gt;
&lt;br /&gt;
==Job Analysis==&lt;br /&gt;
&lt;br /&gt;
*Enrichment plots are sensitive to consistent treatment and proper accounting for all docked molecules. The combine script properly accounts for all docked molecules by detecting bumped out, no matched, and timed out molecules. &lt;br /&gt;
&lt;br /&gt;
To achieve consistency, you have two options:&lt;br /&gt;
1. Write coordinates for all molecules (what I use)&lt;br /&gt;
In INDOCK, set number_save to 50000 or something high enough to capture all dockable hierarchies. DOCK output is now gzipped so this is cheaper in disk space than it used to be.&lt;br /&gt;
2. Do not check for broken molecules&lt;br /&gt;
Use the -b option when running combine.py&lt;br /&gt;
&lt;br /&gt;
===Combining Parallel Jobs===&lt;br /&gt;
*Merge all parallel jobs into a single set of unique scores.&lt;br /&gt;
 $mud/combine.py&lt;br /&gt;
This combine carefully accounts for all docked molecules, for more informative enrichment plots.&lt;br /&gt;
&lt;br /&gt;
*Options:&lt;br /&gt;
Use -b or --broken to skip finding broken molecules. Use -d or --done to indicate that all subjobs are complete, for the case where you did not submit with a MUD submission script. Use -p or --prefix if your output files are named something other than test. Use --box if your  box file is not at ../../grids/box relative to your subjob directories.&lt;br /&gt;
&lt;br /&gt;
*Creates:&lt;br /&gt;
#combine.scores - fully processed scores, using the best one for each id&lt;br /&gt;
#combine.raw - contains all scores as scrapped from DOCK output&lt;br /&gt;
#combine.broken - broken molecules and the reason they failed&lt;br /&gt;
#combine.zeroes - important sanity check&lt;br /&gt;
&lt;br /&gt;
format of combine.scores:&lt;br /&gt;
 &amp;lt;id&amp;gt; &amp;lt;shape&amp;gt; &amp;lt;elect&amp;gt; &amp;lt;VdW&amp;gt; &amp;lt;polar solv&amp;gt; &amp;lt;apolar solv&amp;gt; &amp;lt;total&amp;gt; &amp;lt;subdir&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The .zeroes file is a sanity check because it lists the number of molecules followed by the number of zeroes in each scoring column. Past experience has shown that when DOCK fails randomly and silently, it often generates a large  number of zero scores. If this happens, simply re-running the job will give better results. &lt;br /&gt;
&lt;br /&gt;
===Computing Enrichments===&lt;br /&gt;
*Compute enrichment starting from the combined scores.&lt;br /&gt;
 $mud/enrich.py -s -l LIGAND_FILE&lt;br /&gt;
 &amp;lt; or &amp;gt;&lt;br /&gt;
 $mud/enrich.py -l LIGAND_FILE -d DECOY_FILE&lt;br /&gt;
Generates both enrichment and roc curves, both for the ligands against all molecules and for the ligands versus just the decoys. It will try to run combine if it has not been run yet, but will do so only with defaults for every option.&lt;br /&gt;
&lt;br /&gt;
*Input:&lt;br /&gt;
Use -l to specify the ligand identifier file and -d to specify the decoy identifier file.&lt;br /&gt;
&lt;br /&gt;
The identifier files simply contain an id for each known ligand that matched the one in the docking databases. The script is smart enough to match &amp;quot;ZINC12345678&amp;quot; to &amp;quot;C12345678&amp;quot;, so either form is acceptable.&lt;br /&gt;
&lt;br /&gt;
*Options:&lt;br /&gt;
Use -s or --skip-own-curves to skip consideration of decoys and thus generation of _own curves. Use -f to force combine to run again.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;span id=&amp;quot;Enrich_Types&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
*Creates:&lt;br /&gt;
#enrich.txt - Enrichment curve for ligands versus all molecules&lt;br /&gt;
#roc.txt - ROC curve for ligands versus all molecules&lt;br /&gt;
#enrich_own.txt - Enrichment curve for ligands versus only the decoys&lt;br /&gt;
#roc_own.txt - ROC curve for ligands versus only the decoys&lt;br /&gt;
_own files are not generate is the -s option is used.&lt;br /&gt;
&lt;br /&gt;
format for output files:&lt;br /&gt;
 #AUC 50.00  LogAUC 0.00&lt;br /&gt;
 &amp;lt;x&amp;gt; &amp;lt;y&amp;gt;&lt;br /&gt;
 &amp;lt;x&amp;gt; &amp;lt;y&amp;gt;&lt;br /&gt;
  ...&lt;br /&gt;
AUC is area under the curve and the random expectation value is 50%. LogAUC is the area between the log curve and the log random curve, so the random expectation value is 0%. &amp;lt;y&amp;gt; is always &amp;quot;% ligands found&amp;quot;, and &amp;lt;x&amp;gt; is either &amp;quot;% database searched&amp;quot; for enrichment plots or &amp;quot;% non-ligands found&amp;quot; for ROC plots.&lt;br /&gt;
&lt;br /&gt;
===Plotting Enrichments===&lt;br /&gt;
Easily plot enrichment and roc curves from one or more jobs.&lt;br /&gt;
 $mud/plots.py -i . -l New_Run -i ../old_run_dir -l Old_Run -t AmpC&lt;br /&gt;
 &amp;lt; or &amp;gt;&lt;br /&gt;
 $mud/plots.py -i .&lt;br /&gt;
Generates plots with one curve for each -i input_directory.&lt;br /&gt;
&lt;br /&gt;
*Options:&lt;br /&gt;
Use -s or --skip-own-curves to skip _own curves, especially if they don&#039;t exist because enrich.py was run with -s. You can either label each -i INDIR with a -l LABEL, or use no -l options to get the default labels based on parent directory names. Use -t TITLE to change the plot title and filename. Use -o to specify a different output directory. Use -n to get normal instead of semi-log plots (and AUC in place of LogAUC).&lt;br /&gt;
&lt;br /&gt;
*Creates:&lt;br /&gt;
#[title_]enrich.png&lt;br /&gt;
#[title_]roc.png&lt;br /&gt;
#[title_]enrich_own.png&lt;br /&gt;
#[title_]roc_own.png&lt;br /&gt;
&lt;br /&gt;
The various graphs have the same meaning as their respective cures from [[#Computing Enrichments]]. [title_] is optional and exists when a custom title is given with the -t option.&lt;br /&gt;
&lt;br /&gt;
===Computing Energy Histograms===&lt;br /&gt;
*Compute energy distributions starting from the combined scores.&lt;br /&gt;
 $mud/energies.py -s -l LIGAND_FILE&lt;br /&gt;
 &amp;lt; or &amp;gt;&lt;br /&gt;
 $mud/energies.py -l LIGAND_FILE -d DECOY_FILE&lt;br /&gt;
Generates the energy distributions for the ligands, decoys, and all the other molecules.&lt;br /&gt;
&lt;br /&gt;
*Input:&lt;br /&gt;
Use -l to specify the ligand identifier file and -d to specify the decoy identifier file.&lt;br /&gt;
&lt;br /&gt;
The identifier files simply contain an id for each known ligand that matched the one in the docking databases. The script is smart enough to match &amp;quot;ZINC12345678&amp;quot; to &amp;quot;C12345678&amp;quot;, so either form is acceptable.&lt;br /&gt;
&lt;br /&gt;
*Options:&lt;br /&gt;
Use -s or --skip-own-curves to skip consideration of decoys.&lt;br /&gt;
&lt;br /&gt;
*Creates:&lt;br /&gt;
#counts.txt - Energy distributions&lt;br /&gt;
&lt;br /&gt;
format for output:&lt;br /&gt;
 number_of_sections number_of_bins min_energy_threshold max_energy_threshold&lt;br /&gt;
 ##### section_name&lt;br /&gt;
 bin_upper_edge1 count_below_edge1&lt;br /&gt;
 ...&lt;br /&gt;
 bin_upper_edgeN count_below_edgeN&lt;br /&gt;
 ABOVE count_above_last_edge&lt;br /&gt;
The sections are for ligands, decoys (optional), and others. The bins and counts define the energy histogram. The bins are finely spaced here in order to have more resolution when combine with other runs, whose energy ranges may be different.&lt;br /&gt;
&lt;br /&gt;
===Plotting Energy Histograms===&lt;br /&gt;
Easily plot energy histograms from one or more jobs.&lt;br /&gt;
 $mud/eplots.py -i . -l New_Run -i ../old_run_dir -l Old_Run -t AmpC&lt;br /&gt;
 &amp;lt; or &amp;gt;&lt;br /&gt;
 $mud/eplots.py -i .&lt;br /&gt;
Generates plots with energy distributions for each -i input_directory.&lt;br /&gt;
&lt;br /&gt;
*Options:&lt;br /&gt;
You can either label each -i INDIR with a -l LABEL, or use no -l options to get the default labels based on parent directory names. Use -t TITLE to change the plot title and filename. Use -o to specify a different output directory.&lt;br /&gt;
&lt;br /&gt;
*Creates:&lt;br /&gt;
#[title_]counts.png&lt;br /&gt;
&lt;br /&gt;
===Visualizing Molecule by Molecule Results===&lt;br /&gt;
Create a DOCK 4,5,6 type pdb file for use in Chimera&#039;s ViewDOCK.&lt;br /&gt;
 $mud/topdock.py -o topdock.pdb&lt;br /&gt;
&lt;br /&gt;
*Options:&lt;br /&gt;
Use -o to specify an output file besides stdout. Use -t NUMBER to get whatever number of top scoring molecules.&lt;br /&gt;
&lt;br /&gt;
&amp;amp;rarr; Back to [[Tutorials]]&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=How_to_compile_DOCK&amp;diff=3204</id>
		<title>How to compile DOCK</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=How_to_compile_DOCK&amp;diff=3204"/>
		<updated>2009-12-05T01:19:03Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
This is for the Shoichet Lab local version of DOCK 3.5.54 trunk. &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Checking out the source files&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
Commands:&lt;br /&gt;
 csh&lt;br /&gt;
 mkdir /where/to/put&lt;br /&gt;
 cd /where/to/put&lt;br /&gt;
 svn checkout file:///raid4/svn/dock&lt;br /&gt;
 svn checkout file:///raid4/svn/libfgz&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Compiling the program on our cluster&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
Commands:&lt;br /&gt;
 ssh sgehead&lt;br /&gt;
 # You should see &amp;quot;Enabling pgf compiler&amp;quot; when you login, otherwise seek help&lt;br /&gt;
 cd /where/to/put/libfgz/trunk&lt;br /&gt;
 make&lt;br /&gt;
 cd ../../dock/trunk/i386&lt;br /&gt;
 make&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Compiling the program on the shared QB3 cluster&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
On one of the compilation nodes on the shared QB3 cluster (optint1 or optint2):&lt;br /&gt;
&lt;br /&gt;
 ssh optint2&lt;br /&gt;
 cd /where/to/put/libfgz/trunk&lt;br /&gt;
 cp Makefile Makefile.old&lt;br /&gt;
 modify Makefile:&lt;br /&gt;
  uncomment the following:&lt;br /&gt;
  FC = ifort -O3&lt;br /&gt;
  CC = icc -O3&lt;br /&gt;
 make&lt;br /&gt;
 cd ../../dock/trunk/i386&lt;br /&gt;
 cp Makefile Makefile.old&lt;br /&gt;
 modify Makefile&lt;br /&gt;
  uncomment the following:&lt;br /&gt;
  F77 = ifort&lt;br /&gt;
  FFLAGS = -O3 -convert big_endian&lt;br /&gt;
 make dock&lt;br /&gt;
&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=How_to_compile_DOCK&amp;diff=3203</id>
		<title>How to compile DOCK</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=How_to_compile_DOCK&amp;diff=3203"/>
		<updated>2009-12-05T01:14:15Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: Change to subversion&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&#039;&#039;&#039;Checking out the source files&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
Commands:&lt;br /&gt;
 csh&lt;br /&gt;
 mkdir /where/to/put&lt;br /&gt;
 cd /where/to/put&lt;br /&gt;
 svn checkout file:///raid4/svn/dock&lt;br /&gt;
 svn checkout file:///raid4/svn/libfgz&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Compiling the program on our cluster&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
Commands:&lt;br /&gt;
 cd /where/to/put/libfgz/trunk&lt;br /&gt;
 make&lt;br /&gt;
 cd ../../dock/trunk/i386&lt;br /&gt;
 make&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Compiling the program on the shared QB3 cluster&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
On one of the compilation nodes on the shared QB3 cluster (optint1 or optint2):&lt;br /&gt;
&lt;br /&gt;
 ssh optint2&lt;br /&gt;
 cd /where/to/put/libfgz/trunk&lt;br /&gt;
 cp Makefile Makefile.old&lt;br /&gt;
 modify Makefile:&lt;br /&gt;
  uncomment the following:&lt;br /&gt;
  FC = ifort -O3&lt;br /&gt;
  CC = icc -O3&lt;br /&gt;
 make&lt;br /&gt;
 cd ../../dock/trunk/i386&lt;br /&gt;
 cp Makefile Makefile.old&lt;br /&gt;
 modify Makefile&lt;br /&gt;
  uncomment the following:&lt;br /&gt;
  F77 = ifort&lt;br /&gt;
  FFLAGS = -O3 -convert big_endian&lt;br /&gt;
 make dock&lt;br /&gt;
&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=SGE_Cluster_Docking&amp;diff=4310</id>
		<title>SGE Cluster Docking</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=SGE_Cluster_Docking&amp;diff=4310"/>
		<updated>2009-09-25T21:15:23Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: /* Typical Docking Workflow */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== SGE Cluster Information ==&lt;br /&gt;
&lt;br /&gt;
*&#039;sgehead.compbio.ucsf.edu&#039; is the submit machine for the Sun Grid Engine (SGE) cluster. wilco is also authorized to submit jobs.&lt;br /&gt;
*&#039;sgemaster.compbio.ucsf.edu&#039; is the admin machine for the SGE cluster.&lt;br /&gt;
*There are around 250 cluster nodes providing 600 total cores to run jobs in the sge queue as of May, 2009, named like &#039;node-1-1&#039; through &#039;node-3-36&#039; where the first number is the rack # and the second is the slot # in that rack.&lt;br /&gt;
&lt;br /&gt;
== SGE Commands ==&lt;br /&gt;
*qsub: submit jobs&amp;lt;br&amp;gt;&lt;br /&gt;
*qstat: check job status&amp;lt;br&amp;gt;&lt;br /&gt;
*qdel: remove jobs&amp;lt;br&amp;gt;&lt;br /&gt;
*qhost: check cluster status&amp;lt;br&amp;gt; &lt;br /&gt;
*man sge_intro: start of manpage documentation&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Typical Docking Workflow ==&lt;br /&gt;
&lt;br /&gt;
*Generate spheres and grids - See [[Using MakeDOCK]] for more information, including how to prepare the receptor and ligand&lt;br /&gt;
 ssh sgehead.compbio.ucsf.edu   # ssh to SGE submit machine&lt;br /&gt;
 mkdir example                  # make docking directory&lt;br /&gt;
 cd example                     # change to docking directory&lt;br /&gt;
 cp &amp;lt;somedir&amp;gt;/rec.pdb .         # copy or create rec.pdb&lt;br /&gt;
 cp &amp;lt;somedir&amp;gt;/xtal-lig.mol2 .   # copy or create xtal-lig.mol2 (or even xtal-lig.pdb)&lt;br /&gt;
 startdockblaster4              # create spheres and grids                 &lt;br /&gt;
                                # Check output for WARNING messages, correct as needed&lt;br /&gt;
&lt;br /&gt;
* Submit docking run&lt;br /&gt;
 cp calibrate/INDOCK.1.A INDOCK # copy or create INDOCK&lt;br /&gt;
 md4db.csh bysubset 2 50        # create directories for docking run with 50 chunks&lt;br /&gt;
                                # 2 indicates we want the fragment-like subset of ZINC (See http://zinc.docking.org/subset1)&lt;br /&gt;
 cd run.2                       # chdir into run.2 directory&lt;br /&gt;
 startdockbks3 .                # submit database chunks to SGE cluster&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For information on which ZINC&lt;br /&gt;
&lt;br /&gt;
[[Category:Internal]]&lt;br /&gt;
[[Category:Cluster]]&lt;br /&gt;
[[Category:Unix]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=How_to_compile_DOCK&amp;diff=3202</id>
		<title>How to compile DOCK</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=How_to_compile_DOCK&amp;diff=3202"/>
		<updated>2009-08-28T00:30:52Z</updated>

		<summary type="html">&lt;p&gt;Mysinger: bugfix&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&#039;&#039;&#039;Checking out the source files&#039;&#039;&#039;&lt;br /&gt;
* change to cshell.&lt;br /&gt;
* create a directory for the source files.&lt;br /&gt;
* change to this directory.&lt;br /&gt;
* set the environment variable for CVS.&lt;br /&gt;
* check out the dock sources.&lt;br /&gt;
* check out the auxilliary libraries.&lt;br /&gt;
&lt;br /&gt;
As commands:&lt;br /&gt;
 csh&lt;br /&gt;
 mkdir /where/to/put/dock35&lt;br /&gt;
 cd /where/to/put/dock35&lt;br /&gt;
 setenv CVSROOT /raid1/cvs&lt;br /&gt;
 cvs co dock&lt;br /&gt;
 cvs co libfgz&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Compiling the program&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
On a 64-bit machine, e.g. one of the compilation nodes on the shared QB3 cluster (optint1 or optint2):&lt;br /&gt;
&lt;br /&gt;
 ssh optint2&lt;br /&gt;
 cd /where/to/put/dock35&lt;br /&gt;
 cd libfgz/&lt;br /&gt;
 cp Makefile Makefile.old&lt;br /&gt;
 modify Makefile:&lt;br /&gt;
  comment out the following:&lt;br /&gt;
  #FC = gfortran -O3&lt;br /&gt;
  #CC = gcc -O3&lt;br /&gt;
  uncomment the following:&lt;br /&gt;
  FC = ifort -O3&lt;br /&gt;
  CC = icc -O3&lt;br /&gt;
 make&lt;br /&gt;
 cd ../dock/i386/&lt;br /&gt;
 cp Makefile Makefile.old&lt;br /&gt;
 modify Makefile&lt;br /&gt;
  comment out the following:&lt;br /&gt;
  #F77 = pgf77&lt;br /&gt;
  #FFLIBS = -lc -lgcc_eh -lgfortran&lt;br /&gt;
  #FFLAGS = -byteswapio ...&lt;br /&gt;
  uncomment the following:&lt;br /&gt;
  F77 = ifort&lt;br /&gt;
  FFLAGS = -O3 -convert big_endian&lt;br /&gt;
 make&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Tutorials]]&lt;/div&gt;</summary>
		<author><name>Mysinger</name></author>
	</entry>
</feed>