ZINC15 chembl curation pipeline

From DISI
Revision as of 22:15, 5 July 2015 by Frodo (talk | contribs) (asdf)
Jump to navigation Jump to search

When a new version of ChEMBL appears, the following steps need to be taken, in order.

Download and load ChEMBL as psql and sdf

  • download the psql and SDF files
  • load the SDF using the usual pipeline
  • load the psql into a chembl20 schema. This requires editing the psql file as distributed.

Extract, process and load ChEMBL annotations

  • as xyz, ~/store/chemblload/40/ README and files cited therein


Chemical clustering

  • export chembl once loaded, then load the clusters back into ZINC.

SEA calculation

TeeTee SEA calculation