ZINC15 chembl curation pipeline: Difference between revisions

From DISI
Jump to navigation Jump to search
(asdf)
 
(asdf)
Line 1: Line 1:
Step by step instructions, from the arrival of a new version of chembl to all tables updated.  
When a new version of ChEMBL appears, the following steps need to be taken, in order.


* download chembl
== Download and load ChEMBL as psql and sdf ==
* download the psql and SDF files
* load the SDF using the usual pipeline
* load the psql into a chembl20 schema.  This requires editing the psql file as distributed.


* load chembl  (permissions, location, ownership)
== Extract, process and load ChEMBL annotations ==


* export chembl
* as xyz, ~/store/chemblload/40/ README and files cited therein


* clustering


* sea calculation
== Chemical clustering ==
* export chembl once loaded, then load the clusters back into ZINC.


* sea loading
== SEA calculation ==


* other arrangements.
 
== TeeTee SEA calculation ==




[[Category:ZINC15]]
[[Category:ZINC15]]
[[Category:Curator]]
[[Category:Curator]]

Revision as of 22:15, 5 July 2015

When a new version of ChEMBL appears, the following steps need to be taken, in order.

Download and load ChEMBL as psql and sdf

  • download the psql and SDF files
  • load the SDF using the usual pipeline
  • load the psql into a chembl20 schema. This requires editing the psql file as distributed.

Extract, process and load ChEMBL annotations

  • as xyz, ~/store/chemblload/40/ README and files cited therein


Chemical clustering

  • export chembl once loaded, then load the clusters back into ZINC.

SEA calculation

TeeTee SEA calculation