Rsyncing zinc15

From DISI
Revision as of 14:21, 29 April 2020 by Dudenko (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

A few examples below: This is how you set up a ZINC Mirror. I am looking at you Uppsala, Beijing, and Kiev.

  1. what is available?
rsync --list-only rsync://files.docking.org
  1. download 2D
rsync -L  -a --progress  --include="[ABCDEFG][A-G]*.smi" --exclude="[HIJK]*" rsync://files.docking.org/ZINC-2D/ zinc
  1. download 3D
rsync -L  -a --progress  --include="[ABCDEFG][A-G]*.db2.gz" --exclude="[HIJK]*" rsync://files.docking.org/ZINC-3D/  zinc
  1. Example (download 3D):

Let's say, you are in folder /ZINC15DB, which contains your 3D database (named as 3D), and you want to sync only *.db2.gz files from CA-tranche, then:

rsync -L  -a --progress --prune-empty-dirs --delete-excluded --include="*/" --include="CA[ABCE][A-D][RM][LMNOP].*.db2.gz" --exclude="*" rsync://files.docking.org/ZINC-3D/CA 3D


  1. special subsets. e.g. metabolites, biogenic

For instance, to get all metabolites in PDBQT format:

rsync -L -a --progress --prune-empty-dirs --include="*/" --include="*.pdbqt.gz" --exclude="*" rsync://files.docking.org/ZINC-special/metabolites zinc

To get all biogenic in SDF format

rsync -L -a --progress --prune-empty-dirs --include="*/" --include="*.sdf.gz" --exclude="*" rsync://files.docking.org/ZINC-special/biogenic zinc

To get all aggregators in DB2 format

rsync -L -a --progress --prune-empty-dirs --include="*/" --include="*.db.gz" --exclude="*" rsync://files.docking.org/ZINC-special/aggregators zinc

To get SMILES and other details:

wget http://files.docking.org/special/current/biogenic/biogenic.info.txt.gz
http://files.docking.org/special/current/metabolites/metabolites.info.txt.gz
http://files.docking.org/special/current/aggregators/aggregators.info.txt.gz