Rsyncing zinc15

Revision as of 07:21, 29 April 2020 by Dudenko (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

A few examples below: This is how you set up a ZINC Mirror. I am looking at you Uppsala, Beijing, and Kiev.

  1. what is available?
rsync --list-only rsync://
  1. download 2D
rsync -L  -a --progress  --include="[ABCDEFG][A-G]*.smi" --exclude="[HIJK]*" rsync:// zinc
  1. download 3D
rsync -L  -a --progress  --include="[ABCDEFG][A-G]*.db2.gz" --exclude="[HIJK]*" rsync://  zinc
  1. Example (download 3D):

Let's say, you are in folder /ZINC15DB, which contains your 3D database (named as 3D), and you want to sync only *.db2.gz files from CA-tranche, then:

rsync -L  -a --progress --prune-empty-dirs --delete-excluded --include="*/" --include="CA[ABCE][A-D][RM][LMNOP].*.db2.gz" --exclude="*" rsync:// 3D

  1. special subsets. e.g. metabolites, biogenic

For instance, to get all metabolites in PDBQT format:

rsync -L -a --progress --prune-empty-dirs --include="*/" --include="*.pdbqt.gz" --exclude="*" rsync:// zinc

To get all biogenic in SDF format

rsync -L -a --progress --prune-empty-dirs --include="*/" --include="*.sdf.gz" --exclude="*" rsync:// zinc

To get all aggregators in DB2 format

rsync -L -a --progress --prune-empty-dirs --include="*/" --include="*.db.gz" --exclude="*" rsync:// zinc

To get SMILES and other details: