Resources: Difference between revisions

From DISI
Jump to navigation Jump to search
(asd)
(asdf)
 
(28 intermediate revisions by the same user not shown)
Line 1: Line 1:
A resource is the type of thing you are asking for (substance, catalog, gene, etc).  Resources are written as plurals in the URL, indicating that the result is always a selection from among them.  Each ZINC URL may specify only one resource.  The URL syntax here covers both the web pages and the API. We currently support the following resources:
A resource is the type of thing you are asking about, such as substance, catalog, gene, etc.  Resources are written as plurals in the URL, indicating that the result is always a selection from among them.  Each ZINC15 URL begins with a resource.  A second resource may be specified later in the URL, as may constraints derived from other resources. There are nearly 20 resources in ZINC15 already (see below), but we do not expect the final total to exceed 40.  Each resource is documented online with the help endpoint. Thus the help pages for the substances resource is http://zinc15.docking.org/substances/help.  The URL syntax here covers both the web pages and the API. We currently support the following resources:


{| class="wikitable"
{| class="wikitable"
Line 5: Line 5:
! Resource !! Is a set of !! Approximate<br />Number !! Example use of this resource when the answer to your question is:<br /> a) a list of these things<br /> b) one of these things, or<br /> c) things derived from a single one of these things.  
! Resource !! Is a set of !! Approximate<br />Number !! Example use of this resource when the answer to your question is:<br /> a) a list of these things<br /> b) one of these things, or<br /> c) things derived from a single one of these things.  
|-
|-
|  substances || molecules || 200,000,000 || a) purchasable compounds [http://zinc15.docking.org/substances/subsets/purchasable]<br />b) ZINC ID 53 [http://zinc15.docking.org/substances/53]<br />c) All genes hit by compound ZINC597013  [http://zinc15.docking.org/substances/ZINC000000597013/genes]
[http://zinc15.docking.org/substances substances] || molecules || 200,000,000 || a) compounds you can buy [http://zinc15.docking.org/substances/subsets/for-sale]<br />b) ZINC ID 53 [http://zinc15.docking.org/substances/53]<br />c) All genes hit by compound ZINC597013  [http://zinc15.docking.org/substances/ZINC000000597013/genes]
|-
|-
| catalogs || vendor and annotated catalogs || 400 || a) catalogs whose members are biogenic (but see also endogenous, metabolite) [http://zinc15.docking.org/catalogs/subsets/biogenic]<br />b) The ChEMBL20 catalog [http://zinc15.docking.org/catalogs/chembl20/]<br />c) Items in the DrugBank FDA catalogs [http://zinc15.docking.org/catalogs/dbfda/items/]  Interestingly, this shows clearly when a DrugBank ID has been expanded into multiple ZINC IDs.
| [http://zinc15.docking.org/catalogs catalogs] || vendor and annotated catalogs || 400 || a) catalogs whose members are biogenic (but see also endogenous, metabolite) [http://zinc15.docking.org/catalogs/subsets/biogenic]<br />b) The ChEMBL20 catalog [http://zinc15.docking.org/catalogs/chembl20/]<br />c) Items in the DrugBank FDA catalogs [http://zinc15.docking.org/catalogs/dbfda/items/]  Interestingly, this shows clearly when a DrugBank ID has been expanded into multiple ZINC IDs.
|-
|-
| genes || UniProt Gene Symbols || 2800 || a) All Class A GPCR genes [http://zinc15.docking.org/genes/subsets/gpcra/]<br />b) The BTK gene [http://zinc15.docking.org/genes/BTK]<br />c) SEA Predicted ligands for 5-HT2A [http://zinc15.docking.org/genes/HTR2A/predictions/]
| [http://zinc15.docking.org/genes genes] || UniProt Gene Symbols || 2800 || a) All Class A GPCR genes [http://zinc15.docking.org/genes/subsets/gpcra/]<br />b) The BTK gene [http://zinc15.docking.org/genes/BTK]<br />c) SEA Predicted ligands for 5-HT2A [http://zinc15.docking.org/genes/HTR2A/predictions/]
|-
|-
| catitems || What they are called in the source catalog || 1 billion || about original catalog information
| [http://zinc15.docking.org/catitems catitems] || What vendors and annotated catalogs call the molecules in their source catalogs || 1 billion || a) All molecules where the original supplier code begins with B1234 [http://zinc15.docking.org/catitems/?supplier_code-startswith=B1234] <br />b) N.B. There is no unique key for supplier codes, so you cannot necessairily select one.<br />The purpose of the catitem code allows you to write queries that originate in another catalog<br />Items in the HMDB database, where the catalog code starts with HMDB1 and the gene it is active against starts with ADR.  [http://zinc15.docking.org/catitems/?supplier_code-startswith=HMDB1&catalog.short_name=hmdb&gene.name-startswith=ADR]<br />Compounds sold by Frontier, beginning with B123 and containing a pyridine ring.  [http://zinc15.docking.org/catitems/?supplier_code-startswith=B123&catalog.short_name=frontier&substance.structure-contains=c1ccncc1]
|-
|-
| protomers || 3D representations || 6 million and growing || various protonated and tautomerized forms of the molecule in 3D
| [http://zinc15.docking.org/protomers protomers] || 3D representations || 6 million and growing || a) Molecules with a charge of minus 2 at pH 7.4. [http://zinc15.docking.org/protomers/subsets/ref/?net_charge=-2]<br />b) One individual representation in 3D SDF format [http://zinc15.docking.org/protomers/3913415.sdf]<br />c) Catalogs that this molecule can be found in [http://zinc15.docking.org/protomers/3913415/catalogs/]
|-
|-
| targets || UniProt accession codes|| 3800 || species-specific molecular targets
| [http://zinc15.docking.org/orthologs orthologs] || UniProt accession codes, thus species specific|| 3800 || a) Information about orthologs by uniprot code in ZINC as a text file [http://zinc15.docking.org/orthologs.txt]<br />b) Information about the FTO_HUMAN ortholog. [http://zinc15.docking.org/orthologs/FTO_HUMAN.txt] <br />c) Substances annotated to bind FTO_HUMAN [http://zinc15.docking.org/orthologs/FTO_HUMAN/substances/]
|-
|-
| activities || Observed || 220,000 || about individual observations of biological activity of small molecules.
| [http://zinc15.docking.org/activities activities] || best ligand-gene affinity. See observations for each individual value || 220,000 ||a) Individual data points of compound-gene associations [http://zinc15.docking.org/activities]<br />c) All activities better than 1 nM [http://zinc15.docking.org/activities/subsets/1nM/]<br />
|-
|-
| clusterreps || Cluster representatives || 3900 || about chemical diversity of ligands for genes
| [http://zinc15.docking.org/majorclasses majorclasses] || Major classes || 15 || a) A list of major classes [http://zinc15.docking.org/majorclasses/]<br />b) The membrane_receptor major class [http://zinc15.docking.org/majorclasses/membrane_receptor]<br />c) genes in the enzyme major class [http://zinc15.docking.org/majorclasses/enzyme/]<br />Another examples: substances in the genes in the major class of ion channels [http://zinc15.docking.org/majorclasses/ion_channel/substances]
|-
|-
| clusters || Clustered compounds by gene || 220,000 || about compounds for genes organized by chemotype.
| [http://zinc15.docking.org/subclasses subclasses] ||Subclasses|| 44 || a) A list of subclasses [http://zinc15.docking.org/subclasses]<br />b) The GPCR-A subclass [http://zinc15.docking.org/subclasses/gpcr-a/]<br />c) The substances that bind the GPCR F subclass [http://zinc15.docking.org/subclasses/gpcr-f/substances]
|-
|-
| majorclasses ||asdf asdf || 15 || asdfasfd sadf
| [http://zinc15.docking.org/organisms organisms] ||asdf asdf || 5 ||a) List of supported organisms [http://zinc15.docking.org/organisms.txt]<br />b) not supported<br />c) not supported
|-
|-
| subclasses ||asdf asdf || 44 || asdfasfd sadf
| [http://zinc15.docking.org/ecfp4s.txt ecfp4s]  || lala || 100,000,000 ||a) List of fingerprints [http://zinc15.docking.org/ecfp4s.txt]<br />b) Fingerprint for ZINC53 [http://zinc15.docking.org/ecfp4s/53.txt]<br />c) Fingerprints for all ligands for MR [http://zinc15.docking.org/ecfp4s/genes/MR.txt]
|-
|-
| organisms ||asdf asdf || 5 || asdfasfd sadf
| [http://zinc15.docking.org/predictions predictions] || SEA predictions || 1 billion || a) List of predictions [http://zinc15.docking.org/predictions]<br />b) Prediction #1 [http://zinc15.docking.org/predictions/1]<br />c) Predictions for ADRB2 [http://zinc15.docking.org/predictions/genes/DRD2]
|-
|-
| tools ||asdf asdf || 10,000 || Reserved for future use
| [http://zinc15.docking.org/atccodes atccodes] || atc codes || 1500 || a) List of ATC codes [http://zinc15.docking.org/atccodes/]<br />b) ATC code L01A (cancer) [http://zinc15.docking.org/atccodes/L01A/]<br />c) something goes here...
|-
|-
| ecfp4s  || lala || 100,000,000 || fingerprints
| [http://zinc15.docking.org/generelations generelations] || gene_relations || 250,000 || a) List of gene-gene relations [http://zinc15.docking.org/generelations/]<br />b) Gene relation #1 [http://zinc15.docking.org/generelations/1]<br />c) Similar genes by the ligands they bind for OPRK [http://zinc15.docking.org/generelations/graph/OPRK1/prediction]  other options are sequence-similarity and coexpression and ppi
|-
|-
| predictions || SEA predictions || 1 billion || sea
| [http://zinc15.docking.org/observations observations] || Individual reports of ligand-gene associations.  See activities if you just want the best value for each gene || 280,000 || a) List of observations [http://zinc15.docking.org/observations/]<br />b) Something goes here<br />c) Something goes here too.
|-
|-
| atccodes || atc codes || 1500 || atccodes
| [http://zinc15.docking.org/patterns patterns] || SMARTS patterns || 535 patterns, 2.5 M entries ||a) List of patterns [http://zinc15.docking.org/patterns/]<br />b) Something here<br />c) Something here
|-
|-
| generelations || gene_relations || 250,000 || gene_relations
| [http://zinc15.docking.org/rings rings] || Ring systems || 10,000 || a) List of rings [http://zinc15.docking.org/rings/]<br />b) something here<br />c) something here
|-
|-
| observations || obs || 280,000 || obs
| [http://zinc15.docking.org/toolcompounds toolcompounds] || Tool compounds || 3000 || 3 chosen, 10 short listed, max, per gene<br />a) List of tool compounds [http://zinc15.docking.org/toolcompounds/]<br />b) something goes here<br />c) something here.
|}
|}


Historical note:  In ZINC 12, the reference-table was always substances, and thus the result of any query was always a set of molecules. A few reports such as vendor and target listings were available via the webpages, but were not supported by the API.  This is a substantial change that appeared first in ZINC 15.
There is also a pseudo resource: [http://zinc15.docking.org/tranches/home tranches] which is physico-property space binned by logP and molecular weight.
 
Resources in development for 2015: clusters, clusterreps, nplikeness, druglikeness and nplikeness.  Others will be announced later.
 
Historical note:  In ZINC 12, the reference-table was always substances, and thus the result of any query was always a set of molecules. A few reports such as vendor and gene listings were available via the webpages, but were not supported by the API.  This is a substantial change that appeared first in ZINC 15.




[[Category:ZINC15]]
[[Category:ZINC15]]

Latest revision as of 18:30, 9 February 2017

A resource is the type of thing you are asking about, such as substance, catalog, gene, etc. Resources are written as plurals in the URL, indicating that the result is always a selection from among them. Each ZINC15 URL begins with a resource. A second resource may be specified later in the URL, as may constraints derived from other resources. There are nearly 20 resources in ZINC15 already (see below), but we do not expect the final total to exceed 40. Each resource is documented online with the help endpoint. Thus the help pages for the substances resource is http://zinc15.docking.org/substances/help. The URL syntax here covers both the web pages and the API. We currently support the following resources:

Resource Is a set of Approximate
Number
Example use of this resource when the answer to your question is:
a) a list of these things
b) one of these things, or
c) things derived from a single one of these things.
substances molecules 200,000,000 a) compounds you can buy [1]
b) ZINC ID 53 [2]
c) All genes hit by compound ZINC597013 [3]
catalogs vendor and annotated catalogs 400 a) catalogs whose members are biogenic (but see also endogenous, metabolite) [4]
b) The ChEMBL20 catalog [5]
c) Items in the DrugBank FDA catalogs [6] Interestingly, this shows clearly when a DrugBank ID has been expanded into multiple ZINC IDs.
genes UniProt Gene Symbols 2800 a) All Class A GPCR genes [7]
b) The BTK gene [8]
c) SEA Predicted ligands for 5-HT2A [9]
catitems What vendors and annotated catalogs call the molecules in their source catalogs 1 billion a) All molecules where the original supplier code begins with B1234 [10]
b) N.B. There is no unique key for supplier codes, so you cannot necessairily select one.
The purpose of the catitem code allows you to write queries that originate in another catalog
Items in the HMDB database, where the catalog code starts with HMDB1 and the gene it is active against starts with ADR. [11]
Compounds sold by Frontier, beginning with B123 and containing a pyridine ring. [12]
protomers 3D representations 6 million and growing a) Molecules with a charge of minus 2 at pH 7.4. [13]
b) One individual representation in 3D SDF format [14]
c) Catalogs that this molecule can be found in [15]
orthologs UniProt accession codes, thus species specific 3800 a) Information about orthologs by uniprot code in ZINC as a text file [16]
b) Information about the FTO_HUMAN ortholog. [17]
c) Substances annotated to bind FTO_HUMAN [18]
activities best ligand-gene affinity. See observations for each individual value 220,000 a) Individual data points of compound-gene associations [19]
c) All activities better than 1 nM [20]
majorclasses Major classes 15 a) A list of major classes [21]
b) The membrane_receptor major class [22]
c) genes in the enzyme major class [23]
Another examples: substances in the genes in the major class of ion channels [24]
subclasses Subclasses 44 a) A list of subclasses [25]
b) The GPCR-A subclass [26]
c) The substances that bind the GPCR F subclass [27]
organisms asdf asdf 5 a) List of supported organisms [28]
b) not supported
c) not supported
ecfp4s lala 100,000,000 a) List of fingerprints [29]
b) Fingerprint for ZINC53 [30]
c) Fingerprints for all ligands for MR [31]
predictions SEA predictions 1 billion a) List of predictions [32]
b) Prediction #1 [33]
c) Predictions for ADRB2 [34]
atccodes atc codes 1500 a) List of ATC codes [35]
b) ATC code L01A (cancer) [36]
c) something goes here...
generelations gene_relations 250,000 a) List of gene-gene relations [37]
b) Gene relation #1 [38]
c) Similar genes by the ligands they bind for OPRK [39] other options are sequence-similarity and coexpression and ppi
observations Individual reports of ligand-gene associations. See activities if you just want the best value for each gene 280,000 a) List of observations [40]
b) Something goes here
c) Something goes here too.
patterns SMARTS patterns 535 patterns, 2.5 M entries a) List of patterns [41]
b) Something here
c) Something here
rings Ring systems 10,000 a) List of rings [42]
b) something here
c) something here
toolcompounds Tool compounds 3000 3 chosen, 10 short listed, max, per gene
a) List of tool compounds [43]
b) something goes here
c) something here.

There is also a pseudo resource: tranches which is physico-property space binned by logP and molecular weight.

Resources in development for 2015: clusters, clusterreps, nplikeness, druglikeness and nplikeness. Others will be announced later.

Historical note: In ZINC 12, the reference-table was always substances, and thus the result of any query was always a set of molecules. A few reports such as vendor and gene listings were available via the webpages, but were not supported by the API. This is a substantial change that appeared first in ZINC 15.