Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush...
Transcript of Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush...
![Page 1: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/1.jpg)
Chemical Compound Search
in PATENTSCOPE
SCP, December 13, 2016
Paul Halfpenny
Senior Administrator, Office of the Assistant Director General
![Page 2: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/2.jpg)
Principle:
Recognize chemical compounds in patent texts and from
embedded drawings included in patent texts
Standardize all the different representations of chemical
structures into Inchikeys and annotate the document
Implement search functions for Inchikeys that can be
used by non chemists
Search chemical compounds
![Page 3: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/3.jpg)
Common Search Phrases
IUPAC name
N-(4-hydroxyphenyl)acetamide
INN
paracetamol
Other names
Acetaminophen, panadol, tylenol, …
RZVAJINKPMORJF-UHFFFAOYSA-N
![Page 4: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/4.jpg)
(…) At the moment the surgical procedure starts, benzodiazepin, e.g.
diazepam, is administered in a dose of no more than 5 mg. (…)
(…) At the moment the surgical procedure starts, benzodiazepin, e.g.
@AAOVKJBEBIDNHE-UHFFFAOYSA-N@, is administered in a dose of
no more than 5 mg. (…)
Addition of InchiKey Annotation
![Page 5: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/5.jpg)
PATENTSCOPE Documents
Enriched PATENTSCOPE Documents
(…) At the moment the surgical
procedure starts, benzodiazepin, e.g.
diazepam, is administered in a dose of
no more than 5 mg. (…)
(…) At the moment the surgical procedure
starts, benzodiazepin, e.g.
@AAOVKJBEBIDNHE-UHFFFAOYSA-N@,
is administered in a dose of no more than 5
mg. (…)
AAOVKJBEBIDNH
E-UHFFFAOYSA-N
![Page 6: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/6.jpg)
Access only with a PATENTSCOPE account
![Page 7: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/7.jpg)
How does it work?
![Page 8: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/8.jpg)
How does it work?
![Page 9: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/9.jpg)
Its chemical formula is C7H8N4O2 and IUPAC name:
3,7-dimethyl-1H-purine-2,6-dione
Theobromine is found in the seeds of the plant
Theobroma Cacao, which is the well-known source of
chocolate and cocoa.
Example 1: Theobromine
![Page 10: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/10.jpg)
![Page 11: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/11.jpg)
![Page 12: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/12.jpg)
![Page 13: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/13.jpg)
![Page 14: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/14.jpg)
![Page 15: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/15.jpg)
![Page 16: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/16.jpg)
![Page 17: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/17.jpg)
WIKIPEDIA:
INNs are official generic and non proprietary names
given to a pharmaceutical drug or active ingredients
issued by the World Health Organization (WHO).
Growing need to be able to search INNs in patent texts
PATENTSCOPE supports the search of 6917 INNs by
Inchikey
International Non proprietary Names
![Page 18: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/18.jpg)
Example 2: Ritonavir
![Page 19: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/19.jpg)
![Page 20: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/20.jpg)
![Page 21: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/21.jpg)
Scope
Works on complete exact formulas ≠ Markush structures (-R) that
are chemical symbols used to indicate a collection of chemicals with
similar structures.
Chemical elements, short names (less than 4 characters), common
solvents and polymers are not annotated by design
PCT and US national collections with IPC codes related to chemistry
Languages: English and German
![Page 22: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/22.jpg)
Limitations
Based on state of the art fully automated chemical recognition
algorithms
The technology is NOT 100% accurate
OCR errors in the available patent full texts make the recognition of
chemical compounds even more challenging
![Page 23: Chemical Compound Search in PATENTSCOPE...Scope Works on complete exact formulas ≠ Markush structures (-R) that are chemical symbols used to indicate a collection of chemicals with](https://reader033.fdocuments.in/reader033/viewer/2022041916/5e69a0514c3f586e7e36daf9/html5/thumbnails/23.jpg)
Suggested Approach
Use the tool as a guide
- positive identification is a good result
- negative identification is not authoritative