Automation Extraction of Side Effect Information from Consumer drug reviews

AUTOMATIC EXTRACTION OF SIDE EFFECT

INFORMATION FROM CONSUMER DRUG

REVIEWS

SUPERVISED BY:

Assoc Prof Khoo Soo Guan, Christopher

Wee Kim Wee School of Communication and Information

20 April, 2015

PRESENTED BY:

Abdul Rachman(G1400808F)

Paudel Sunil(G1400834A)

Sathasivamoorthy Nirathan(G1301369K)

Introduction

• Text mining and information extraction from the reviews of social media

(www.webmd.com).

• Extracting side effect information of psychotropic drugs.

• Psychotropic drugs alter the chemical levels in the brain and impact the behavior,

emotions and the mood.

• In past, pharmacy used to provide the side effects based on the clinical trials.

• These days, trusted health sites (like www.fda.gov) provide the list of probable side

effects.

• Sometimes, user might experience side effects not mentioned in the label of the medicine.

Reviews from www.webmd.com

Objectives

• Objectives:

• To develop an information extraction method to extract the side effect information from

online drug reviews (www.webmd.com)

• To compare the extracted side effects with the ones listed in www.fda.gov

Information extraction method

• Side effect information : awful headache

• Pattern : the only side effect has been ____________________

• Side effect Information : shaking, restlessness and dizziness

• Pattern : side effects are _______________

• Side effect information : nausea (typo error by the user) – pain area in text mining

• Pattern : _________ is a side effect

• Side effect extracted by the proposed method:

Till full stop for the information after the pattern

From the beginning of the sentence for the information before the pattern.

Overall approach for constructing extraction patterns

• To construct a set of good patterns (accurate and good coverage) – candidate patterns

Good coverage: pattern must occur several times (more than 2)

Accuracy: more than 60%

Overall approach for constructing extraction patterns

• Generation of N-grams: ranging from 3 to 6

• For this study: we investigate only 1 seed word, which is “side effect”

Extraction Method

• Side effect information extracted using the generated patterns

• Patterns are matched with the reviews and side effects are extracted using automation

method

Challenges Faced

• Extraction of negative information

Challenges Faced

• User don’t follow proper structure in writing

Analysis of Extracted information

• Total No of Patterns: 505

• Total No of Reviews: 801

• Total No of Side Effect information Retrieved: 63

• Total No of relevant side effect information retrieved: 50

• Total No of relevant side effect information available: 71

Precision, Recall and F1 measure

• 𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 =Total number of Relevant Side Effects Information Retrieved

Total number of Side Effects Information Retrieved∗ 100

63∗ 100 = = 79.37%

• 𝑅𝑒𝑐𝑎𝑙𝑙 =Total number of Relevant Side Effects Information Retrieved

Total number of Relevant Side Effects Information 𝐴𝑣𝑎𝑖𝑙𝑎𝑏𝑙𝑒∗ 100

71∗ 100 = 70.42%

• F1 = 2 ∗precision .recall

precision+recall

= 2 ∗79.37

70.42∗ 100 = 74.63%

Error Analysis

• 21 relevant side effects were missed

• Reasons:

Use of free writing (1)

Pattern construction not possible (2)

In training data sample, accuracy was less than 60% (3)

Error Analysis

• 13 non-relevant side effect information extracted

• Reason:

Even good patterns might extract few bad information

• All these patterns accuracy was above 60% in training sample

Comparison of Side Effects

• Extracted side effects of 15 drugs compared with those listed in www.fda.gov

• Drugs Selection Criteria:

Minimum 30 reviews in training sample

• Few complained side effects are similar in meaning

Comparison of Side Effects

• Few of the extracted side effects not mentioned in the list at all

Conclusion & Future Work

• Thus, the side effects were extracted using the candidate patterns

• Extracted side effects were compared with those of www.fda.gov and found few of them

are not listed in the site

• The extracted information contains lot of noise; future work to be done to extract only the

side effects leaving the noise behind.

• Use of other seed words like downside, bad news, symptom, ill effect etc. to increase the

accuracy of the end results.

References

• Cheng, V. C., Leung, C. H., Liu, J., & Milani, A. (2014). Probabilistic Aspect Mining Model for

Drug Reviews. Knowledge and Data Engineering, IEEE Transactions on, 26(8), 2002-2013.

• Gaizauskas, R., & Wilks, Y. (1998). Information extraction: Beyond document retrieval. Journal of

documentation, 54(1), 70-105.

• Grishman, R. (1997). Information extraction: Techniques and challenges. InInformation extraction

a multidisciplinary approach to an emerging information technology (pp. 10-27). Springer Berlin

Heidelberg.

• Khoo, C. S. G., Chan, S., Niu, Y., & Ang, A. (1999). A method for extracting causal knowledge

from textual databases.

• Nahm, U. Y., & Mooney, R. J. (2002, March). Text mining with information extraction. In AAAI

2002 Spring Symposium on Mining Answers from Texts and Knowledge Bases (Vol. 1).

Thank You !!!

Automation Extraction of Side Effect Information from Consumer drug reviews

Data & Analytics

Transcript of Automation Extraction of Side Effect Information from Consumer drug reviews

Recommendations for selection process automation in systematic reviews

Aspect Term Extraction with History Attention and ... · Aspect Term Extraction is to automatically extract the aspect term from user reviews. As a natural information extraction

End-to-end Automation of Periodic SAP User Access Reviews · 2020-01-03 · A user-friendly interface for administrators to see the status of all reviews at a glance, create new reviews,

CS 224N Final Project: Automated extraction of … 224N Final Project: Automated extraction of product attributes from reviews Nikhil Gupta, Praveen Kumar, ... Canon G3 Electronics

Windows Azure eBook Australia - Switch Automation€¦ · process automation. ELAP Cloud extends these beneits to the entire market with its web-based data extraction and business

The Need for Distributed Intelligence Automation …c4i.gmu.edu/eventsInfo/reviews/2011/papers/12-Goshorn...The Need for Distributed Intelligence Automation Implemented through Four

DNA Extraction Kit - Werfenuk.werfen.com/~/media/il uk/docs/diasorin/ifu 60902 dna...DNA Extraction Kit Instructions For Use ... The automation of DNA extraction reduces hands-on time,

Autonomy Document Process Automation - NDM Technologies Document Process Automatio… · Intelligent classification, extraction, and process automation Autonomy DPA enables you to

The oKtopureTM and sbeadexTM plant nucleic acid … oKtopureTM and sbeadexTM plant nucleic acid extraction kit Dr Heiko Hauser, ... Grade of automation Full walk away automation Nucleic

Automation, Flow Injection Analysis: a new tool to ... · chemistry field [10]) and liquid-liquid extraction [11] ... Back extraction, which is a multi-stage extraction ... theory.

Solid Phase Extraction: A Century of Chemical Development ... and... · Solid Phase Extraction Benefits of Automation Liquid-Liquid Extraction Solid Phase Extraction Laboratory Benefits:

Intelligence Automation - Brochure Web · Intelligent Automation for Insurance ... Image Recognition Techniques, third-party data extraction and enrichment tools to bring in a holistic

Use of automation to achieve high performance solid phase extraction

Nucleic Acid Extraction Automation Overview (early 2011)

DNA/RNA Extraction Kit - genesig Easy DNA/RNA Extraction Kit Plant Extraction Kit Amount ... nucleic acid molecules, ... genesig Easy Extractions allow easy automation on common liquid

USE OF ROBOTICS AND AUTOMATION FOR MINERAL PROSPECTING AND ... · Use of robotics and automation for mineral prospecting and ... automation for mineral prospecting and extraction

Automated Nucleic Acid Extraction WorkStation.pdf• Automation computer • Six ... The Automated Nucleic Acid Extraction WorkStation is a turn-key solution that provides the lab

Cochrane Diagnostic Test Accuracy Reviews · Diagnostic Test Accuracy Reviews Framing the question Identification and selection of studies Quality assessment Data extraction Data

Improvement of efficiency by automation traag.pdf · Automation of DIOXIN method Automation is needed : ... Calibration Valve 5 ml loop Automated Extraction. Method from 2005 Sample

meta-analytical approaches in systematic reviews of … reviews of prognostic studies 3 ... Critical appraisal Predictive performance of the EuroSCORE. Step 4 Quantitative data extraction