QSAR21: Linkingggy y biology to chemistry through...

34
QSAR21: Linking biology to chemistry through chemical inherency and mode-of-action pathways mode of action pathways Chihae Yang 14 June, 2012

Transcript of QSAR21: Linkingggy y biology to chemistry through...

Page 1: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

QSAR21: Linking biology to chemistry g gy y

through chemical inherency and mode-of-action pathwaysmode of action pathways

Chihae Yang14 June, 2012

Page 2: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

• How chemistry helpsHow chemistry helps• Practical approaches

– QSAR 21QSAR 21• Mode of action QSAR• Weight of evidence

• What regulators may accept

Page 3: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Does chemistry help?

“Bi l i i dibl li t d Wh ?”“Biology is incredibly complicated.  Why?” – Richard Dawkins

Page 4: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Does chemistry help?

4

In vitro to in vivo correlation of ToxCastTM Phase I data before and after chemical structure consideration

Page 5: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Chemical inherency

Exposure

In use•Risk assessment

•Structural alerts •QSAR predictions

•Categoriessupervised

Biology Predictions

•Metabolic potential •Chemical reactivity

Categories

interactionlinking layer

Chemical interactions

• PhysChem propertiesSt t l f t unsupervised

linking layer

Intrinsic chemistry

5

• Structural features unsupervised

In collaboration with Ann Richard, US EPA NCCT

Page 6: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Role of cheminformatics in systems biology

Chemical library serves to probe in vitro biology• need sufficient diversity of structures to sample wide range of t t i t ti th d MOAtarget interactions, pathways, and MOAs• need sufficient representation of structures & features to enable QSAR inferences

In Vitro/HTS

In Vivo

Structures

In Vivo

Existing knowledgein collaboration with Ann Richard (US EPA)

Page 7: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Tox21 inventory - sources

NTP studied compounds of all typesNTP-studied compounds of all types, nominations and related compounds, ICCVAM & NICEATM validation and reference compounds, collaborations

NTP

US EPA

NCGC (3458)

FDA CDER Orangebook, OTC, NDC, Green Book, D @FDA B it i

p

US EPA (3726)

Drugs@FDA, Britain NHS, EMEA, Health Canada, Japan NHI

EPA high interest compounds, pesticidalactives, failed drugs, antimicrobials, green alternatives, HPV, MPV,, endocrine ref cmpds, pcompds with tox data from ToxRefDB and NTP in vivo, FDA CFSAN compds

Page 8: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Chemical roles in Tox21 inventory

Covering chemical spaces of:• Pharmaceuticals

– NCGC, [email protected]&• Food ingredients & additives

– US FDA PAFA, FCS• Biocide• Biocide

– EPA pesticide, FIFRA, PAN.

• Cosmetics– EU COSING, US PCPC

• General or industrialtotal of 8204 compounds

Page 9: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Structural features variationTox21 Structural coverage across the

Inventory sources

Feature

y

enrichment

libr

ary

Feature diversityra

gmen

tuc

tura

l fr

Stru

courtesy of Ann Richard, US EPA NCCT

Page 10: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Structural features across the chemical rolesDrugs Biocides Food Ing. & Add. Cosmetics

Pri-alkyl amines

Benzothiazine, Beta-lactam,,Carbamate

AzepinBenzimidazole

t lib

rary

Benzothiazinebeta-lactam

frag

men

t

Carbohydrate

uctu

ral f

PurinPyrazole

Steroid

Stru Purin

Steroid

Frequency of each fragment in dataset (log scale)

Page 11: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Principal components scores plot using structural featuresstructural features

drugsdrugsbiocidesfoodscosmetics

PC are extracted using a basis set of MOSES f t lib fi i t St t l f tfragment library as fingerprints. Structural features found in more than 5 compounds were included.

Page 12: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Principal components scores plot using physicochemical propertiesp y p p

4

6

8

10 drugfoodCOSMOSbiocide

drugsbiocidesfoodscosmetics

-8

-6

-4

-2

0

2

p 4

cosmetics

-10

-5

0

5

10

-10

-5

0

5

10-10

p2p3

logP, water solubility, dipole moment, complexity, h d b d d d t l lhydrogen bond donors and acceptors, molar volume, Lipinski score violation, diameter, radius of gyration

Page 13: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Structural features and physicochemical propertiesphysicochemical properties

• drugsdrugs• biocides• foods

ti• cosmetics

Page 14: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

2-D Clustering of compounds toxicity endpoints vs. the structural features

7 toxicity endpoints

endpoints vs. the structural features

s

agm

ents

rope

rtie

s

ctur

e fr

a

chem

pr

f t t

stru

c

phys

surfactant

Toxicity endpoints include mutation (salmonella), rodent tumorigenicity, and developmental effects (e.g., cleft palate, microthalmia, renal tube defects)

Page 15: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Structural alertsExample of structural correlation with toxicity endpointswith toxicity endpoints

cyclic-conjugate-dienej di

Drugs Biocides Food Ing. & Add. Cosmetics

conjugate-diene_oxySBI_Pryimidine,oxo

SBI_TriazoleSBI_Dioxolane

St id 6 6 6 5 iSteroids_6,6,6,5-ringPhthalic ester

aromatic amine(NH2)aromatic azo

ti itaromatic nitroepoxide

furan, 2-nitroreactive alkyl halides

di lfiddisulfideshydrazine

aliphatic estersalpha-halocarbonyl

i h l tmichael acceptors

% frequency in log scale

Page 16: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

MoA QSAR

• Applied to endpoints where chemical roles areApplied to endpoints where chemical roles are better understood.– Genetic toxicityGenetic toxicity– Genotoxic tumorigenicity– Skin sensitization– Skin sensitization– Skin irritation

Page 17: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Chemical inherency

Exposure

In use•Risk assessment

•Structural alerts •QSAR predictions

•Categoriessupervised

Biology Predictions

•Metabolic potential •Chemical reactivity

Categories

interactionlinking layer

Chemical interactions

• PhysChem propertiesSt t l f t unsupervised

linking layer

Intrinsic chemistry

17

• Structural features unsupervised

In collaboration with Ann Richard, US EPA NCCT

Page 18: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Biology representation: modes‐of‐action in toxicity & risk evaluationmodes of action in toxicity & risk evaluation

Toxicity pathways

Training setOrgans & Cellular M l l Organs &

OrganismsCellular

ResponseMolecularInitiating

EventOHCH3CH3

CH3CH3

chemical MoA categoryO

CH3

e.g. Retinoic Acid

Modes of action (MoA)

Adverse OutcomesModes of action (MoA)

Page 19: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

MoA categories: genotoxic turmorigenicity

Chemical class Biological events

• Aliphatic halide

• Aromatic amine

genotoxictumorigenicity

• PAH

tumorigenicity

• Flavonoid

• Steroid

• Retinoid

Page 20: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

MoAs: features for biology and chemistry

Biological features

Assays Toxicity

In vitro In vivoMoA

y Toxicity

Chemical features

Page 21: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

From known pathways for cleft palate

• WNT• SHh• Growth factors• …

to Chemistry

Page 22: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Possible Mode-of-Actions for cleft palate

Chemical classes Biological key events  Phenotype 

• Glucocorticoid• Steroid

during embryogenesis effects

• SHh antagonist– Cyclopamin

• Sterol biosynthesis

Wnt signaling SHh signalingGlucocorticoid receptors disruptions in EMT

• Sterol biosynthesis inhibitors– triazinei id l

Retinoic acid receptorsP‐glycoprotein 

Glycogen synthase kinaseNuclear receptor

‐ cell proliferation‐ palate growth

‐ fusion

– imidazole– pyrimidine …

• Retinoids

Nuclear receptorAndrogen receptor

22

• Phthalates…

Page 23: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Data remodeling by tox ontology

Phenotype effects ChemicalPhenotype effects Chemical

Sites

System/Organ

Molecular entity

System/Organ

Region/TissuesChemical groups

Ph h tiC ll

Proteins   

Physchem propertiesCells

Chemical role 

Genes                        Biological role

23

Molecular pathways

Page 24: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Developmental database and ontology

• Tox data (1154 chemicals) merged from public sources*EPA T R fDb FDA CDER (d @fd ) FDA CFSAN (f d t t– EPA ToxRefDb, FDA CDER (drugs@fda), FDA CFSAN (food contact substance), ILSI DevTox, NTP

• Ontology for dose level effects• Ontology for dose‐level effects – Maternal toxicity– Fetal survival and growthFetal survival and growth– Fetal morphology

• e.g., cleft palate, microthalmia, kidney/renal/ureter anomalies

• Dysmorphogenesis data– 644 chemicals 

* Singh, AV et. al. Computational Toxicology, Eds Knudsen T et. al. Elsevier Limited 2010, 307-337.

Page 25: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Cleft palate chemotypes

Steroids, glucocorticolds, cyclopamine

Triazole, imidazole, pyridines, pyrimidine,

*substituent

imparting positive -charge at * carbon atom;

piperazole, dioxolane…

Retinoids, conjugated dienes

O

* charge at carbon atom;x=nitrogen or carbon

Phthalic ester (aliphaticchain) O

O

O

Thiocarbamates

O

O

Thiocarbamates

Organofluorides

Page 26: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Structural features and cleft palate chemotypesp yp

AlcoholCarbamate

3.3 ‐0.8 ‐0.1 0.8 0.1 5.8

‐0.4 2.7 ‐0.5 ‐1 ‐0.2 ‐1Halides_aliphatic

Halides_aliphatic_fluorideHalides_aromatic

d i

‐1.5 0 0.3 1 ‐0.3

‐1.5 ‐0.2 ‐0.5 0.7 1.5 0.3

‐1.8 0 3.2 ‐0.4 ‐1.6 1.9Hydrazine

UreaRetinoid_conjugate‐diene

R ti id j t di

‐0.8 0.1 ‐0.1 1.3 ‐2.3 0.4

‐0.1 0.1 0.7 1.2 0.3 1.3

‐1.1 ‐0.4 ‐0.2 0.1 2.4 0.5Retinoid_conjugate‐diene_oxy

Sterol_BSI_Pryimidine,oxoSterol_BSI_Triazole

Sterol BSI Dioxolane

‐1.2 ‐0.4 0.7 0.7 2.8 0.7

‐0.2 ‐0.8 1.6 ‐0.9 0.7 ‐0.9

0.7 2.3 0.8 0.6 0Sterol_BSI_DioxolaneSteroids_6,6,6,5‐ring

Phthalic ester

2.5 1.8 0.5 0.2 0.8 0.9

2.6 1.5 0.9 1.1 2.6 1.2

0.4 0.1 0.75 0.3 0.6 3.3

Page 27: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Molecular features for modeling

Structural features Calculated properties

• Genericized features 

• FDA Redbook categories*

• Electronic*‐ Dipole moment, Hydrogen bond acceptors/donors

• EPA pesticide MoA classes

• MOSES metabolic reaction l

bond acceptors/donors• Steric‐ Topological complexity*, 

rules

• Reactive chemicals (RECAP)ǂ

Cl ft P l t h t

Shape descriptors• Hydrophobicity*

LogP Water solubility• Cleft Palate chemotypes ‐ LogP, Water solubility, Topological polar surface area

• qHTS assay (ToxCast)ǂ Hann et. al. J. Chem. Inf. Comput. 

‐ TGF1Sci. 1999, 39, 897‐902. 

*Publicly available from Molecular Networks

Page 28: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

MoA/WoE modeling approach for cleft palatep

maternal toxicityQuery

Alerts

not maternal tox dose

alerts (chemotypes)N

MoACategories

N

N

TriazolesRetinoidsGlobal DioxolanesConjug DienesTGF Triazoles…RetinoidsGlobal DioxolanesConjug. DienesTGF

WOE model Cleft palate!28

i ifinal

i

w pp

w

Page 29: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

MoA/WoE models for rat cleft palate

WOE (Bayesian) % Concordance % Sensitivity % Specificity

87 73 94

%  %  % sets (pos/neg)

( y )87 73 94

Concordance Sensitivity Specificity PLS factors Weights

84 71 93 5 0.8

84 92 71 1 1Retinoids (5/14)

Global (81/167)

sets (pos/ eg)

84 92 71 1 1

92 80 96 1 0.5

89 78 94 5 1

( / )

Conjugated alkene (11/27) 

Sterol BSI – N (31/62)

86 82 89 1 1

84 73 90 1 1

Sterol BSI ‐ O (13/16)

Organofluoride (13/18)

29

Page 30: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Summary

• QSAR 21 B i h i t t T 21 d bi l t QSAR– Bring chemistry to Tox21 and biology to QSAR

– MoA and weight of evidence (WoE)

Biological features Chemical

molecular featuresMoA

A li d t d l t l ff t

molecular features

• Applied to developmental effects– Cleft palate (rat, mouse), Eye, Kidney, ureter

Page 31: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Reality of safety/risk assessment

Safety/Risk assessment deals with extremely diverse and disparate data and information!

• Complex data– Experimental data: In vivo, in vitro …– Exposure information

• Lack of data or knowledge• Knowledge from past

– Structural alerts– Read‐acrossTh h ld f i l i l (TTC)– Threshold of toxicological concern (TTC)

– QSAR predictions  31

Page 32: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Regulators needs

• Qualified data at their fingertips– has it been regulated before?– where has it been regulated?g– what is the current status

• Mechanistic rationaleMechanistic rationale– strong desire to bring pathway knowledge into workflowworkflow

• Make their decision easier, not harder

ifti th h th il f i f ti d lt f– sifting through the pile of information and results from numerous methods and models

Page 33: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

What regulators wanted for CFSAN CERES

Chemical evaluation and risk estimation system (CERES)

• Transparent rationale

• Adopt pragmatic approaches reflecting systems biology– e.g., Biological analogs based on MoA

• Quantitative weight of evidence assessment

• Dashboard, War room, Red flags…

Page 34: QSAR21: Linkingggy y biology to chemistry through …nas-sites.org/emergingscience/files/2012/06/ChihaeYang...Role of cheminformatics in systems biology Chemical library serves to

Acknowledgment

• James F Rathman, the Ohio State University• Ann Richard, US EPA NCCT• Andrew Worth, JRC• US FDA CFSAN

– Kirk Arvidson, Annette McCarthy, Jason Aungst, Kristi Jacobs Mary Shackelford Yan Gu Deborah Hansen (NCTR)Jacobs, Mary Shackelford, Yan Gu, Deborah Hansen (NCTR)

• Funding: European Community’s Seventh Framework Program (FP7/2007‐2013) and from Cosmetics Europe (Grant agreement n° 266835) in SEURAT cluster.