Bradley Open Notebook Science ACSfall2012

42
Shining a light on chemical properties with Open Notebook Science and open strategies Jean-Claude Bradley August 20, 2012 Open Notebook Science/Open Chemistry/Electronic Lab Notebook ACS Symposium Associate Professor of Chemistry Drexel University

description

Jean-Claude Bradley presents at the American Chemical Society meeting on August 20, 2012. Examples are first presented to demonstrate how access to Open Notebooks can provide critical information not usually shared in the traditional publication process. The use of Google App Scripts to look up chemical properties allows for the use of Google Spreadsheets as a self-contained dashboard to plan and analyze chemical reactions. The concept of the Open Chemical Property Matrix (OCPM) is introduced and a smartphone app to suggest recrystallization solvents is then presented.

Transcript of Bradley Open Notebook Science ACSfall2012

Page 1: Bradley Open Notebook Science ACSfall2012

Shining a light on chemical properties

with Open Notebook Science and open strategies

Jean-Claude Bradley

August 20, 2012

Open Notebook Science/Open Chemistry/Electronic Lab Notebook

ACS Symposium

Associate Professor of ChemistryDrexel University

Page 2: Bradley Open Notebook Science ACSfall2012

Openness in Chemistry

WHY?

Page 3: Bradley Open Notebook Science ACSfall2012

Dibenzalacetone derivatives docking against tubulin (paclitaxel site)

(Andrew Lang)

Page 4: Bradley Open Notebook Science ACSfall2012

“Simple” aldol condensation synthesis

Top Hit(no reports of synthesis)

In top ten(a few reports of synthesis)(Andrew Lang)

Page 5: Bradley Open Notebook Science ACSfall2012

What is the current standard for “sufficient information” in

communicating organic chemistry?

By definition, all peer-reviewed published documentation has been approved as sufficient by authors, editors and reviewers.

Page 6: Bradley Open Notebook Science ACSfall2012

Searching for aldol condensations of acetone in the Reaction Attempts

database

(Andrew Lang)

Page 7: Bradley Open Notebook Science ACSfall2012

An example of a failed experiment in an Open Notebook with useful information

Page 8: Bradley Open Notebook Science ACSfall2012

A failed experiment reveals the importance of aldehyde solubility

Page 9: Bradley Open Notebook Science ACSfall2012

An example of a successful experiment in an Open Notebook

Page 10: Bradley Open Notebook Science ACSfall2012

Using a Google Spreadsheet for reaction planning and analysis

Page 11: Bradley Open Notebook Science ACSfall2012

Calling Google App Scripts

Page 12: Bradley Open Notebook Science ACSfall2012

Calling Google App Scripts

(Andrew Lang and Rich Apodaca)

Page 13: Bradley Open Notebook Science ACSfall2012

Converting Google Apps Scripts Results to Values

(Andrew Lang and Rich Apodaca)

Page 14: Bradley Open Notebook Science ACSfall2012

Never having to leave the Google Spreadsheet dashboard for access to key info

(Andrew Lang and Rich Apodaca)

Page 15: Bradley Open Notebook Science ACSfall2012

A click away from an interactive NMR display (using JCAMP-DX format and ChemDoodle)

(Andrew Lang)

Page 16: Bradley Open Notebook Science ACSfall2012

A click to all melting point sources contributing to the average

Page 17: Bradley Open Notebook Science ACSfall2012

Chemistry Google App Scripts Resource page

(Andrew Lang and Rich Apodaca)

Page 18: Bradley Open Notebook Science ACSfall2012

Chemistry Google App Scripts description sheet

(Andrew Lang and Rich Apodaca)

Page 19: Bradley Open Notebook Science ACSfall2012

Predicted solubilities (M) for reactant and product

(Andrew Lang)

Page 20: Bradley Open Notebook Science ACSfall2012

Information from the literature on the target synthesis

Page 21: Bradley Open Notebook Science ACSfall2012

Information from the literature on the target synthesis

Page 22: Bradley Open Notebook Science ACSfall2012

Information from the literature on the target synthesis

Page 23: Bradley Open Notebook Science ACSfall2012

A successful synthesis by avoiding water, dramatically increasing NaOH and long reaction

time

Page 24: Bradley Open Notebook Science ACSfall2012

Open Chemical Property Matrix (OCPM)

logP

Solubility (in 1-octanol saturated water @25C)

Solubility (in water saturated 1-octanol @25C)

Page 25: Bradley Open Notebook Science ACSfall2012

Open Chemical Property Matrix (OCPM)

logP

Solubility (in 1-octanol saturated water @25C)

Solubility (in water saturated 1-octanol @25C)

Solubility (in water near 25C) Solubility (in 1-

octanol near 25C)

Page 26: Bradley Open Notebook Science ACSfall2012

Open Chemical Property Matrix (OCPM)

logP

Abraham descriptors

Melting point

Aqueous solubility

Octanol solubility

Vapor pressure

Flash point

Boiling point

Page 27: Bradley Open Notebook Science ACSfall2012

Types of Open Matrix Elements

1. True measurements (from Open Data collections e.g. Open melting point dataset of 27,000)

2. Calculatable descriptors (from OSS e.g. CDK, MOPAC7.1)

3. Predicted properties (from Open models)

Page 28: Bradley Open Notebook Science ACSfall2012

What is the solubility of benzoic acid in boiling benzene?

Page 29: Bradley Open Notebook Science ACSfall2012

Lack of provenance details generates noise in the matrix

What questions do these numbers answer?

Page 30: Bradley Open Notebook Science ACSfall2012

Examples of OCPM relationships

Page 31: Bradley Open Notebook Science ACSfall2012

Examples of OCPM flash point calculations

Page 32: Bradley Open Notebook Science ACSfall2012

Examples of OCPM solubility calculations

Page 33: Bradley Open Notebook Science ACSfall2012

Provenance of the experimental data points

Page 34: Bradley Open Notebook Science ACSfall2012

Practical applications of the OCPM

1. The automatic open evaluation of models from the dark literature to determine where they do and where they don’t work in the chemical space

2. The development of new open models built upon the population of new measurements, descriptors and predictions

3. The identification of compounds with desired properties from virtual libraries

Page 35: Bradley Open Notebook Science ACSfall2012

Finding a good recrystallization solvent

1. Estimate or look up the solubility at boiling

2. Estimate or look up the solubility at a convenient lower temperature (e.g. 25C or 0C)

3. Determine the predicted recovery yield

4. Lower the priority for solvents that boil too high (too hard to dry precipitate)

5. Lower the priority where the solubility at boiling is too low (wastes solvent and makes it harder to crystallize)

Page 36: Bradley Open Notebook Science ACSfall2012

Translate these requirements to desired properties

1. Look up the solvent boiling point

2. Look up the room temperature solubility or predict it via Abraham descriptors predicted from a model using the CDK

3. Look up the solute melting point or predict it via a model using the CDK

4. Use the melting point and the solubility at room temperature to predict the solubility at boiling

5. Calculate the predicted recrystallization yield

Page 37: Bradley Open Notebook Science ACSfall2012

Implement as an App

(Andrew Lang)

Page 38: Bradley Open Notebook Science ACSfall2012

What are good solvents to recrystallize benzoic acid?

(Andrew Lang)

Page 39: Bradley Open Notebook Science ACSfall2012

Click on the solvent to see temp curve

Page 40: Bradley Open Notebook Science ACSfall2012

Deliver melting point data via App

(Andrew Lang)

Page 41: Bradley Open Notebook Science ACSfall2012

Dibenzalacetone libraries are promising for connecting the OCPM with useful applications

Page 42: Bradley Open Notebook Science ACSfall2012

Conclusions

More openness in chemistry can make science more efficient

Provide interfaces that make sense to the end users: Open Data, Open Models and Open Source Software to modelersApps (smartphones, Google App Scripts, etc.) for chemists at the bench

Acknowledgements

Andrew Lang (code, modeling)Bill Acree (modeling, solubility data contribution)Antony Williams (ChemSpider services, mp data curation)Matthew McBride and Rida Atif (recrystallization and synthesis)