ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs ›...

36
1 Adrian Paschke , Freie Universitaet Berlin Stephan Heineke, FIZ CHEMIE ChemCloud - Chemical e-Science Information Cloud

Transcript of ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs ›...

Page 1: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

1

Adrian Paschke, Freie Universitaet Berlin

Stephan Heineke, FIZ CHEMIE

ChemCloud - Chemical e-Science Information Cloud

Page 2: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

About FIZ CHEMIE

� 1830 Founding of Pharmaceutisches Centralblatt

�Reestablished 1981 by the German Government

� German Chemistry Information Center

2

� German Chemistry Information Center

� Focus: Information Competence

Educational Systems

Reaction Databases

Thermophysical Databases

Page 3: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

About Corporate Semantic Web

1. Application of Semantic Web technologies in enterprise

information systems (Semantic Enterprise)

– Collaborative workflows and (business) process management

(e.g. e-Science workflows, Semantic Business Process Management)

– Knowledge Management

3

– Knowledge Management

(e.g. Semantic Knowledge Management, Semantic Corporate Memory)

2. Corporate = Business Context

– Application of Semantic Web technologies under economical considerations

and business conditions (e.g. cost models, return on investment)

Page 4: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

About Corporate Semantic Web at Freie Universität Berlin

4

Page 5: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Agenda

� ChemCloud – Chemical e-Science Information Cloud

� Semantic Model of the ChemCloud

5

� Examplary Use Cases

� Conclusion

Page 6: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Support of Knowledge Intensive eScience Processes

Research &

Development

Knowledge

� Event & Process

Context

KP KP

6

� �Information

Sources:

Knowledge

Management:

Semantics

Information

�Relations

& Interpretation

� Content

KP KP

Literature Colleagues Experts Databases 6

Page 7: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Motivation: Emerging e-Science Infrastructure

Ask Queries / Derive Knowledge

Get Decision Support

Submit Tasks

Trigger Actions

Get Reactions

Experiments

Simulations

Computations

Trigger Actions

Get Reactions

7

GoPubMed

W3C HCLS

Blades, Cluster Load

Balancer

UDDI

Storage with

virt. Images

Network,

Switches

Knowledge Clouds Resource Clouds

Remote Labs

Virtual Labs

Simula-tions

Page 8: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Thermophysical Data

Historical

Data

8

8eLearning Content

FU Berlin DBPedia (Germany)

FU Berlin W3C HCLS KB

ChemCloudLinked Open Data

Cloud

Literature Data

Page 9: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Example 1 - Thermophysical Linked Data Cloud

� Type of data

– Experimental thermodynamic and physical properties of mixtures and pure substancesfrom a total of 22,000 compounds

� Knowledge base growth

– About 1,000 tables per month

� Content

9

� Content

– 450,000 tables and diagrams of PVT state values, phase equilibrium data, transportand surface properties, caloric, optical and acoustic properties

� Material types

– 75% organic, 20% inorganic and 5% organometallic compounds with the focus on solvents

� Sources

– 37,900 publications from journals and reports as well as measurement protocols anddata collections in printed and electronic format, from 1919 to present, ~ 800 predefined therms

Page 10: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Example 2 - eLearning Linked Data Cloud

�Type of Data

– Curricular Encyclopedia for Chemistry

�Content

10

�Content

– 1,800 learning units with 15,000 content pages, 25,000 media elements, 900 exercises as well as 3,500 glossary and biography entries

�Subjects

– Chemistry, Biochemistry, Pharmacy, Mathematics, Physics

Page 11: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Example 3 – Literature Linked Data Cloud

�Type of data

– STM Literature

�Content

– About 500,000 documents

11

– About 500,000 documents

�Subject

– Science, Technology and Medicine

�Sources

– eBooks and papers

Page 12: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Example 4 – DBPedia (Germany) Linked Data Cloud

� Type of data

– German part of WikiPedia

� Content

– Wikipedia articles and meta data

� Material Subject

http://de.dbpedia.org

12

� Material Subject

– All subjects included

� Sources

– WikiPedia Germany

Page 13: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Agenda

� ChemCloud – Chemical e-Science Information Cloud

� Semantic Model of the ChemCloud

13

� Examplary Use Cases

� Conclusion

Page 14: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

� General Ontologies

– General concepts commonly used in the domain ontologies,e.g. content, product, term

� Chemistry Domain Ontologies

– Semantic Chemistry Concept, e.g.thermo-physical properties, reactions, compound classes

� Product Domain Ontologies

Top Level Modular Ontology Model

General

Chemistry Meta Product

14

� Product Domain Ontologies

– Product/Services Descriptions

� Meta Domain Ontologies

– e.g. provenance of data for DBPedia Deutschland, eBooks, eLearning; mapping ontologies

� Content-Ontologies

– e.g. content e-Learning, eBooks, InfoTherm

� Standard-Ontologies

– FOAF (Persons), Dublin Core (Content)

FOAF

Content

DC …

Page 15: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Thermo – Physical Domain Ontology

� >800 pre-defined terms in Thesarus

– e.g. category "Molecule classes" approx. 500 molecule classes

– e.g. "metal organic compounds" or "inorganic acids" and their thermophysical properties

� Semantic Web Thesaurus

15

� Semantic Web Thesaurus

Page 16: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

ThermoML:Property

[ a ThermoML:PropertyType ;

ThermoML:PropPhaseID

[ a ThermoML:PropPhaseIDType ;

ThermoML:ePropPhase "Liquid" ] ;

ThermoML:PropRepeatability

[ a ThermoML:PropVarRepeatabilityType ;

ThermoML:eRepeatMethod

"Standard deviation of a single value (unbiased)" ;

ThermoML:sRepeatEvaluator

ThermoML Chemistry Domain Ontology

– Annotate ThermoML:ePropName with terms

from ChemaxiomProp

– Annotate measuring units with terms from

ChemAxiomUnits

– Annotate measuring methods with terms from

ChemAxiomMetrology

16

ThermoML:sRepeatEvaluator

"Author“ ] ;

ThermoML:Property-MethodID

[ a ThermoML:Property-MethodIDType ;

ThermoML:PropertyGroup

[ a ThermoML:PropertyGroupType ;

ThermoML:PhaseTransition

[ a ThermoML:PhaseTransitionType ;

ThermoML:ePropName "Molar enthalpy of vaporization or sublimation, kJ/mol" ;

ThermoML:sMethodName

"Correlation Gas Chromatography"

– Align ThermoML.owl and

ChemaxiomChemDomain for Compound names

and metadata

– Add the roles from Chebi Role Ontology,

structure from Molecular Structure Ontology

and chemistry relationships

Page 17: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

� General Ontologies

– General concepts commonly used in the domain ontologies,e.g. content, product, term

� Chemistry Domain Ontologies

– Semantic Chemistry Concept, e.g.reactions, compound classes

� Product Domain Ontologies

Top Level Modular Ontology Model

General

Chemistry Meta Product

17

– Product/Services Descriptions

� Meta Domain Ontologies

– e.g. provenance of data for DBPedia Deutschland, eBooks, eLearning; mapping ontologies

� Content-Ontologies

– e.g. content e-Learning, eBooks

� Standard-Ontologies

– FOAF (Persons), Dublin Core (Content)

FOAF

Content

DC …

Page 18: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

E-Learning-Ontology: Content

ContentElement

LearningUnit

Exercise

hasContentElement/referencesReading

hasContentElement/

18

ContentElement Exercise

Media Image

Formula

referencesMedia

hasContentElement/referencesReading

Page 19: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Example – Linked Data Pubby Interface eLearning Data Cloud

19

Page 20: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Example DBPedia Deutschland Triplestore Web View

20

Page 21: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Agenda

� ChemCloud – Chemical e-Science Information Cloud

� Semantic Model of the ChemCloud

21

� Examplary Use Cases

� Conclusion

Page 22: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

E-Learning-Ontology: Learner

D

C

School

Training University

22

B

A Training_B

Page 23: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Example 1: User e-learning path recommendation

User

Requirement

Content YRecommendation

Predefined Teaching Path

23

Content DContent CContent BContent A Content FContent E

Content XRecommendation

Question 1

Page 24: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Example: Semantic e-learning path recommendation

User

Dynamic, semantic based Teaching Path

RecommenderEngine

e-Learning Information

Cloud

24

Content DContent CContent BContent A Content FContent E

Requirement

Content XRecommendation

Content YRecommendation

Question 1

Page 25: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Example: Semantic e-learning path recommendation

User

Process:

RecommenderEngine

e-Learning Information

Cloud

25

Content DContent CContent BContent A Content FContent E

Requirement

Content XRecommendation

Content YRecommendation

Question 1

Page 26: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Example: Semantic e-learning path recommendation

User

Process:

RecommenderEngine

e-Learning Information

Cloud

26

Content DContent CContent BContent A Content FContent E

Requirement

Content XRecommendation

Content YRecommendation

Question 1

Page 27: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Example: Semantic e-learning path recommendation

User

Other User Parameter

RecommenderEngine

e-Learning Information

Cloud

27

Content DContent CContent BContent A Content FContent E

Requirement

Content XRecommendation

Content YRecommendation

Question 1

Page 28: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Example: Semantic e-learning path recommendation

User

Process:

RecommenderEngine

e-Learning Information

Cloud

28

Content DContent CContent BContent A Content FContent E

Requirement

Content XRecommendation

Content YRecommendation

Question 1

Page 29: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Example: Semantic e-learning path recommendation

Process:

e-Learning Information

Cloud

User RecommenderEngine

29

Content DContent CContent BContent A Content FContent E

Requirement

Content XRecommendation

Content YRecommendation

Question 1

Page 30: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Organic compounds

Organo-bromine compounds

Organometallic compounds

Organo-silicon

compounds

Ontology

<a onto=…>

Example 2: Thermo-Physical Semantic Search

Ontology-

� Ontology representation of Infotherm

Thesaurus with classifications from

other data sources

– Add Classification from Chebi

Ontology

– Add Synonyms from Chebi, Pubchem

� Enabling Multilingual Search

30

this:DataReportStat a scovo:Item ;

rdf:value 937 ;

scovo:dimension <http://www.fiz-

chemie.de/infotherm/ontologies/Therm

oML.owl#DataReport> ;

scovo:dimension

void:numberOfResources .

Relevant information which conforms to the ontology

<a onto=…> Ontology-

based Search� Enabling Multilingual Search

– DBPedia multilingual terms

� Enabling accurate search with

chemical identifiers

– "4095094" , "C H3 Br3 Si" ;

� Adding Semantic Auto Complete

� Adding Semantic Facets

Page 31: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

� Scientific Question

– Which effects of the substance X are known?

– Who is the expert for this topic?

Example 3: e-Science Infrastructure

News

Semantic e-Science

Knowledge Cloud

31

� Goal: integrated knowledge base

– Efficient discovery in heterogeneous data sources

Literature

Patent Databases

Information Databases

Page 32: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Example:Knowledge Intensive e-Science Processes

e-Science Processes

(event-based rule engines)Services, User Interface

Scientists

32

FIZ Chemie

DBPedia

DeutschlandW3C HCLS

Federated Triplestore

(AllegroGraph)

Semantic Knowledge

(ChemCloud)

Blades,

Cluster Load

Balancer

UDDI

Storage with

virt. Images

Network,

Switches

Services & Tools

Page 33: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Process Makna Wiki

Example: Rule Responder e-Science Infrastructure

"IF a Person has most publications in the field and one or more patents in

the field THEN the Person is an expert in this Field".

33

Rule Responder e-Science Infrastructure

http://responder.ruleml.org

Process Makna Wiki User Interface with Scientific Workflow System

Page 34: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Agenda

� ChemCloud – Chemical e-Science Information Cloud

� Semantic Model of the ChemCloud

34

� Examplary Use Cases

� Conclusion

Page 35: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Summary

� Goals– Improve Data Quality

– Synonyms, identifier, multi-lingual, complements, …

– Improve Search and Knowledge Management

– Semantic context, multi-lingual, faceted search, …

– Enabling new Kinds of Queries and Applications

– Cross database, cross domain, reasoning

� Current Implementation1. Step 1 Conversion

Enabling New Kinds of Queries

Improving Search

Improving Data Quality

35

1. Step 1 Conversion

• Reuse and create ontologies for the ChemCloud Semantic Model

• Convert relational and XML data into RDF

2. Step 2 Data Integration

◦ Linked Data Integration (Chebi, Pubchem)

◦ Ontology Integration (Bio2RDF, Chem2Bio2RDF)

3. Step 3 Applications

◦ Semantic Faceted Search (Semantic Browser; Drupal and Exhibit)

◦ Semantic e-Book Recommender and e-Learning

◦ Semantic e-Science Scientific Workflows (Rule Responder)

� Future Steps– More data and improved data quality in ChemCloud

– Better integration of existing data and ontologies from others (ChemAxiom, Chebi, …)

– Integration into Linked (Open) Data Cloud and tools/applications

Data Enrichment

Inter-

Operability

Flexibility

Page 36: ChemCloud - Chemical e-Science Information Cloudegonw.github.io › acsrdf2010 › pdfs › ACS_CINF_FINAL_PASCHKE.pdfAbout Corporate Semantic Web 1.Application of Semantic Web technologies

Questions

36

http://www.chemistry.de/

http://www.corporate-semantic-web.de