The Semantic Web Professor James Hendler hendler hendler .

Post on 11-Jan-2016

214 views 1 download

Tags:

Transcript of The Semantic Web Professor James Hendler hendler hendler .

The Semantic Web

Professor James Hendlerhttp://www.cs.umd.edu/~hendler

http://owl.mindswap.org

2

Jim Hendler, 5/13/03

MIND SWAP• Maryland Information and Network Dynamics Laboratory,

Semantic Web and Agents Project• J. Hendler• B. Parsia• Jennifer Golbeck• Aditya Kalyanpur• Grecia Lapizco-Encinas• Katy Newton• Evren Sirin

• Corporate Research Partners: • Fujitsu Laboratory of America, College Park• Lockheed Martin Advanced Technology Laboratories• NTT Corp• SAIC Corp.

• http://owl.mindswap.org: Owl-powered Semantic Web page (under development)

• http://www.mindswap.org: Traditional web page

Ronald Alford Ross Baker Amy Alford Matt Westhoff Michael Grove Ronald Reck

3

Jim Hendler, 5/13/03

Building the Semantic Web

4

Jim Hendler, 5/13/03

The Changing Web

1990

2000

2010

IMAGES AND

DOCUMENTS

DATAAND

PROGRAMS

5

Jim Hendler, 5/13/03

The Semantic Wave

(Berners-Lee, 03)

UMCP has been a key contributor

6

Jim Hendler, 5/13/03

XML is NOT semantics

7

Jim Hendler, 5/13/03

<photo> <subject> http://www.w3.org/~timbl </subject> <name> Tim Berners-Lee</name> </name> …</photo>

XML is NOT semantics

8

Jim Hendler, 5/13/03

<photo> <subject> http://www.w3.org/~timbl </subject> <name> Tim Berners-Lee</name> </name> …</photo>

XML is NOT semantics

Xml schema is DOCUMENT checking photo has multiple subject fields photo has one physical location etc.

9

Jim Hendler, 5/13/03

<photo> <subject> http://www.w3.org/~timbl </subject> <name> Tim Berners-Lee</name> </name> …</photo>

XML is NOT semantics

Xml schema is DOCUMENT checking photo has multiple subject fields photo has one physical location etc.

WHICH SAYS NOTHING ABOUT TALKS, SUBJECTS, PEOPLE, EVENTS, etc.

10

Jim Hendler, 5/13/03

Event:title

Event:WebPage

< > rdf:type photo:Photograph, Photo:File http://…/images#image1, Photo:topic :event1#event:speaker.

Event1 a Event:event; date “May 7-11”, speaker http://…#timbl.html Title “WWW 2002…”

TimBL rdf:type w3c-ont:person; name “Tim Berners-Lee” …

<daml:ObjectProperty rdf:ID="photograph"><rdfs:domain rdf:resource="#Picture"/><rdfs:range rdf:resource= …#person"/></daml:ObjectProperty>

<rdf:Description rdf:about="http://www.w3.org/2001/03/earl/0.95#Person"><rdf:type rdf:resource="http://www.w3.org/2000/01/rdf-schema#Class"/><rdfs:subClassOf rdf:resource="http://www.w3.org/2001/03/earl/0.95#Assertor"/></rdf:Description>

<s:Class rdf:about="http://www.semanticweb.org/ontologies/swrc-onto-2000-09-10.daml#Conference"><s:comment>describes a generic conceptabout events </s:comment><s:subClassOf rdf:resource="http://www.semanticweb.org/ontologies/swrc-onto-2000-09-10.daml#Event"/><a:disjointFrom rdf:resource="http://www.semanticweb.org/ontologies/swrc-onto-2000-09-10.daml#Workshop"/><a:restrictedBy rdf:resource="http://www.semanticweb.org/ontologies/swrc-onto-2000-09-10.daml#genid18"/>

The SEMANTICS is inthe links (e.g. to ontologies)!

11

Jim Hendler, 5/13/03

From the small and structured<daml:Class rdf:about="#Book"> </daml:Class>

<daml:Class rdf:about="#Author"> <rdfs:subClassO rdf:resource="#Person"/> </daml:Class>

<daml:Property rdf:about="#age"> <rdfs:domain rdf:resource="#Book"/> <rdfs:range rdf:resource="#age_value_id"/> </daml:Property>

<rdfs:Property rdf:about="#award"> <rdfs:domain rdf:resource="Book"/> <rdfs:range rdf:resource="#award_value_id"/> </rdfs:Property>

• Database schema for International Digitial Children’s Library Metadata repository• ~40 classes

• ~25 properties (corresponding to database keys)

12

Jim Hendler, 5/13/03

To the large and complexowl:Class rdf:ID="CD80_Antigen"> <rdfs:subClassOf rdf:resource="#Cell_Adhesion_Molecule</rdfs:label> <rdfs:label>CD80 Antigen</rdfs:label> <rdfs:subClassOf>

<owl:Restriction> <owl:onProperty rdf:resource="#Protein_Expressed_In_Tissue"/> <owl:someValuesFrom rdf:resource="#T_Cell"/>

</owl:Restriction></rdfs:subClassOf> <rdfs:subClassOf>

<owl:Restriction> <owl:onProperty rdf:resource="#Protein_Expressed_In_Tissue"/> <owl:someValuesFrom rdf:resource="#B_Cell"/>

</owl:Restriction></rdfs:subClassOf> <rdfs:subClassOf>

<owl:Restriction> <owl:onProperty rdf:resource="#Protein_Expressed_In_Tissue"/> <owl:someValuesFrom rdf:resource="#Macrophage"/>

</owl:Restriction></owl:Class>

<CD80_Antigen rdf:ID="CD80_AntigenData"> <Swiss_Prot>P33681</Swiss_Prot> <DEFINITION> <![CDATA[<def-source>NCI</def-source> <def-definition>A member of the immunoglobulin supergene family . Co-regulator of T cell activation with CD86. Plays a critical role in autoimmune, humoral, and transplant responses. Expressed on activated T and B cells and on macrophages. (from PROW)</def-definition>]]> </DEFINITION>

• US National Cancer Institute Ontology• ~17,000

classes

• >1 million synonyms and definitions

• RDF graph >500,000 triples

13

Jim Hendler, 5/13/03

And many more• DAML ontologies for:

• Army Thesaurus: 18,000 concepts • CIA World Fact Book in DAML: 1400 classes, 2500

instances• UJTLs: 326 classes, all Joint and Naval Task lists as

instances• Military Geofile: ~50 installation types; 56,700 Geoloc

instances• NIMA Geonames: 36,750 Afghanistan targets • And Many more (220 in library)

• WordNet, NCI Cancer Thesaurus, Heads of State, Weather Conditions, CYC, UNSPC, Genealogy, baseball, Chemistry, …

14

Jim Hendler, 5/13/03

The Growing Semantic Web• Jan 1, 03: Crawler finds 5.8M+ DAML statements on 20,000+ web pages

• Doesn’t include many instance KBs tied to ontologies• Doesn’t include many very large RDFS-based KBs that include some OWL

• Ontology library at http://www.daml.org has 209 ontologies (March 2003)• Open for anyone to create • Open for anyone to use

• OWL is being supported by large corporation labs• Web tool developers: IBM, HP, Sun, Intel, Fujitsu• Content providers: Daimler-Chrysler, Nokia, Motorola, EDS, Agfa

• OWL is starting to be used by thesaurus developers• C.f. National Cancer Institute metathesaurus released in OWL Lite• CALL Thesaurus in DAML • NASA thesaurus available in DAML

• Use of semantic markup for Web Services beginning to move beyond basic research• DAML-S cited as required reading for Web Services Choreography WG• DAML-S being demoed to WSDL WG tomorrow!!

15

Jim Hendler, 5/13/03

BUNCH OF SLIDES DELETED

16

Jim Hendler, 5/13/03

The Semantic Web “Revolution”Scientific American Article notes

[Joint starting place:]

I. Semantic Web Vision (TBL)

II. What are the enablers? (in sequence)Screen Scraping (Ora and TBL)

Data on Web (Ora and TBL)Zip code link between Data Bases (TBL)

Ontology Independence (JAH)

Effect of Scale (TBL)

“Then, a miracle occurs”

III. What can you do with it? (not necessaril y in sequence)Self -describing documents (JAH)

Logic to encode… (TBL)Services and Advertising (Ora)Devices (Ora)

Digital Signatures, Authentication, and Trust (TBL)

17

Jim Hendler, 5/13/03

Tools are criticalTools for marking up images, figures and other non-textual data… must be built in a way that they tie into the "business processes" of the working scientist -- that is, rather than learning a whole new set of tools, the basic web tools of the scientist must include mechanisms that make it EASIER for the scientist to produce web content (appropriately markedup) while authoring papers, performing experiments, creating and logging data, and the other day to day activities of the working researcher.

Science and the Semantic Web, Hendler, 03

The same is true of the military user

18

Jim Hendler, 5/13/03

Automatically-organized portals

19

Jim Hendler, 5/13/03

New Info Management Paradigms

20

Jim Hendler, 5/13/03

And other “cool stuff”

21

Jim Hendler, 5/13/03

Tools for visualization and use

A number of tools for querying, browsing and visualizing SW data are being developed

22

Jim Hendler, 5/13/03

Handle larger size/complexity

Tools are now scaling as well

23

Jim Hendler, 5/13/03 PhotoSMORE: Marking up (sub)images

24

Jim Hendler, 5/13/03

And the info flows…

25

Jim Hendler, 5/13/03

And the info flows… correctly

26

Jim Hendler, 5/13/03

Putting all this together…

27

Jim Hendler, 5/13/03

…is starting to work

document

service

database

Are there any mountains in Europe bigger than Mt. Blanc?

28

Jim Hendler, 5/13/03

Wrapping/Mapping toolDAML Ontology(DAML-S)

XML Schema (WSDL)

Mappings for attributes

Define transformationfunctions

Manualeditingif necess.

29

Jim Hendler, 5/13/03

“Ad hoc Programming”

Advanced information management capabilities Discovery, Filtering, Composition

30

Jim Hendler, 5/13/03

Ad hoc programming + logic