Searching for Knowledge and Data on the Semantic Web

38
UMBC UMBC an Honors University in an Honors University in Maryland Maryland 1 Searching for Knowledge and Data on the Semantic Web Tim Finin University of Maryland, Baltimore County http://ebiquity.umbc.edu/resource/html/id/179/ Joint work with Li Ding, Anupam Joshi, Yun Peng, Cynthia Parr, Pranam Kolari, Pavan Reddivari, Sandor Dornbush, Rong Pan, Akshay Java, Joel Sachs, Scott Cost and Vishal Doshi http://creativecommons.org/licenses/by-nc-sa/2.0/ This work was partially supported by DARPA contract F30602-97-1-0215, NSF grants CCR007080 and IIS9875433 and grants from IBM, Fujitsu and HP.

description

Searching for Knowledge and Data on the Semantic Web. Tim Finin University of Maryland, Baltimore County http://ebiquity.umbc.edu/resource/html/id/179/ - PowerPoint PPT Presentation

Transcript of Searching for Knowledge and Data on the Semantic Web

Page 1: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 1

Searching for Knowledge and Data on the Semantic Web

Tim Finin

University of Maryland, Baltimore County

http://ebiquity.umbc.edu/resource/html/id/179/

Joint work with Li Ding, Anupam Joshi, Yun Peng, Cynthia Parr, Pranam Kolari, Pavan Reddivari, Sandor Dornbush, Rong Pan, Akshay Java, Joel Sachs, Scott Cost and Vishal Doshi

http://creativecommons.org/licenses/by-nc-sa/2.0/ This work was partially supported by DARPA contract F30602-97-1-0215, NSF grants CCR007080 and IIS9875433 and grants from IBM, Fujitsu and

HP.

Page 2: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 3

Google has made us smarter

Page 3: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 4

But what about our agents?

tell

register

Agents still have a very minimal understanding of text and images.

Page 4: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 6

XML helps

“XML is Lisp's bastard nephew, with uglier syntax and no semantics. Yet XML is poised to enable the creation of a Web of data that dwarfs anything since the Library at Alexandria.”

-- Philip Wadler, Et tu XML? The fall of the relational empire, VLDB, Rome, September 2001.

Page 5: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 7

“The Semantic Web will globalize KR*, just as the WWW globalize hypertext”

-- Tim Berners-Lee

Semantic Web adds semantics

* Knowledge Representation

Page 6: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 13

But what about our agents?

A Google for knowledge on the Semantic Web is needed by software agents and programs

SwoogleSwoogle

Swoogle

Swoogle

SwoogleSwoogle

SwoogleSwoogle

Swoogle SwoogleSwoogle

SwoogleSwoogle

SwoogleSwoogle

tell

register

Page 7: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 15

•http://swoogle.umbc.edu/•Running since summer 2004•1.5M RDF documents, 300M RDF triples, 10K

ontologies

Page 8: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 16

Analysis

Index

Discovery

IR Indexer

Search Services

Semantic Webmetadata

Web Service

Web Server

Candidate URLs

Bounded Web CrawlerGoogle Crawler

SwoogleBot

SWD Indexer

Ranking

document cache

SWD classifier

human machine

html rdf/xml

the WebSemantic Web

Information flow Swoogle‘s web interface

Legends

Swoogle Architecture

Page 9: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 21

Applications and use cases

• Supporting Semantic Web developers– Ontology designers, vocabulary discovery, who’s using

my ontologies or data?, use analysis, errors,statistics, etc.

• Searching specialized collections– Spire: aggregating observations and data from biologists

– InferenceWeb: searching over and enhancing proofs

– SemNews: Text Meaning of news stories

• Supporting SW tools– Triple shop: finding data for SPARQL queries

Page 10: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 22

Page 11: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 23

By default, ontologies are ordered by their ‘popularity’, but they can also be ordered by recency or size.

80 ontologies were found that had these three terms

Let’s look at this one

Page 12: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 24

Basic MetadatahasDateDiscovered:  2005-01-17 hasDatePing:  2006-03-21 hasPingState:  PingModified type:  SemanticWebDocument isEmbedded:  false hasGrammar:  RDFXML hasParseState:  ParseSuccess hasDateLastmodified:  2005-04-29 hasDateCache:  2006-03-21 hasEncoding:  ISO-8859-1 hasLength:  18K hasCntTriple:  311.00 hasOntoRatio:  0.98 hasCntSwt:  94.00 hasCntSwtDef:  72.00 hasCntInstance:  8.00

Page 13: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 25

Page 14: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 26

Page 15: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 27

These are the namespaces this ontology uses. Clicking on one

shows all of the documents using the namespace.

All of this is available in RDF form for the

agents among us.

Page 16: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 28

Here’s what the agent sees. Note the swoogle and wob (web of belief) ontologies.

Page 17: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 29

We can also search for terms (classes, properties) like terms for “person”.

Page 18: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 30

10K terms associated with “person”! Ordered by use.

Let’s look at foaf:Person’s metadata

Page 19: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 31

Page 20: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 32

Page 21: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 33

Page 22: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 34

Page 23: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 35

Page 24: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 36

Page 25: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 38

UMBC Triple Shop• http://sparql.cs.umbc.edu/• Online SPARQL RDF query processing based

on HP’s Jena and Joseki with several interesting features

• Selectable level of inference over model• Automatically finds SWDs for give queries using

Swoogle backend database– Provide dataset creation wizard– Dataset can be stored on our server or downloaded– Tag, share and search over saved datasets

Page 26: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 40

Who knows Anupam Joshi?Show me their names, email address and pictures

Page 27: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 41

The UMBC ebiquity site publishes lots of RDF data, including FOAF profiles

Page 28: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 42

No FROM clause!

Constraints on wherethe data comes from

Page 29: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 43

PREFIX foaf: <http://xmlns.com/foaf/0.1/>SELECT DISTINCT ?p2name ?p2mbox ?p2pixWHERE { ?p1 foaf:name "Anupam Joshi" . ?p1 foaf:mbox ?p1mbox . ?p2 foaf:knows ?p3 . ?p3 foaf:mbox ?p1mbox . ?p2 foaf:name ?p2name . ?p2 foaf:mbox ?p2mbox . OPTIONAL { ?p2 foaf:depiction ?p2pix } . }ORDER BY ?p2name

Page 30: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 44

Page 31: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 45

Swoogle found 292 RDF data files that appear relevant to answering our query

Page 32: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 46

Let’s save the dataset before we use it

Page 33: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 47

Page 34: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 48

And tag it so we and others can find it more easily.

Page 35: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 49

Here we are using it to get an answer to “Who knows Anupam Joshi”

Page 36: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 50

He has many friends!

Page 37: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 56

Conclusion• The web will contain the world’s knowledge in

forms accessible to people and computers– We need better ways to discover, index, search and

reason over SW knowledge

• SW search engines address different tasks than html search engines– So they require different techniques and APIs

• Swoogle like systems can help create consensus ontologies and foster best practices– Swoogle is for Semantic Web 1.0– Semantic Web 2.0 will make different demands

Page 38: Searching for Knowledge and Data  on the Semantic Web

UMBCUMBCan Honors University in an Honors University in

MarylandMaryland 57

http://ebiquity.umbc.edu/Annotated

in OWL

For more information