Post on 30-Dec-2015
description
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 1
Searching for Knowledge and Data on the Semantic Web
Tim Finin
University of Maryland, Baltimore County
http://ebiquity.umbc.edu/resource/html/id/179/
Joint work with Li Ding, Anupam Joshi, Yun Peng, Cynthia Parr, Pranam Kolari, Pavan Reddivari, Sandor Dornbush, Rong Pan, Akshay Java, Joel Sachs, Scott Cost and Vishal Doshi
http://creativecommons.org/licenses/by-nc-sa/2.0/ This work was partially supported by DARPA contract F30602-97-1-0215, NSF grants CCR007080 and IIS9875433 and grants from IBM, Fujitsu and
HP.
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 3
Google has made us smarter
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 4
But what about our agents?
tell
register
Agents still have a very minimal understanding of text and images.
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 6
XML helps
“XML is Lisp's bastard nephew, with uglier syntax and no semantics. Yet XML is poised to enable the creation of a Web of data that dwarfs anything since the Library at Alexandria.”
-- Philip Wadler, Et tu XML? The fall of the relational empire, VLDB, Rome, September 2001.
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 7
“The Semantic Web will globalize KR*, just as the WWW globalize hypertext”
-- Tim Berners-Lee
Semantic Web adds semantics
* Knowledge Representation
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 13
But what about our agents?
A Google for knowledge on the Semantic Web is needed by software agents and programs
SwoogleSwoogle
Swoogle
Swoogle
SwoogleSwoogle
SwoogleSwoogle
Swoogle SwoogleSwoogle
SwoogleSwoogle
SwoogleSwoogle
tell
register
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 15
•http://swoogle.umbc.edu/•Running since summer 2004•1.5M RDF documents, 300M RDF triples, 10K
ontologies
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 16
Analysis
Index
Discovery
IR Indexer
Search Services
Semantic Webmetadata
Web Service
Web Server
Candidate URLs
Bounded Web CrawlerGoogle Crawler
SwoogleBot
SWD Indexer
Ranking
document cache
SWD classifier
human machine
html rdf/xml
…
the WebSemantic Web
Information flow Swoogle‘s web interface
Legends
Swoogle Architecture
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 21
Applications and use cases
• Supporting Semantic Web developers– Ontology designers, vocabulary discovery, who’s using
my ontologies or data?, use analysis, errors,statistics, etc.
• Searching specialized collections– Spire: aggregating observations and data from biologists
– InferenceWeb: searching over and enhancing proofs
– SemNews: Text Meaning of news stories
• Supporting SW tools– Triple shop: finding data for SPARQL queries
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 22
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 23
By default, ontologies are ordered by their ‘popularity’, but they can also be ordered by recency or size.
80 ontologies were found that had these three terms
Let’s look at this one
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 24
Basic MetadatahasDateDiscovered: 2005-01-17 hasDatePing: 2006-03-21 hasPingState: PingModified type: SemanticWebDocument isEmbedded: false hasGrammar: RDFXML hasParseState: ParseSuccess hasDateLastmodified: 2005-04-29 hasDateCache: 2006-03-21 hasEncoding: ISO-8859-1 hasLength: 18K hasCntTriple: 311.00 hasOntoRatio: 0.98 hasCntSwt: 94.00 hasCntSwtDef: 72.00 hasCntInstance: 8.00
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 25
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 26
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 27
These are the namespaces this ontology uses. Clicking on one
shows all of the documents using the namespace.
All of this is available in RDF form for the
agents among us.
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 28
Here’s what the agent sees. Note the swoogle and wob (web of belief) ontologies.
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 29
We can also search for terms (classes, properties) like terms for “person”.
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 30
10K terms associated with “person”! Ordered by use.
Let’s look at foaf:Person’s metadata
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 31
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 32
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 33
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 34
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 35
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 36
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 38
UMBC Triple Shop• http://sparql.cs.umbc.edu/• Online SPARQL RDF query processing based
on HP’s Jena and Joseki with several interesting features
• Selectable level of inference over model• Automatically finds SWDs for give queries using
Swoogle backend database– Provide dataset creation wizard– Dataset can be stored on our server or downloaded– Tag, share and search over saved datasets
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 40
Who knows Anupam Joshi?Show me their names, email address and pictures
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 41
The UMBC ebiquity site publishes lots of RDF data, including FOAF profiles
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 42
No FROM clause!
Constraints on wherethe data comes from
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 43
PREFIX foaf: <http://xmlns.com/foaf/0.1/>SELECT DISTINCT ?p2name ?p2mbox ?p2pixWHERE { ?p1 foaf:name "Anupam Joshi" . ?p1 foaf:mbox ?p1mbox . ?p2 foaf:knows ?p3 . ?p3 foaf:mbox ?p1mbox . ?p2 foaf:name ?p2name . ?p2 foaf:mbox ?p2mbox . OPTIONAL { ?p2 foaf:depiction ?p2pix } . }ORDER BY ?p2name
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 44
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 45
Swoogle found 292 RDF data files that appear relevant to answering our query
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 46
Let’s save the dataset before we use it
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 47
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 48
And tag it so we and others can find it more easily.
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 49
Here we are using it to get an answer to “Who knows Anupam Joshi”
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 50
He has many friends!
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 56
Conclusion• The web will contain the world’s knowledge in
forms accessible to people and computers– We need better ways to discover, index, search and
reason over SW knowledge
• SW search engines address different tasks than html search engines– So they require different techniques and APIs
• Swoogle like systems can help create consensus ontologies and foster best practices– Swoogle is for Semantic Web 1.0– Semantic Web 2.0 will make different demands
UMBCUMBCan Honors University in an Honors University in
MarylandMaryland 57
http://ebiquity.umbc.edu/Annotated
in OWL
For more information