Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

42
Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

description

Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl. PoolParty at a glance. Developed by punkt. netServices Current release: PoolParty 2.8 Main focus on three application areas: SKOS Thesaurus Management Linked Data (publishing & consuming) - PowerPoint PPT Presentation

Transcript of Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Page 1: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Fusing Corporate Thesaurus Management with

Linked Data using PoolParty

Thomas Schandl

Page 2: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

PoolParty at a glance

• Developed by punkt. netServicesCurrent release: PoolParty 2.8

• Main focus on three applicationareas:

– SKOS Thesaurus Management

– Linked Data (publishing & consuming)

– Semantic Search & Semantic Indexing

2

Page 3: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Challenge for Content Management

3

1.Annotation: Add meaning to the content

2.Link content: Bring content together in a meaningful way

3.Make content searchable: Add background knowledge to the content

Page 4: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Traditional approach to annotate content with metadata

4

Apple is in the process of launching an application to allow iPhone, iPad and iPod Touch users to purchase Apple merchandise straight from their devices.

Apple

application

merchandise

iPod touch

iPadiPhone

Page 5: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Semantic Web approach: Concepts & Relations instead of simple text

5

Apple is in the process of launching an application to allow iPhone, iPad and iPod Touch users to purchase Apple merchandise straight from their devices.

http://my.com/AppleApple

Apple Inc.

http://my.com/iPhone

http://my.com/iPhone3G

iPhone

iPhone 3GS

iPhone 3G

http://my.com/smartphone

Page 6: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

in a nutshell

• W3C Semantic Web standards: Management of multi-lingual (corporate) thesauri & taxonomies on top of Semantic Web standards (SKOS, RDF, OWL & SPARQL)

• Usability: easy-to-use, web-based AJAX user interface

• Scalable Semantic Technologies: RDF Triple Store (SAIL), (Lucene) index engine and a phrase-extraction component

• Service oriented: PoolParty Server offers a Java-API & several interfaces: HTTP web services, SPARQL endpoint, Linked Data

6

Page 7: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

PoolParty GUI

7

Page 8: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Full compatibility with SKOS/RDF

8

Page 9: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Some highlights: PoolParty thesaurus management

• Drag & drop , Auto-Complete

• Document analysis: phrase extraction

• Enrich concepts by using linked data

• Publish thesauri as linked data

• Advanced reporting functionality

• Import and validation of thesauriand CSV files

• Thesauris quality checker

• Wiki style collaborative editing of thesauri

• Visual browsing and map navigation

9

Page 10: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Built-in automatic phrase extraction

10

• Supports different formats (html, doc,pdf, ppt, …)

• Thesaurus basedextraction

• Integrable withCMS, CRM etc.

• Supports different formats (html, doc,pdf, ppt, …)

• Thesaurus basedextraction

• Integrable withCMS, CRM etc.

Page 11: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Some Applications on top of PoolParty

• Tag recommendation: support users and content managers when annotating text

• Semantic Indexing: PoolParty TagEvent Store as a basis for a semantic index ( IndexBuilder)

• Similarity search: „Similarity“ is configurable: Certain features of a document can be „boosted“ (example: persons, places / user tags etc.)

• Semantic Search and Navigation: Thesaurus can be used for facetted and moderated search (examples: emteba.at, ecoi.net)

• Search Engine Dictionaries: provide company or domain specific terms for search engine dictionary

11

Page 12: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Similarity search: finding the unexpected…

12

Expert #4532

Senior Product Manager Enterprise Wiki

at MitchelLake Consulting

in Sydney Area………

Project #AZ67

Integration of Confluence which is a web-based

corporate wiki. It is developed and

marketed by Atlassian, Australia.

…..

same topic

near location

Page 13: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

PoolParty DemoZone

• compare thesaurus based approach with traditional approach

• tag recommender

• similar documents

• find images which fit to your document

• browser bookmarklet

13

Page 14: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Wordpress Glossary Plugin

14

• automatic generation of glossaries for Wordpress blogs

• SKOS compatibility

• automatic link detection and linkage with glossary term

Page 15: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Programmatic access via Web Services

• getProposedTagsForDocument

• addTaggingEvent

• getTagFrequencies

• addDocumentToSimilarityIndex

• findSimilarDocuments

• getConceptSuggestions

• …..

15

Page 16: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Programmatic access – Example: emteba.at

16

Page 17: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

PoolParty

Linked DataFeatures in Detail

Page 18: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

SKOS Thesauri + Linked Data

18

Page 19: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Linked Data – Benefits & Application Scenarios

19

Thesaurus Management•Automatic population ofthesauri•(Semi) Automatic categorization of new concepts End User

•Content augmentation•Improved recommender services•Improved navigation elements, e.g. in web-shopsContent Provider

•Improved SEO•Reduced costs of content management•New services and mashups

Page 20: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Publishing Linked Data with PoolParty

20

• using linked data patterns and „Cool URIs“

• Linked Data front-end

Additionally:

• Wiki front-end

• SPARQL-endpoint

Page 21: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Linked Data frontend

21

Page 22: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Consuming Linked Data

22

• advanced linked data look-up services

• expandable number of linked data sources already integrated

• linked data synchronisation mechanisms (beta)

Page 23: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Linked Data Screencast

• Here comes a screencast

23

Page 24: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Using SKOS context to link concepts to LD resources and semi-automatic population of thesaurus

Example: Thesaurus about arts and artists Concept „Painters“ with NT:

Kandinsky, Rembrandt and Berners-Lee

• Using broader and sibling concepts to help disambiguate and suggest the painter Berners-Lee

• Finding mutual categories from Dbpedia or Freebase

• Suggesting more NTs for Painters using LD categories

24

Page 25: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

PoolParty

Semantic Search

Page 26: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

More background knowledge from thesauri and linked data can improve semantic search

• better disambiguation of search terms

• background knowledge of search terms help to „expand queries“

• better similarity search because of more metadata

• content augmentation through linked data

26

Page 27: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Semantic Services provided by PoolParty

27

Search assistants(Auto-Complete, faceted search)

Improve user´s search experience

Moderated Search

Creating complex queries

Tag Recommendation

Identifying the meaning of a document

Similarity Search(Recommender Systems)

Understanding relations

1

2

3

4

Page 28: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Search Assistants

28

• clever auto-complete

• query expansion

• faceted search

• visual search

• Google synonyms

Page 29: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Moderated Search

29

• thesaurus helps to create complex queries

• supports multi-linguality

• helps to explore a domain without deep knowledge

Page 30: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Tag Recommendation

30

• annotation of documents with low effort

• motivation for people to annotate documents

• basis for building a semantic index

Page 31: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Similarity Search

31

• improved similarity detection on top of additional background knowledge

• build recommender systems for web-shops or knowledge management systems

• help people to skim large document collections

• detect hidden relations between documents

Page 32: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Integration of thesauri with Enterprise Search

32

PoolParty ReportingExport parts of thesauri intoindividual XML-formats and synchronize with search engine

Possible integrations with enterprise search engine:•Autocomplete-Server•Entity dictionary•Query rewriting•Moderated search•Enrich semantic index

PoolParty Web-ServicesIntegrate thesauriinto search enginewith real-timequeries

• improved semantic enterprise search

• all metadata can be administrated at one single place

• expandable via linked data mechanisms

Page 33: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

PoolParty

Thesaurus ManagementAdvanced Features

Page 34: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Multilinguality

34

Page 35: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Concept mapping

• skos:exactMatch

• skos:closeMatch

used for linked data mapping

used for concept mapping, e.g. after having imported a thesaurus

35

Page 36: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Associating notes with concepts

36

• skos:historyNote

• skos:changeNote

• skos:editorialNote

used to trace meanings of a concept

used to discuss meanings of a concept

Page 37: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Introduce individual relations between concepts

37

Create your own individual inverse or symmetric relations between concepts

Page 38: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Import / export / reporting

38

• import & export of SKOS using various RDF serializations

• import of CSV

• import of Zthes

• import/export of sub-trees

• custom reports and XML exports based on PoolParty´s template engine

Page 39: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Quality checks and validation service

39

Check thesauri to….

• be complete

• be non-cyclic (e.g. no circularity in the broader/narrower hierarchy).

• have no disjoints between related and hierarchical paths.

Page 40: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Visual browsing

40

Page 41: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Use your favourite theme!

41

Page 42: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

Contact

Apply for a PoolParty demo accounthttp://poolparty.punkt.at/

Thomas [email protected]+43-1-8974122-27

punkt. netServices GmbHLerchenfelder Guertel 43A—1160 Wien / Austriahttp://www.punkt.at/

42