Post on 07-Jul-2015
description
Talk at MODULE University Vienna Thomas Kurz
!2014/02/06
Vienna, Austria
Semantic Content Management Techniques and Tools
Redlink was founded in 2013/03 and is headquartered in , Austria.SALZBURG
John Pereira
Aingaran Pillai
Andrea Volpini
Sebastian Schaffert
Rupert Westenthaler
Jakob Frank
Thomas Kurz
Sergio Fernàndez
David Riccitelli
Semantic Content Management 02/36
Semantic Content Management 03/36
Outline
• Why we need Semantics in CMS ?
• How can Semantic Web Technologies help ?
• How Redlink makes the integration much easier ?
!
• Excursus: What about Semantic Media Asset
Management Systems
Semantic Content Management 04/36
"We are drowning in information and starved for knowledge." John Naisbitt
• Content is highly available through the Internet
• Information are distributed over people and systems
• Data is available in various media and technical formats
We need an efficient way for working with huge amounts of unstructured content
Semantic Content Management 05/36
Content Management Systems
• CMS are a single point of entry, providing consistency and the
foundations for collaborative work with content
• CMS provide functionalities to handle large amounts of content:
• Creation of new content
• Editing of existing content
• Organisation and management of content
• Presentation of content
• Media-neutral data management (separation of layout and content)
Semantic Content Management 06/36
Semantic Content Management 07/36
State of Play in Content Management
• Current solutions provide efficient ways to manage
content
• Domain-specific requirements, like “multichannel content
distribution” are addressed
• Content can be managed and presented in multi-media
formats
… B U T …
Semantic Content Management 08/36
Problems in current Content Management Systems
• Content is only “understandable” by users and not by machines
• Irrelevant search results
• Aggregation of relevant content needs to be done manually
!
• Inferring Knowledge from Content
• Dependencies, relations and inconsistencies among content items
need to be identified and defined manually
!
• Content is strongly connected to presentation
• works only inside a certain environment
Semantic Content Management 09/36
The GOAL
DATA
INFORMATION
KNOWLEDGE
WISDOM
+ Context
+ Meaning
+ Insight
John Smith
John Smith is a name
John Smith is a potential customer for your products
It would be right/wrong to sell the product to
John Smith.
Semantic Content Management 10/36
Slide by Nova Spivack, Radar Networks
Semantic Content Management 11/36
How Semantic Web Technologies can help
http://upload.wikimedia.org/wikipedia/commons/f/f3/Semantic_Web_Stack.png
Semantic Content Management 12/36
(Open) Linked Data
1. Use URIs as names for things
2. Use HTTP URIs so that people can look up those names.
3. When someone looks up a URI, provide useful information, using
the standards (RDF*, SPARQL)
4. Include links to other URIs. so that they can discover more things.
Semantic Content Management 13/36
Semantic Lifting via Natural Language Processing
Semantic Content Management 14/36
How should we handle this?
Semantic Content Management 01/02
The Redlink Platform
Semantic Content Management 15/36
Semantic Content Management 16/36
• Read-Write Linked Data
• Triple store with transactions, versioning and reasoning
• SPARQL and LDPath query languages
• Transparent Linked Data Caching
The Open Platform for Linked Data
http://marmotta.apache.org/
Semantic Content Management 17/36
• Semantic Enhancement process chaining
• Several Natural Language processing facilities
• Multi-language support
• Classification and Sentiment Analysis
http://stanbol.apache.org
The Toolbox for Semantic Lifting
Semantic Content Management 18/36
• Based on Apache Lucene
• Many language specific processing procedures
• Highly scalable (Solr cloud) and ultra fast
• Highly configurable
http://lucene.apache.org/solr/
The highgly scalable Search Server
Semantic Content Management 19/36
DEV.REDLINK.IO
PART II !
Media Asset Management Bridging the Semantic Gap
Semantic Content Management 01/02
Semantic Media Asset Management Systems
• Multimedia Content is enormously growing within the last
decade (Web 2.0)
• Multimedia Content must be prepared for automatic processing
• for multimedia retrieval
• for reuse across platforms, contexts, locations, languages
• Multimedia Content Management Systems heavily rely on high
quality metadata (meaning is hidden Semantic Gap)
Semantic Content Management 21/36
Semantic Web Technologies can bridge the gap
Semantic Content Management 01/02
Where we use Semantics
• Controlled Vocabularies
• Domain specific Thesauri using standard representations
• Reuse of external data
• Create Knowledge by linking
• (Semi-) Automatic Metadata enrichment and classification
• Semantic Search (Facetting, Synonymes, Multilingual)
Semantic Content Management 22/36
What do we need to bring Media Objects in the Web of Data
Semantic Content Management 01/02
Media Fragments
„ … a media-format independent, standard means of addressing
media fragments on the Web using Uniform Resource Identifiers. “ [W3C Recommendation: Media Fragments URI 1.0 (basic)]
!
temporal t=10,20
spacial xywh=0,0,20,20
track track=audio
id id=chapter2
!
http://www.w3.org/TR/media-frags/
Semantic Content Management 23/36
Semantic Content Management 01/02
Media Resource Description
Ontology for Media Resources 1.0
„ … to bridge the different descriptions of media resources, and
provide a core set of descriptive properties.“ [W3C Recommendation: Ontology for Media Resources 1.0]
!
Open Annotation Collaboration
!
!
!
Semantic Content Management 24/36
Semantic Content Management 24/36
Semantic Content Management 26/36
Hello, my name is Tom!
Last summer I was in Paris in France for vacation. It was really amazing. I love Paris!
Semantic Content Management 27/36
RDFize Tom's statement
Tom likes Paris, France. -‐> Tom likes Paris. -‐> ( Tom, likes, Paris ) -‐> Paris is a part of France. -‐> ( Paris, partOf, France )
Semantic Content Management 28/36
RDFize Tom's statement
Tom likes Paris, France. -‐> Tom likes Paris. -‐> ( Tom, likes, Paris ) -‐> Paris is a part of France. -‐> ( Paris, partOf, France )
Semantic Content Management 29/36
Link to external resources
Semantic Content Management 30/36
But what about this?
Title:! „Me and the big thing“!Album:! „A vacation in Paris“!Author:!„Tom Tester“!
Semantic Content Management 31/36
Extract Information
Title:! „Me and the big thing“!Album:! „A vacation in Paris“!Author:!„Tom Tester“!
Semantic Content Management 32/36
Link Information
Title:!„Me and the big thing“!Album:!„A vacation in <http://dbpedia.org/resource/Paris>“!Author:! „<http://tom-tester.org/me>“
:image!:hasFragment!:image#xywh=...!!:image#xywh=...!:subject foaf:Person!!:image !:hasFragment !:image#xywh=..!!:image#xywh=.. !:subject !dbpedia:EiffelTower!(50%)
Semantic Content Management 35/36
Create new facts by using Contextual Semantics
:image :hasFragment :image#xywh=...!:image#xywh=... :subject foaf:Person!:image :author tom:me!!:image :hasFragment :image#xywh=..!!:image :location dbpedia:Paris!:image :location geonames:France!!:image#xywh=.. :subject dbpedia:EiffelTower!(+90%)!!:image :showsOnTheLeft tom:me (50%)!:image :showsOnTheRight dbpedia:EiffelTower!:image :showsOnTheRight dbpedia:VisitorAtraction!
Semantic Content Management
Thanks for your attention! Any Questions?
thomas.kurz@redlink.co