ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a...
-
Upload
antidot -
Category
Technology
-
view
881 -
download
0
description
Transcript of ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a...
![Page 1: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/1.jpg)
Copyright Antidot™ 1
Linked Enterprise Data
LEVERAGING THE SEMANTIC WEB STACKIN A CORPORATE ENVIRONMENT
ISWC 2012 – BOSTONFABRICE LACROIX – [email protected]
![Page 2: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/2.jpg)
Copyright Antidot™ 2
Antidot – who we are
French-based Software Vendor Since 1999 | Paris, Lyon, Aix-en-Provence Information access | Data
management
Mission: Provide our customers with innovative customizable solutions that help them create value with their data, and make their employees more aware and efficient.
![Page 3: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/3.jpg)
Copyright Antidot™ 3
ClientsPublishing
Healthcare
Enterprises E-commerce
![Page 4: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/4.jpg)
Copyright Antidot™ 4
Unstructured documents
files, ECM, collaborative spacesintranet, extranet, Web sitese-mails, instant messaging
![Page 5: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/5.jpg)
Copyright Antidot™ 5
Structured data
CRM, ERP, directoryknowledge basesbusiness applications (production, support)
![Page 6: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/6.jpg)
Copyright Antidot™ 6
IS are bloated
1 practice => 1 need => 1 application => 1 siloInformation system is driven by the processData are numerous, various and scattered
![Page 7: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/7.jpg)
Copyright Antidot™ 7
Solutions or workarounds?
BI MDM
SOA Search
![Page 8: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/8.jpg)
Copyright Antidot™ 8
Solutions and workarounds
Enterprise Search brings little value to users Document oriented Does not solve real business problems
Google like Verity like
![Page 9: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/9.jpg)
Copyright Antidot™ 9
What we want
![Page 10: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/10.jpg)
Copyright Antidot™ 10
What we want
LDAP
CRM
Production
ERP
ECM
FilesSupport
![Page 11: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/11.jpg)
Copyright Antidot™ 11
Changing the paradigm
Switching from an application view to a data centric way of thinking.
![Page 12: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/12.jpg)
Copyright Antidot™ 12
Bring out the implicit
Build the Giant Enterprise Graph
![Page 13: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/13.jpg)
Copyright Antidot™ 13
LED
Linked Enterprise Data application of the Semantic Web
technologies and Linked Data principles to the enterprise infrastructure
![Page 14: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/14.jpg)
Copyright Antidot™ 14
What works for the Web…
Federating silos on the Web
http://www.w3.org/People/Ivan/CorePresentations/RDFTutorial/Slides.html#(102)
![Page 15: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/15.jpg)
Copyright Antidot™ 15
…can’t always be used
in corporate IS Legacy apps can’t be "Sparql’ed" 80% un- or semi- structured data don’t fit in the
model as such Defining vocabularies/ontologies for silos is too
complex and expensive Don’t want RDF per se but valuable information External data is available in XML/JSON through
Web Services Staff trained for RDB, XML, Web apps. No Risk and stability strategy: SemWeb
technology considered as new and immature
![Page 16: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/16.jpg)
Copyright Antidot™ 16
The RDF/storage approach
Setting up a global RDF repository does not work either ITs are afraid by the "RDF everywhere"
activists
![Page 17: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/17.jpg)
Copyright Antidot™ 17
Semantic Web technology
still is the right solution
in corporate environmentBUT it is not an aim
JUST use it
as a means
![Page 18: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/18.jpg)
Copyright Antidot™ 18
Just do it
Think of it as a stream paradigm build new objects using existing data without interfering with the existing
infrastructure with SemWeb somewhere under the hood
![Page 19: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/19.jpg)
Copyright Antidot™ 19
Enterprise Graph HowTo
Construct the graph generate triples from data create triples from documents
Leverage the graph enrich infer
Browse the graph select resources build objects
Trash the graph
![Page 20: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/20.jpg)
Copyright Antidot™ 20
How: extract & normalize
Harvest and normalize as in an ETL fetch, clean, transform… normalize records (names, IDs) to
prepare the linking step
For databases db2triples : an RDB2RDF
implementation by Antidot (open source, W3C validated)
![Page 21: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/21.jpg)
Copyright Antidot™ 21
How: semantize
Don’t transform everything in RDF cherry-pick a subset of interesting fields
for each object and create their RDF triples counterpart
interesting == needed for linking or inferring
Semantize
![Page 22: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/22.jpg)
Copyright Antidot™ 22
How: semantize
Triples generation Be smart: avoid upfront ontology design,
use small vocabularies Be pragmatic: transform XML tags and
field names to predicates Be agile: only insert what you need. And
when you need more, add more.
Semantic Web fuels the modeling, linking and information building process
![Page 23: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/23.jpg)
Copyright Antidot™ 23
Enterprise Graph HowTo
Construct the graph generate triples from data create triples from documents
Leverage the graph enrich infer
Browse the graph select resources build objects
Trash the graph
![Page 24: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/24.jpg)
Copyright Antidot™ 24
How: semantize
Unstructured documents Extract metadata and transform them as
needed to RDF.➡ Ex: author => dc:creator
Use of text-mining to extract named entities: people, organizations, products…➡ generate those entities list using the data
sources: directory for employees, CRM for companies and people, ERP for products
➡ create triples like doc_URI quotes entity_URI
![Page 25: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/25.jpg)
Copyright Antidot™ 25
How: semantize
Unstructured documents Compare documents using various and
dedicated algorithms➡ is the same➡ is included➡ is similar➡ is related
Generates new triples➡ create triples like<docA> is_sub_version_of <docB>
![Page 26: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/26.jpg)
Copyright Antidot™ 26
Enterprise Graph HowTo
Construct the graph generate triples from data create triples from documents
Leverage the graph enrich infer
Browse the graph select resources build objects
Trash the graph
![Page 27: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/27.jpg)
Copyright Antidot™ 27
How: enrich
Enrich the graph run specific algorithms to generate more
links and triples (classifiers, topic detection, …)
insert external data gathered from the LOD or other external datasets or APIs
![Page 28: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/28.jpg)
Copyright Antidot™ 28
How: infer
Create new knowledge add rules according to your needs
IF a coworker is quoted in documents
THEN the business unit is bound to the documents
AND this coworker belongs to a business unit
![Page 29: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/29.jpg)
Copyright Antidot™ 29
Enterprise Graph HowTo
Construct the graph generate triples from data create triples from documents
Leverage the graph enrich infer
Browse the graph select resources build objects
Trash the graph
![Page 30: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/30.jpg)
Copyright Antidot™ 30
How: build
Build select resources corresponding to
objects seeds (using Sparql queries) for each seed, follow links smartly in
order to create basic objects
Build
![Page 31: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/31.jpg)
Copyright Antidot™ 31
How: build
Finalize decorate the new knowledge objects
with data set apart (not loaded in the triplestore)
now we have rich user-actionable objects
Build Finalize
![Page 32: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/32.jpg)
Copyright Antidot™ 32
Enterprise Graph HowTo
Construct the graph generate triples from data create triples from documents
Leverage the graph enrich infer
Browse the graph select resources build objects
Trash the graph
![Page 33: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/33.jpg)
Copyright Antidot™ 33
How: expose
Make the new information available to users and to the entire IS
EnrichHarvest
Classify
Semantize
NormalizeAnnotate
Indexation AFS search engine
RDF Triplestore (Linked Data)
Relational DB
![Page 34: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/34.jpg)
Copyright Antidot™ 34
Conclusion
It works! The triples we create and the inference
rules we add are dictated by the goal / application➡ usage and value oriented
We benefit from the lazy-flexible-dynamic modeling of RDF-RDFS-OWL➡ we are agile
What matters is the graph. But the graph is not the triplestore➡ storage independent
![Page 35: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/35.jpg)
Copyright Antidot™ 35
There’s an app for that
Antidot Information Factory a software solution designed specifically
to leverage structured and unstructured data
enable large-scale processing of existing data
automate publishing of enriched or newly created information.
Harvest Normalize Semantize Enrich Build Expose
![Page 36: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/36.jpg)
Copyright Antidot™ 36
The Giant Enterprise Graph
Now we have a path to let SemWeb enter the enterprise
![Page 37: ISWC 2012 - Industry Track - Linked Enterprise Data: leveraging the Semantic Web stack in a corporate IS environment.](https://reader036.fdocuments.in/reader036/viewer/2022062615/548b2b86b47959142c8b470e/html5/thumbnails/37.jpg)
Copyright Antidot™ 37
THANKS FOR YOUR ATTENTIONQUESTIONS?
DiscussUnderstandLearnExchange