Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.
-
date post
15-Jan-2016 -
Category
Documents
-
view
219 -
download
1
Transcript of Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.
![Page 1: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/1.jpg)
Principles and Principles and pragmatics of a pragmatics of a
Semantic Culture WebSemantic Culture Web
Tearing down wallsand
Building bridges
![Page 2: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/2.jpg)
Overview
• Virtual collections and Semantic Web• Semantic collection-search demonstrator
– For cultural heritage objects
• Metadata & vocabulary representation and enrichment
• Principles for knowledge engineering on the Web
![Page 3: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/3.jpg)
• Part of large Dutch knowledge-economy project MultimediaN
• Partners: VU, CWI, UvA, DEN,ICN
• People: Alia Amin, Lora Aroyo, Mark van
Assem, Victor de Boer, Lynda Hardman, Michiel Hildebrand, Laura Hollink, Marco de Niet, Borys Omelayenko, Marie-France van Orsouw, Jacco van Ossenbruggen, Guus Schreiber Jos Taekema, Annemiek Teesing, Anna Tordai, Jan Wielemaker, Bob Wielinga
• Artchive.com, Rijksmuseum Amsterdam, Dutch ethnology musea (Amsterdam, Leiden), National Library (Bibliopolis)
Acknowledgements
![Page 4: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/4.jpg)
Hypothesis
• Semantic Web technology is in particular useful in knowledge-rich domains
or formulated differently
• If we cannot show added value in knowledge-rich domains, then it may have no value at all
![Page 5: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/5.jpg)
The Web: resources and links
URL URL
Web link
![Page 6: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/6.jpg)
The Semantic Web: typed resources and links
URL URL
Web link
ULAN
Henri Matisse
Dublin Core
creator
Painting“Woman with hat
SFMOMA
![Page 7: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/7.jpg)
![Page 8: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/8.jpg)
![Page 9: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/9.jpg)
![Page 10: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/10.jpg)
Principle 1: semantic annotation
• Description of web objects with “concepts” from a shared vocabulary
![Page 11: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/11.jpg)
Principle 2: semantic search
• Search for objects which are linked via concepts (semantic link)
• Use the type of semantic link to provide meaningful presentation of the search results
Paris
Montmartre
PartOf
Query“Paris”
![Page 12: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/12.jpg)
The myth of a unified vocabulary
• In large virtual collections there are always multiple vocabularies – In multiple languages
• Every vocabulary has its own perspective– You can’t just merge them
• But you can use vocabularies jointly by defining a limited set of links– “Vocabulary alignment”
• It is surprising what you can do with just a few links
![Page 13: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/13.jpg)
Principle 3: vocabulary alignment
“Tokugawa”
SVCN period Edo
SVCN is local in-house ethnology thesaurus
AAT style/period Edo (Japanese period) Tokugawa
AAT is Getty’s Art & Architecture Thesaurus
![Page 14: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/14.jpg)
A link between two thesauri
![Page 15: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/15.jpg)
Levels of interoperability
• Syntactic interoperability– using data formats that you can share– XML family is the preferred option
• Semantic interoperability– How to share meaning / concepts– Technology for finding and representing semantic
links
![Page 16: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/16.jpg)
![Page 17: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/17.jpg)
Distributed vs. centralized collection data
• Minimal requirement: collection object has image URI
• Preference for external metadata, accessed through protocol such as OAI
• In practice, external metadata access is still cumbersome
![Page 18: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/18.jpg)
http://e-culture.multimedian.nl/demo/search
![Page 19: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/19.jpg)
Search strategies
• Basic search: keyword-oriented• Advanced search:
– Tweaking default search parameters– Time-related queries
• Faceted search• Relation search
– How are two URIs related?
![Page 20: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/20.jpg)
![Page 21: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/21.jpg)
Keyword search with semantic clustering
1. Btree of literals plus Porter stem and metaphone index
2. Find resources with matching labels• Default resources are “Work”s
3. Find related resources by one-way graph traversal• owl:inverseOf is used• Threshold used for constraining search
4. Cluster results (group instances)
![Page 22: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/22.jpg)
![Page 23: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/23.jpg)
![Page 24: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/24.jpg)
![Page 25: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/25.jpg)
Search: WordNet patterns that increase recall without sacrificing precisions
![Page 26: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/26.jpg)
Term disambiguation is key issue in semantic search
• Post-query– Sort search results based on different meanings
of the search term– Mimics Google-type search
• Pre-query– Ask user to disambiguate by displaying list of
possible meanings– Interface is more complex, but more search
functionality can be offered
![Page 27: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/27.jpg)
Faceted search
• Use Dublin Core scheme to formulate complex queries
• Navigate through relevant metadata
![Page 28: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/28.jpg)
Faceted search Faceted search
![Page 29: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/29.jpg)
What do you need to do to make your collection part of a Semantic Culture Web?
Four activities
![Page 30: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/30.jpg)
From metadata to semantic metadata
1. Make vocabularyinteroperable
2. Align metadata schema
3. Enrich metadata
4. Align vocabulary
![Page 31: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/31.jpg)
Activity 1: syntactic vocabulary interoperability
• Making vocabularies available in the Web standard RDF
• Many organizations already do this• W3C provides the SKOS template to make
this almost straightforward• Effort required: at most a few days
![Page 32: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/32.jpg)
![Page 33: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/33.jpg)
33
Multi-lingual labels for concepts
![Page 34: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/34.jpg)
34
Semantic relation:broader and narrower
• No subclass semantics assumed!
![Page 35: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/35.jpg)
![Page 36: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/36.jpg)
Activity 2: aligning the metadata schema
• Specify your collection metadata scheme as a specialization of Dublin Core
• With RDF/OWL this is easy/trivial!• Cf. DC Application Profiles
![Page 37: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/37.jpg)
Aligning VRA with Dublin Core
• VRA is specialization of Dublin Core for visual resources
• VRA properties “material.medium” and “material.support” are specializations of Dublin Core property “format”
vra:material.medium rdfs:subPropertyOf dc:fotmat .
vra:material.medium rdfs:subPropertyOf dc:format .
![Page 38: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/38.jpg)
Activity 3: enriching the metadata
• Extracting additional concepts from an annotation– Matching the string “Paris” to a vocabulary term
• Information-extraction techniques exists (and continue to be developed)
• Effort required can be up to a few weeks– The more concepts, the better, but no need to be
perfect!
![Page 39: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/39.jpg)
Example textual annotation
![Page 40: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/40.jpg)
Resulting semantic annotation (rendered as HTML with RDFa)
![Page 41: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/41.jpg)
41
RDFa: embedding RDF in (X)HTML
Regular HTML
Resulting RDF statements
HTML with RDFa
![Page 42: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/42.jpg)
Activity 4: aligning the vocabulary
• Find semantic links between vocabulary links– Derain (ULAN) related-to Fauve (AAT))
• Automatic techniques exists, but performance varies• Often combination of automatic and manual
alignment• Effort strongly dependent on vocabularies
– But “a little semantic goes a long way” (Hendler)
![Page 43: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/43.jpg)
Learning alignments
• Learning relations between art styles in AAT and artists in ULAN through NLP of art historic texts– “Who are Impressionist painters?”
![Page 44: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/44.jpg)
Extracting additional knowledge from scope notes
![Page 45: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/45.jpg)
Principles for knowledge engineering
on the Web
![Page 46: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/46.jpg)
Principle 1: Be modest!
• Ontology engineers should refrain from developing their own idiosyncratic ontologies
• Instead, they should make the available rich vocabularies, thesauri and databases available in web format
• Initially, only add the originally intended semantics
![Page 47: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/47.jpg)
Principle 2: Think large!
"Once you have a truly massive amount of information integrated as knowledge, then the
human-software system will be superhuman, in the same sense that mankind with writing is superhuman compared to mankind before
writing."
Doug Lenat
![Page 48: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/48.jpg)
Principle 3: Develop and use patterns!
• Don’t try to be (too) creative• Ontology engineering should not be an art
but a discipline• Patterns play a key role in methodology for
ontology engineering• See for example patterns developed by the
W3C Semantic Web Best Practices group
http://www.w3.org/2001/sw/BestPractices/• SKOS can also be considered a pattern
![Page 49: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/49.jpg)
Principle 4: Don’t recreate, but enrich and align
• Techniques:– Learning ontology relations/mappings– Semantic analysis, e.g. OntoClean– Processing of scope notes in thesauri
![Page 50: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/50.jpg)
Principle 5: Beware of ontologicalover-commitment!
![Page 51: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/51.jpg)
Principle 6: Specifying a data model in OWL does ot make it an ontology!
• Papers about your own idiosyncratic “university ontology” should be rejected at SW conferences
• The qality of an ontology does not depend on the number of OWL constrcts sed
![Page 52: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/52.jpg)
Principle 7: Required level of formal semantics depends on the domain!
• In our semantic search we use three OWL constructs:– owl:sameAs, owl:TransitiveProperty,
owl:SymmetricProperty
• But cultural heritage has is very different from medicine and bioinformatics– Don’t over-generalize on requirements for e.g.
OWL
![Page 53: Principles and pragmatics of a Semantic Culture Web Tearing down walls and Building bridges.](https://reader038.fdocuments.in/reader038/viewer/2022110103/56649d5e5503460f94a3e11a/html5/thumbnails/53.jpg)
Perspectives
• Basic Semantic Web technology is ready for deployment
• Research themes:– Scalability, vocabulary alignment, metadata
extraction
• Web 2.0 facilities fit well:– Involving community experts in annotation– Personalization
• Social barriers have to be overcome!