Terminology Management in DITA

30
Terminology Management Using DITA to squeeze maximum value from your Terminology Jim Tivy October 2013
  • date post

    18-Oct-2014
  • Category

    Technology

  • view

    864
  • download

    2

description

 

Transcript of Terminology Management in DITA

Page 1: Terminology Management in DITA

Terminology ManagementUsing DITA to squeeze maximum value from

your Terminology

Jim TivyOctober 2013

Page 2: Terminology Management in DITA

Jim Tivy:

• Documentation systems since the 80s

• Data modeling, database system specialist

• XQuery Working Group early 90s

• CTO, Bluestream

• Member, Oasis DITA TC

Page 3: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Contents

1. Introduction2. DITA Glossary/Term Mechanism3. Thesaurus Index4. Browse and Search

Page 4: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Introduction

Page 5: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Terminology Management

• Identification, organization and use of terms• The promise:

– accuracy– coherence – findability

• Terminology– Set of preferred terms– Alternative terms – synonyms– Acronyms

Page 6: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Content Project Phases

Page 7: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

DITA Glossary/Term Mechanism

Page 8: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

DITA Glossary Mechanism

• <glossentry> • <term>• <glossref>• Concept (sense) oriented• Synonyms• Acronyms

Page 9: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

<glossentry><glossentry> … <glossterm>voice mail</glossterm> <glossdef>Allows callers to leave voice messages.</glossdef> <glossBody> <glossAlt> <glossSynonym>answering machine</glossSynonym> <glossUsage>Older term</glossUsage> </glossAlt> </glossBody></glossentry>

Page 10: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

<term><task <taskbody> <prereq> <p>Your <term>voice mail</term> must be set up with a service provider, a password, and a greeting. </p> </prereq> … </taskbody></task>

Page 11: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

<glossref>• <p>Your <term keyref="voice_mail"/> must be set up with a

service provider, a password, and a greeting. </p>

• Needs a key definition.• <map>

<topicref href="/Content/Retrieve_voicemail_xi10005.xml"/> ... <topichead navtitle="Glossary> <glossref href="/Content/VoiceMail_xi10019.xml" keys="voice_mail“ linking="normal" toc="no" print="yes"/> <glossref href="/Content/SIMcard_xi10017.xml" keys="SIMcard“ linking="normal" toc="no" print="yes"/> ... </topichead></map>

• As shown in HTML…

Page 12: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Abbreviated FormAdded to <glossentry> <glossBody … <glossSurfaceForm>phone screen (SCR)</glossSurfaceForm> <glossAlt> <glossAcronym>SCR</glossAcronym> </glossAlt> …

Used instead of <term><abbreviated-form keyref="phone_screen"/>

As shown in HTML…

Page 13: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Thesaurus Index

Page 14: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Thesauri

• A kind of taxonomy• Hierarchy of concepts – narrower, broader• Concepts are controlled and organized• Concepts have labels

– Some labels are preferred– Some labels are synonyms– There are links to related concepts– There can be labels for each language

• Getty Center example

Page 15: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Getty Example

Page 16: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Example SKOS thesaurus entry

prefLabel (en-US): in callsaltLabel: getting calleddefinition: In calls are those calls you receive when someone dials your number. broader: phonenarrower: voice mail

Page 17: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Thesaurus Index

• A thesaurus with links to content

Phones

PartsCalling

Voice mail Screen

vmail.xml music.xml calls.xml photos.xml screen.xml

ThesaurusTree

Files

Page 18: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Harvest Time!

• “As you sow, so shall you reap.”• Harvest Demo

Page 19: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Managing the Thesaurus Index

• Managing concepts– Manual Harvest

• Managing concept links– Manual Harvest (where used?)– Manual Link to images (only for authors?)

• Consistency of thesaurus index and repository– Manual or fully synchronized?

Page 20: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Thesaurus Standards

• W3C SKOS 2009: Simple Knowledge Organization System– Schema plus triples – in OWL

• ISO 25964-1 2011: Thesauri for information retrieval– Entity Relationship Model – works well for RDBs

Page 21: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Thesaurus Index as links

• RDF or OWL2 statements “File vmail.xml is aboutTerm voice mail.”

vmail.xml voice mailaboutTerm

TermFile

Page 22: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

What about Subject scheme

<subjectScheme> <hasNarrower> <subjectdef keys="PhoneFeatures“ navtitle="Phone Features“> <subjectdef keys="VoiceMail" navtitle="Voice Mail" href=“GE_VoiceMail.xml"/> </subjectdef> </hasNarrower></subjectScheme>

Page 23: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Subject to topic link

<map> <!-- ... --> <topicref href=“RetrievingVoiceMail.xml" > <topicsubject> <subjectref keyref="VoiceMail "/> </topicsubject> <!-- ... --> </topicref> <!-- ... --></map>

Page 24: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Browse and Search

Page 25: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Browse

• Authors browse by term (subject)– Single or multi facet

• Readers browse by term– Needs well known hierarchy– Or full text, then facets

• Dynamic generation of relationships

Page 26: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Search

• Search the Thesaurus only– As in the Getty example

• Use Thesaurus in Full Text Search– Generate Synonym Ring (SynSet)

• Rank terms higher• Increases precision

– Maximize relevant/returned– Less false positives

Page 27: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Demo Browse and Search

Author Browse: XDocs demoReader Browse: http://www.kayak.com/

Page 28: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

In Summary

• Identify and organize.• Use terms in DITA• Harvest into an index• Manage for accuracy.• Authors use index for findability• Readers use index for findability• Iterate as necessary

Page 29: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

Breaking the Silo – Linked Data

Page 30: Terminology Management in DITA

@JimTVanc @LavaCon Oct 2013

ReferencesTitle Author ReferenceSKOS Reference August 18, 2009 W3C SKOS Reference

Information Standards Quarterly, Winter 2012, from ISO 2788 to ISO 25964.

ISO Evolution of Thesaurus Standards

DITA 1.2 Glossary and Terminology Specialization Feature Description

Kara Warburton https://www.oasis-open.org/committees/download.php/34831/GlossarySpecializationBestPractice_Final.pdf

DITA 1.2 Specification Oasis http://docs.oasis-open.org/dita/v1.2/spec/DITA1.2-spec.html

Visual Thesaurus http://www.visualthesaurus.com

Getty Thesaurus of Geographic Names® Online http://www.getty.edu/research/tools/vocabularies/tgn/

Online Thesauri and Authority FilesNote: Only some of these work.

http://www.asindexing.org/i4a/pages/index.cfm?pageid=3625

UNBIS ThesaurusUnited Nations Bibliographic Information System (UNBIS)

https://lib-thesaurus.un.org/LIB/DHLUNBISThesaurus.nsf

http://www.taxonomies-sig.org/about.htm American Society for Indexing

http://www.taxonomies-sig.org/about.htm