Maja Žumer: Library catalogues of the future: realising the old vision with new tools
-
Upload
uisk-ff-uk -
Category
Education
-
view
104 -
download
2
Transcript of Maja Žumer: Library catalogues of the future: realising the old vision with new tools
Libraries of the future: realising the
old vision with new tools
Maja Žumer
University of Ljubljana
Slovenia
Prague, March 3, 2014
Library catalogue
• librarian‘s tool
• user‘s tool
– point of entrance to the library collection
– finding relevant materials
– getting an overview of the collection
2
3
Theoretical studies
• Panizzi (1841)
• Cutter (1876)
• Paris Principles (1961)
• IME-ICC (2009)
4
Charles Ammi Cutter: Rules for a Printed Dictionary Catalog(1876) The objectives of a catalogue are: 1. to enable a person to find a book of which either (Identifying
objective) • the author • the title • the subject is known. 2. to show what the library has (Collocating objective) • by a given author • on a given subject • in a given kind of literature 3. to assist in the choice of a book (Evaluating objective) • as to its edition (bibliographically) • as to its character (literary or topical)
5
Paris Principles (1961)
The catalogue should be an efficient instrument for ascertaining
2.1 whether the library contains a particular book specified by (a) its author and title, or
(b) if the author is not named in the book, its title alone, or
(c) if author and title are inappropriate or insufficient for identification, a suitable substitute for the title; and
2.2 (a) which works by a particular author and
(b) which editions of a particular work are in the library.
6
Why the need for change?
• New developments (computers, databases, new materials)
• New user needs, expectations and demands
What is different?
• Libraries are facing competition for the first time
• Library catalogues are not perceived as intuitive – compared to other tools and services
• Users actively avoid using the catalogue even when they want to borrow a book
• „Everything is on the Web“
• Users expect simple tools which do not require specific training
7
Are libraries aware of the changes?
• Not completely
– Libraries are not questioning (enough) their tools
– They are relying on tradition
• But there are discussions and developments:
– New models
– Awareness of new tools and services (e.g. Semantic Web)
– Assuming new roles or performing them in a new way (e-learning, data curation)
8
Development of formats
9
NUL 1774-1900
10
NUL 1900-1947
11
ISBD format
12
MARC
13
DC in XML
14
Changes in OPAC interfaces
• Replica of card catalogue
• Web OPAC
• Faceted OPAC
• Catalogue 2.0
15
Traditional OPAC
16
New generation catalogue
17
18
Why the need for change?
• New user needs, expectations and demands
• Library metadata is not (should not be) isolated
• Libraries should expose and fully utilise the quality of their metadata
What is wrong with current catalogues?
• Long displays of similar records
• No meaningful clustering
• No (or not enough) relationships
• Zero hits
• Authority data not exposed
• Exploration not supported
19
• New systems are needed
• We need thorough understanding of the bibliographic universe
• Conceptual models as the necessary foundation
20
21
Functional Requirements for Bibliographic Records
• Approved by IFLA in 1997
• Published in 1998
• Conceptual model of the ‘bibliographic universe’
22
User functions/tasks
• using the data to FIND materials that correspond to the user's stated search criteria
• using the data retrieved to IDENTIFY an entity (e.g., to confirm that the document described corresponds to the document sought by the user, or to distinguish between two similar documents)
• using the data to SELECT an entity that is appropriate to the user's needs (e.g., to select a text in a language the user understands, or to choose a version of a computer program that is compatible with the hardware and operating system available to the user)
• using the data in order to acquire or OBTAIN access to the entity described
23
User functions/tasks
• FIND: reduce the set to a manageable size • IDENTIFY: understand exactly the nature of the
resource described by metadata • SELECT: among the relevant resources choose the
ones which are suitable • OBTAIN: get access to the resource
• EXPLORE: browse the collection to get acquainted
with ot and/or locate interesting resources
24
FRBR Entities
Group 1:
(products of intellectual & artistic endeavor)
• Work
• Expression
• Manifestation
• Item
WORK: a distinct intellectual or artistic creation
EXPRESSION: the intellectual or artistic realization
of a work in the form of alpha-numeric, musical, or choreographic notation, sound, image, object, movement, etc., or any combination of such forms
MANIFESTATION: the physical embodiment of an
expression of a work.
ITEM: a single exemplar of a manifestation.
25
26
Work
Expression
Manifestation
Item
is realized through
is embodied in
is exemplified by one
many
Group 1 Conceptual/content
Physical/recording
27
Group 2:
(agents related to Group 1 entities)
•Person
•Corporate body
•Family (added by FRAD)
28
Work
Expression
Manifestation
Item
Group 2
is owned by
is produced by
is realized by
is created by
Person
Corporate Body
29
Subjects of works:
• Group1 and Group 2
• Group 3
• Concept
• Object
• Event
• Place
30
31
32
LC Control No. : 47023612
LCCN Permalink : http://lccn.loc.gov/47023612
Type of Material : Book (Print, Microform, Electronic, etc.)
Personal Name : Shakespeare, William, 1564-1616.
Main Title : ... Hamlet, traduit par Andre Gide.
Published/Created : [Paris] Gallimard [1946]
Description : 2 p. l., 7-237, [2] p. 17 cm.
CALL NUMBER : PR2779.H3 G5Copy 1
-- Request in : Jefferson or Adams Bldg General or
Area Studies Reading Rms
Shakespeare, William, 1564-1616. Hamlet. French.
33
LC Control No. : 47023612
LCCN Permalink : http://lccn.loc.gov/47023612
Type of Material : Book (Print, Microform, Electronic, etc.)
Personal Name : Shakespeare, William, 1564-1616.
Main Title : ... Hamlet, traduit par Andre Gide.
Published/Created : [Paris] Gallimard [1946]
Description : 2 p. l., 7-237, [2] p. 17 cm.
CALL NUMBER : PR2779.H3 G5Copy 1
-- Request in : Jefferson or Adams Bldg General or
Area Studies Reading Rms
Shakespeare, William, 1564-1616. Hamlet. French.
Work
34
LC Control No. : 47023612
LCCN Permalink : http://lccn.loc.gov/47023612
Type of Material : Book (Print, Microform, Electronic, etc.)
Personal Name : Shakespeare, William, 1564-1616.
Main Title : ... Hamlet, traduit par Andre Gide.
Published/Created : [Paris] Gallimard [1946]
Description : 2 p. l., 7-237, [2] p. 17 cm.
CALL NUMBER : PR2779.H3 G5Copy 1
-- Request in : Jefferson or Adams Bldg General or
Area Studies Reading Rms
Shakespeare, William, 1564-1616. Hamlet. French.
Expression
35
LC Control No. : 47023612
LCCN Permalink : http://lccn.loc.gov/47023612
Type of Material : Book (Print, Microform, Electronic, etc.)
Personal Name : Shakespeare, William, 1564-1616.
Main Title : ... Hamlet, traduit par Andre Gide.
Published/Created : [Paris] Gallimard [1946]
Description : 2 p. l., 7-237, [2] p. 17 cm.
CALL NUMBER : PR2779.H3 G5Copy 1
-- Request in : Jefferson or Adams Bldg General or
Area Studies Reading Rms
Shakespeare, William, 1564-1616. Hamlet. French.
Manifestation
36
LC Control No. : 47023612
LCCN Permalink : http://lccn.loc.gov/47023612
Type of Material : Book (Print, Microform, Electronic, etc.)
Personal Name : Shakespeare, William, 1564-1616.
Main Title : ... Hamlet, traduit par Andre Gide.
Published/Created : [Paris] Gallimard [1946]
Description : 2 p. l., 7-237, [2] p. 17 cm.
CALL NUMBER : PR2779.H3 G5Copy 1
-- Request in : Jefferson or Adams Bldg General or
Area Studies Reading Rms
Shakespeare, William, 1564-1616. Hamlet. French.
Item
37
“The FRBR revolution”
• User-oriented view of bibliographic data
• Catalogue as a database and not a replica of a card catalogue
• Re-thinking of current cataloguing theory and practice
• potential for interoperability within ALM, rights management, Semantic Web
38
The “FRBR family”
• FRBR: the original framework – All entities, focus on Group 1
• FRAD: Functional Requirements for Authority Data – Focus on Group 2
– Accepted in 2009, published in 2010
• FRSAD: Functional Requirements for Subject Authority Data – Focus on ‘aboutness’
– Accepted in 2010, published in 2011
39
FRAD
• Family added in Group 2
• Name as a separate entity
• Justify and Contextualise added
40
FRAD
41
FRSAD: established in April 2005
Terms of reference:
• to build a conceptual model of Group 3 entities within the FRBR framework as they relate to the aboutness of works,
• to provide a clearly defined, structured frame of reference for relating the data that are recorded in subject authority records to the needs of the users of those records, and
• to assist in an assessment of the potential for international sharing and use of subject authority data both within the library sector and beyond.
42
Challenges of subject access
• Is it possible to objectively determine the topic(s) of each work?
• In what context will the users need a particular work in the future?
• Granularity of topics
• Specificity of topics
• How to represent the subject of a work?
• How will the user formulate the query?
• Different knowledge organisation systems
• Subject searching is difficult for users
43
FRSAR WG Activities
• User surveys on the Use of Subject
Authority Data
a. Pilot with Non Library & Information
Professionals
b. Library & Information Professionals
• Investigation of user tasks
• Entities, attributes and relationships
User tasks FRSAD:
Find
Identify
Select
Explore
FRBR :
Find
Identify
Select
Obtain
FRAD:
Find
Identify
Contextualize
Justify
44
Products of
intellectual or
artistic
endeavours
An additional
set of entities
that serve as
the subjects of
works
Agents related
to Group 1
.
Extension of FRBR Figure 3.3 "Group 3 entities and 'subject' relationships"
Family added by FRAD
45
46
FRBR Group 3 entities….
This part of the model has been criticized,
because it does not include time and does not
cover well activities and processes (e.g.,
Heaney, 1997; Delsey, T. 2005)
47
Study and Discussions
Different scenarios discussed:
• Keep FRBR Group 3 entities and only
analyse attributes and relationships.
• Add time to the FRBR list.
• Take Ranganathan’s facets as the basis.
• Take <indecs> as the basis.
• Make a pragmatic list of entities. One
example is the one by Buizza and Guerrini
• Propose something new
48
Two small tests:
Four students and faculty members at Kent State Library school classified existing subject terms used by the NSDL (National Science Digital Library) contributors. These include 3 thousand terms assigned based on a variety of subject vocabularies and free keywords.
Professor Lois Chan classified the subject headings from LCSH she included in her books.
They classified terms into six categories: concrete stuff, abstract stuff, event, time, place, and others.
49
Test Results
• Blurred distinction between concrete and abstract
• Confusion about proper names
• Terms are put into ‘other’ category
• This categorisation is not generally applicable or useful
• There is no generally applicable categorisation
50
3.2 Models discussed Kent & Dublin meetings, 2006-12
51
FRSAD– generalisation of FRBR
FRSAD
52
Nomen: any alpha numeric, sound, visual etc. symbol or combination of symbols by which a thema is known, referred to or addressed as
Thema: anything that can be subject of a work
53
Nomens
1-8
Nomen 9
Attributes of thema
• Type
• Note (description)
54
55
Types of thema
Depending on the implementation (domain
and/or KOS), thema is categorised
• FRBR: object, concept, event, place, +…
• Class and instance
• Medicine: medical condition, symptom,
treatment, substance…
• …
56
Nomen attributes (include but not limited to)
– Type of nomen (note: see next slide)
– Scheme: system/vocabulary (LCSH, UDC,…)
– Reference source
– Representation (alphanumeric, sound, visual,...)
– Language (English, Japanese, Slovenian,…)
– Script (Cyrillic, Korean, Chinese-simplified,…)
– Transcription/transliteration
– Encoding (Latin-1, UTF-8,…)
– Form (long, short, formula…)
– Time of validity
– Audience: Community, for which this is the preferred form
– Status (provisional, accepted, official,...)
57
Types of nomen
• Identifier (= name, assigned to an entity,
which is persistent and unique within a
domain)
• Controlled name (= name constructed in
authority control/vocabulary maintenance
process, which usually serves as access point) (note: called Controlled access point in FRAD)
• Implementation-specific types, e.g.:
– Defined by originating system
– Defined by language
– ….
58
Thema relationships General relationships between themas
(applicable to all types)
• Hierarchical
– Partitive
– Generic
– Instance
– Polyhierarchical
• Associative (=other)
Other thema-to-thema relationships are
implementation-dependent
59
Nomen relationships
• Partitive
• Equivalence (nomens of the same thema)
Equivalence can be specified further, e.g.:
• Replaces/is replaced by
• Has variant form/is variant form
• Has derivation/is derived from
– Has acronym/is acronym
– Has abbreviation/is abbreviation
– Has transliterated form/is transliteration
thema –thema relations
place as
thema
nomens
thema types (place-specific)
nomen –nomen
relations
60
61
The importance of the THEMA-NOMEN
model to the subject authority data
• to separate what are usually called concepts (or topics, subjects, classes [of concepts]) from what they are known by, referred to, or addressed as.
• A general abstract model, not limited to any particular domain or implementation
• Potential for interoperability within the library field and beyond
Functional Requirements for
Subject Authority Data (FRSAD)
- A Conceptual Model
by Marcia Lei Zeng, Maja Žumer
& Athena Salaba (Eds.)
Series: IFLA Series on
Bibliographic Control 43
Publisher: De Gruyter Saur
62
Why FRBR?
• Not a replica of a card catalogue
• Bibliographic universe presented as a network - relationships
• Supports exploration
• Is intuitive
63
64
FRBR: intuitive?
• Declaratively user-oriented
• No user studies
• No completely FRBR-based implementation
• One way to find out: mental model elicitation
65
Study
• Do mental models resemble the conceptual model?
• 30 participants
• Ljubljana
• July 2007 – February 2008
• We only looked at Group 1 entities
Tasks
• Card sorting
• Concept mapping
• Comparison of examples
66
67
1. Card sorting
• Abstract/concrete nature of the things described
• Cards: plain descriptions of instances of FRBR entitites
• No expression/manifestation groupings (Work – Editions – Copies)
• Original expressions with works, other expressions with manifestations
68
2. Concept mapping
• “What comes out of what?”
69
9
7
11 14
8 12
1 13
2 3
4
5
9
7 10
8
11
12
14
13
5
2
3
1
4
11 13 2 12
8 5 3
1
10
7
9
11
4
12
11
1 13 7 10
2
4
3
5 8
14
12
12
8
9
7
14
13 1
5 2 3
4 11
10
Examples of individual graphs
70
71
Task 2: Results
• Most common connections were FRBR-like
• Core group of mental models close to FRBR
72
3. Comparison task
• 1. Interviews
• 2. Rankings
• 11 pairs of similar objects (mostly books)
73
74
Rankings
• Ranking pairs according to their perceived substitutability from the most substitutable to the least substitutable
• Pairs could be on the same level of substitutability
75
Pair Average Rank
Parma 2
Koča 2
Kačič 3.8
Bulgakov 3.9
Mystery 4.8
Kam 7.2
Skrivnost 7.4
Economics 7.7
Africa 7.8
Poirot 9.6
Room 9.7
76
Conclusions of the study
• There is no single mental model
• The more people think about bibliographic universe and the more they interact with it, the more FRBR-like their mental models are
• Results of user study indicate that FRBR can be used as conceptual basis for catalogs
• Positioning of the original expression in the model (often seen as surrogate of work)
Published in
• PISANSKI, Jan, ŽUMER, Maja. Mental models of the bibliographic universe : part 1 : mental models of description. J. Doc., 2010, vol. 66, no. 5, str. 643-667.
• PISANSKI, Jan, ŽUMER, Maja. Mental models of the bibliographic universe : part 2 : comparison task and conclusions. J. Doc., 2010, vol. 66, no. 5, str. 668-680.
77
Continuation
• Based on Task 2
• List of descriptions+six graphs (including FRBR graph)
• 6 groups of 10 students – for each of two examples
• „Which graph is the best representation of the relationships between entities listed?“
78
No. Description
1 The novel The Da Vinci Code by Dan Brown.
2 Da Vincijeva šifra translated by Nataša Müller, published in 2004 by Mladinska knjiga in Ljubljana. It has 473
pages. 3 Signed copy of The Da Vinci Code from 2003, owned by Lojze B., a book collector.
4 The first edition of The Da Vinci Code, published in 2003 by Doubleday in New York. It has 455 pages.
5 Copy of Da Vincijeva šifra from 2004, located in the local library in Spodnja vas with call number 234.
6 Translation of the novel The Da Vinci Code into Slovene by Nataša Müller.
7 Copy of Da Vincijeva šifra from 2006, located in the local library in Spodnja vas. The call number is 567.
8 Da Vincijeva šifra translated by Nataša Müller, published in 2006 by Mladinska knjiga in Ljubljana. It has 511
pages. 9 Copy of The Da Vinci Code from 2006, located in the local library in Spodnja vas. The call number is 123.
10 Edition of The Da Vinci Code, published in 2006 by Anchor Books in New York. It has 489 pages.
11 Brown’s original English text of the novel The Da Vinci Code.
12 Copy of Il Codice da Vinci from 2003 with the call number DAV2, which Lojze B. borrowed from library in
Koper 13 Copy of Il codice da Vinci from 2003, located in library in Koper. It‘s call number is DAV1.
14 Edition of Il codice da Vinci translated by Riccardo Valla, published in 2003 by Mondadori publishing house in
Milan. It has 523 pages. 15 Translation of The Da Vinci Code in Italian by Riccardo Valla. 79
Graph 6, example 1
80
Results Major Ex. 1 Ex. 2 Ex. 3 Ex. 4 Ex. 5 Ex. 6 Own
Computer
Studies
2 + 2 0 + 1 0 + 1 1 + 1 6 + 4 1 + 1 0 + 0
Design 0 + 1 0 + 2 1 + 1 1 + 0 3 + 4 3 + 2 2 + 0
Economics 0 + 0 0 + 1 0 + 1 0 + 2 8 + 4 2 + 2 0 + 0
History 1 + 1 1 + 3 1 + 1 0 + 2 5 + 3 0 + 0 2 + 0
Medicine 2 + 1 1 + 0 0 + 0 1 + 1 4 + 6 2 + 2 0 + 0
Social
Studies
1 + 0 0 + 0 0 + 1 1 + 0 7 + 9 1 + 0 0 + 0
Total 6 + 5 2 + 7 2 + 5 4 + 6 33 + 30 9 + 7 4 + 0
81
• PISANSKI, Jan, ŽUMER, Maja. User verification of the FRBR conceptual model. J. Doc., 2012, vol. 68, no. 4, str. 582-592.
82
If FRBR is the model, why not implement it immediately?
• Harmonisation of the FRBR family
• Frbrisation
• Presentation of search results
• Semantic web (formats, identifiers)
83
Presentation of search results
Currently
• Lists of manifestations
• No meaningful order
• Relationships missing or not evident
• Exploration not supported
Visualisation as a possible scenario
84
M
M
M
M
M
M
M
M
M
E E E E E E E E
W W W W W W
85
W
E E E E E E E E
W W W W W W
M M M M M M M M M
W
W
W
W W
W W
W
W W W
W
W W
W
W
W
W
W W
W
W
W
W W
W W
novel
motion picture
musical
picture book
play
illustrations
literary criticism
novel
novel
literary criticism
TV
documentary
86
PROTOTYPE FrbrVis
87
indented hierarchy radial
treemap sunburst
4 hierarchical scenarios
88
User study
• 4 visualisations + baseline traditional
• Visualisations superior in performance
• Visualisations preferred
• Indented hierarchy and sunburst chosen as best
• Very positive feedback
89
Published in
• MERČUN, Tanja, ŽUMER, Maja, AALBERG, Trond. Presenting and exploring the complexity of bibliographic relationships. Lect. notes comput. sci., str. 63-66. http://link.springer.com/chapter/10.1007/978-3-642-34752-8_8#page-1, doi: 10.1007/978-3-642-34752-8_8.
• MERČUN, Tanja, ŽUMER, Maja, AALBERG, Trond. FrbrVis : an information visualization approach to presenting FRBR work families. Lect. notes comput. sci., str. 504-507. http://www.springerlink.com/content/978-3-642-33289-0/#section=1132939&page=1&locus=16, doi: 10.1007/978-3-642-33290-6_60.
• More to come… 90
The future of libraries
• Competition with other information providers and their tools
• Accepting change
• Important contribution to the Semantic Web
91
Utilising the technology to finally realise the old goal:
Reconfirm the library catalogue (and bibliographic information in general) as an important building block of the global information infrastructure
92
Shouldn‘t this be our motto, too?
OUTSTANDING PAST
BRIGHT FUTURE
(borrowed from University of Malta)
93