OAI-2003 1 Science & Culture Developing a Knowledge Site in Distributed Information Environments...

Post on 27-Mar-2015

212 views 0 download

Tags:

Transcript of OAI-2003 1 Science & Culture Developing a Knowledge Site in Distributed Information Environments...

OAI-2003 1

Science & CultureDeveloping a Knowledge Site in Distributed

Information Environments

OAI-ForumBath, September 5th, 2003

Ann BordaAlpay BelerNick Wyatt

Science MuseumLondon UK

OAI-2003 2

Science & Culture 1 Large scale internet project funded by the New Opportunities

Fund (Lottery funding) National Museum of Science & Industry (NMSI)

– the Science Museum (London)– Science and Society Picture Library (London)– National Railway Museum (York)– National Museum of Photography, Film & Television

(Bradford) Audience – life long learners

OAI-2003 3

Science & Culture 2 Aim of the website:

– to make a rich quantity of material & collections accessible, – to contextualise through intelligent display, searching

(resource discovery) and relational linking. – to develop user-focused activities and personalisation tools

that are supported by these resources

OAI-2003 4

Science & Culture 3

Sourced content:– 40,000 digitised images and accompanying text

records– 30,000 library records– 10,000 object records – 50 narrative topics

OAI-2003 5

Issues Different types/functions of legacy systems

Different data standards and platforms in use Diffuse coordination of 'repurposable' content Mix of non-networked and networked systems Varied database connectivity Costs and time in upgrading and configuration

OAI-2003 6

Source Systems OverviewLocation Source

Database Function Type of System

National Railway Museum (York)

iBase Image management Microsoft Data Engine (MSDE) v. 7.

National Museum of Photography, Film & Television (Bradford)

iBase Image management Microsoft Data Engine (MSDE) v. 7.

Science Museum – Science and Society Picture Library (London)

Capture Picture library database File Maker Pro + C++

Science Museum (London)

MultiMIMSY 2000

Collections management Oracle 8.0 running on a Microsoft Windows 2000 server.

Science Museum Library (London)

Unicorn Library management C/ISAM (Informix) with BRS-Search running on SunOS5.7 (Solaris 7).

OAI-2003 7

Primary Records

Library (Unicorn) – AACR2, Marc21Object (MultiMimsy) – SpectrumImage records

– Capture (SSPL) – local guidelines– Ibase (NRM) – local guidelines– Ibase (NMPFT) – local guidelines

Images

OAI-2003 8

Cataloguing

Agreed definitions of DC elements Agreed list of qualifiers Core fields for export NMSI-wide image cataloguing

guidelines Procedures for cataloguing objects Cataloguing standards document

OAI-2003 9

Authority Control

Importance Authority fields People – authorised name, normalised name, dates,

biography etc. Authority lists for

– people – organisations – places – events/periods

On-going process

OAI-2003 10

Data Mapping

Agreed mappings to DC elements Need for concatenation, e.g. names Matching with authority files Linking to other record types Mapping table Need for vigilance & checking

OAI-2003 11

‘Interim’ Database The logical design would focus on:

Simplicity and efficiency of retrieval of informationOptimisation and consistency of information

NOT a cataloguing database a collection point for data DC fields as primary data structure enable data ‘normalisation’ export as DC fields in XML wrapper end-purpose to populate the web CMS

OAI-2003 12

Export Formats

Location Source Database Export

National Railway Museum(York)

iBase XML

National Museum ofPhotography, Film &Television (Bradford)

iBase XML

Science Museum – Scienceand Society Picture Library(London)

Capture CSV

Science Museum (London) MultiMIMSY 2000 XMLScience Museum Library(London)

Unicorn Tab delimited

OAI-2003 13

XML RECORD- <dcschema xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:noNamespaceSchemaLocation="D:\NOF\dcschema.xsd">- <dc_record> <dc_identifier>1980-108</dc_identifier>- <dc_creator1> <name>Coster, Salomon, d. 1659</name> <role>maker</role> </dc_creator1> <dc_relation relationType="Object">T/1980-108</dc_relation> <dc_relation relationType="Image">TIM100282</dc_relation> <dc_relation relationType="Image">10326522</dc_relation> <dc_subject subjectType="Controlled">pendulum clock</dc_subject> <dc_type>physical object | text</dc_type> <dc_format>text | xml</dc_format> <dc_title>’Haagsche Klokje’, pendulum clock, c. 1657.</dc_title> <dc_description>This clock by Salomon Coster (d. 1659) of the Netherlands is one of theearliest pendulum clocks ever made. The Dutch scientist Christiaan Huygens (1629-1693)designed the first successful pendulum clock in 1656. He worked with Coster, anexperienced clockmaker, to apply his new invention to commercial use. In 1657 Huygenshad the patent protecting his invention assigned to Coster, but sadly Coster died suddenlyin 1659 after producing only a few pendulum clocks. This is one of only seven pendulumclocks made by Coster known to have survived. The application of the pendulum clock totimekeeping during the scientific revolution of the 17

th century was arguably the most

fundamental advance in the history of time measurement.</dc_description> <dc_date_created>c. 1657</dc_date_created> <dc_subject_broad2>Time Measurement</dc_subject_broad2> <dc_cover_spatial>World > Europe > Netherlands > The Hague</dc_cover_spatial> <dc_language>en</dc_language> <audit_ok_for_nof>true</audit_ok_for_nof> </dc_record>

OAI-2003 14

Image management 3 image management systems need to reference image files with the text records text records to retain ‘mapped’ reference to image

files upon export ability to process TIFF images to JPEGs for web

Solution: Interim database back-end is an iBase system to

handle image relationships & processes

OAI-2003 158

Metadata & resource discovery

Common metadata set

Common syntax

Consistency of content

Dublin Core

XML/DOM/Common validation

Rules for contentCommon approachCataloguing rules

OAI-2003 16

Digitisation Infrastructure

OAI-2003 17

Web Content Management System (CMS)

COM database MS based architecture & server platform specified to handle XML and DC elements ability to handle metadata from the interim database. ability to manage relationships

ability to handle image references and image files

OAI-2003 18

Ingenious Site

Subject relatedResources

Subject relatedDebate

T OPIC 1See separate

structure

T OPIC 2

SUBJECT S(Editorial)

READ

UserRegistration

DebateDebate List

DEBAT E

Im agesLibrary records

ObjectsResources

SEARCH(Subject

Categories)

SEE

UserRegistration

M y Lightbox M y Gallery M y Links

CREATE

HOM E

OAI-2003 19

Topic Structure

Core Topic Text

Activity

“Relevance Toolbar”

Biography

Voices

Unusual Takes

Glossary

Debate

Related Images

Related Stories

Related Library

Records

 

OAI-2003 20

Collection Level Description

DC & RSLP CLD elements used Allows consistency for searching Overall CLD for whole site Subject Level CLD Topic Level & Topic Part CLD

OAI-2003 21

Site CLD<META NAME="DC.Title" CONTENT="Ingenious">

<META NAME="DC.Description" CONTENT="This site makes connections between people, innovations and ideas. It contains images and other resources illustrating human endeavour and development from the Science Museum, National Railway Museum and National Museum of Photography, Film & Television. Subjects and topics put these images in context, giving historical and cultural insights on current issues in science, technology and medicine.">

<META NAME="DC.Rights" CONTENT="http://www.sciencemuseum.org.uk/copyright/copyright.asp">

<META NAME="DC.Creator" CONTENT="Science Museum | National Railway Museum | National Museum of Photography, Film & Television">

<META NAME="DC.Publisher" CONTENT="National Museum of Science & Industry">

<META NAME="DC.Language" CONTENT="en-uk">

<META NAME="DC.Type.category" CONTENT="collection">

<META NAME="DC.Format" CONTENT="text/html">

<META NAME="DC.Date.created" CONTENT="2003”>

<META NAME="DC.Identifier" CONTENT="http://193.71.79.113/">

<META NAME="DC.Subject.LCSH" CONTENT="Culture | Science | Technology | Medicine | Photography | Transport | Railroads | Industries | History">

<META NAME="DC.Relation.HasPart" CONTENT="sacsub01 | sacsub02 | sacsub03 | sacsub04 | sacsub05 | sacsub06 | sacsub07 | sacsub08 | sacsub09 | sacsub10 | sacsub11 | sacsub12">

OAI-2003 22

SEARCH - Subject Metadata

OAI-2003 23

Topic metadata<!-- DC begin --><META NAME="DC.Title" CONTENT="Home and away; ">

<META NAME="DC.Identifier" CONTENT="sacsub04; ">

<META NAME="DC.Description" CONTENT=" ; ">

<META NAME="DC.Format" CONTENT="text/html">

<META NAME="Robots" CONTENT="all">

<META NAME="DC.Language" CONTENT="en-uk">

<META NAME="DC.Type.category" CONTENT="collection">

<META NAME="DC.Creator" CONTENT="; ">

<META NAME="DC.Date.Created" CONTENT="2003">

<META NAME="CLD:hasLocation" CONTENT="; ">

<META NAME="DC:Subject.SAC" CONTENT="Home and away; ">

<META NAME="DC:Subject.Broad2" CONTENT="Cinematography & Film | Entertainment | Photography | Photography: Equipment | Radio | Sound Reproduction & Acoustics | Television | Computing & Data Processing | Clothing | Domestic Life & Household Management | Firemaking | Lighting | Social & Economic Life | Sports & Pastimes | Building Construction & Architecture | Civil Engineering | Firefighting | Plastics ; ">

<META NAME="DC:Subject.Keywords" CONTENT="; ">

<META NAME="DC.Rights" CONTENT="http://www.sciencemuseum.org.uk/copyright/copyright.asp">

<META NAME="CLD:Owner" CONTENT="National Museum of Science & Industry">

<META NAME="DCQ:hasPublication" CONTENT="sactopic19 | sactopic20 | sactopic21 | sactopic22; ">

<META NAME="DCQ:isPartOf" CONTENT="Ingenious; ">

<META NAME="CLD:isDescriptionOf" CONTENT="NMSI Digitised resources; "><!-- DC end -->

OAI-2003 24

Sustainability - Present Conform to future standards (e.g. Dublin Core) Repurposable data in an XML wrapper: ‘Create

once and use many’ Customisable according to needs Ability to share data across organisations Multiple platform delivery: different channels Community building

OAI-2003 25

Science & Culture - Repurposing

OAI-2003 26

Future - Semantic Web

Search Engines

Knowledge ManagementSystems & Applications

HTML WebPages

Agents

MetadataDublin CoreRDF SchemaXML

Resource DiscoveryServices

Information Retrieval

Taxonomies

OAI-2003 27

Future – Creating Communities Interest groups linked to subject hierarchies Groups generate knowledge Knowledge added to existing “formal data” Taps into “informal” knowledge

– unwritten– oral– practical

The nature of knowledge on the site changes

OAI-2003 28

Presented by

Ann Borda, NOF Project ManagerA.Borda@nmsi.ac.uk

Alpay Beler, NOF IS ArchitectA.Beler@nmsi.ac.uk

Nick Wyatt, NOF Metadata ArchitectN.Wyatt@nmsi.ac.uk

Science Museum, London