DSpace Basic Tutorial Stuart Lewis & Chris Yates [email protected].

43
DSpace Basic Tutorial Stuart Lewis & Chris Yates [email protected]

Transcript of DSpace Basic Tutorial Stuart Lewis & Chris Yates [email protected].

Page 1: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

DSpace Basic Tutorial

Stuart Lewis & Chris [email protected]

Page 2: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

Information

• Details:– Requires tutorial CD

• DSpace 1.4.2• CD includes DSpace 1.5 alpha

• CD and workbook created by:– Chris Yates

[email protected]

– Stuart Lewis• [email protected]

Page 3: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

Information

• Tutorial created by:– Repositories Support Project– http://www.rsp.ac.uk/– [email protected]

• Funded by:– JISC as part of the RepositoryNet

Page 4: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

Contents

1. Introduction to DSpace

2. The tutorial CD

3. DSpace technical architecture

4. Users and groups

5. Item structure

6. Metadata and item input, workflows

7. Search and browse

8. Import / export / harvest

Page 5: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

Introduction to DSpace

• “DSpace captures your data in any format in text, video, audio, and data. It distributes it over the web. It indexes your work, so users can search and retrieve your items. It preserves your digital work over the long term. DSpace provides a way to manage your research materials and publications in a professionally maintained repository to give them greater visibility and accessibility over time.”– www.dspace.org

Page 6: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

*Introduction to the tutorial CD

• Intended to be used with tutorial• DSpace version 1.4.2 (includes 1.5 alpha)• Shouldn’t affect your PC• Installs no software• Can be reused• DOES NOT SAVE DATA!• Disclaimer…

Page 7: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

DSpace Technical Architecture

• Written in Java– Can be run on any platform that supports

Java• Most installation on Unix (Linux* / Solaris)• Runs on Windows / Mac OS X

– Sun JDK (not GNU)– 1.4 for <= version 1.4.2– 1.5* for >= version 1.5

Page 8: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

DSpace Technical Architecture

• Database:– Same machine, or database server

• Postgres*• Oracle

• Web application server– Tomcat*– Jetty– Other

Page 9: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

DSpace file layout

• Download– [dspace-src]

• Edit config/dspace.cfg• Build

• Installed– [dspace] (often /dspace/)

• [dspace]/assetstore/• [dspace]/upload/• [dspace]/logs/• [dspace]/bin/• [dspace]/search/

Page 10: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

*Create the database

• Create a database user– Who will own the database– Called dspace

• Create the database– Called dspace– UNICODE encoding– Owned by the dspace database user

Page 11: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

Create the database

• Create a database user– Double click on the ‘Terminal icon’– ‘su - postgres’ password is ‘postgres’– ‘createuser -U postgres -d -P dspace’– password is ‘postgres’

• Create the database– ‘createdb -U dspace -E UNICODE dspace’

Page 12: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

*Build DSpace

• DSpace needs to be compiled• Uses ‘ant’ build system• Inserts default data into the database

– Table structures– Dublin Core metadata schema– Bitstream formats

• Builds package for the web server• Configuration can be changed

Page 13: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

Build DSpace

• ‘cd /dspace142-src/’• ‘gedit config/dspace.cfg’

– Change dspace.name to your name– Save and quit

• ‘ant fresh_install’

• ‘chmod 777 /dspace142/upload’• ‘chmod 777 /dspace142/assetstore’

Page 14: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

*Deploy to the web server

• .war files are packaged applications

• Tomcat is a Java web application server– Tomcat automatically unpacks .war files

• Two web applications– DSpace & DSpace OAI interface

• Copy .war files to Tomcat directory

• Start Tomcat

Page 15: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

Deploy to the web server

• ‘cp build/*.war /var/lib/tomcat5.5/’

• ‘sudo /etc/init.d/tomcat5.5 start’

• Load Firefox– Go to http://localhost:8080/dspace/

Page 16: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

DSpace users and groups

• Administrator– bin/create-administrator

• Create more users:– Web user interface– Administrator

• Other authentication methods:– LDAP (LDAP or Active Directory)– Plugable and stackable authentication

Page 17: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

DSpace users and groups

• Groups– Members can be users of other groups– E.g. Dept group made up of research

group groups

• User defined or

• Automatically generated for collections

Page 18: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

*Create DSpace users

• Create an administrator

• Log in

• Log out

• Create a normal user

• Modify a group

Page 19: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

Create DSpace users

• Create first administrator– ‘bin/create-administrator’– Answer questions

• Create another user– Administrator pages, ‘E-People’– ‘Add EPerson’

• Promote new user to administrator– Administrator pages, ‘Groups’– Edit ‘Administrator’ group

Page 20: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

*Communities & collections

• Communities– Often used to represent organisational

units– Can have sub-communities and collections– Can be branded (logo) and have own

policies

• Collections– Holds items

Page 21: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

Communities & collections

• ‘Communities / Collections’ link

• ‘Create Top-Level Community’– Enter name, short description– Press ‘Create’ button

• Create collection– Enter name, short description– Add the Administrator group to submitters

Page 22: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

DSpace items

• Metadata– One or more metadata schemas– User-entered– System-generated (e.g. accessioned date)

• Files– In bundles– Special bundles (e.g. extracted text,

licences)

Page 23: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

DSpace Items

• Items can be mapped across collections– E.g. appear in central and departmental e-

theses collections– Same as a file system symbolic link

• Submissions controlled by input forms– config/input-forms.xml– Input forms

• Controlled vocabularies

Page 24: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

DSpace Items

• input-forms.xml<form-map>

<name-map collection-handle="default" form-name="traditional" />

<name-map collection-handle=”2160/56” form-name=”etheses" />

<name-map collection-handle=”2160/802" form-name=”lawdepartment" />

</form-map>

Page 25: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

input-forms.xml

<form name="traditional"><page number="1">

<field>…

</field><field>

…</field>

</page><page number="…">

…</page>

</form>

Page 26: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

input-forms.xml

<field><dc-schema>schema</dc-schema><dc-element>element</dc-element><dc-qualifier>qualifier</dc-qualifier><repeatable>true/false</repeatable><label>Text label</label><input-type>name/onebox/date/twobox/textarea/dropdown/

qualdrop_value</input-type><hint>Expanded hint</hint><required>Warning to show if not entered</required>

</field>

Page 27: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

input-forms.xml

<value-pairs value-pairs-name=”name" dc-term=”element">

<pair>

<displayed-value>English</displayed-value>

<stored-value>en</stored-value>

</pair>

<pair>

<displayed-value>Welsh</displayed-value>

<stored-value>cy</stored-value>

</pair>

<value-pairs>

Page 28: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

input-forms.xml

<field><dc-schema>dc</dc-schema><dc-element>contributor</dc-element><dc-qualifier>author</dc-qualifier><repeatable>true</repeatable><label>Authors</label><input-type>name</input-type><hint>Enter the names of the authors of this item below.</hint><required></required>

</field>

Page 29: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

input-forms.xml

<field><dc-schema>dc</dc-schema><dc-element>title</dc-element><dc-qualifier></dc-qualifier><repeatable>false</repeatable><label>Title</label><input-type>name</input-type><hint>Enter the main title of the item.</hint><required>You must enter a main title for this item.</required>

</field>

Page 30: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

*Deposit an item

• Choose collection

• Enter metadata

• Upload file

• Confirm details

• Agree to the licence

Page 31: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

Deposit an item

• Enter the collection you created– Tick ‘The item has been published or

publicly distributed before’ - asks extra questions about the publishing (i.e. date / publisher)

– Enter metadata– Upload file (‘/dspace-docs/RSP.pdf’)– Agree to licence– Submit item

Page 32: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

*Create a workflow

• Three workflows– Accept/reject step

• E.g. Head of research• “Should item be included in the repository?”

– Accept/reject/edit metadata step• E.g. Repository manager

– Edit metadata step• E.g. Librarian

Page 33: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

Create a workflow

• Create new collection– Tick ‘This submission will include and

accept/reject/edit metadata step’– Enter name and short description– Add ‘Administrator’ group to workflow

• Submit to the new collection• Go to ‘My DSpace’ to enter the workflow

– ‘Edit Metadata’– ‘Approve’

Page 34: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

Search and browse

• Browse– By:

• Author / title / date

– Database driven– Always up to date

• Search– Lucene search engine– Define fields to index in dspace.cfg– Full texts– Not always up to date

Page 35: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

*Search system initalisation

• Build indexes– Index metadata

• Extract from database

– Index full-texts• Extract from PDF/Doc files• Extra MediaFilters can be written

Page 36: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

Search system initalisation

• Search ‘Aberystwyth’– No results

• Run:– ‘bin/filter-media’

• Extract full texts• (create thumbnails)• Build indexes

• Search ‘Aberystwyth’– See results!

Page 37: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

*Scheduled background jobs

• filter-media– Extract texts and build indexes

• sub-daily– Email subscription emails

• checker– checks bitstream checksums

• stat-* – statistics

Page 38: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

*RSS feeds and thumbnails

• Configured in dspace.cfg

• RSS feeds:– webui.feed.enable = [false|true]– webui.feed.localresolve = [false|true]

• Thumbnails:– webui.item.thumbnail.show = [true|false]– webui.browse.thumbnail.show = [false|true]

Page 39: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

RSS feeds and thumbnails

• ‘gedit /dspace142/config/dspace.cfg’– Set webui.feed.enable to true– Set webui.feed.localresolve to true

• ‘sudo /etc/init.d/tomcat5.5 restart’• Upload new item with PNG file

– Upload png from /home/dspace/examples/– ‘/dspace142/bin/filter-media’– See thumbnail

Page 40: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

Import / export

• See docs

• Bulk import command line tool– Imports one item per directory– Multiple files / metadata file / contents file

• Bulk exporter– Writes same file format– Adds file containing handle (for re-import)

Page 41: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

Import / export

archive_directory/item_000/

dublin_core.xml -- qualified DC metadata

contents -- one line per filename file_1.doc -- files to be added file_2.pdf

[dspace]/bin/dsrun org.dspace.app.itemimport.ItemImport --add

--eperson= [email protected] --collection=collectionID --source=items_dir --mapfile=mapfile

Page 42: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

Harvesting / OAI-PMH

• OAI-PMH interface– Separate web application– /dspace-oai/– /dspace-oai/request?verb=– /dspace-oai/request?verb=Identify– /dspace-oai/request?verb=ListSets– /dspace-oai/request?verb=GetRecord– /dspace-oai/request?verb=ListIdentifiers– /dspace-oai/request?verb=ListMetadataFormats– /dspace-oai/request?verb=ListRecords

Page 43: DSpace Basic Tutorial Stuart Lewis & Chris Yates support@rsp.ac.uk.

The end…

• Incomplete– Lots lots more!– Email [email protected]– Email dspace-tech email list

• Advanced tutorial this afternoon– Or surgery / open discussion / demos