Wikisource - Where we are, where we want to go

22
Wikisource Where we are Where we want to go Andrea Zanni Wikimedia Italia Wikimania 2012

description

These slide were presented at Wikimania 2012, in Washington DC. the talk regarded Wikisource as a digital library, his strenghts and weaknesses, and above all his possible future.

Transcript of Wikisource - Where we are, where we want to go

Page 1: Wikisource  - Where we are, where we want to go

Wikisource

Where we areWhere we want to go

Andrea ZanniWikimedia Italia

Wikimania 2012

Page 2: Wikisource  - Where we are, where we want to go

The library of Babel

“The universe (which others call the library)...”

J. L. Borgeshttp://en.wikipedia.org/wiki/The_Library_of_Babel

Page 3: Wikisource  - Where we are, where we want to go

What are digital libraries?

Nobody really knows, but many agree on requirements:

1. Collection2. Metadata3. Services4. People

Page 4: Wikisource  - Where we are, where we want to go

Collection

● Reliability● Readability ● Curation ● Quality

Page 5: Wikisource  - Where we are, where we want to go

It's a kind of magic..Metadata are used for ● cataloging ● indexing● retrieving ● archiving ● communicating ● preserving information.

If we want to deal with information, we need metadata.

Page 6: Wikisource  - Where we are, where we want to go

It's a kind of magic..Metadata are used for ● cataloging ● indexing● retrieving ● archiving ● communicating ● preserving information.

If we want to deal with information, we need metadata.

Page 7: Wikisource  - Where we are, where we want to go

Metadata

On Wikisource metadata contains information about books and authors

● in simple text● human-readable ● no standard● not interoperable

… no magic :-(

Page 8: Wikisource  - Where we are, where we want to go

Services

Everything that is beyond books:

● reference● (all kind of) categories ● lists● links● context● disambiguation ● redirects

Page 9: Wikisource  - Where we are, where we want to go

People

Librarians (and users) form the community (we are not Google books!)

● curation → books, project, policies ● empowerment → from users to librarians

Page 10: Wikisource  - Where we are, where we want to go

5th law of Library Science

“The library is a growing organism”

S. R. Ranganathanhttp://en.wikipedia.org/wiki/Five_laws_of_library_science

Page 11: Wikisource  - Where we are, where we want to go

Hyperlibrary: Xanadu 0.1

“Can you imagine that they used to have libraries where the books didn't talk to each other?”

Marvin Minsky[citation needed]

Page 12: Wikisource  - Where we are, where we want to go

“Collaboratory”

read write

laboratory

● Tools → framework → other tools ● MediaWiki, js, templates, python, bot, API,

toolserver, ...

library

Page 13: Wikisource  - Where we are, where we want to go

The future

Page 14: Wikisource  - Where we are, where we want to go

Interoperability

● Bibliographic data from OCLC, Open Library, catalogs.

● Disseminate metadata and full text (OAI-PMH)

● Wikisource API

Page 15: Wikisource  - Where we are, where we want to go

ePub

Fresh generated ePub on the rocks (via ePub converter)

● outreach● eReader apps

Page 16: Wikisource  - Where we are, where we want to go

Classification

Potential of MediaWiki categories:● Colon classification● subjects from National Libraries● thesauri and ontologies

Page 17: Wikisource  - Where we are, where we want to go

Microcontribution

“the more simple and small task is, the wider audience you get”

● Citizen science (Galaxy Zoo, Ancient Lives)● from page unit to word unit

More: WikiCaptcha (next presentation same room!)

Page 18: Wikisource  - Where we are, where we want to go

New architecture on djvu

● in-line transcription● high granularity● save text directly on djvu● multiple layers

Page 19: Wikisource  - Where we are, where we want to go
Page 20: Wikisource  - Where we are, where we want to go

Xanadu 0.2

Systematic use of transclusion● Interwiki● Wikiquote● Wikipedia● Blogs, websites, etc.

Page 21: Wikisource  - Where we are, where we want to go

Born-digital documents processNo specific process: must pass through the whole process for digitized files

1) Djvu2) OCR3) Commons4) Transcription

Collaboration with repositories and digital libraries (scientific articles, thesis, free documents).

Page 22: Wikisource  - Where we are, where we want to go

● Email: [email protected]● Nickname: Aubrey● Skype: aubreymcfato

Feedbacks?