ForgetIT Project TYPO3Camp Milano 2014

78
http://www.forgetit-project.eu

description

ForgetIT – Some store to remember, some store to forget With growing storage capacities and sinking storage prices, the paradigm of keeping everything is prevailing. However, keeping information accessible, useable and useful goes far beyond purely keeping things, especially in the long run, and entails expenses much larger than just the storage costs. This issue especially applies to content in Content Management Systems where we increasingly face the situation of creating, managing and storing (preserving) multimedia content, which we might never access again due to the pure volume of content. To overcome these issues, we envision the concept of flexible managed forgetting for information that progressively ceases in importance and finally becomes obsolete as well as for redundant information. We will extend TYPO3 with preservation and forgetting. The forgetting will also reduce the user’s cognitive burden for past activities and information in TYPO3 but still allows access if needed. The same as our brain will retrieve details of our past when remembering and getting associations, the approach will provide such means. Within the Seventh Framework Programme for Research (FP7) of the European Union the "ForgetIT" project strives to build a solution for the mentioned problems. The project has a scope of 3 years and TYPO3 has been selected as CMS to build upon as it is Open Source Software and has an open and active community. An overview of the project can be found on the projects website (of course made with TYPO3): http://www.forgetit-project.eu/

Transcript of ForgetIT Project TYPO3Camp Milano 2014

Page 2: ForgetIT Project TYPO3Camp Milano 2014

Some store to remember, some store to forget

Page 3: ForgetIT Project TYPO3Camp Milano 2014

Olivier DobberkauCEO of dkd Internet Service GmbHFrankfurt, Germany

About me

Page 4: ForgetIT Project TYPO3Camp Milano 2014
Page 5: ForgetIT Project TYPO3Camp Milano 2014

What this is all about

The problem

Page 6: ForgetIT Project TYPO3Camp Milano 2014

Storage capacity is ever increasingPrices for storage are falling

Page 7: ForgetIT Project TYPO3Camp Milano 2014

How large is large?

Page 8: ForgetIT Project TYPO3Camp Milano 2014

Size references

A simple text: an average Wikipedia article ≈ 3.78 kB (no markup)

Lots of text: complete Wikipedia ≈ 13.5 GB (text only, no markup)

An average image (12MP) ≈ 1.3 MB (JPG 90% quality; 24bit/pixel)

An average movie stored on Blu-ray Disc ≈ 25.48 GB

Page 9: ForgetIT Project TYPO3Camp Milano 2014

1955 – The IBM 355

Capacity: 12 MB

Cost: 6,233.33 USD/MB

3,250 90

✘0

✘0.16 kB

Page 10: ForgetIT Project TYPO3Camp Milano 2014

1970 – The IBM 3330

Capacity: 100 MB

Cost: 259.70 USD/MB

3.94 kB27,089 76 0

✘0

Page 11: ForgetIT Project TYPO3Camp Milano 2014

1988 – Seagate ST-238

Capacity: 30 MB

Cost: 9.97 USD/MB

102.71 kB8,126 23 0

✘0

Page 12: ForgetIT Project TYPO3Camp Milano 2014

2000 – Western Digital WD600AB

Capacity: 60 GB

Cost: 0.00275 USD/MB

16,644,063 4 47,261 2 363.64 MB

Page 13: ForgetIT Project TYPO3Camp Milano 2014

2010 – Seagate ST32000542AS

Capacity: 2 TB

Cost: 0.0000450 USD/MB≈ 5 cent/GB

541,798,941 148 1,538,461 76 21.7 GB

Page 14: ForgetIT Project TYPO3Camp Milano 2014

2013 – NSA

Capacity: ∞

Cost: free

∞ ∞ ∞ ∞ it’s free :)

Page 15: ForgetIT Project TYPO3Camp Milano 2014

Let’s store everything, then!Cool!

Page 16: ForgetIT Project TYPO3Camp Milano 2014

Or, maybe not...

There’s a lot more costs

Retrieval

Maintenance

Indexing

Updates

Page 17: ForgetIT Project TYPO3Camp Milano 2014

We need to keep our information

Accessible

Usable

Useful

Page 18: ForgetIT Project TYPO3Camp Milano 2014

The concept of Memory Buoyancy

Let’s start to forget!

Page 19: ForgetIT Project TYPO3Camp Milano 2014

Memory Buoyancy

time

memory

Page 20: ForgetIT Project TYPO3Camp Milano 2014

Memory Buoyancy

Page 21: ForgetIT Project TYPO3Camp Milano 2014

Memory Buoyancy

Page 22: ForgetIT Project TYPO3Camp Milano 2014

A short overview

The ForgetIT Project

Page 23: ForgetIT Project TYPO3Camp Milano 2014

ForgetIT project overview

Consortium of 11 partners

Project start was in February 2013

3 years of research & development

http://www.forgetit-project.eu

The ForgetIT project is funded by the EC within the 7th Framework Programme under the objective "Digital Preservation"(GA 600826).

Page 24: ForgetIT Project TYPO3Camp Milano 2014

Project Partners 1/2

Centre for Research and Technology Hellas

dkd Internet Service GmbH

Deutsches Forschungszentrum für Künstliche Intelligenz GmbH

Eurix Srl

Gottfried Wilhelm Leibniz Universität Hannover

Page 25: ForgetIT Project TYPO3Camp Milano 2014

Project Partners 2/2

IBM Israel - Science and Technology Ltd

Luleå Tekniska Universitet

The Chancellor, Masters and Scholars of the University of Oxford

The University of Edinburgh

The University of Sheffield

Turk Telekomunikasyon AS

Page 26: ForgetIT Project TYPO3Camp Milano 2014

Inspiring people to share!

TYPO3 is the CMS used for the organisational use cases

TYPO3 was chosen because it’s Open Source

We want to raise awareness on the matter of preservation

We will publish our modules under open source licenses

Page 27: ForgetIT Project TYPO3Camp Milano 2014

ForgetIT core concepts

Page 28: ForgetIT Project TYPO3Camp Milano 2014

Managed Forgetting

Page 29: ForgetIT Project TYPO3Camp Milano 2014

Synergetic Preservation

Page 30: ForgetIT Project TYPO3Camp Milano 2014

Contextualised Remembering

Page 31: ForgetIT Project TYPO3Camp Milano 2014

“meta-data is a love note to the future” (Jason Scott)

Do you preserve?

Page 32: ForgetIT Project TYPO3Camp Milano 2014

What is preservation?

“Preservation — The protection of cultural

property through activities that minimize

chemical and physical deterioration and

damage and that prevent loss of informational

content. The primary goal of preservation is to

prolong the existence of cultural property.”Preservation 101

Page 33: ForgetIT Project TYPO3Camp Milano 2014

Problems are caused by

storage medium (disks, tapes, DVD, etc.)

format of the data

availability of the software or operating system

possible encryption

Page 34: ForgetIT Project TYPO3Camp Milano 2014

“The digital dark age is a possible future

situation where it will be difficult or impossible

to read historical electronic documents and

multimedia, because they have been stored in

an obsolete and obscure file format.” WikipediaDigital Dark Age

Page 35: ForgetIT Project TYPO3Camp Milano 2014

Preserving a website is not trivial

What do want you preserve?

Content only?

Content and Design?

How often? Stock prices vs. Company History page

How do you deal with browser differences?

How do you preserve functionality? E.g. insurance fee calculator

Page 36: ForgetIT Project TYPO3Camp Milano 2014

Preservation Value

~ 5,000 €~ 200,000 €

Page 37: ForgetIT Project TYPO3Camp Milano 2014

PrivateOrganisational

The ForgetIT Use Cases

Page 38: ForgetIT Project TYPO3Camp Milano 2014

A personal use case:How to organise an ever growing picture collection

Personal Preservation

Page 39: ForgetIT Project TYPO3Camp Milano 2014

Typical use cases in the daily work with TYPO3-driven company websites.

Organisational Preservation

Page 40: ForgetIT Project TYPO3Camp Milano 2014

Organisational Use Cases

Digital Asset Management

Versioning

Archiving a complete Website

Individual genres and their specific requirements

Example: Press Release

Page 41: ForgetIT Project TYPO3Camp Milano 2014

An organisational use case

Press Release Example

Page 42: ForgetIT Project TYPO3Camp Milano 2014

Elements of a Press Release

text

image

links

documents

Page 43: ForgetIT Project TYPO3Camp Milano 2014

Meta information

Presseinformationen Spielwarenmesse

Global Toy Conference Now on Saturday at the Spielwarenmesse

* Customised programme for retailers: “How to get your customer into the shop”* Conference will take place for the 5th time in Nuremberg on 1 February 2014

All around the world, retailers are wondering how they can still get their customers in their shops in the age of the Internet – because competition for the sale of consumer goods online is growing dramatically. With the topic “How to Get Customers into Your Shop – Successful Pricing, Presentation and Selling” the Global Toy Conference of the Spielwarenmesse demonstrates what parameters business owners can adjust for the future. The conference will take place for the first time in the St Petersburg hall in the NCC East on Saturday. The new earlier date means that more international retailers can take advantage of the knowledge on offer at the toy industry's leading trade fair – from 9 a.m. to 4 p.m. on 1 February 2014.

...

Page 44: ForgetIT Project TYPO3Camp Milano 2014

Translations

German English

Page 45: ForgetIT Project TYPO3Camp Milano 2014

Page 47: ForgetIT Project TYPO3Camp Milano 2014

media

meta info

Page 48: ForgetIT Project TYPO3Camp Milano 2014

media

meta info

Page 49: ForgetIT Project TYPO3Camp Milano 2014

Content Management Systemmedia

meta info

copy

move

refer

Page 50: ForgetIT Project TYPO3Camp Milano 2014

media

meta info

media asset

meta info

media asset

meta info

etc.

meta info

editablecontent

meta info

structure(code, users,

plugins, extensions,

etc.)

meta info

externalDigital Asset (DAM)

internal

Page 51: ForgetIT Project TYPO3Camp Milano 2014

Archive 1

Info Level 2

Info Level 3

media

meta info

media asset

meta info

media asset

meta info

etc.

meta info

editablecontent

meta info

structure(code, users,

plugins, extensions,

etc.

meta info

Info Level 1

(semi)automatic

static

dynamic

Info Level 4, etc.

Output

Archive 2Delete

Page 52: ForgetIT Project TYPO3Camp Milano 2014

Archive 1

Info Level 2

Info Level 3

media

meta info

media asset

meta info

media asset

meta info

etc.

meta info

editablecontent

meta info

structure(code, users,

plugins, extensions,

etc.

meta info

Info Level 1

(semi)automatic

static

dynamic

Info Level 4, etc.

Output

Archive 2Delete

Page 53: ForgetIT Project TYPO3Camp Milano 2014

Archive 1 Archive 2Delete

L2

L1

L3

L4

L2

L1

L3

L4

T-CM (Todays Content Management) F-CM (Future Content Management)

Retrieve Service

Page 54: ForgetIT Project TYPO3Camp Milano 2014

Information Lifecycle

Collect Create Process Publish Analyse Archive

Page 55: ForgetIT Project TYPO3Camp Milano 2014

Collect

Page 56: ForgetIT Project TYPO3Camp Milano 2014

Create

Page 57: ForgetIT Project TYPO3Camp Milano 2014

Process

Page 58: ForgetIT Project TYPO3Camp Milano 2014

Publish

Page 59: ForgetIT Project TYPO3Camp Milano 2014

Analyse

Page 60: ForgetIT Project TYPO3Camp Milano 2014

Archive

Page 61: ForgetIT Project TYPO3Camp Milano 2014

Information Lifecycle

Collect Create Process Publish Analyse ArchiveProcess

Annotations

Page 62: ForgetIT Project TYPO3Camp Milano 2014

Example Press Release

Annotation (text) Annotation (image)

global toy conference,conference, podium, speaker, lights

Page 63: ForgetIT Project TYPO3Camp Milano 2014

A game about forgetting.

Do you remember?

Page 64: ForgetIT Project TYPO3Camp Milano 2014

or how you can participate

Next steps

Page 65: ForgetIT Project TYPO3Camp Milano 2014

We’d love to see you participate!

Reflect your thoughts with us

Take our short survey: http://tinyurl.com/forgetit-webarchiving

Tell us your use cases

Join the development of TYPO3 features

Page 66: ForgetIT Project TYPO3Camp Milano 2014

Thank you for your attention!

Page 67: ForgetIT Project TYPO3Camp Milano 2014

Sources, Books, Images

References

Page 68: ForgetIT Project TYPO3Camp Milano 2014

References (Sources) 1/2

Size of Wikipedia (as of 2013-10-04): https://en.wikipedia.org/wiki/Wikipedia:Size_comparisons

Average JPG size: http://web.forret.com/tools/megapixel.asp?title=12+Megapixel+camera&width=4000&height=3000

Average movie size: http://answers.yahoo.com/question/index?qid=20110807095141AABGQm8

Storage Prices: http://www.jcmit.com/diskprice.htm

Page 69: ForgetIT Project TYPO3Camp Milano 2014

References (Sources) 2/2

Forget IT Website: http://www.forgetit-project.eu

Preservation: http://unfacilitated.preservation101.org/session1/expl_whatis-definitions.asp

Digital Dark Age: https://en.wikipedia.org/wiki/Digital_dark_age

Page 70: ForgetIT Project TYPO3Camp Milano 2014

References (Books)

Delete: The Virtue of Forgetting in the Digital Age, Viktor Mayer-Schönberger

Page 71: ForgetIT Project TYPO3Camp Milano 2014

References (Images) 1/8

“About me”: all images by Søren Schaffstein

“ForgetIT Team” by Søren Schaffstein

“The Problem/Knot”: http://www.istockphoto.com/stock-photo-8933647-rope-with-knot.php

“1 Dollar”: http://www.istockphoto.com/stock-photo-17830696-fan-dollars-isolated-on-white.php

Starbucks Cups: http://5feetonagoodday.files.wordpress.com/2012/01/starbucks-coffee-cups-sizes-tall-grande-venti-trenta.jpg

Page 72: ForgetIT Project TYPO3Camp Milano 2014

References (Images) 2/8

IBM 355: http://www-03.ibm.com/ibm/history/exhibits/storage/storage_355.html

IBM 3330: http://www-03.ibm.com/ibm/history/exhibits/storage/storage_3330.html

Seagate ST-238: http://www.redlop.de/bilder/produkte/gross/Seagate-WREN-5-ST4702N-702-MB-.png

Western Digital WD600AB: http://www.junek.de/thomas/bilder/WD600AB.jpg

Page 73: ForgetIT Project TYPO3Camp Milano 2014

References (Images) 3/8

Seagate ST32000542AS: http://bilder.afterbuy.de/images/ZZNLZ/seagatesata.jpg

Finger “Forget”: http://www.istockphoto.com/stock-photo-7252836-string-finger-reminder-on-white.php

Memory Buoyancy: http://www.istockphoto.com/stock-photo-16244755-fishing-hook-underwater.php?st=0320b45

Fish: http://www.istockphoto.com/stock-photo-14623368-gold-fish-and-piranha.php

Page 74: ForgetIT Project TYPO3Camp Milano 2014

References (Images) 4/8

Game pieces by Søren Schaffstein

Managed Forgetting: http://www.istockphoto.com/stock-photo-3533508-colorful-memos.php?st=0320b45

Synergetic Preservation: http://www.istockphoto.com/stock-photo-13301920-goldfish-jump.php

Contextualised Remembering: http://www.istockphoto.com/stock-photo-14370511-shoebox-of-old-photos-too.php

Page 75: ForgetIT Project TYPO3Camp Milano 2014

References (Images) 5/8

Cans: http://www.istockphoto.com/stock-photo-16948268-three-metallic-goods-can-with-key.php

5 1/4” Disk: https://secure.flickr.com/photos/twicepix/4330813840/sizes/z/in/photostream/

5 1/4” Disk Drawing: https://secure.flickr.com/photos/flattop341/2094771560/sizes/z/in/photostream/

Ami Pro: http://www.os2museum.com/wp/?attachment_id=99

Digital Dark Age by Søren Schaffstein

Page 76: ForgetIT Project TYPO3Camp Milano 2014

References (Images) 6/8

Gauges: http://www.istockphoto.com/stock-photo-9059088-old-gauges.php

Golf Car: http://www.netzeitung.de/default/337276.html#

Golf Car Papers: http://www.motor-talk.de/news/das-heilige-blech-wieder-unterm-hammer-t4421282.html

Create: http://hdwallsize.com/wp-content/uploads/2013/04/Abstract-Art-Wallpaper-Dekstop.jpg

Page 77: ForgetIT Project TYPO3Camp Milano 2014

References (Images) 7/8

Process by Søren Schaffstein

Publish: http://www.istockphoto.com/stock-photo-25712828-british-dog-reading.php?st=e5bf164

Analyse: http://www.istockphoto.com/stock-photo-28297160-laboratory-experimental-testing.php?st=239c76e

Archive: http://www.istockphoto.com/stock-photo-18865341-old-wooden-card-catalogue-with-one-opened-drawer.php

Page 78: ForgetIT Project TYPO3Camp Milano 2014

References (Images) 8/8

Shoes: http://www.istockphoto.com/stock-photo-2457744-what-s-your-walking-style.php?st=e12d3d2

Questions: http://www.istockphoto.com/stock-photo-17686236-decision-making.php