Towards Concise Preservation by Managed Forgetting: Research Issues and Case Study
-
Upload
nattiya-kanhabua -
Category
Presentations & Public Speaking
-
view
375 -
download
0
Transcript of Towards Concise Preservation by Managed Forgetting: Research Issues and Case Study
ForgetIT Project, GA 600826
Towards Concise Preservationby Managed Forgetting
iPRES-2013 ConferenceLisbon, Portugal5 September 2013
Nattiya Kanhabua, Claudia Niederée, and Wolf SiberskiL3S Research Center / Leibniz Universität Hannover
Hannover, Germany
2
Partners in the ForgetIT project
An interdisciplinary team of experts in:– Preservation, information management, information extraction– Multimedia analysis, storage computing, cognitive psychology
3
Outline
Motivation & VisionApproaches: First IdeasIntegration FrameworkPilot Applications: Overview
ForgetIT Project, GA 600826
4
Inspiration
A Computer that forgets ?Intentionally ??
And in context of preservation???
5
Inspiration
However we are facing– dramatic increase in content creation (e.g. digital photography)– information overload and changing professional + private lives– increasing storage costs for long-term storage (>10 years)– increasing use of mobile devices with restricted capacity– inadvertent forgetting in lack of systematic preservation
A Computer that forgets ?Intentionally ??
And in context of preservation???
6
Inspiration
However we are facing– dramatic increase in content creation (e.g. digital photography)– information overload and changing professional + private lives– increasing storage costs for long-term storage (>10 years)– increasing use of mobile devices with restricted capacity– inadvertent forgetting in lack of systematic preservation
And: Forgetting plays a crucial role for human remembering and life in general (focus, stress on important information, forgetting of details)
A Computer that forgets ?Intentionally ??
And in context of preservation???
7
Inspiration
However we are facing– dramatic increase in content creation (e.g. digital photography)– information overload and changing professional + private lives– increasing storage costs for long-term storage (>10 years)– increasing use of mobile devices with restricted capacity– inadvertent forgetting in lack of systematic preservation
And: Forgetting plays a crucial role for human remembering and life in general (focus, stress on important information, forgetting of details)
A Computer that forgets ?Intentionally ??
And in context of preservation???
So: “Shouldn’t there be something like forgetting in digital memories as well?
ForgetIT
8
Complementing Human Memory
V. Mayer-Schönberger. Delete - The Virtue of Forgetting in the Digital Age. Morgan Kaufmann Publishers, 2009.
9
Motivation
major progress in preservation technology
maturing Information extractiontechnology
storage as service (e.g. clouds)
Opportunities increasing amount of digital contenthandled over decades
more or less systematic backup strategies used
non-paper practices for long-term perspective required
Needs
10
Motivation
major progress in preservation technology
maturing Information extractiontechnology
storage as service (e.g. clouds)
Opportunities increasing amount of digital contenthandled over decades
more or less systematic backup strategies used
non-paper practices for long-term perspective required
Needs
large gap for adoption high-up front cost no established
practices lack of understanding
of benefit reluctance to invest
Major Obstacles
11
Vision: Building a Bridge
major progress in preservation technology
maturing Information extractiontechnology
storage as service (e.g. clouds)
Opportunities increasing amount of
digital contenthandled over decades
more or less systematic backup strategies used
non-paper practices for long-term perspective required
Needs
large gap for adoption high-up front cost no established
practices lack of understanding
of benefit reluctance to invest
Major Obstacles
12
Vision: Building a Bridge
major progress in preservation technology
maturing Information extractiontechnology
storage as service (e.g. clouds)
Opportunities increasing amount of
digital contenthandled over decades
more or less systematic backup strategies used
non-paper practices for long-term perspective required
Needs
Enabling smooth transition to preservation
large gap for adoption high-up front cost no established
practices lack of understanding
of benefit reluctance to invest
Major Obstacles
13
Vision: Building a Bridge
major progress in preservation technology
maturing Information extractiontechnology
storage as service (e.g. clouds)
Opportunities increasing amount of
digital contenthandled over decades
more or less systematic backup strategies used
non-paper practices for long-term perspective required
Needs
Enabling smooth transition to preservation
Creating immediate benefit + reducing effort
large gap for adoption high-up front cost no established
practices lack of understanding
of benefit reluctance to invest
Major Obstacles
14
Vision: Building a Bridge
major progress in preservation technology
maturing Information extractiontechnology
storage as service (e.g. clouds)
Opportunities increasing amount of
digital contenthandled over decades
more or less systematic backup strategies used
non-paper practices for long-term perspective required
Needs
ForgetIT
Enabling smooth transition to preservation
Creating immediate benefit + reducing effort
Opening alternatives to “keep it all” and “forgetting by accident”
large gap for adoption high-up front cost no established
practices lack of understanding
of benefit reluctance to invest
Major Obstacles
15
Vision: Building a Bridge
major progress in preservation technology
maturing Information extractiontechnology
storage as service (e.g. clouds)
Opportunities increasing amount of
digital contenthandled over decades
more or less systematic backup strategies used
non-paper practices for long-term perspective required
Needs
ForgetIT
Enabling smooth transition to preservation
Creating immediate benefit + reducing effort
Opening alternatives to “keep it all” and “forgetting by accident”
Easing interpretation in the long run
large gap for adoption high-up front cost no established
practices lack of understanding
of benefit reluctance to invest
Major Obstacles
16
Vision: Building a Bridge
major progress in preservation technology
maturing Information extractiontechnology
storage as service (e.g. clouds)
Opportunities increasing amount of
digital contenthandled over decades
more or less systematic backup strategies used
non-paper practices for long-term perspective required
Needs
ForgetIT
Enabling smooth transition to preservation
Creating immediate benefit + reducing effort
Opening alternatives to “keep it all” and “forgetting by accident”
Easing interpretation in the long run
taking inspiration from and complementing human memory
large gap for adoption high-up front cost no established
practices lack of understanding
of benefit reluctance to invest
Major Obstacles
17
Building the Bridge
Managed Forgetting
Synergetic Preservation
Contextualized
Remembering
18
Building the Bridge
Managed Forgetting
Synergetic Preservation
Contextualized
Remembering
• as opposed to the current “forgetting by accident”
• inspired by human forgetting
19
Building the Bridge
Managed Forgetting
Synergetic Preservation
Contextualized
Remembering
• bringing back information into active use in a meaningful way
• as opposed to the current “forgetting by accident”
• inspired by human forgetting
20
Building the Bridge
Managed Forgetting
Synergetic Preservation
Contextualized
Remembering
• bringing back information into active use in a meaningful way
• as opposed to the current “forgetting by accident”
• inspired by human forgetting
• couples information management and preservation management
21
Simple Example: Holidays
+20 Years+5-10 Years+1 Yearsafter trip +1 month
• Trip to Paris with Friends
• Thousands of picures
22
• High awareness of trip details
• Showing of pictures
• Sorting out redundant pictures
• Sub-grouping and sorting
Simple Example: Holidays
+20 Years+5-10 Years+1 Yearsafter trip +1 month
• Trip to Paris with Friends
• Thousands of picures
23
• High awareness of trip details
• Showing of pictures
• Sorting out redundant pictures
• Sub-grouping and sorting
Simple Example: Holidays
+20 Years+5-10 Years+1 Yearsafter trip +1 month
• Trip to Paris with Friends
• Thousands of picures
• Life goes on• Pictures go
out of focus• Creation of a
small diverse subset for showing occasionally
24
• High awareness of trip details
• Showing of pictures
• Sorting out redundant pictures
• Sub-grouping and sorting
Simple Example: Holidays
+20 Years+5-10 Years+1 Yearsafter trip +1 month
• Trip to Paris with Friends
• Thousands of picures
• Life goes on• Pictures go
out of focus• Creation of a
small diverse subset for showing occasionally
• Creation of summary page
• Addition of context info
• Further reduction of redundancy
• Rest of pictures into archiveFebruary 2015ParisTeam: Me, Mary Christine, Tom
25
• High awareness of trip details
• Showing of pictures
• Sorting out redundant pictures
• Sub-grouping and sorting
Simple Example: Holidays
+20 Years+5-10 Years+1 Yearsafter trip +1 month
• Trip to Paris with Friends
• Thousands of picures
• Life goes on• Pictures go
out of focus• Creation of a
small diverse subset for showing occasionally
• Creation of summary page
• Addition of context info
• Further reduction of redundancy
• Rest of pictures into archiveFebruary 2015ParisTeam: Me, Mary Christine, Tom
• Changes in life (e.g. marriage)
• Addition/update of context information
• Dealing with preservation issues
girlfriend
26
• High awareness of trip details
• Showing of pictures
• Sorting out redundant pictures
• Sub-grouping and sorting
Simple Example: Holidays
+20 Years+5-10 Years+1 Yearsafter trip +1 month
• Trip to Paris with Friends
• Thousands of picures
• Life goes on• Pictures go
out of focus• Creation of a
small diverse subset for showing occasionally
• Creation of summary page
• Addition of context info
• Further reduction of redundancy
• Rest of pictures into archiveFebruary 2015ParisTeam: Me, Mary Christine, Tom
• Changes in life (e.g. marriage)
• Addition/update of context information
• Dealing with preservation issues
girlfriendGirlfriendwife
27
• High awareness of trip details
• Showing of pictures
• Sorting out redundant pictures
• Sub-grouping and sorting
Simple Example: Holidays
+20 Years+5-10 Years+1 Yearsafter trip +1 month
• Trip to Paris with Friends
• Thousands of picures
• Life goes on• Pictures go
out of focus• Creation of a
small diverse subset for showing occasionally
• Creation of summary page
• Addition of context info
• Further reduction of redundancy
• Rest of pictures into archiveFebruary 2015ParisTeam: Me, Mary Christine, Tom
• Changes in life (e.g. marriage)
• Addition/update of context information
• Dealing with preservation issues
girlfriendGirlfriendwife
• Revisiting of Photo of trip photos
• Re-integration into overall photo collection (link into context)
Managed Forgetting
28
Automatic Deletion?
Managed Forgetting
inspired by central role of human forgetting
Aim: – help in identifying and focus on relevant information– supporting preservation content selection
will replace inadvertent forgetting
managed forgetting ≠ automatic deletion
instead: range of forgetting options e.g. – resource condensation– change of indexing & ranking– reduction of redundancy
29
Managed Forgetting
inspired by central role of human forgetting
Aim: – help in identifying and focus on relevant information– supporting preservation content selection
will replace inadvertent forgetting
managed forgetting ≠ automatic deletion
instead: range of forgetting options e.g. – resource condensation– change of indexing & ranking– reduction of redundancy
Based on:
30
Managed Forgetting
inspired by central role of human forgetting
Aim: – help in identifying and focus on relevant information– supporting preservation content selection
will replace inadvertent forgetting
managed forgetting ≠ automatic deletion
instead: range of forgetting options e.g. – resource condensation– change of indexing & ranking– reduction of redundancy
Based on:
careful information value assessment
31
decreasing memory buoyancy
Managed Forgetting
inspired by central role of human forgetting
Aim: – help in identifying and focus on relevant information– supporting preservation content selection
will replace inadvertent forgetting
managed forgetting ≠ automatic deletion
instead: range of forgetting options e.g. – resource condensation– change of indexing & ranking– reduction of redundancy
Based on:
careful information value assessment
forgetting strategies via policies
32
decreasing memory buoyancy
Managed Forgetting
inspired by central role of human forgetting
Aim: – help in identifying and focus on relevant information– supporting preservation content selection
will replace inadvertent forgetting
managed forgetting ≠ automatic deletion
instead: range of forgetting options e.g. – resource condensation– change of indexing & ranking– reduction of redundancy
Based on:
careful information value assessment
forgetting strategies via policies
forgetting options to integrate final manual checking before deletion
33
decreasing memory buoyancy
Managed Forgetting
inspired by central role of human forgetting
Aim: – help in identifying and focus on relevant information– supporting preservation content selection
will replace inadvertent forgetting
managed forgetting ≠ automatic deletion
instead: range of forgetting options e.g. – resource condensation– change of indexing & ranking– reduction of redundancy
Based on:
careful information value assessment
forgetting strategies via policies
forgetting options to integrate final manual checking before deletion
combination with multi-tier storage solution possible
34
decreasing memory buoyancy
Use of tiers
35
Contextualized Remembering
Aim: – bringing back information into active use in a meaningful
way even if a lot of time has passed– aiming for semantic level of preservation
Based on:
taking into account relevant parts of context when moving to archiveincreasing contextualization of preserved contentconsidering context evolution over time (evolution-aware contextualization)
Evolution-aware Contextualization & Re-contextualization
36
Context of Interpretation
t
C
Archival InformationSystem
Information System
D
Evolution-aware Contextualization & Re-contextualization
37
Context of Interpretation
t
C C‘
Archival InformationSystem
Information System
Human ForgettingChange in focusStructural changes
D
Evolution-aware Contextualization & Re-contextualization
38
Context of Interpretation
t
C C‘
Archival InformationSystem
Information System
Human ForgettingChange in focusStructural changes
Contextualization
DD
Evolution-aware Contextualization & Re-contextualization
39
Context of Interpretation
t
C C‘
Archival InformationSystem
Pres(D‘)
Pres(C‘)
Information System
Human ForgettingChange in focusStructural changes
Contextualization
D
Context-awarePreservation
DD
Evolution-aware Contextualization & Re-contextualization
40
Context of Interpretation
t
C C‘
Archival InformationSystem
Pres(D‘)
Pres(C‘)
Information System
Human ForgettingChange in focusStructural changes
C‘‘
Semantic evolutionStructural evolutionTerminology evolution
Contextualization
D
Context-awarePreservation
DD
Evolution-aware Contextualization & Re-contextualization
41
Context of Interpretation
t
C C‘
Archival InformationSystem
Pres(D‘)
Pres(C‘)
Information System
Human ForgettingChange in focusStructural changes
C‘‘
Semantic evolutionStructural evolutionTerminology evolution
Contextualization
D
Context-awarePreservation
Semantic Evolution Detection
DD
Evolution-aware Contextualization & Re-contextualization
42
Context of Interpretation
t
C C‘
Archival InformationSystem
Pres(D‘)
Pres(C‘)
Information System
Human ForgettingChange in focusStructural changes
C‘‘
Evolution-awareContextualization
Pres(D‘)
Pres(C‘‘)
Semantic evolutionStructural evolutionTerminology evolution
Contextualization
D
Context-awarePreservation
Semantic Evolution Detection
DD
Evolution-aware Contextualization & Re-contextualization
43
Context of Interpretation
t
C C‘
Archival InformationSystem
Pres(D‘)
Pres(C‘)
Information System
Human ForgettingChange in focusStructural changes
C‘‘
Evolution-awareContextualization
Re-contextualization
Pres(D‘)
Pres(C‘‘)
Semantic evolutionStructural evolutionTerminology evolution
Pres(D‘)
Pres(C‘‘)
D
Contextualization
C‘‘‘
D
Context-awarePreservation
Semantic Evolution Detection
DD
ForgetIT Project, GA600826 - Kickoff Meeting, Hannover, February 2013
44
Synergetic Preservation
smooth and step-wise transition between active information use and preservation enables rich information flow in both directionssupports more informed preservation decisionseases preservation adoption
Data Management
Descr. Info.
Archival Storage
AIPs
Access
Ingest
Administration
Preservation Planning
Preserve-or-Forget Framework
Synergetic Preservation
Extraction & Contextualization
Re-Contextualization
Content Management
Access
Authoring
Administration
Adapter Layer
Managed Forgetting
Information Assessment Condensation
Arc
hiv
al In
form
atio
n S
yste
m
Info
rmat
ion
Man
agem
ent
Sys
tem
Integration Framework
45
Information Management System
• Resources + Meta data:• ResourceID• Content (size, tags, aging, geo)• Context (folder/file usage)• Social features • Resources neighbours (Graph)
Forgettor
Assessorcalculates:+ Memory Buoyancy+ Perservation Value
Analyzer1. Classification of resources
w.r.t. startegies2. Triggers forgetting actions
Strategies
ValuesStatistics
Resources Meta-Info
Resources Values + Decisions
Integration Framework
46
Information Management System
• Resources + Meta data:• ResourceID• Content (size, tags, aging, geo)• Context (folder/file usage)• Social features • Resources neighbours (Graph)
Forgettor
Assessorcalculates:+ Memory Buoyancy+ Perservation Value
Analyzer1. Classification of resources
w.r.t. startegies2. Triggers forgetting actions
Strategies
ValuesStatistics
Resources Meta-Info
Resources Values + Decisions
Input: strategy meta-infomation (content, context,
neigbours )previous values
Integration Framework
47
Information Management System
• Resources + Meta data:• ResourceID• Content (size, tags, aging, geo)• Context (folder/file usage)• Social features • Resources neighbours (Graph)
Forgettor
Assessorcalculates:+ Memory Buoyancy+ Perservation Value
Analyzer1. Classification of resources
w.r.t. startegies2. Triggers forgetting actions
Strategies
ValuesStatistics
Forgetting strategies for
different types of resources
Resources Meta-Info
Resources Values + Decisions
Input: strategy meta-infomation (content, context,
neigbours )previous values
Integration Framework
48
Information Management System
• Resources + Meta data:• ResourceID• Content (size, tags, aging, geo)• Context (folder/file usage)• Social features • Resources neighbours (Graph)
Forgettor
Assessorcalculates:+ Memory Buoyancy+ Perservation Value
Analyzer1. Classification of resources
w.r.t. startegies2. Triggers forgetting actions
Strategies
ValuesStatistics
Forgetting strategies for
different types of resources
Resources Meta-Info
Resources Values + Decisions
Input: strategy meta-infomation (content, context,
neigbours )previous values
Processing Resources based on stategies and
information values
Integration Framework
49
Information Management System
• Resources + Meta data:• ResourceID• Content (size, tags, aging, geo)• Context (folder/file usage)• Social features • Resources neighbours (Graph)
Forgettor
Assessorcalculates:+ Memory Buoyancy+ Perservation Value
Analyzer1. Classification of resources
w.r.t. startegies2. Triggers forgetting actions
Strategies
ValuesStatistics
Forgetting strategies for
different types of resources
Resources Meta-Info
Resources Values + Decisions
Input: strategy meta-infomation (content, context,
neigbours )previous values
Processing Resources based on stategies and
information values
Storing the new values and sending them back to IMS
Integration Framework
50
Information Management System
• Resources + Meta data:• ResourceID• Content (size, tags, aging, geo)• Context (folder/file usage)• Social features • Resources neighbours (Graph)
Forgettor
Assessorcalculates:+ Memory Buoyancy+ Perservation Value
Analyzer1. Classification of resources
w.r.t. startegies2. Triggers forgetting actions
Strategies
ValuesStatistics
Forgetting strategies for
different types of resources
Resources Meta-Info
Resources Values + Decisions
Input: strategy meta-infomation (content, context,
neigbours )previous values
Processing Resources based on stategies and
information values
Storing the new values and sending them back to IMSArchives
Acce
ss
Stor
e
Store &access data
51
Application: Organizational Preservation
Starting point: existing and popular CMS (TYPO3)Sophisticated workflows for content creation and publicationBut: Separation of publication and preservation/archival Access to archived content is difficult and costly obsolete and even outdated information stays online
ForgetIT approach:Preservation as integral part (binary model gradual managed forgetting)
Bolder attitude towards removing content possibleAutomated support of cleaning up processesSupport of many stages of archiving, e.g. offline but still in index, aggregates online/ content in archive, only aggregates kept, etc.
Dissemination/Exploitation: Involvement of TYPO3 community, TYPO3 with preservation extension as open source project to TYPO3 community
52
Application: Personal Preservation
Starting point:tremendous growth of information in personal sphereDiversity and fast evolution of devices, platforms and formatsKeeping info sustainably available: Only ad hoc solutions for mid-term, long-term solutions
ForgetIT approach: Preservation solution for personal information spaceBased on concept of Semantic DesktopConsideration of social web content, multimedia content, other types of personal content, knowledge structuresAdditional short/mid-term benefit: de-cluttering information space by managed forgettingConsideration of multi-level infrastructures (e.g. mobile, PC, cloud)
Dissemination/Exploitation: Personal Preservation as a service (e.g. to customers of a telco company)
53
Variables & Dimensions
Personal Organization
Scenarios • Personal events (years at school, holidays, social events, graduations, marriage, etc)
• Public events
• Work-related events (project starts/closing, business trips, new products, etc.)
Data Type • Local: photos, mobile contacts, sms• Online: user-generated content• Feature:
1. documents2. user behaviors3. social context
• Local: textual documents• Online: web pages• Feature:
1. documents2. user roles3. policies
Interaction(user vs. system)
• search/retrieve, re-find• organize• explore• preserve
Action summarization, aggregation, delete
54
Information Value Assessment
Memory Buoyancy Preservation Value
Short-term relevance/interestsE.g., current meeting documents
Long-term interestsE.g. important life events
Subjective metrics+ usage logs (views, edits, modifies)+ social context, influences
Objective metrics+ diversity, coverage, quality
55
Thank you
http://ForgetIT-Project.eu/
Enter EventForgetIT Project, GA 600826