Content Migrations: Getting from A to B

73
Content Migrations: A Field Guide

description

Deck fro

Transcript of Content Migrations: Getting from A to B

Page 1: Content Migrations: Getting from A to B

Content Migrations:A Field Guide

Page 2: Content Migrations: Getting from A to B

• Author of Website Migration Handbook v2

• First large migration: World Bank (1,000+ subsites)

• Consults to large and medium organizations

• David guides complex website transformations.

Page 3: Content Migrations: Getting from A to B

Deane Barker

• Working in content management since 1996

• Founding partner in Blend Interactive• Board member of Content

Management Professionals

Page 4: Content Migrations: Getting from A to B

Planning vs. Technical

• The planning process encompasses the entire scope of your migration effort

• The technical process is just one very critical part of this process

Page 5: Content Migrations: Getting from A to B

Agenda

• David will discuss the larger planning process– Break

• Deane will follow with a discussion about the specific technical challenges– End at 4:00 p.m.– Deane and David will be available for

discussion until 5:00 p.m.

Page 6: Content Migrations: Getting from A to B

Ask Questions

Page 7: Content Migrations: Getting from A to B

Getting from A to B

Page 8: Content Migrations: Getting from A to B

It’s painful.

[The End]

Page 9: Content Migrations: Getting from A to B

Requirements for Transfer

• You know–…what is being moved–…how it has to change on the way over–…how it fits back together on the other

side

Page 10: Content Migrations: Getting from A to B

Agenda

• Original Content vs. Derived Content• Content Geography• The Four Tasks of Content Transfer• Automated vs. Manual Import• The Automated Import Process• QA Automation

Page 11: Content Migrations: Getting from A to B

Original Content vs. Derived Content

Page 12: Content Migrations: Getting from A to B

Some HTML has to be moved.

Some HTML will be generated by your new system as content is imported.

Page 13: Content Migrations: Getting from A to B
Page 14: Content Migrations: Getting from A to B

Index Pages vs. Content Pages

Page 15: Content Migrations: Getting from A to B
Page 16: Content Migrations: Getting from A to B
Page 17: Content Migrations: Getting from A to B

Many pages on your new site are not rendered via content, but via

development.

Page 18: Content Migrations: Getting from A to B

Before you begin transfer, make sure you know which pages are derived and you have made plans to generate those in the new system.

Page 19: Content Migrations: Getting from A to B

Content Geography

Page 20: Content Migrations: Getting from A to B

Content has different levels of “geography”

Some content is very specifically placed, while other content is automatically organized.

Page 21: Content Migrations: Getting from A to B

Home

Products

Product A

Product B

About

History

Page 22: Content Migrations: Getting from A to B

Press Release

Page 23: Content Migrations: Getting from A to B

Highly-geographical content is much harder to migrate.

You have to migrate both the content and the placement.

Page 24: Content Migrations: Getting from A to B

Pop Quiz:Why are blogs so easy to migrate?No geography.Lots of derived index pages.

Page 25: Content Migrations: Getting from A to B

Hierarchical content requires you to determine and transfer structure

Page 26: Content Migrations: Getting from A to B

Home

Products

Product A

Product B

About

History

Stub Mapping

Existing Home

Products

Product A

Product B

About

History

New

Page 27: Content Migrations: Getting from A to B

The Path to Stub Mapping

• “We need to codify the new website structure…”

• “…let’s just store this in the new CMS…”

• “…and let’s store the old URL, just for reference…”

• “…and…can we just use that old URL to transfer the content?”

Page 28: Content Migrations: Getting from A to B

The Four Tasks of Content Transfer

Page 29: Content Migrations: Getting from A to B

The Four Tasks

• Extract• Transform• Import• Normalize

• We can generalize about the first two– Extract and transform are platform-

agnostic

Page 30: Content Migrations: Getting from A to B

#1: Extract

• Get content out of the existing system

• Break content into its necessary components

• Store in a neutral format– XML, usually

Page 31: Content Migrations: Getting from A to B
Page 32: Content Migrations: Getting from A to B

Migrating out of a CMS is a lot easier than the alternative.

CMS enforces at least some consistency.

Page 33: Content Migrations: Getting from A to B

Are you going to extract from the repository level or the publication

level?

Page 34: Content Migrations: Getting from A to B

Repository vs. Publication Extraction

RepositoryHTML

Processing

Page 35: Content Migrations: Getting from A to B

You may need to make changes to your old site to make

extraction easier or more complete.

Page 36: Content Migrations: Getting from A to B

You do not have to wait for anything to do this.

You can start extraction on the very day you decide to migrate your website.

Page 37: Content Migrations: Getting from A to B

#2: Transform

• Modify extracted content• Fix legacy problems with the content• Adapt content to fit the new

architecture• Neutralize idiosyncrasies in the

content

Page 38: Content Migrations: Getting from A to B

Content Transformation

Page 39: Content Migrations: Getting from A to B

Common Transformations

Page 40: Content Migrations: Getting from A to B

Common Transformations

Page 41: Content Migrations: Getting from A to B

#3: Import

• Move post-transformed content from a neutral format into the new system

• This is different for every CMS• This capability should be part of the

evaluation process

Page 42: Content Migrations: Getting from A to B

#4: Normalize

• Fix problems that are only “fixable” once content is in its new home

• Ex:– Relationship reconstruction– URL resolution– Navigation reconstruction

Page 43: Content Migrations: Getting from A to B
Page 44: Content Migrations: Getting from A to B

Content relationships can introduce chicken-egg

problems.

Page 45: Content Migrations: Getting from A to B

How will URLs change on the new platform?

If you content is interlinked how are you going to keep all those links valid?

Page 46: Content Migrations: Getting from A to B

Embedded URLs

Page 47: Content Migrations: Getting from A to B

Embedded URL Resolution

• If you have embedded URLs, they are now broken.

• How do you “re-connect” these URLs to the correct content?

• Usually performed as some kind of batch job.– You rarely get 100% accuracy.– Prepare to catch the remainder in QA.

Page 48: Content Migrations: Getting from A to B

Always store the old URL for a migrated page of content.

Page 49: Content Migrations: Getting from A to B

How it Works

• Iterate over every piece of content…• …then iterate over every single

property looking for anything that might contain links…

• …then iterate over all those links looking for the new content holding that old link…

• …then correct the link.

Page 50: Content Migrations: Getting from A to B

Once migrated, use the old URL to do a lookup in your 404 handler.

Page 51: Content Migrations: Getting from A to B

If you can preserve binary file URLs, do so. Your new CMS will likely make

this easier.

Page 52: Content Migrations: Getting from A to B

Depending on volume, menu reconstruction might be a manual process.

Page 53: Content Migrations: Getting from A to B

Automated vs. Manual Import

Page 54: Content Migrations: Getting from A to B

What is the actual mechanism of movement?

Copy-and-paste?Automated?

Page 55: Content Migrations: Getting from A to B

When Copy-and-Paste Works

• When you don’t have a lot of content• When you have access to cheap

labor• When your content is highly

geographic• When you cannot automate

transformation• When you have enough resources for

sufficient QA

Page 56: Content Migrations: Getting from A to B

When Automated Migration Works

• When you have large volumes of content

• When your content is not highly-geographic

• When you have sufficient technology and/or development resources

Page 57: Content Migrations: Getting from A to B

You don’t have to use the same method for your entire project.

Page 58: Content Migrations: Getting from A to B

The Automated Migration Process

Page 59: Content Migrations: Getting from A to B

Automated Migration Tools

• Great answer to the Transfer phase• Less of an answer to everything else• They still have to be configured and

tested

Page 60: Content Migrations: Getting from A to B

The Promise:

You will be able to develop a script that will reduce your migration to a button-click.

Page 61: Content Migrations: Getting from A to B

The Promise:

You will run this script, need to do nothing else, then launch your new website.

Page 62: Content Migrations: Getting from A to B

The Value-Add

• A scripting environment• Tested tools for:– Extraction– Transformation– Import (maybe…)

• Professional services

$$$$

Page 63: Content Migrations: Getting from A to B

Automated Migration Process

• Develop automated migration script– Configure– Execute– Evaluate– (Repeat)

• Accept a cycle “as good as is reasonable”• Perform necessary manual editing• Re-do changes during content freeze• Launch

Page 64: Content Migrations: Getting from A to B

Automated migrations are highly iterative.

Configure-Execute-Evaluate

Page 65: Content Migrations: Getting from A to B

Automated Migration Cycle

Configure Execute Evaluat

eManual Editing

Iterate again…

Launch

Weeks? Months? Days? Minutes?

“As good as is reasonable…”

Page 66: Content Migrations: Getting from A to B

Once you accept the output of a migration cycle, you are in a content

freeze

Page 67: Content Migrations: Getting from A to B

Handling a Content Freeze

• Don’t change any content on the existing site

• Track changes so they can be re-changed on the new site

Page 68: Content Migrations: Getting from A to B

QA Automations

Page 69: Content Migrations: Getting from A to B

Ideally, track the QA process inside the CMS

itself.

Page 70: Content Migrations: Getting from A to B
Page 71: Content Migrations: Getting from A to B
Page 72: Content Migrations: Getting from A to B
Page 73: Content Migrations: Getting from A to B

• WEBhttp://gadgetopia.com

• TWITTER@gadgetopia

[email protected]