AWTW - Automated Web Translation Workflow

31
Automated Web Translation Workflow (AWTW) Automated Web Translation Workflow (AWTW) Fosdem 2009 February 8, 2009 - Brussels

description

This presentation has been given at FOSDEM 2009, Brussels, on 8 February, by Stany van Gelder, Marketing Director of Connexion Corporate Communications. It explains how you can optimise the translation process of websites built with Drupal, by automating all the technical part of the job, while still calling upon professional human translators.

Transcript of AWTW - Automated Web Translation Workflow

Page 1: AWTW - Automated Web Translation Workflow

Automated Web Translation Workflow (AWTW)Automated Web Translation Workflow (AWTW)

Fosdem 2009

February 8, 2009 - Brussels

Page 2: AWTW - Automated Web Translation Workflow

Automated Website Translation:: Agenda

• Who is Connexion?• Machine translation > < CAT tools• Why not English only?• Traditional > < Automated Website

translation workflow• Which tools?• Demo• Technical approach• Conclusion• Q&A

Page 3: AWTW - Automated Web Translation Workflow
Page 4: AWTW - Automated Web Translation Workflow
Page 5: AWTW - Automated Web Translation Workflow

AWTW:: Machine Translation > < CAT tools

• Machine Translation• Substitution of words in one natural language

for words in another • Rough translation of web page or message• Google is the best

(NIST 2006 – National Institute of Standards and Technology: Machine Translation Evaluation)

Drupal google translate moduleWhy maintain a multi-lingual site while Google Translate can do that for you?

• Try out Babel Fish, Systran and Worldlingo• Sample

Page 6: AWTW - Automated Web Translation Workflow

AWTW:: Machine Translation:: sample

Page 7: AWTW - Automated Web Translation Workflow

AWTW:: Machine Translation:: sample

Page 8: AWTW - Automated Web Translation Workflow

AWTW :: Machine Translation:: sample

Client 2007 premier

Livre maintenant votre table!

Plus de 60 ateliers à commande manuelle…

Customer First 2007

Book now your table!

More than 60 hands-on workshops…

Page 9: AWTW - Automated Web Translation Workflow

AWTW :: Machine Translation > < CAT tools

• Computer Assisted Translation tools• Human translator remains key• Faster and more reliable with CAT tools• Translation memories (TM’s)

text segments in a source language validated translations in one or more target languages

• Trados/SDL, DéjàVu, WordFast,…

Page 10: AWTW - Automated Web Translation Workflow

AWTW :: Current Trends

• Globalization >> Localization

• Do you speak (web) English?

• Why not communicating in the language of your target group ?• Time-to-publish• Cost

• Drupal 6 roadmap: integration of i18n• Dries also finds i18n key

Page 11: AWTW - Automated Web Translation Workflow

AWTW:: Trends :: the issues

• Translation process = not very efficient.

• Time waste:• manually and repetitively

cutting and pasting, emailing, FTPing and project managing

the same translation tasks over and over

• 2 types of web site translations• Full mirror• Partly localised information

Page 12: AWTW - Automated Web Translation Workflow

AWTW:: Trends :: the issues

• Gap between • people and processes that create and manage

content, using content management systems, • people who use translation software to

translate that content.

• The technologies don't talk to each other very easily, if at all.

Page 13: AWTW - Automated Web Translation Workflow

AWTW:: Trends :: the issues

• Another process gap:• when translators send content back for

uploading into the CMS• time consuming, • technically challenging • the risk of additional errors.

Page 14: AWTW - Automated Web Translation Workflow

AWTW:: the solution

Page 15: AWTW - Automated Web Translation Workflow

Workflow for a 50 page web site in 1 + 5 languages

Page 16: AWTW - Automated Web Translation Workflow

AWTW:: Auto Workflow:: Xtra Benefits

Next to time and cost savings…• SEO

– Page headers, titles,… also translated– Good for search in local languages by Googlers

• User-friendliness / usability– No technical people needed to do the job !

• Integration with WCMS– Workflow unchanged– Any editing

• Also in (creative) print– Time-to-publish considerably reduced

Page 17: AWTW - Automated Web Translation Workflow

AWTW:: Autom. Workflow:: Which Tools?

• WCMS– Open Source – Award-winning – Any other valuable WCMS

• TMS (Translation Memory System)– DéjàVu– Trados / SDL– Wordfast– Other xml compatible TMS’s

Page 18: AWTW - Automated Web Translation Workflow

Our Customers

Page 19: AWTW - Automated Web Translation Workflow

Website Translations:: Automated Workflow:: Demo

Open your eyes…

WCMS TMS

Page 20: AWTW - Automated Web Translation Workflow

AWTW:: technical steps :: EXPORT

1. Load all selected nodes

2. For each node:1. Create an <item> in the XML

2. Add the node id to <nid>

3. Add the title to <title>

4. Add the body to <body>

5. Add each CCK text fields to <cck_fieldname>

Page 21: AWTW - Automated Web Translation Workflow

AWTW:: technical steps :: EXPORT

<?xml version="1.0" encoding="UTF-8" ?>

<export>

<elements>

<item>

<nid>10</nid>

<title>Hello world</title>

<body>

<![CDATA[<p>Hello world!</p>]]>

</body>

<textfield_1>

<![CDATA[<p>Drupal is the best CMS.</p>]]>

</textfield_1>

<textfield_2>

<![CDATA[<p>Forget the rest!</p>]]>

</textfield_1>

</item>

</elements>

</export>

Page 22: AWTW - Automated Web Translation Workflow

AWTW:: technical steps :: IMPORT

1. Parse the uploaded XML

2. For each <item>:1. Load the original node

2. Reset the node id

3. Update the created and author fields

4. Set the language of the new node

5. Replace all translated values

6. Update all links to other nodes• Nodereference

• Book outline

7. Save the new node

8. Link the translation to the original node

Page 23: AWTW - Automated Web Translation Workflow

AWTW:: technical steps :: IMPORT

<?xml version="1.0" encoding="UTF-8" ?>

<export>

<elements>

<item>

<nid>10</nid>

<title>Hallo wereld</title>

<body>

<![CDATA[<p>Hallo wereld!</p>]]>

</body>

<textfield_1>

<![CDATA[<p>Drupal is het beste CMS.</p>]]>

</textfield_1>

<textfield_2>

<![CDATA[<p>Vergeet de rest!</p>]]>

</textfield_1>

</item>

</elements>

</export>

node_load

Hallo wereldHallo wereld

Hallo wereld!Hallo wereld!

Drupal is het ...Drupal is het ...

Vergeet de rest!Vergeet de rest!

now()now()

current usercurrent user

import languageimport language

Page 24: AWTW - Automated Web Translation Workflow

AWTW:: technical steps :: IMPORT

Hallo wereldHallo wereld

Hallo wereld!Hallo wereld!

Drupal is het ...Drupal is het ...

Vergeet de rest!Vergeet de rest!

now()now()

current usercurrent user

import languageimport language

translation of nodetranslation of node

Search translation of the referenced node

Page 25: AWTW - Automated Web Translation Workflow

AWTW:: included content

• Content that can be translated: – Title– Body– CCK Text fields

• Content that is updated automatically:– CCK Node reference fields– Book outline– Taxonomy

Page 26: AWTW - Automated Web Translation Workflow

AWTW:: content updates

• The AWTW module can also be used if the translation of the node already exists– In that case the existing translation is

loaded in the import step– The updated translation is saved as a

revision

Page 27: AWTW - Automated Web Translation Workflow

AWTW:: roadmap

• Drupal 6 version– Make AWTW available for Drupal 6

• Automatic mapping of internal links– All links to internal pages should be

redirected to the translation of these pages

• CCK Filefield and Imagefield– Allow translation of all text fields

(description, title, alt tag) linked to CCK uploads

Page 28: AWTW - Automated Web Translation Workflow

AWTW:: roadmap

• Translation of meta tags– Integration with Meta Tags module to

translate the meta tags

• Translation of menus– More difficult because there are

several alternatives for managing multilingual menus

• On demand

Page 29: AWTW - Automated Web Translation Workflow

AWTW:: and what about XLIFF?

Gabor Hojtsy

Page 30: AWTW - Automated Web Translation Workflow

AWTW:: Benefits

• Easy• Faster• Lower cost• No IT expert needed• Search Engine Friendly

Page 31: AWTW - Automated Web Translation Workflow

Thank you!

Questions ?General

Stany van [email protected]

Drupal.org name = svg

TechnicalKoen Van Isterdael Rachid

[email protected] [email protected] Drupal.org = koenvi rashendi

More information?www.connexion.eu