A journey from information overload to digital overload to algorithm overload – medium
design of translation Best practices on the · glossaries, translation services). Avoid information...
Transcript of design of translation Best practices on the · glossaries, translation services). Avoid information...
![Page 1: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/1.jpg)
Best practices on the design of translation
David ChanPau GinerSanthosh Thottingal
![Page 2: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/2.jpg)
Knowledge is better if you can understand it
https://secure.flickr.com/photos/adam_jones/5793940771/
![Page 3: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/3.jpg)
![Page 4: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/4.jpg)
“Surprisingly small amount of content overlap between languages of Wikipedia”
The English Wikipedia contains only 51% of the articles in the second-largest edition, German.
Based on the analysis of one month of all edits to the top 46 language editions of Wikipedia by Scott A. Hale, University of Oxford.More info at http://arxiv.org/abs/1312.0976
Big opportunities for translation
EnglishGerman
![Page 5: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/5.jpg)
Potential users are activeOver 15% of users edit multiple language editions.
These multilingual users are more active (2.3 times) than their monolingual counterparts on average.
Multilingual users made 30% of all edits.
Based on the analysis of one month of all edits to the top 46 language editions of Wikipedia by Scott A. Hale, University of Oxford.More info at http://arxiv.org/abs/1312.0976
Main language pairs
![Page 6: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/6.jpg)
Workflow
Discover article lacking
translation
Create translation
draft
Publish as new
article
Follow the current process but avoiding manual steps
Translation viewWhere translations are made
Entry pointsWays to make users aware of the tool
![Page 7: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/7.jpg)
Translating content
![Page 8: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/8.jpg)
Select a paragraph and improve the initial automatic translation.The user is not forced to translate the whole article.
![Page 9: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/9.jpg)
A simple workflowOne paragraph at a time. Provide enough freedom to rearrange sentences, but don’t force to translate the whole document.
Provide context. Visually aligning source and translations communicates what is translated and what is lacking.
Editing freedom. Don’t provide a strict workflow. Let edit the document as freely as possible.
![Page 10: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/10.jpg)
Definitions and alternative translations are provided at hand
![Page 11: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/11.jpg)
Provide information at handContext relevant information. Integrate information from different sources (dictionaries, glossaries, translation services).
Avoid information overload. Make the information compact.
![Page 12: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/12.jpg)
Technology componentsContent Translation Mediawiki Extension
Content Translation Server
Entry points
Nodejs Server
Translation Dashboard Translation Interface
Machine Translation
Translation Memory
Dictionary Link adaptation
Parsoid
Cache
![Page 13: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/13.jpg)
Links are adapted automaticallyA link pointing to Signal article points to the 식호
를 article (in Korean Wikipedia)
![Page 14: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/14.jpg)
Anticipate user needsAvoid repetitive steps. Repetitive tasks such as finding the equivalent target for a link, can be automated thanks to Wikidata.
![Page 15: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/15.jpg)
We can detect if users modified the default automatic translation, and
encourage them to do so.
![Page 16: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/16.jpg)
Quality mattersEducate. Convey that the focus is more on quality than on quantity.Warn. Detect potential patterns that lead to low-quality (unmodified automatic translation or pasted text).Inform the community. Allow other users to easily find potential problematic content.
![Page 17: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/17.jpg)
Styling is not the focus of the tool. Users can edit the resulting articles
later
![Page 18: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/18.jpg)
More details
http://www.mediawiki.org/wiki/Content_translation
![Page 20: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/20.jpg)
Finding what to translate
![Page 21: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/21.jpg)
![Page 22: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/22.jpg)
Allow translating where the lack of content is detected
![Page 23: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/23.jpg)
Selecting languages
![Page 24: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/24.jpg)
Language selection becomes a problem when you support more than 300 languages
![Page 25: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/25.jpg)
Instead of showing all options, provide the most relevant ones and access to the rest.
![Page 26: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/26.jpg)
![Page 27: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/27.jpg)
Most relevant languages
Access to all languages with flexible search
![Page 28: design of translation Best practices on the · glossaries, translation services). Avoid information overload. Make the information compact. Technology components Content Translation](https://reader036.fdocuments.in/reader036/viewer/2022071019/5fd2cc7e972f0938d9799015/html5/thumbnails/28.jpg)
Technical details
● Universal Language Selector● A jquery plug-in● Supports all languages defined by Unicode● Open source
http://github.com/wikimedia/jquery.uls