Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3

Post on 06-May-2022

6 views 0 download

Transcript of Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Mul(lingualWeb-­‐LT  Execu(ve  Summary  

Felix  Sasaki  DFKI  /  W3C  Fellow  

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Project  goals  •  Provide  reference  implementa(ons  of  metadata  for  mul(lingual  processes  – Content  crea(on,  (human  or  machine)  transla(on,  localiza(on  workflows,  ...  

•  Define  a  metadata  standard  based  on  implementa(ons  and  exis(ng  work  – From  Interna(onaliza(on  Tag  Set  (ITS)  1.0  >  ITS  2.0  

•  Con(nue  and  enlarge  a  community  around  the  Mul(lingualWeb  

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Groups  involved  

MLW-­‐LT  consor(um  (Reference  Implementa(ons)  

W3C  MLW-­‐LT  Working  Group  

Members  (Standardiza(on)  

MLW  PC  members  (Community  building)  

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Requirements  Gathering  •  Workshop  June  2012,  Dublin  – 71  a^endees  –   New  stakeholders:  linked  open  data  community  –   New  implementers:  Adobe,  ]init[,  Logrus,  Tilde  

•  Requirements  gathering  document  – W3C  public  working  drab  – Wiki  version  21.000+  access  

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Standardiza(on  Process  ...  •  ITS  2.0  drab  development  June  –  December  2012  – 40+  individuals  par(cipa(ng  – 2100+  emails,  aggressive  standardiza(on  progress  – Engaging  “invited  experts”  and  further  par(cipants,  including  higher-­‐level  decision  makers:  

 Adobe,  CNR,  DERI,  Ecole  Mohammadia  

d'Ingenieurs  Rabat,  ]init[,  Logrus,  NCSR,  Opera,  SAP,  Tilde  

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

...  driven  by  implementa(ons  •  Test  suite  development  star(ng  August  2012,  driven  by  TCD  –  Input:  Files  with  ITS  2.0  metadata  – Output:  metadata  overview  –  Current  state:  223  input  files,  839  implementer  output  files,  80%  coverage  

<!DOCTYPE  html>  ...        <p>Everything  started  when  Zebulon  discovered  that  he  had  a  <span  translate="NO">doppelgänger</span>  ...  </html>  

...  /html/body[1]/p[1]  translate="yes"  /html/body[1]/p[1]/span[1]  translate="no"  ...  

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

“Metadata  for  the  Mul(lingual  Web”  •  Summarizing  usage  scenarios  and  implementa(ons  

•  Aligned  with  implementa(on  development  

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Usage  scenarios  and  implementa(on  highlights  

•  XLIFF  transla(on  package  crea(on  driven  by  ITS  2.0  metadata  

•  Quality  check  driven  by  metadata  constraints  •  Installa(on  of  workflow  from  CMS  to  TMS  system  •  CMS  implementa(on  of  metadata  authoring  support  

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Usage  scenarios  and  implementa(on  highlights  

•  Text-­‐processing  component  interconnected  with  Drupal  

•  Cocomore  –  Linguaserve:  showcase  “localiza(on  workflow  with  VDMA”  

•  Linguaserve:  “real  (me  MT  with  Spanish  Tax  Agency”  

•  Volunteer  implementer  Shaun  McCance  –  ITS  Tool:  XML  to  PO  and  back  

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Not  covered  during  this  review  •  Valida(on  of  HTML5+ITS  (UEP)  – Available  at  h^p://validator.nu/    – Staged  for  integra(on  in  W3C  validator  

•  ITS  Libre  Office  Writer  Extension  -­‐  ]init[  •  ITS  2.0  Enriched  Terminology  Annota(on  –  Tilde  •  Visual  designs  to  render  "ITS  for  HTML5”  –  Logrus  •  Localisa(on  Workflows  Using  ITS  2.0  with  Adobe  CQ  and  Apache  JackRabbit  –  Adobe    

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Deliverables  for  year  one  •  D1.1  Detailed  Overall  Management  and  Bodies  Management,  including  the  Quality  Assurance  Plan  

•  D1.2.1  Report  on  Internal  and  External  Communica(on  Tools  

•  D1.2.2  LT-­‐Web  -­‐  W3C  Coordina(on  Yearly  Report  •  D1.2.3  Contact  Database  •  D2.1  Requirements  and  Use  Case  Document  •  D2.2  LT-­‐Web  Metadata  Drab  Documents  

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Deliverables  for  year  one  •  D4.1.1  Lucy  Modifica(on  •  D4.1.2  MaTrEx  Modifica(on  •  D4.1.3  Linguaserve  Online  System  Modifica(on  •  D4.1.4  Report  on  Modifica(ons  in  MT  Systems  •  D6.1.1  Workshop  1  •  D6.1.2  Summary  Report  1  

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

WP5  “Deep  Web  Informa(on  and    MT  Training”  

•  Deliverables  – D5.1.1  MT  Training  Module  – D5.1.2  XLIFF  Deep  Web  MT  Training  Exporter  – D5.2  Metadata-­‐Aware  MT  Training  

•  Delivery  date  will  be  delayed  to  be  able  to  benefit  from  Cocomore  training  data  

•  Overall  WP  will  be  in  (me  (conclusion  by  M21)  

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Communica(on  •  W3C  infrastructure  +  telephone  conference  tool  – Mailing  lists  –  IRC  – Ac(on  /  issue  tracker  –  ...  see  D1.2.1  

•  Separate  channels  for  – Working  Group  (standardiza(on)  – Workshop  planning  (MLW  PC)  – Public  

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Management  

MLW-­‐LT  consor(um  (Reference  Implementa(ons)  

W3C  MLW-­‐LT  Working  Group  

Members  (Standardiza(on)  

MLW  PC  members  (Community  building)  Communica(on  

infrastructure  

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Conclusion  •  Community  building  via  suppor(ng  ...  – Reference  implementa(on  – Standardiza(on  – Outreach  

•  ...  pays  of!  •  Similar  projects  could  be  useful  in  the  future  

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Q/A