Xml Work Flow
-
Upload
heyday-software-solutions -
Category
Technology
-
view
1.239 -
download
4
Transcript of Xml Work Flow
Current Projects
E-Publishing– IMF– Wiley UK– VST
PDF to XML Work Flow
Data Capture Coding Validation E-Deployment
Data Capture
Capture Text,Box-Text and Box-FootNotes from Source PDF - only Chapters
Capture Chapter/Article-FootNotes from Source PDF- only Chapters
Capture Images & Tables as JPG and Image Related Text as HTML from Source PDF
Capture Table Content from the Source PDF as Text and add IMF TAGS
Coding
Merging of all the Data capture tasks as per IMF specification
Creating Front Matter from source PDF parts ( TOC,Preface,Abbrevations,Main Messages)
Creating Back Matter from source PDF parts(appendixes,Glossary,References)
Image Editing as per IMF specification Merging of all the above tasks as per IMF
specification
Validation
QC With Epsilon QC With Browsers for desired View QC With Oxygen
E-Deployment
Deploy in Customer Desired Format
TASK 2 TASK 3 TASK 8TASK 7TASK 4 TASK 6
TASK 12 TASK 11 TASK 10
TASK 9
QC DEPARTTASK 13
DELIVERABLE (XML)
INPUT (PDF)
TASK 1
TASK 5
TASK 1Capture Text, Box-Text and Box-Footnotes
from Source PDF-Chapters
TASK 2Capture Footnotes of Chapter/Article
from Source PDF
TASK 3Capture Images & Tables as JPG
from Source PDF-Chapters
TASK 4Capture Table Data as Text from Source PDF
and Add IMF-Table Tags
TASK 6Capture Front Matter from Source PDF
(TOC,Preface,Abbrevations,Main Messages)
TASK 7Capture Back Matter from Source PDF
(Appendixes, Glossaries and References)
TASK 5Merge all previous Tasks output into one
and add Required IMF Tags
TASK 8Edit all Images to set required resolution and Size
TASK 9Merge Tasks (from 5 to 8) to get final output
Val
idat
ion
Thr
ough
Eps
ilon
Val
idat
ion
Thr
ough
B
row
ser
for
Des
ired
Vie
w
Val
idat
ion
agai
nst o
f IM
F-
DT
D u
sing
Oxy
gen
Team Members
Team Leaders
Quality Analyst
Abbyy FineReader
Epsilon Editor
EpsilonDTDXSL
Oxygen
Task 1, Task 2, Task 3, Task 4
Task 5, Task 6, Task 7, Task 8, Task 9
Task 10, Task 11, Task 12, Task 13
Do
Do
DoUsing
Using
Using
Tasks Distribution and Methodology
Capturing Various Type of Data
Code around the Data
Validate the Code and Data
TASK 1 : SAMPLE
Description : Capture Text from Source PDF (Only Chapters) Using OCR Tool
Input : Source PDF
TASK 1 : SAMPLE
Output : One HTML file for each Chapter/Article
TASK 2 : SAMPLE
Description : Capture Chapter/Article-Foot Notes from Source PDF- Only Chapters
Input : Source PDF
TASK 2 : SAMPLEOutput : One html or multiple html when footnote repeats its ID for each Chapter/Article
TASK 3 : SAMPLEDescription : Capture Images & Tables as JPG and Image Related Text as HTML from Source PDF
Input : Source PDF
TASK 3 : SAMPLE
Output : Multiple JPG’s & One HTML
TASK 4 : SAMPLE
Description : Capture Table Content from the Source PDF as Text and add IMF TAGS
Input : Source PDF
TASK 4 : SAMPLE
Output : HTML
TASK 5 : SAMPLE
Description : Merging of all the above Tasks(1 to 4) as per IMF specification
Input : Task 1 to Task 4
Output: HTML
TASK 6 : SAMPLE Input : Source PDFDescription : Capture Front Matter from source PDF parts ( TOC, Preface,
Abbreviations, Main Messages)
TASK 6 : SAMPLE
Output : HTML
TASK 7 : SAMPLE
Description : Capture Back Matter from source PDF parts (Appendixes, Glossary, References)
Input : Source PDF
TASK 7 : SAMPLE
Output :HTML
TASK 8 : SAMPLE
Description : Image Editing as per IMF specification
Output : Final JPG’s
Input : Source PDF
TASK 9 : SAMPLE
Description : Merging of all the above tasks(5,6,7,8) as per IMF specification
Output : Final XML without Validation
Input : Task 5 to Task 8
TASK 10 : SAMPLE
Description : First Level Validation With Epsilon
Output : XML
Input : Task 9 - XML
TASK 11 : SAMPLEDescription : Validation With Browsers for desired View
Output : Final XML Validation- Second Level
TASK 12 : SAMPLE
Description : Validation With Oxygen against of IMF-DTD
Output : Final XML Validation- Third Level
TASK 13 : SAMPLE
Description : Packing Process in Desired Manner
Output : Deliverable Product