Autonomous Cleaning of Corrupted Scanned Documents A Generative Modeling Approach Zhenwen Dai Jӧrg...

Autonomous Cleaning of Corrupted Scanned Documents A Generative Modeling Approach Zhenwen Dai Jrg Lcke Frankfurt Institute for Advanced Studies, Dept. of Physics, Goethe-University Frankfurt

A document cleaning problem 2

What method can save us? Optical Character Recognition (OCR) 3

OCR Software 4 input OCR Character Segmentation Character Classification ? ? vs. (FineReader 11)

What method can save us? Optical Character Recognition (OCR) Automatic Image Inpainting 5

Unable to identify the defects because corruption and characters consist of same features solution requires knowledge of explicit character representations 7

What else? Optical Character Recognition (OCR) Automatic Image Inpainting Image Denoising? Problem requires a new solution! 8

Our Approach training data is only the page of corrupted document no label information a limited alphabet (currently) 9 inputour approach

How does it work without supervision? Characters are salient self-repeating patterns. Corruptions are more irregular. Related to Sparse Coding 10 inputour approach

The Flow of Our Approach 11 Cut into Image Patches Character Detection & Recognition b a y s e A Character Model on Image Patches Learning

A Probabilistic Generative Model Show a character generation process. A character representation (parameters) mask param. Feature Vectors (RGB color) 12

A Tour of Generation 1.Select a character. 2.Translate to the position. 3.Generate a background. 4.Overlap character with background according to mask. 13 Translation by [12,10] T Pixel-wise Background Distribution Prior Prob. 0.2 Learning masks features

Maximum Likelihood Iterative Parameter Update Rules from EM: 14 prior prob. std parameter set posterior t1t1 t2t2 t0t0 tntn A posterior distribution is needed for every image patch in the update rules.

Posterior Computation Problem A posterior distribution is needed for every image patch in the update rules. Similar to template matching A pre-selection approximation 15 inference Which character? ABCDE Where? ????? ??? hidden space pre-selection (Lcke & Eggert, JMLR 2010) (Yuille & Kersten, TiCS 2006) (truncated variational EM)

An Intuitive Illustration of Pre-selection Select some local features according to parameters. Very few features A number of good guesses ABCDE 16 (Lcke & Eggert, JMLR 2010) (Yuille & Kersten, TiCS 2006) BCAED BCAED Features in image patches B BD

Learn the Character Representations Input: image patches (Gabor wavelets) A learning course: (about 25 mins) 17 maskfeaturestdchars 1 2 3 maskfeaturestdchars 4 5 6 (heat map) featurestd

Learn the Character Representations Input: image patches (Gabor wavelets) A learning course: (about 25 mins) 18 maskfeaturestdchars 1 2 3 maskfeaturestdchars 4 5 6 (heat map) featurestd

Document Cleaning How to recognize characters against noise? Character segmentation fails. Our model one char per patch It is a non-trivial task. Try to explore from the model as much as possible. 19

Document Cleaning Procedure Inference of every patch with the learned model 1.Paint a clean character at the detected position. 2.Erase the character from the original document. Accept original reconstructed Fully visible=1 20 reconstructed Clean Characters from the Corrupted Document

Document Cleaning Procedure Inference of every patch with the learned model Iterate until no more reconstruction. iteration 1 reconstructed Accept Reject original reconstructed Accept Fully visible=1 Fully visible=0 Fully visible=1 iteration 2 Reject Accept reconstructed Fully visible=0 Fully visible=1 Fully visible=0 Fully visible=1 21 more than one character per patch (about 1 min per iteration)

Before Cleaning 22

After Iteration 1 23

More Experiments More characters (9 chars) Unusual character set (Klingon) Irregular placement (randomly placed, rotated) Occluded by spilled ink 26 9 charsKlingon Rotated, random placed Occluded original reconstructed

Recognition Rates 27

False Positives 28

Not only a Character Model Detect and count cells on microscopic image data 29 in collaboration with Thilo Figge and Carl Svensson

Summary Addressed the corrupted document cleaning problem. Followed a probabilistic generative approach. Autonomous cleaning of a document is possible. Demonstrated efficiency and robustness. The dataset will be available online soon. Future directions: Extended to large alphabet by incorporating prior knowledge of documents. Extended to various different applications. 30

Acknowledgement 31 http://fias.uni-frankfurt.de/cnml

Thanks for your attention! 32

Learned Character Representations Cut the document into small patches. Run the learning algorithm. 33

Performance bayes9 charsKlingonRandomly placed Occluded Recognition Rates OCR56.5%75.4%00.8%41.6% Our algorithm100% 97.4% False Positives OCR29728523186413 Our algorithm00036 34

Document Cleaning Procedure Character vs. Noise ? MAP inference can only choose among learned characters. 3.Define a novel quality measure. Threshold: 0.5 y a MAP mask param.mask posteriordifference 35

Autonomous Cleaning of Corrupted Scanned Documents A Generative Modeling Approach Zhenwen Dai Jӧrg...

Documents

Transcript of Autonomous Cleaning of Corrupted Scanned Documents A Generative Modeling Approach Zhenwen Dai Jӧrg...

Christian Essential Series: Was Early Christianity ...adlucem.co/.../07/...Was-Early-Christianity-Corrupted-by-Hellenism.pdf · Was Early Christianity Corrupted by 'Hellenism'? ...

Generative Process, Generative Outcome: The Transformational ...

Learning from Corrupted Binary Labels via Class-Probability Estimation ...users.cecs.anu.edu.au/~akmenon/papers/corrupted-labels/corrupted... · Learning from Corrupted Binary Labels

Corrupted Characters - Border Town Burningbordertownburning.ciantygames.com/pdf/Corrupted.pdf · Corrupted Characters ... Any member of a Witch Hunter or Sisters of Sigmar ... him

How to fix corrupted Labels Moodle 1.9

How to repair corrupted MDF File?

Spectrum Headquarters · 2014. 4. 1. · doctor. row tik tikka-tik tik-tik-tik tikka-tik us an ro primary guidance corrupted primary guidance corrupted primary guidance corrupted

Conceptual framework for generative design - Generative Art

Highly corrupted image inpainting through hypoelliptic ...gmvision.lsis.org/slides/prandi.pdfHighly corrupted image inpainting through hypoelliptic diffusion U. Boscain, R. Chertovskih,

Most Corrupted Countries in the world!

A Theory of Learning with Corrupted Labels

Is the Bible Corrupted?

Noise Corrupted Signals and Signal Processing using MATLABpeople.clarkson.edu/~estradjm/Noise Corrupted Signals and Signal... · CLARKSON UNIVERSITY Noise Corrupted Signals and Signal

Generative Structural Analysis Page 1 Generative ...bndtechsource.ucoz.com/V5_Online_Docs/Ana_Sim/estug2.pdf · Generative Structural Analysis Preface ... extension to Generative

Replacement of lost or corrupted card

Corrupted profit or black money - Astrology

HK Lücke - Ratio Decidendi - Adjudicative Rational and Source of Law

STUDIO SCRIPT/1b GF Newman's The Corrupted …downloads.bbc.co.uk/writersroom/scripts/The-Corrupted-S2...1 STUDIO SCRIPT/1b GF Newman's The Corrupted Episode 17 – 1967 The voice

Getting Started Generative Design - BIM@SGbimsg.org/.../2-Generative-design-classroom-session... · Getting Started –Generative Design Agenda • (10 mins) Introduction to Generative

The Man That Corrupted Hadleyburg