Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line...
Transcript of Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line...
![Page 1: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/1.jpg)
Versatile Searchof Scanned Arabic Handwriting
Sargur N. Srihari, Gregory R. Ball, and Harish Srinivasan
Center of Excellence for Document Analysis and Recognition (CEDAR)
Department of Computer Science and EngineeringUniversity at Buffalo, State University of New York
Email: [email protected]
![Page 2: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/2.jpg)
Outline• CEDAR Handwriting Analysis Systems
– CEDAR-FOX and CEDARABIC• Versatile Search
– Query Types: Image and Text• Word Spotting Algorithms
– Word Segmentation Based• Holistic (Word Shape) Algorithm• Analytic (Character Shape) Algorithm
– Word Segmentation Free • Performance: Precision and Recall• Conclusion
![Page 3: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/3.jpg)
End-to-End Systems Developed (1983-Present)Including corpuses
1. Hand-Written Address Interpretation HWAI USPS, Australia Post, UK
2. Name and Address Block ReaderNABR IRS
3. Handwriting Segmentation and RecognitionPenman NSA
4. Japanese Character RecognitionCherry Blossom NSA
5. Writer Identification and SearchCedar-Fox English NIJ
CEDARABIC Arabic
![Page 4: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/4.jpg)
CEDAR-FOX vs. CEDARABIC
• English documents• Forensic applications• Search
– Keyword, Image• Recognition
– Character, Word
• Arabic documents• XML representation, Truthing Tools
• Search– Keyword (English)
– Image (Arabic)
• Verification and Identification– Writer– Signature
• Image Enhancement and Noise Removal– Two types of thresholding– Rule line and Underline removal
• Transcript Mapping for Creating Corpuses• Database for Document Metadata
Developed over 5 years in consultation with law enforcement agencies and professional QDEs
![Page 5: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/5.jpg)
Search Problem
• Searching electronic documents for information related to a query is ubiquitous
• Searching scanned printed documents is a recent application
• Searching scanned handwritten images is a research frontier– Given a query and a repository of scanned
handwritten documents, retrieve most relevant subset of documents
![Page 6: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/6.jpg)
Approaches
• CBIR – Content-based information retrieval –broad topic in IR and data mining– Image based approaches—direct CBIR based on image
retrieval (word spotting)– Text based approach—transcribe document to text and
search electronic representation• Both methods are error prone (grand challenge in
computer vision)• Combining both may achieve better performance
than either alone
![Page 7: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/7.jpg)
Versatile Search1. Versatile query
1. Typed Arabic (eg.بمرآز)UNICODE string of Arabic text
2. Typed English (eg. at, center)corresponding to idea that should appear in Arabic document
3. Arabic Image (eg. )of Arabic word or words
2. Versatile search (combine search methods)1. Image query (word spotting)
If query is image, preserve it throughout searchIf query is text, extract or generate image query
2. Text query (needs recognition)If query is image, convert to textIf query is text, preserve it throughout search
![Page 8: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/8.jpg)
Word Spotting using Image QueryImage Query
Database(pre-segmented)
Words Spotted
![Page 9: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/9.jpg)
Search Based on Text QueryCEDARABIC User Interface
EnglishTextQuery
Results
![Page 10: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/10.jpg)
Word Search User Interface1. Query (English Text)2. Retrieved Style Choices3. Chosen Styles4. Results
![Page 11: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/11.jpg)
CEDARABIC Document RepresentationPre Processed “.arb” file XML Representation
![Page 12: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/12.jpg)
Handwritten Arabic Recognition Overview
Preprocessing
Data Normalization
Encoding
Segmentation
Recognition
Convert toBinary
Slant Angle SmoothingNoiseReduction
Chain Code Generation
Page Line Word
Word ShapeRecognition
CharacterBased Word Recognition
Holistic LineRecognition
HandwrittenArabic Text
Preprocessed Text
Segmented Text
الرياض في االجتماعي سلمان االمير بمرآزat, center
the, prince
Salmansocialin, Alryad(capital of Saudi)
Recognized TextUnicode
Englishequivalent
![Page 13: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/13.jpg)
Recognition
Word Shape Recognition
Character Based Word Recognition
Holistic Line Recognition
Detail of Recognition ModuleCharacter Based Word RecognitionOversegment Words
Dynamic Programming(Maximization)
Find NearestPrototype
Prototype Clusters
Word Library Combine WordLibrary Features
Search forClosest Match
Holistic Approach Operates on Lines(No Word Segmentation)
Maximize WordScores (for each line) “Segmentation Free”
Feature Vector
Library Images and VectorsWord Shape Recognition
Holistic Line Recognition (Sliding Window)
Word Spotting
Noon Yeh Sad Lam-Hah Alef
Recognition
الملك الفكرذلكاليوم
![Page 14: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/14.jpg)
Query
Final Search Query
User Query
Versatile Search Framework
Sample Lookup Handwriting Recognition
Arabic Text Arabic Handwriting English Text
Text/Image Lookup
Image Query Text (UNICODE) Query
Search
Neural Network
Word Shape Matching Transcription Search
Result
![Page 15: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/15.jpg)
Segmentation• Line – separating page into component lines
• Most critical – new method achieves extremely successful line segmentation
• Word – separating line into component words• Developed automatic segmentation method; • Segmentation-free methods avoid need for word segmentation
• Character – separating word into component characters• Holistic approaches avoid character segmentation issues• Character based methods use prototypes to avoid need for complete
character segmentation
Search depends on successful segmentation
![Page 16: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/16.jpg)
Line Segmentation
Algorithm– Creates statistical
models of adjacent lines
– In combination with top-down approaches
– To be presented at SPIE, San JoseJanuary 2006
![Page 17: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/17.jpg)
Word Segmentation
Not word gap
To determine whether a gap is a true word gap
Word gap
![Page 18: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/18.jpg)
Arabic Word Segmentation Algorithm• Improved over method for Latin script segmentation• Clustering of components• Convex hulls of clusters• Convex hull of pair of clusters• Features(9)
– Minimum distance between convex hulls– Ratio of area of pair to sum of individual areas– Heights of clusters– Alef Flag (words tend to begin with alef)
Height / width ofComponents used
![Page 19: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/19.jpg)
Word Segmentation PerformanceTruthAuto-segmentation
![Page 20: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/20.jpg)
CEDARABIC Word SegmentationAutomatic mode Manual Mode
Useful for creating a corpus
![Page 21: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/21.jpg)
Holistic Word Shape Features (Language Independent)
Candidate Wordwi in Database
Chosen styles
s1
s2
s3
s4
),(1)(1
ji
n
ji swd
nwscore ∑
=
=
⎟⎟⎠
⎞⎜⎜⎝
⎛++++
−−= 2/1
1000011100011110
01100011
)])()()([(1
21),(
ssssssssssssYXd
Feature Vectors
![Page 22: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/22.jpg)
Spotting Based on Word Image QueriesUser Interface
Devanagari Script-PrintedLatin Script-Handwriting
Word Image Query in English and Sanskrit
![Page 23: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/23.jpg)
Analytic (Character Based): Presegmentation using ligature points
• Query: UNICODE text of word • UNICODE text mapped to positional
variations of characters (initial(i), medial(m), final(f), separate positions)Alef|Lam|Teh|Qaf|Alef maksura|
toAlefi|Lami|Tehm|Qafm|Alef maksuraf|
• Candidate word is pre-segmented, based upon ligature points
Pre-segmentation
Alef|Lam|Teh|Qaf|Alef maksura
Ligature based segmentation of a candidate word
![Page 24: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/24.jpg)
Analytic (with char segmentation and recognition)
• Pre-segments reassembled into super-segments
• Candidate structures are measured against 2000 prototype chars (34 classes, 4 of each), WMR features, nearest-neighbor
• Scores of best candidate super-segments are combined into word-score
• Even with small prototype set, word to be spotted is in top 5 choices > 90% cases
• Advantage of not requiring any prototype word images
Best matching set of character super-segments
![Page 25: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/25.jpg)
Character Based Spotting (with compound characters)
• Vertically oriented character combinations– Somewhat unique problem to Arabic– Dealt with by making compound character
classes– Compound character classes dramatically
improve recognition
Lam-ha Ha Lam
![Page 26: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/26.jpg)
Word-Segmentation Free Method• Uses query to evaluate each potential word grouping• Utilizes sliding window
– Recognition and segmentation performed concurrently– Entire line acts as input– Splits line into connected component groups– Ligature based segmentation can further split components– Considers all realistic combinations of adjacent connected components
CandidateSegmentations
![Page 27: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/27.jpg)
Segmentation Free Method
• Top 1 scoring regions for following text:– Alef|Lam|Teh|Qaf|Alef maksura|– Reh|Yeh+hamza|Yeh|Seen|– Alef|Lam|Lam|Qaf|Alef|Hamza|– Alef|Lam|Sheen|Yeh|Khah|
![Page 28: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/28.jpg)
Combining Results
• After parallel image and text search, results combined with neural network
• Input: Output from each of the searches; optionally a set of features of the images
• Output: A combined score
![Page 29: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/29.jpg)
CEDARABIC Word Spotting Performance• Averaged over 150 Queries chosen randomly among: advancing, african, aims, algeria, algerian, allah-
god ,am ,america, american ,ar, arabian, asian, atalanta, barcelona, because we, brescia, building,built, established, copeam, cagliari, cairo, chievo,country, department, developing, different views, european, existence, fiorentina,france, french, friday,gmt,gaza, germany, getting worse, gunmen,history ,influenced, intellectual, iran, iranian, iraq, islam, islamic, israel, italian, japanese, juventus, ke, khan younis (city), khartoum, lazio, lecce, legates, etc
Performance increases with more stylesStyles = 3, Testing on 7 Writers
![Page 30: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/30.jpg)
Results• Higher performance
than either method alone• 91% raw classification
accuracy• At 50% recall, 55%
precision was obtained in the word shape method, 75% precision for character based method
• Combined method about 80%
![Page 31: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/31.jpg)
Word Spotting Precision-Recall150 queries (king, nation, Friday,..)
0 10 20 30 40 50 60 70 80 90 1000
10
20
30
40
50
60
70
80
90
100precision recall
Recall
Pre
cisi
on
0 10 20 30 40 50 60 70 80 90 1000
10
20
30
40
50
60
70
80
90
100precision recall
Recall
Pre
cisi
on
0 10 20 30 40 50 60 70 80 90 1000
10
20
30
40
50
60
70
80
90
100precision recall
Recall
Pre
cisi
on
3 writers 4 writers 5 writers
0 10 20 30 40 50 60 70 80 90 1000
10
20
30
40
50
60
70
80
90
100precision recall
Recall
Pre
cisi
on
0 10 20 30 40 50 60 70 80 90 1000
10
20
30
40
50
60
70
80
90
100precision recall
Recall
Pre
cisi
on
0 10 20 30 40 50 60 70 80 90 1000
10
20
30
40
50
60
70
80
90
100precision recall
Recall
Pre
cisi
on
6 writers 7 writers 8 writers
![Page 32: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/32.jpg)
Performance as No of Styles Increase
3 4 5 6 7 845
50
55
60
65
70Precision at 50% Recall vs. Number of writers
Number of writers
Pre
cisi
on a
t 50%
Rec
all
Precision at 50% Recallvs Number of Writer Styles
![Page 33: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/33.jpg)
Character Based versus Word Based
compound character
character
word
![Page 34: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/34.jpg)
Performance of Segmentation Free –Character Based Method
• Comparison of manual, automatic, and segmentation free methods
• All use character based recognition; manual segmentation represents “ideal” recognition
• Segmentation free method offers significant performance increase over automatic segmentation
• Additional performance available by combining automatic/segmentation free method
Manual Segmentation
SegmentationFree
AutomaticSegmentation
![Page 35: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/35.jpg)
Time comparison
• Methods compared on 200 word document, times in seconds on Pentium 4 (2.8 GHz)
• Overhead can be cached or preprocessed/stored before executing queries.
Method Overhead Per QueryWord Shape based 4 0.5Character based 1 0.6
Word Segmentation Free 1 1.2 - 4
![Page 36: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/36.jpg)
Summary• CEDAR systems and corpuses
– Developed over 25 years– Postal, IRS, Penman, Japanese, Indic, Forensic, Arabic
• CEDARABIC is an end-to-end system with user interfaces for:– Search based on keywords, writership, database functionality– Image enhancement, ROI selection, Transcript mapping
![Page 37: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/37.jpg)
Summary• Two methods for dealing with unsegmented lines
– New method of automated word segmentation introduced for Arabic
• Improved performance over Latin script segmentation
– Segmentation free method
• Three methods of word spotting– Word based
• Performance increases with no of styles chosen in search query
– Character based– Character based with compound characters
![Page 38: Versatile Search of Scanned Arabic Handwritingsrihari/talks/SACH-06.pdf · Holistic Line Recognition Handwritten Arabic Text ... Recognized Text Unicode English equivalent. Recognition](https://reader033.fdocuments.in/reader033/viewer/2022050201/5f553309814cc35b0b5b14a4/html5/thumbnails/38.jpg)
Conclusions/Future Directions
• Processing image and text based queries in parallel can result in higher performance than either alone
• Versatile search framework can be applied to many search problems
• Using improved image or text-based search algorithms can push overall performance higher