grp7

14
Second Intermediate Presentation on Malayalam Text Recognition And Translation From Digitized Images Government Engineering College, Sreekrishnapuram June 7, 2015 (Govt. Engg. College, Sreekrishnapuram) Malayalam Text Recongition June 7, 2015 1 / 14

description

project

Transcript of grp7

SecondIntermediatePresentationonMalayalamTextRecognitionAndTranslationFromDigitizedImagesGovernmentEngineeringCollege,SreekrishnapuramJune7,2015(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 1/14IntroducingVaaniAboutVaaniVaaniisanandroidapplicationforMalayalamtextextractionandtranslationfromdigitizedimages.(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 2/14Functional RequirementsWhatshouldVaanido?AllowuserstocaptureimageoruploaditfromdevicememorySelecttexttobetranslatedRecognizethetextselectedTranslateitFigure: UseCases(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 3/14Non-Functional Requirementsdescriptiondependsonhowgoodthepictureisveryhighoineeasytonavigateanduse(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 4/14SubsystemI UserInterfaceDescriptionCameraandUploadInterfaceTextSelectioninterfaceLanguageSelectionInterfaceOutputDisplay(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 5/14SubsystemI UserInterfaceHelloWorld!(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 6/14SubsystemI UserInterfaceImplementationImplementedusingEclipse(Version3.8)andAndroid5.0.1(API21).OpenJDK7(OpenJavaDevelopmentKit)DevelopedonUbuntu14.04(TrustyTahr).(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 7/14SubsystemI UserInterfaceCodingandNamingStandardsProgrammingLanguageusedJavaVariablesandmethodsfollowcamelCasenamingstandard.(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 8/14SubsystemI UserInterfaceWorkdonesofarSuccessfullyCompleted80%oftheUserInterface.Theuserinterfaceincludes4activtiesforeachofthefunctionalitiestobeimplemented.Threeoftheabovementionedactivitieshavebeencompleted.ThenalactivitywhichincludesintegratingTesseractisunderdevelopment.(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 9/14SubsystemII OCREngineDescriptionTesseractisanOCRengineUsedtoextracttextfromimagesInitiallydevelopedtorecognizeEnglishNeedtobetrainedtorecognizeMalayalamIntegratedintoandroidusingtesstwolibrary(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 10/14SubsystemII OCREngineTrainingTesseractInput: BinaryimagecontainingMalayalamtext.Boxtheimage.Editboxesmanually.Extractunicodecharacterset.ShapeclusteringOutputisthetraineddataforthelanguage.(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 11/14SubsystemII OCREngineImplementationTesseract(version3.02.02)Leptonica1.70(ImageProcessingLibrary)jTessBoxEditormoshpytt.pySwanalekhatransliterationtoolautotrain.py(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 12/14SubsystemII OCREngineWorkdonesofarCreatedacorpusfortraining.DecidedtousefontAnjali(oldMalayalamLipi).Createdsampletrainingdataof760characters.(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 13/14TimeScheduleSerial Number Phase ExpectedDateOfCompletion1 ProblemIdentication 10thJanuary2 LiteratureSurvey 2ndFebruary3 FeasibilityStudy 9thFebruary4 Design 24thFebruary5 TrainingandCoding 16thMarch6 TestingandDebugging 25thMarch7 Documentation 30thMarch(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 14/14