grp7
-
Upload
rekha-narain -
Category
Documents
-
view
212 -
download
0
description
Transcript of grp7
SecondIntermediatePresentationonMalayalamTextRecognitionAndTranslationFromDigitizedImagesGovernmentEngineeringCollege,SreekrishnapuramJune7,2015(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 1/14IntroducingVaaniAboutVaaniVaaniisanandroidapplicationforMalayalamtextextractionandtranslationfromdigitizedimages.(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 2/14Functional RequirementsWhatshouldVaanido?AllowuserstocaptureimageoruploaditfromdevicememorySelecttexttobetranslatedRecognizethetextselectedTranslateitFigure: UseCases(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 3/14Non-Functional Requirementsdescriptiondependsonhowgoodthepictureisveryhighoineeasytonavigateanduse(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 4/14SubsystemI UserInterfaceDescriptionCameraandUploadInterfaceTextSelectioninterfaceLanguageSelectionInterfaceOutputDisplay(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 5/14SubsystemI UserInterfaceHelloWorld!(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 6/14SubsystemI UserInterfaceImplementationImplementedusingEclipse(Version3.8)andAndroid5.0.1(API21).OpenJDK7(OpenJavaDevelopmentKit)DevelopedonUbuntu14.04(TrustyTahr).(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 7/14SubsystemI UserInterfaceCodingandNamingStandardsProgrammingLanguageusedJavaVariablesandmethodsfollowcamelCasenamingstandard.(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 8/14SubsystemI UserInterfaceWorkdonesofarSuccessfullyCompleted80%oftheUserInterface.Theuserinterfaceincludes4activtiesforeachofthefunctionalitiestobeimplemented.Threeoftheabovementionedactivitieshavebeencompleted.ThenalactivitywhichincludesintegratingTesseractisunderdevelopment.(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 9/14SubsystemII OCREngineDescriptionTesseractisanOCRengineUsedtoextracttextfromimagesInitiallydevelopedtorecognizeEnglishNeedtobetrainedtorecognizeMalayalamIntegratedintoandroidusingtesstwolibrary(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 10/14SubsystemII OCREngineTrainingTesseractInput: BinaryimagecontainingMalayalamtext.Boxtheimage.Editboxesmanually.Extractunicodecharacterset.ShapeclusteringOutputisthetraineddataforthelanguage.(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 11/14SubsystemII OCREngineImplementationTesseract(version3.02.02)Leptonica1.70(ImageProcessingLibrary)jTessBoxEditormoshpytt.pySwanalekhatransliterationtoolautotrain.py(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 12/14SubsystemII OCREngineWorkdonesofarCreatedacorpusfortraining.DecidedtousefontAnjali(oldMalayalamLipi).Createdsampletrainingdataof760characters.(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 13/14TimeScheduleSerial Number Phase ExpectedDateOfCompletion1 ProblemIdentication 10thJanuary2 LiteratureSurvey 2ndFebruary3 FeasibilityStudy 9thFebruary4 Design 24thFebruary5 TrainingandCoding 16thMarch6 TestingandDebugging 25thMarch7 Documentation 30thMarch(Govt. Engg. College,Sreekrishnapuram) MalayalamTextRecongition June7,2015 14/14