Ei13 Abstracts L

www.electronicimaging.org • TEL:+17036429090 • [email protected] 1

Conferences & Courses3–7February2013

LocationHyatt Regency San Francisco Airport Hotel Burlingame, California, USA

www.electronicimaging.org

Technical Summaries

2 www.electronicimaging.org • TEL:+17036429090 • [email protected]

Contents

3DImaging,Interaction,andMetrology8648 Stereoscopic Displays and Applications XXIV

(Woods, Holliman, Favalora) . . . . . . . . . . . . . . . . . . . . . . . .38649 The Engineering Reality of Virtual Reality 2013

(Dolinsky, McDowall) . . . . . . . . . . . . . . . . . . . . . . . . . . . .188650 3D Image Processing (3DIP) and Applications 2013

(Baskurt, Sitnik). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .23

Visualization,Perception,andColor8651 Human Vision and Electronic Imaging XVIII

(Rogowitz, Pappas, de Ridder) . . . . . . . . . . . . . . . . . . . . .308652 Color Imaging XVIII: Displaying, Processing,

Hardcopy, and Applications(Eschbach, Marcu, Rizzi) .418653 Image Quality and System Performance X

(Burns, Triantaphillidou). . . . . . . . . . . . . . . . . . . . . . . . . .498654 Visualization and Data Analysis 2013(Wong,

Kao, Hao, Chen) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .55

ImageProcessing8655 Image Processing: Algorithms and Systems XI

(Egiazarian, Agaian, Gotchev) . . . . . . . . . . . . . . . . . . . . . .638656 Real-Time Image and Video Processing 2013

(Kehtarnavaz, Carlsohn). . . . . . . . . . . . . . . . . . . . . . . . . .728657 Computational Imaging XI(Bouman, Pollak, Wolfe) . . . . .778658 Document Recognition and Retrieval XX

(Zanibbi, Coüasnon) . . . . . . . . . . . . . . . . . . . . . . . . . . . . .83

ImageCapture8659 Sensors, Cameras, and Systems for Industrial/

Scientific Applications XIV(Widenhorn, Dupret). . . . . . .918660 Digital Photography IX(Sampat, Battiato) . . . . . . . . . . .97

ComputerVision8661 Image Processing: Machine Vision Applications VI

(Bingham, Lam) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1028662 Intelligent Robots and Computer Vision XXX:

Algorithms and Techniques(Röning, Casasent) . . . . . . .1108663 Video Surveillance and Transportation Imaging

Applications(Loce, Saber) . . . . . . . . . . . . . . . . . . . . . .116

MediaProcessingandCommunication8664 Imaging and Printing in a Web 2.0 World IV

(Lin, Allebach, Fan) . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1238665 Media Watermarking, Security, and Forensics

2013(Alattar, Memon, Heitzenrater). . . . . . . . . . . . . . . .1278666 Visual Information Processing and

Communication IV(Said, Guleryuz, Stevenson) . . . . . .1328667AMultimedia Content Access: Algorithms and

Systems VII(Snoek, Sebe, Kennedy) . . . . . . . . . . . . .136

MobileImaging8667BMultimedia on Mobile Devices 2013

(Creutzburg, Akopian) . . . . . . . . . . . . . . . . . . . . . . . . . .1388667CMobile Imaging System Design and Image

Quality(Wüller, Matherson) . . . . . . . . . . . . . . . . . . . . .1428667DMobile Computational Photography

(Georgiev, Lumsdaine) . . . . . . . . . . . . . . . . . . . . . . . . .147

ElectronicImaging

IS&T/SPIE

3–7 February 2013 HyattRegencySanFranciscoAirportHotelBurlingame,California,USA

Symposium Steering Committee

Gaurav Sharma, SymposiumChair,Univ.ofRochester(UnitedStates)

Sergio R. Goma,SymposiumCo-Chair,QualcommInc.(UnitedStates)

Choon-Woo Kim,ShortCourseChair,InhaUniv.(Korea,Republicof)

Andrew J. Woods, TechnicalAdvisor,CurtinUniv.(Australia)

Majid Rabbani, PastSymposiumChair,EastmanKodakCo.(UnitedStates)

Suzanne E. Grinnan, IS&TExecutiveDirector

Rob Whitner, SPIEEventManager

Symposium ChairGaurav SharmaUniversityofRochester

Symposium Co-ChairSergio GomaQualcommInc.

Short Course ChairChoon-Woo KimInhaUniversity


Conference 8648: Stereoscopic Displays and Applications XXIVMonday-Wednesday4–6February2013

Part of Proceedings of SPIE Vol. 8648 Stereoscopic Displays and Applications XXIV

8648-1, Session 1

Simulator sickness analysis of 3D video viewing on passive 3DTVKjellE.Brunnström,KunWang,AcreoAB(Sweden)andMidSwedenUniv.(Sweden);BörjeAndrén,AcreoAB(Sweden)

NoAbstractAvailable

8648-2, Session 1

Investigation of visual fatigue/discomfort generated by S3D video using eye-tracking dataIanaIatsun,Mohamed-ChakerLarabi,ChristineFernandez-Maloigne,XLIM-SIC(France)

Duringthelastyearsexpansionof3Dvideogrownsignificantly.Stereoscopicimagingisdistributedinmovieproduction,advertising,interactivegames,andothers.Amongtheadvantagesof3Darethestronginvolvementofviewersandtheincreasedfeelingofpresence.However,oneofthemaindisadvantagesispossibilitytoaffecthumanhealth.

Inthiswork,weproposetostudythelinkbetweenpupildiameterreflectingtheactivityoftheautomoticnervoussystemandeyemovementsrepresentingacharacteristicofthevisualmotorsystemespeciallytheciliarymuscle,andthevisualfatigue.

8648-80, Session K1

History of polarized image stereoscopic display (Keynote Presentation)VivianK.Walworth,StereoJet,Inc.(UnitedStates)

Thispresentationwillrecountthehistoricworkofmanyinvestigatorswhocontributedtothedevelopmentofstereoscopicdisplaythroughutilizationoflightpolarization.Thetalkwillincludepersonalrecollectionsofexperienceswithkeyinventorsandcolleagues,aswellasearlyadventuresinexploitationofthetechnology.Earlycinema3Ddemonstrationsutilizedpolarizingfiltersoverpairedprojectors,andviewersworecomplementarypolarizingglasses.Millionssawthe3DChryslerfilmattheNewYorkWorld’sFairin1938and1939.3Dcinemahasalsoutilizedthistechnology.TheVectographprocess,whichprovidedback-to-backimagepairshavingoppositelyorientedpolarization,facilitatedsingle-passprintingofstereoscopicimages.Black-and-whiteVectographstereoscopicimageswereusedextensivelyduringWorldWarIIforbothaerialsurveillanceandmilitaryinstructionmanuals.Formanyyearsbinocularvisiontestingandtrainingkitshaveincludedblack-and-whiteVectographtargetimages.InthepostwaryearsexcellentcolorVectographsweremadeinthePolaroidlaboratories,usingdyetransfertechnology.PolaroidalsocollaboratedwithTechnicolorin3Dcinematestsusingthatcompany’s“imbibition”transfertechnology.Theintroductionofinkjetprintingtechnologyinthe1980sgaveimpetustoexplorationofsingle-pass3DcolorprintingatPolaroid.Inthe1990stheRowlandInstituteforScienceproducedexcellentfull-colorStereoJet3DImagesonVectographsubstrate,whichisnolongermanufactured.StereoJet,Inc.ispresentlydevelopingcircularlypolarizedfull-colorStereoJetimagesystemsusingconventionalpolarizersubstrate.

8648-4, Session 2

Optical modeling of lenticular array for autostereoscopic displaysSung-MinJung,Jin-HyukJang,Hong-YunKang,Keong-JinLee,Ji-NaKang,Seung-ChulLee,Kyoung-MoonLim,Sang-DeogYeo,LGDisplay(Korea,Republicof)

Inthisstudy,wesuggestmathematicalmodeldescribingopticalphenomenaatthesurfacesoflenticulararrayforautostereoscopicdisplays.RelationsbetweenincidentandrefractedlightsarederivedfromSnell’slawinvectorform.Bysimulatingthemathematicalmodelwiththeactualdesignconditionofautostereoscopicdisplays,weobtainangulardistributionoflightandcomparetheresultswithexperiment.Fromtheresults,itisclarifiedthatoursimulationmodelcoincidewellwiththeexperimentalresults.Webelievethatoursimulationmodelisveryusefulfordesignandoptimizationoflenticulartypeautostereoscopicdisplays.

8648-6, Session 2

Simple measurement of lenticular lens quality for autostereoscopic displaysStuartGray,RobertA.Boudreau,CorningIncorporated(UnitedStates)

Autostereoscopic3Ddisplaysutilizinglenticularlensesrequireprecisealignmentbetweenthedisplaypixelsandthelensarraytocreateahighqualityviewingexperience.Thispaperdescribesasimpletechniqueforestimatingthealignmenterrorsbetweenthelensarrayandpixelsacrosstheentiredisplay.

Whenamulti-viewautosteroscopicdisplayisobservedinsidetheoptimumviewingdistancethenportionsofseveralviewsareobservedinoneeye.Differentviewscanbeidentifiedbycreatingatestpatternwhereeachviewisasinglecolorwithdifferentcolorsinadjacentviews.Awellalignedlensarraywillshowuniformandparallelbandsofthedifferentcolors.Deviationsinthewidthorspacingofthebandsrepresentlenticularmisalignment.

Thiswasdemonstratedona9-view,55-inchdisplaywithatestpatternshowingredinviews1,4and7,greeninviews2,5and8andblueinviews3,6and9.Thepatternwasviewed1.7metersfromthescreenratherthanthespecifiedoptimumof4.8meters.Theimageshoweddiagonalcolorbandswithdeviationsfromparallelcorrespondingtomisalignmentofupto400uminsomelocations.

8648-8, Session 2

Visibility of crosstalk for high-dynamic range displaysMarcLambooij,MartinHammer,TPVision(Netherlands)

Crosstalkisoneofthemaindisplay-relatedperceptualfactorsdegradingimagequalityandcausingvisualdiscomfort.Inthisexperimentweinvestigatetheimpactofhighdisplaycontrastandhighdisplayluminanceontheperceptionofcrosstalk.

Ahigh-dynamicrangeLCDincombinationwithaWheatstoneviewerisused.Toindependentlyvaryluminance,contrast,andbinocularcrosstalk(definedas(BW-BB)/(WW-BB))m,thedisplayopto-electricalresponsewascharacterizedandmodeled.Luminancevariedfrommoderatetoveryhigh(125,500,and1,500cd/m^2)andcontrastfromlowtohigh(1:100,1:1,000,and1:2,500).

Resultsshowthatincreaseinluminanceleadstoareducedcrosstalkvisibilitythresholdresultinginaminimalvisibilitythresholdof0.20%


at1,500cd/m^2.Thevisibilitythresholdwasindependentofthetestedcontrastlevels,indicatingthatcontrastlevelsabove1:100donotaffectvisibilitythresholds.

Inconclusion,withtheintroductionofOLEDsthesystemcrosstalkwillrise,buttheincreaseinvisibilityofstereoscopiccrosstalkisabsent.IfpeakluminanceisconsiderablyhigherthanexistingTVs,suchasinhigh-dynamicrangedisplays,thecrosstalkvisibilitythresholdwillbelower.Astheabsolutethresholdlevelsareverylow(atorbelow0.3%)thiswillresultinlittleperceptualeffect.

8648-10, Session 2

Method to test and quantify 3D active shutter glassesKurtHoffmeister,MechdyneCorp.(UnitedStates)

LiquidCrystalShutter(LCS)3Dglasses(alsoknownas3Dshutterglasses,activeglasses,orjustshutterglasses)aretheselectiondevicecommonlyusedtoview3Dstereoscopiccontentontime-sequential3Ddisplays.TherearemanyavailablemodelsofLCSglassestochoosefrom;although,unfortunately,therecanbewideperformancedifferencesbetweentheseglasses.Ultimatelytheperceivedqualityof3Dimageviewingdependsonthedisplayviewedaswellastheperformanceoftheshutterglassesused.Theobjectiveofthispaperistodefinekeyglassesperformanceparametersandtopresentatestingmethodtohelpquantifyglassesandselectbetweenmodelsavailable.Differencesinshutteropeningspeedandopentransparency(shutteringperformance)areshownbetweentestedmodelsusingthemethodpresentedhere.Additionaldifferencesarenotedforotherperformanceparametersandfeaturesillustratingthattherecanbemanyproductdifferencestoconsiderthatdonotdirectlyaffectshutteringperformance.

8648-12, Session 2

Towards standardized 3DTV QoE assessment: cross-lab study on display technology and viewing environment parametersMarcusBarkowsky,JingLi,Polytech’Nantes(France);TaehwanHan,SungwookYoun,JiheonOk,ChulheeLee,YonseiUniv.(Korea,Republicof);ChristerHedberg,InirajithV.Ananth,AcreoAB(Sweden);KunWang,KjellE.Brunnström,AcreoAB(Sweden)andMidSwedenUniv.(Sweden);PatrickLeCallet,Polytech’Nantes(France)

Theperceptionofdegradationsmeasuredinsubjectiveassessmentsofimagesandvideosequencesisinfluencedbytheviewingconditions.Instereoscopic3DTV,theinfluenceoftheviewingenvironment,suchasillumination,viewingdistance,votinginterface,observerscreening,trainingandintroductiontotheexperiment,isexpectedtodiffersignificantlyfromtheinfluencethatwasperceivedwhen2Drecommendations,suchasITU-RBT.500,wereestablished.ThesubjectivemeasurementofQualityofExperience(QoE)in3DTVextendsoverseveralpsychophysicaldimensionssuchaspicturequality,depthsensation,andvisualcomfortwhichmaybecombinedtohigherlevelindicationssuchasnaturalness,presenceandvisualexperience.Thisstudyfocusesontheinfluenceofdisplaytechnologyandviewingconditionsonthesubjectivescoresobtainedinassessingdegradationsof3Dpicturequalityintroducedbyvideocodingandspatialresolutionreduction.ThreelaboratoriesinSweden,SouthKoreaandFranceperformedsubjectiveexperimentsusingtheAbsoluteCategoryRatingwithHiddenReference(ACR-HR)methodonthesamevideosequences.Differencesinviewingdistance,displaytechnology(activeorpassive),andillumination(darkorITUconformingbackgroundillumination)havebeenincludedonpurpose.Additionalobserverdatasuchas3Dviewingexperienceandscreeningresultswereobtained.Thevotesobtainedinthestudiesareanalyzedandcomparedusingparametricandnon-parametricstatisticalmethodssuchasAnova,PearsonCorrelation,RMSE,andWilcoxonandStudentteststoobtaininformationaboutthediscriminationpowerforthevideoconditions.

8648-3, Session 5

Optical axes misalignment compensation method for stereo camera zoomingJinmoKang,JunyongLee,ChangseobPark,KoreanBroadcastingSystem(Korea,Republicof)

Zoomingofstereocamerasuffersfromopticalaxesmisalignmentproblem.Astheopticalaxisofazoomlensisrarelyalignedwiththecenterofcamera’simagesensor,thepositionoftheopticalaxisontheimagesensorweaveswithacircularmotionasthezoomispulled.

Tocorrectthisopticalaxeserrorforstereocamera,weneedtocalibratethestereorigforeveryzoomvalueusingnearandfarchart.Usuallythisisdonemanuallyandittakeslongtimetocalibratetheentirezoomrange.

Inthispaper,weproposeanewmethodwhichcompensatesopticalaxesmisalignmentforstereocamerazoomingautomatically.Opticalaxesmisalignmentismodeledasimagesensorcentertranslationfromthelenscenter.Tomatchtheimagepointofleftandrightimage,rightcameratranslationandrotationequationisdevised.Byusingtheseequationswecaneasilygettherigcalibrationvalue.Then,wecancompensateopticalaxesmisalignmentbysavingandapplyingthisrigcalibrationinformationwhenwechangethezoomofstereocamera.

Theexplanationoftheproposedmethodanddevisedequationwillbeprovided.Andexperimentalverificationresultsusingstereocamerarigwillbepresentedinthepaper.

8648-5, Session 5

Stereo rendering of rain in real-timeSyedA.Hussain,DavidF.McAllister,NorthCarolinaStateUniv.(UnitedStates)

Therenderingofphotorealisticrainhasbeenpreviouslystudiedformonoscopicviewing.Weextendthemonoscopicstatisticalrainmodelstosimulatethebehavioranddistributionoffallingrainforstereoviewing.Ourgoalistobeabletorenderinreal-timeframerates.Inthisinvestigationweignorethecomplexissuesofsceneilluminationandconcentrateontheparametersthatproducearealisticraindistribution.Usingtheconceptofretinalpersistencewerenderavisiblefallingraindropasalinearstreak.Tospeedrenderingweusepre-computedimagesofsuchrainstreaks.Rainstreakpositionsfortheleftandrighteyeviewsarecreatedbygeneratingrandomnumbersthatdependontheviewvolumeofthescene.Wepermitinteractivebutcontrolledmodificationofrainparameterssuchasdensityandwindgusts.Wecompareourapproachtotheuseofexisting2D-3Dconversionmethods.Theresultsdemonstratethatusingcommercial2D-3Dconvertersarenotsufficientinproducingrealisticstereoraineffects.Futureresearchwillconcentrateonincludingcomplexlightinginteractions.

8648-7, Session 5

A new 3D video format for multiview/freeview depth image based renderingLinDu,YanXu,GangCheng,TechnicolorBeijing(China)

Thispaperdescribesanovel3Dmultiview/freeviewvideoformat,2DplusDeltaandEmbedOcclusionandTransparencyinformation(2D+DEOT),whichembedstheadditionalocclusioninformationintoaframe-compatible2D+deltaformatusingdigitalwatermarkingtechniques.Themultipleocclusionandtransparency(OT)layersareweighted,encodedandmultiplexedusingspreadspectrumwatermarkingandthenembeddedintoaframe-compatible2D+depth/deltabaselayerusingleastsignificantbits(LSB)watermarking.Themultiview/freeview3Dvideocanberenderedattheplayerusingdepthimagebasedrenderingmethodsaftertheocclusioninformationisretrievedaswatermarksandtheoriginal2D+deltabaselayerisrecoveredbysubtractingthere-producedwatermarkfromtheLSBofeachframe.Therefore,thisvideoformatcanprovideOTinformation

Conference 8648


formultiview/freeview3Dvideosynthesiswhileonlyintroducesminornoiseattheleastsignificantbitsforconventional2D+delta3Dvideorenderingsystems.Severalencodingandembeddingschemesofthis3DvideoformataretestedusingtheBreakdancingandBalletsequencesfromMicrosoftResearch.Thetestresultsshowthatusingthisformatcanbothgeneratesgoodqualitymultiview/freeview3Dvideoandkeepsthecompatibilitytocurrent2Dencoding/decodingandtransmissionsystems.

8648-9, Session 5

Subjective evaluation of an edge-based depth image compression schemeYunLi,MårtenSjöström,UlfJennehag,RogerOlsson,SylvainTourancheau,MidSwedenUniv.(Sweden)

Wehavepreviouslyproposedadepthimagecodingschemethatpreservessignificantedgesandencodessmoothareasbetweenthese.Anobjectiveevaluationconsideringthestructuralsimilarity(SSIM)indexforsynthesizedviewsdemonstratedanadvantagetotheproposedschemeoverthehighefficiencyvideocoding(HEVC)intramodeincertaincases.However,somediscrepanciesexistbetweentheobjectiveevaluationandourvisualinspection,whichmotivatedthisstudyofsubjectivetests.ThetestwasconductedaccordingtoITU-RBT.500-13recommendationwithStimulus-comparisonmethod.ThesynthesizedimagesofthetestsequenceswerepairedsuchthattheyusedencodeddepthmapofsimilarbitratesproducedfromtheproposedschemeandHEVCrespectively.TheresultsfromthesubjectivetestshowedthattheproposedschemeperformsslightlybetterthanHEVCwithstatisticalsignificanceatalmostalltestedbitratesforthegivencontents.Thepaperrevealsthatpreservingsignificantdepthcontinuitiesinadepthimagecanresultinanoverallbetterqualityofexperienceforsynthesizedviews.ItalsoillustratesthattheobjectivemetricSSIMfailstopredicttheimportanceofpreservededgesforperceivedqualityinasynthesizedview.

8648-11, Session 5

Cooperative ARQ for 3D video transmissionYanXu,LinDu,Technicolor(China)TechnologyCo.,Ltd.(China)

Errorcontrolisimportantfor3Dvideostreamingespeciallyoverwirelessnetworkswhichhavehighlyvariablenature.Variousschemesexistthatmitigatetheeffectsoferrorsduringtransmissionof2Dvideodata.Existingerrorcontrolmethodscanbeappliedtosome3Dformat,suchasframecompatibleformat,inwhichthetwoviewsaremultiplexedintoasingleframe.However,for2Dplusmetadataformatbeingtransmittedindifferentchannels,therehasnoerrorcontrolmethodtodealwithit.AutomaticRepeatQuery(ARQ)iswidelyusedtomitigatetheeffectsoferrors,whichcanbetailoredto3Derrorcontrolinconsiderationof3Dtransmissioncharacteristics.Weproposeanerrorcontrolmethodfortransmitting2Dplusmetadataintwodifferentchannels.ItappliesselectiverepeatARQwithconsiderationoftwofactors.Oneistheinterdependentrelationshipbetween2Dvideoanditsmetadatainformation;theotheristimeconstraintsforcontinuousvideoplayback.SimulationresultsshowthatitallowsthesendertocontrolARQstrengthadaptivelyonaper-packetbasis.Anditmakesagoodtradeoffbetweenplaybackdelayandpacketlossrate,whichprovidesaguidelineoferrorcontrolsystemdesignfor3Dvideostreaming.

8648-14, Session 3

Natural 3D Content on Glasses-free Light-field 3D CinemaTiborBalogh,ZsoltNagy,PéterTamásKovács,VamsiKiranAdhikarla,HolografikaKft.(Hungary)

Thispaperpresentsahardware-softwaresystemtovisualizesyntheticornatural3Dlight-fieldvideostoalargeraudienceonacinema-sized

screensizethenwhatwaspreviouslypossiblewithautostereoscopicdisplays.Wepresentthehardwaresystemincludingopticalandmechanicaldesignconsiderations,theelectricalsystemandrenderingcluster,andthesoftwaredrivingthewholesystem,andfeedingitwith3Ddata.Thedisplayisthefirstfront-projectedlight-field3DdisplayHoloViziosystem,controlling63MPixels,hasalltheadvantagesofpreviouslight-fielddisplays,butallowsforamoreflexiblearrangement,largerscreensize,andsimplerset-up.Thesoftwaresystemmakesitpossibletoshow3Dapplicationsinreal-timeaswellasnaturalcontentcapturedwithdenselight-fieldcameraarrangementsaswellascontentcapturedwithwidebaselinefew-camerarigs.Thesoftwaresystemisabletovisualizepre-recordedMultiviewPlusDepth(MVD)videosonthislight-fieldglasses-freecinemasystem,interpolatingandextrapolatingmissinginformationnotcapturedbythecameras.

8648-15, Session 3

Large-scale autostereoscopic outdoor displayJoergReitterer,FranzFidler,FerdinandSaintJulien-Wallsee,TriLiteTechnologiesGmbH(Austria);GerhardSchmid,WolfgangGartner,WalterLeeb,UlrichSchmid,TechnischeUniv.Wien(Austria)

State-of-the-artautostereoscopicdisplaysareoftenlimitedinsize,effectivebrightness,numberof3Dviewingzones,andmaximum3Dviewingdistances,allofwhicharemandatoryrequirementsforapplyingautostereoscopic3Dtolarge-scaleoutdoordisplays.Conventionalautostereoscopicindoorconceptslikelenticularlensesorparallaxbarrierscannotsimplybeadaptedforthesescreensduetotheinherentlossofeffectiveresolutionandbrightness,whichwouldhaveasignificantlydisadvantageousimpactonbothimagequalityandsunlightreadability.Wehavedevelopedamodularautostereoscopicmulti-viewlaserdisplaywithsunlightreadableeffectivebrightness,severalthousandof3Dviewingzones,andamaximum3Dviewingdistanceofupto50meters.Forproofofconceptpurposesaprototypedisplaywith4x5pixelsisrealized.Duetovariousmanufacturingtoleranceseachindividualpixelofthedisplayhasslightlydifferentopticalpropertiesand,hence,theperceived3Dimagequalityofthedisplayhastobedescribedstochastically.Inthispaperwepresentthecorrespondingstochasticmodelandevaluatethesimulationandmeasurementresultsoftheprototypedisplay.

8648-20, Session 3

Multi-view display module using MEMS projectors for an ultra-large screen autostereoscopic displayKenjiHirabayashi,TokyoUniv.ofAgricultureandTechnology(Japan);HiromitsuTakenaka,OsamuKonuma,YasuhiroMorimoto,SamsungYokohamaResearchInstituteCo.,Ltd.(Japan);YasuhiroTakaki,TokyoUniv.ofAgricultureandTechnology(Japan)

Amulti-viewdisplaymoduleusingMEMSprojectorsisproposedtorealizeanultra-largescreenautostereoscopicdisplay.ThemoduleconsistsofanarrayofMEMSprojectors,averticaldiffuser,andalenticularlens.AllMEMSprojectorshavingdifferenthorizontalpositionsprojectimagesthataresuperimposedontheverticaldiffuser.Eachcylindricallensconstitutingthelenticularlensgeneratesmultiple3Dpixelsatdifferenthorizontalpositionsnearitsfocalplane.Becausethe3Dpixelisanimageofamicro-mirroroftheMEMSprojector,thenumberof3Dpixelsineachlensisequaltothenumberofprojectors.Therefore,thehorizontalresolutionofthemodulecanbeincreasedusingmoreprojectors.ByproperlymodulatinglasersintheMEMSprojector,thehorizontalpositionsofdotsconstitutingaprojectedimagecanbealteredatdifferenthorizontalscanlines.Whenseveralscanlinescorrespondtoone3Dpixel,anumberofviewscanbeincreased.Becausethemodulehasaframelessscreen,thenumberofmodulescanbearrangedtwo-dimensionallytoobtainalargescreen.TheprototypemodulewasconstructedusingfourMEMSprojectors.Themodulehada3Dresolutionof160x120,andprovided64views.Thescreensizewas14.4inches.

Conference 8648


8648-13, Session 6

Real-time handling of existing content sources on a multi-layer displayDarrylSingh,JungShin,PureDepthInc.(NewZealand)

Multi-LayerDisplays(MLD)consistsoftwoormoreimagingplanesseparatedbyphysicaldepthwherethedepthisakeycomponentincreatingaglasses-free3Deffect.It’scorebenefitsincludebeingviewablefrommultipleangles,havingfullpanelresolutionfor3Deffectswithnosideeffectsofnauseaoreye-strain,however,typicallycontentmustbedesignedforit’sopticalconfigurationinforegroundandbackgroundimagepairs.Aprocesswasdesignedtogiveaconsistent3Deffectina2-layerMLDfromexistingstereovideocontentinreal-time.Optimizationofthestereomatchingalgorithmstogeneratedepthmapsinreal-timespecificallytailoredfortheopticalcharacteristicsandimageprocessingalgorithmsofaMLDwasrequired.Theend-to-endprocessincludedimprovementstotheHierarchicalBeliefPropagationHBPstereomatchingalgorithm,improvementstoopticalflowandtemporalconsistency.ImagingalgorithmsdesignedfortheopticalcharacteristicsofaMLDprovidedsomevisualcompensationfordepthmapinaccuracies.TheresultcanbedemonstratedinaPCenvironment,displayedona22”MLD,usedinthecasinoslotmarket,with8mmofpanelseperation.Priortothisdevelopment,stereocontenthadnotbeenusedtoachieveadepth-based3DeffectonaMLDinreal-time.

8648-16, Session 6

Switching dual layer display with dynamic LCD maskQuinnY.J.Smithwick,MarkReichow,WaltDisneyImagineering(UnitedStates)

Duallayerdisplaysaresimpleeffectivemeansofcreatingdimensionalimages.AcommonimplementationisastackedspacedfrontlayertransparentLCDpanelandbacklayermonitor.Foregroundcontentdoesn’tappearoverdarkbackgrounds,andappearslowcontrastandsemi-transparentoverbrightbackgrounds.Thefrontscreenpreventsaccesstotheimagevolumewhenusedininteractivegestural,haptic,ortouchapplications.

TheSwitchingDualLayerDisplaywithaDynamicMaskproduceshigh-contrastfloatingfrontcontentwithopacityindependentofthebackground.Thedisplayconsistsofopticallystackedfronttransparent120HzLCDpanelandback120Hzmonitor,andarelaymirror.Inthefirstmode,thebackdisplayshowsforegroundcontentandthefrontdisplayisclear.Themirrorreimagesthebackdisplaythroughthetransparentfrontdisplay,sotheforegroundfloatsinfrontofthedisplay.Inthesecondmode,thebackdisplayshowswhitewithamaskoftheforeground,reflectsoffthemirror,backlightingthefrontdisplayshowingbackgroundcontent.Rapidlyalternatingbetweenthemodes,theviewerperceivesopaquehigh-contrastforegroundcontentfloatingabovethebackground.

TheSwitchedDualLayerDisplayisavolumetricdisplay,exhibitingcoupledaccommodation/vergencecues,occlusion,andrealimagery.

8648-18, Session 6

Design of time-multiplexed autostereoscopic displays based on virtual stacking of multi-layer panelsHironobuGotoda,NationalInstituteofInformatics(Japan)

Theterm,“slice-stackingdisplays”,referstoaparticularkindofvolumetricdisplays,whichpresentalargenumberofvolumetricelementsorvoxelstoviewers.Voxelsareoftenpackedina2Darray,called“slice”or“depthplane”,whichsequentiallyfillsa3Dvolume.Recently,“multi-layerdisplays”havebeenproposedandimplementedbyseveralresearchers.Unlikeslice-stackingdisplays,multi-layerdisplayscanpresentmultipleslicesatthesametime.Moreover,slices

inmulti-layerdisplaysalwaysremainstationary,i.e.,therearenomovingorrotatingslices,whichcanbetypicallyfoundinslice-stackingdisplays.

Inthispaper,weconsideramixtureofslice-stackingandmulti-layerdisplays.Morespecifically,wepresentanewdesignofmulti-layerdisplays,whereslicescanmovebackandforth,changingperiodicallytherelativedistancefromtheviewers.Byallowingtheslicestomovearound,wecan,ineffect,increasethenumberoflayersinthedisplays.Thisresultsinfinerresolutionofthelightfieldobservablefromthedisplay,andimprovementintheprecisionoftheaccommodationcues.Twoconfigurations,implementingtheconceptof“movingslices”,arepresentedusingeitheramovingmirrororavarifocallens,andcomparedwitheachother.

8648-19, Session 6

A new method to enlarge a range of continuously perceived depth in DFD (Depth-fused 3D) displayAtsuhiroTsunakawa,TomokiSoumiya,HirotsuguYamamoto,ShiroSuyama,Univ.ofTokushima(Japan)

InDFD(Depth-fused3-D)displaywhichcanpresentacontinuous3-Dimagebyusingonlylayeredtwotransparentdisplays,weclarifythattheperceiveddepthdependenceonluminanceratiooffrontandrearimagesiswidelychangedasthedepthdifferenceoftwoplanesisincreased.Whenthedepthdifferenceoftwoplanesissmall,theperceiveddepthischangedalmostlinearlyastheluminanceratioischanged.Ontheotherhand,whenthedepthdifferenceislarge,theperceiveddepthisaroundfrontplaneoraroundrearplaneatluminanceratioofabout0-40%orofabout60-100%,resultinginnon-linearchangeofperceiveddepth.Thisindicatesthatthislargedepthdifferenceresultsinoutofrangeincontinuouslyperceiveddepth.Inordertoenlargethisrangeofcontinuouslyperceiveddepth,wechangedthespatial-frequencydistributionsoffrontandrearimages.Evenwhenthedepthdifferenceoftwoplanesislargeenough,almostlinearchangeofperceiveddepthcanbesuccessfullyobtainedbyincreasinglowspatial-frequencypartoftwoimages,resultingindeeper3-DimageinDFDdisplay.

8648-22, Session 4

Subjective and objective measurements of visual fatigue induced by excessive disparities in stereoscopic imagesYongJuJung,DongchanKim,SohnHosik,Seong-ilLee,HyunWookPark,YongManRo,KAIST(Korea,Republicof)

Asstereoscopicdisplayshavespread,itisimportanttofindevidencesofwhatreallycausesfatigueanddiscomfortandwhathappensinthevisualsystemunderanuncomfortablestereoscopicviewingexperience.Tothebestofourknowledge,nosinglesubjectiveorobjectivemeasurementisathoroughmethodtofindtheseevidences,sincevisualfatigueanddiscomfortarerelatedtomanyaspectsofthevisualsystem.Fewstudieshavebeendoneforthecombinedsubjectiveandobjectivemeasurementsofvisualfatigueanddiscomfort.

Inthispaper,weemploybothsubjectiveandobjectivemeasurementsforvisualfatigueinstereoscopicdisplays.Inparticular,weusefunctionalmagneticresonanceimaging(fMRI)asanobjectivemeasurementofvisualfatigueinstereoscopicdisplays.Ourexperimentconsistedofatwo-passmeasurementofvisualfatigue:asubjectivemeasurementandanobjectivemeasurement.Usingthesubjectivemeasurementresults,weselectedcomfortablevideosanduncomfortablevideosinourdataset.Then,fMRIexperimentswereconductedtoobservewhathappenedinthevisualsystemunderanuncomfortablestereoscopicviewingexperience.Fromthedoublestimuluscomparisonanalysis,theresultsindicatedthattherewerestatisticaldifferencesinbrainactivationregionsbetweencomfortableanduncomfortablevideostimuliwithdifferentamountsofdisparity.

Conference 8648


8648-23, Session 4

Depth distortion in color-interlaced stereoscopic 3D displaysJoohwanKim,PaulV.Johnson,MartinS.Banks,Univ.ofCalifornia,Berkeley(UnitedStates)

Wemeasuredwhethercolor-interlacedstereoscopic3D(S3D)displayscanreducethedepthdistortionsthatariseinconventionaltemporallyinterlacedS3Ddisplays.Weusedahaploscopetoemulatecolor-interlacedS3DdisplaysandconventionaltemporallyinterlacedS3Ddisplays.Ourexperimentalresultsshowthatthehumanvisualsystemcalculatesdisparityusingprimarilyluminanceinformationratherthancolorinformation.ThissuggeststhatwecanreducedepthdistortionintemporallyinterlacedS3Ddisplaysbyarrangingtheorderofcolorpresentationtokeepoverallluminancetemporallyconstant.Intherecentlyproposedcolor-interlacedS3Ddisplaymethod,wepresentgreentothelefteyeandredandbluetotherighteyeatonemomentintimewhileweexchangecolorchannelsinthenextmomentintime.Assumingthattheluminancesignalistheprimarysourceofinformationfordisparitycalculation,thisistheoptimalorderingtoreducethedepthdistortionin3-chipprojectors.

8648-26, Session 4

Depth perception from stationary and moving stereoscopic three-dimensional imagesYu-ChiTai,SowjanyaGowrisankaran,ShunnanYang,JamesE.Sheedy,JohnHayes,PacificUniv.(UnitedStates);AudreyCYounkin,PhilipJCorriveaub,UserExperienceResearchGroup,IntelCorporation(UnitedStates)

Thestudyevaluatedtheaccuracyofdepthperceptionaffordedbystaticanddynamicstereoscopicthree-dimensional(S3D)imageswithproportional(scaledtodisparity)andconstantsizecues.Adultparticipantswithgoodbinocularvisionparticipatedinthestudy.ForstaticS3Dtrials,participantswereaskedtoindicatethedepthofstationaryS3Dimagesrenderedwithdifferentmagnitudesofcrosseddisparity,andwitheitherproportionaloraconstantsize.FordynamicS3Dtrials,participantswereaskedtoindicatethetimewhenmovingS3Dimagesmatchedthedepthofareferenceimageclosertotheviewer.ResultsshowthatviewersperceivedS3Dimagesasbeingcloserthanwouldbepredictedbythemagnitudeofimagedisparity,andcorrespondinglytheyoverestimatedthemotionspeedindepth.Theresultantdepthperceptionandestimateofmotionspeedweremoreaccurateforcertainconditionswithproportionalandlargerimagesize,slowermotion-in-depthandlargerimagedisparity.ThesefindingspossiblyexplainwhyeffectssuchasloomingareoverstimulatinginS3Dviewing.Toincreasetheaccuracyofdepthperception,S3Dcontentshouldmatchimagesizetoitsdisparitylevel,utilizelargerdepthseparation(withoutinducingexcessivediscomfort)andrenderslowermotionindepth.

8648-27, Session 4

Immersion, tangibility, and realism: explaining the qualitative experience of stereopsisDhanrajVishwanath,Univ.ofSt.Andrews(UnitedKingdom)

Thefundamentalpropertythatdrives3Dtechnologyisthecompellingqualitativeexperienceoftangiblesolidobjects,immersivespaceandrealismthatislackinginconventional2Ddisplays.Thisqualitativeperceptualphenomenon,referredtoas‘stereopsis’,iswidelyassumedtobeaby-productofbinocularvisionorparallax.However,itsunderlyingcause,variationandfunctionalroleremainlargelyunexplained.Iwillpresentevidencethatsuggeststhatstereopsisisnotsimplyaproductofbinoculardisparitiesorparallaxbutisamorebasicvisualproperty.Ipresentanalternativetheorythatlinkstheimpressionofstereopsistotheprecisionofabsolute(scaled)depth.I

presentperceptualevidence(qualitativeandquantitative)insupportofthetheoryanddiscusstheimplicationsforsomeimportantproblemsinstereoscopicdisplaytechnology.

8648-21, Session 7

Aerial 3D LED display by use of retroreflective sheetingHirotsuguYamamoto,ShiroSuyama,Univ.ofTokushima(Japan)

Weproposeanewopticalconfigurationtoformaerial3DLEDsbyuseofretroreflectivesheeting.TheproposedconfigurationiscomposedofLEDs,ahalfmirror,andretroreflectivesheeting.AhalfoftheLEDlightsarereflectedbythehalfmirrorandimpingeontheretroreflectivesheeting.Theretroreflectivesheetingreflectsthelightsbacktotheirsources.Onthewaytothesources,ahalfofthelightstransmitthroughthehalfmirrorandformtherealimagesofLEDs.Althoughlessthan25%oftheoutputlightsarecontributedtotheaerialimage,recentLEDpanelshaveenoughluminancetoenjoytheaerialimagewithaquarteroftheluminance.WehavemadeaprototypeoftheproposedaerialLEDdisplay.AnaerialimageoftheLEDpanelhasbeensuccessfullyformedinfreespace.Itsviewinganglewassignificantlyimprovedcomparedtotheaerialdisplaybyuseofcrossedmirrors,whichlimittheviewinganglebyaperturesizeandheightofmirrorwalls.Theviewingangleintheproposedconfigurationismainlylimitedbythesizeoftheretroreflectivesheeting.Furthermore,byusingLEDsindifferentdepths,werealizedanaerial3Ddisplayinfreespace.

8648-24, Session 7

Implementation of shading effect for reconstruction of smooth layer-based 3D holographic imagesJhen-SiChen,UnivofCambridge(UnitedKingdom);QuinnY.J.Smithwick,WaltDisneyImagineering(UnitedStates);DapingChu,UnivofCambridge(UnitedKingdom)

A3Dimagecanbecomposedbyanumberof2Dslicedimages.Thehologramsforeachofthemcanbecalculatedveryquicklyandthencombinedtogetherforreconstructingthe3Dimage.Meanwhile,a3Dimagecanbeseparatedintoviewsfromdifferentanglesusingaso-calledangulartilingapproachwitheachviewbeinggeneratedbyasub-hologram.Thealgorithmdevelopedinthisworkisbasedonboththesetwomethods,thateachviewiscalculatedbyalayer-baseddecompositionwhilethewhole3Dimageiscomposedofdifferentviews.

Theindividualityofangulartiledviewsallowstheintroductionofshadingtoalayer-basedimagebecauseitcanbevariedfromviewtoview.Theangulartilingcanalsoreducesthedepthgapscausedbylayerslicingduetothesuperpositionoflayersfromdifferentviews.Inthisway,toreplayasmooth3Dimagewithcorrectshadingswillrequirelessmemoryincomputationthanusingconventionalmethod,andthewholecalculationwillbefastandeasilyadaptableforparallelcomputations.

Inconclusion,apracticalalgorithmtocomputehologramsfor3Dimagesispresented.Itcanincorporateallthenecessarydiffractiveinformationforimagereconstructionathighspeed.

8648-28, Session 7

Hologram synthesis using integral imaging camera with synthetic aperture technique

Sung-Keun Lee, Yong-Soo Kim, Sung-In Hong, Jae-Hyeung Park, Chungbuk National Univ. (Korea, Republic of)Inthispaper,weproposeanovelincoherenthologramcapturemethod.Intheproposedmethod,theintegralimagingcapture

Conference 8648


systemisusedtocaptureanarrayofdifferentperspectiveimagesofthethree-dimensional(3D)object.Thecapturedperspectivearrayisthentransformedtoanarrayoftheimagesrefocusedatvariousdistances,formingthefocalimagestack.Thisrefocusingprocessatarbitrarydistancecanbeperformedusingacomputationalintegralimagingreconstruction(CIIR)techniquewhichiswell-knowninintegralimagingliteratures.Usingthegeneratedfocalimagestack,thehologramissynthesizedbyconsideringapointspreadfunctionoftheintegralimagingsystem.Sincethefocalimagestackisnotcapturedsequentiallybutgeneratedfromtheperspectivearraywhichiscapturedsimultaneously,theproposedmethodcancapturetheholograminreal-time.Inintegralimaging,thesynthesisofthefocalimagestackpreserveshighresolution,enablinghighresolutionhologramsynthesis.Inthepresentation,wewillexplaintheprinciplewithexperimentalresults.

8648-81, Session K2

Coverage of the London 2012 Olympic Games in 3D (Keynote Presentation)JimDeFilippis,Consultant(UnitedStates)

TheLondon2012Olympicsprovidedtheopportunitytobroadcastafull-time3Dchannel.Inthepast,3Dcoveragewasonlyavailableonaclosedcircuitbasisoflimitedevents.Duringthe2012Olympics,thehostbroadcaster(OlympicBroadcastServices)operateda3Dchannelconsistingoftheopeningandclosingceremonies,multiplesports,bothliveandnearlivecoverage,andproducedover275hoursof3Dstereoscopicprogramming.TheOlympic3Dchannelwasavailabletoalltherightsholdingbroadcastersincluding:BBC,Eurovision,SkyItalia,NineNetworkAustralia,JapanPool,NBCandothers.Thecoreofthe3Dcoveragewasprovidedfromthreemobileproductionunitsandsixsinglecamerafieldproductionunits.AvarietyofstereoscopicrigswereusedineachoffourvenuesalongsidethePanasonicENG/EFPP23DCamcorder.Somespecialstereocameraswerealsousedincluding:polecameras,railcameras,RFcamerasandunderwatercameras.Iwillpresenttheuniquechallengesinproviding3Dcoverage,fromorganizingthe3Dchannelaswellasthetechnicalchallengeofcoveringsportsin3Dwhileaccommodatingthefull-up2Dproductionwithanemphasisonwhatworkedandwhatdidnot.

8648-29, Session 8

The psychology of the 3D experienceSophieH.Janicke,AndrewEllis,FloridaStateUniv.(UnitedStates)

Withtheswiftarrivalof3Dtechnologyinourlivingrooms,contentcreatorsarepressuredtokeepupwithincreaseddemandbyconsumers.Experiencesuggeststousthatsimplyapplying2Dstorytellingconventionsto3Disrarelythebestoptiontomeetthisdemand.So,developingnewformsandapproachestostorytellingwithina3Denvironmentisamust.Buthowdowegoaboutthis?Ourstudywillcontributetotheexplorationofthisquestionfromtheperspectiveofmediapsychology.Ourprojectinvestigatesseveralvariablesunderlyingapositive3Dentertainmentexperience,aswelltheprocessthroughwhichentertainmentnarrativescanimpactaviewer’sattitudesaboutparticularissues(i.e.,narrativepersuasion).Wespecificallylookattheprocessoftransportationintothenarrativeworld,anexperiencethathasbeenshowntounderliebothofeffects.Wecontendthat3Dtheoreticallylendsitselftothecreationofagreaterfeelingoftransportationfortheviewer,whichultimatelyshouldimpacttheirenjoymentandnarrativepersuasion.Ourprojectinvolvestwoexperimentalstudies,usingavarietyof2Dand3Dcontent.Ourworktakesafirststepinbetterunderstandingthe3Dentertainmentexperiencefromamedia-psychologicalperspective,offeringamodelofhowacademicresearchcanassistinthedevelopmentofthisindustry.

8648-30, Session 8

A semi-automatic 2D to stereoscopic 3D image and video conversion system in a semi-automated segmentation perspectiveRaymondPhan,DimitriosAndroutsos,RyersonUniv.(Canada)

Wecreateasystemforsemi-automaticallyconvertingunconstrained2Dimagesandvideosintostereoscopic3D.Currenteffortsaredoneautomaticallyormanuallybyrotoscopers.Theformerprohibitsuserintervention,orerrorcorrection,whilethelatteristimeconsuming,requiringalargestaff.Semi-automaticmixesthetwo,allowingforfasterandaccurateconversion,whiledecreasingtimetorelease3Dcontent.User-definedstrokesfortheimage,oroverseveralkeyframes,correspondingtoaroughestimateofthescenedepthsaredefined.After,therestofthedepthsarefound,creatingdepthmapstogeneratestereoscopic3Dcontent,andDepthImageBasedRenderingisemployedtogeneratetheartificialviews.Here,depthmapestimationcanbeconsideredasamulti-labelsegmentationproblem,whereeachclassisadepthvalue.Optionally,forvideo,onlythefirstframecanbelabelled,andthestrokesarepropagatedoverallframesthroughamodifiedrobusttrackingalgorithm.Ourworkcombinesthemeritsoftworespectedsegmentationalgorithms:GraphCutsandRandomWalks.ThediffusionofdepthsfromRandomWalks,combinedwiththeedgepreservingpropertiesfromGraphCutsisemployedtocreatethebestresultspossible.Resultsdemonstrategoodqualitystereoscopicimagesandvideoswithminimaleffort.

8648-31, Session 8

Temporally consistent disparity estimation using PCA dual-cross-bilateral gridJinZhu,TechnischeUniv.Eindhoven(Netherlands);PatrickVandewalle,PhilipsResearch(Netherlands);GerarddeHaan,PhilipsResearchNederlandB.V.(Netherlands)

Disparityestimationhasbeenextensivelyinvestigatedinrecentyears.Thoughseveralalgorithmshavebeenreportedtoachieveexcellentperformanceonacommonreferencedataset,fewofthemreachasatisfyingbalancebetweenaccuracyandefficiencyforreal-timeapplications,andfewofthemconsidertheproblemoftemporalcoherence.Inthispaper,weintroduceanoveldisparityestimationapproach,whichimprovestheaccuracyforstaticimagesandthetemporalcoherenceforvideos.Forstaticimages,theproposedapproachisinspiredbytheadaptivesupportweightmethodproposedbyYoonetal.andthedualcross-bilateralgridintroducedbyRichardtetal.Principalcomponentanalysis(PCA)isusedtoreducethecolordimensionalityinthecostaggregationstep.Thissimple,butefficienttechniquehelpstheproposedmethodtobecomparabletothebestlocalalgorithmsonthereferencedataset,anditisproventohavethepotentialtobeimplementedinreal-time.Acomputationallyefficientmethodfortemporallyconsistentbehaviorisalsoproposed.Moreover,intheuserevaluationexperiment,theproposedtemporalapproachachievesthebestoveralluserexperienceamongtheselectedcomparisonalgorithms.

8648-32, Session 8

Automatic depth grading tool to successfully adapt stereoscopic 3D content to digital cinema and home viewing environmentsCédricThébault,DidierDoyen,TechnicolorS.A.(France);PierreRouthier,TechnicolorS.A.(Canada);ThierryBorel,TechnicolorS.A.(France)

Aftertheenthusiasminitiallygeneratedbythedigitalcinemastereoexperience,weareclearlynowinasecondphaseoftheadoptionofstereoscopic3D(S3D)content.Thespectatorisnownotonlywaitingforanovelexperiencebutforonethatwillbringhimsomethingmore.ToensurelookingatS3Dcontentwillbeagoodexperience,an

Conference 8648


adaptationofthecontentshouldbeappliedaccordingtothetargeteddisplayanditsenvironment.WehavepresentedanautomatictooltoadaptS3Dcontenttodigitalcinemaandhomeviewingenvironments.Thestereopointofinterestineachshotisautomaticallydefinedandthentheappropriateparallaxshiftisappliedtoensureproperformat.Basedondisparityestimation,thistoolalsohelpstoautomaticallysmoothtransitionbetweenshotsaccordingtothetargetedapplication.Thedevelopmentisnotyetfinalizedsincetemporalvariationoftheprocessinghastobedevelopedtobetteroptimizetheadaptationalloverasequence.

8648-33, Session 8

Disparity remapping to ameliorate visual comfort of stereoscopic videoSohnHosik,YongJuJung,Seong-ilLee,KAIST(Korea,Republicof);FilippoSperanza,CommunicationsResearchCtr.Canada(Canada);YongManRo,KAIST(Korea,Republicof)

Disparityremappingisoneofthesolutionsproposedtoimprovevisualcomfortofstereoscopic3Dcontent.Onecommonapproachisalinearmappingofdisparitythatretargetsanentirescenedisparityrangeintoacomfortablerange.However,retargetingtheoverallscenedisparityrangemightnotalwaysachievethedesiredlevelofvisualcomfort.Infact,visualdiscomfortcouldstillbeinducedbyfastchangingdisparities,e.g.,disparitiesresultingfromobjectsmovingindepth.Moreimportantly,suchdiscomfortmightpersistevenwhentheoverallscenedisparityrangeiswithinanominallycomfortablerange.

Inthispaper,weproposeanoveldisparityremappingmethodaimedatreducingthevisualdiscomfortinducedbyfastchangesindisparity.Theproposedremappingapproachselectivelyscalesthedisparitiesofthediscomfortregionswherefastspatialandtemporalchangesindisparityoccur.Inthisapproach,itisalsoassumedthattheoveralldegreeofvisualdiscomfortisdominantlyinfluencedbyvisuallyimportantregionsinthescene.Theresultsofthesubjectiveassessmentofvisualcomfortshowthattheproposedlocalremappingapproachiscapableofreducingthevisualdiscomfortinducedbyfastchangesofdisparityinstereoscopic3Dvideo.

8648-34, Session 9

Disparity analysis of 3D movies and emotional representationsTakashiKawai,MasahiroHirahara,YuyaTomiyama,DaikiAtsuta,WasedaUniv.(Japan);JukkaHäkkinen,AaltoUniv.(Finland)

Inthefieldofstereoscopic(3D)images,industryassociationsforhardwareandsoftware,consistingmainlyofdisplaymanufacturers,havebeenestablishedtoundertakeinitiativeswiththeaimofpromoting3D.Recentlytherehavebeenmanyinitiativesfromtheviewpointof3Dcontentcreators.Ithasbeenpointedoutthattheestablishmentofknow-howforcreating3DcontentinHollywoodisforwardcomparedwiththeothercountries.Here“know-how”meansproducingadepthsensationmainlybyimposingabinoculardisparity.Theauthorshavecarriedoutmultifacetedandcontinuousanalysisofthedisparityinfamous3DmoviesthatwerecreatedinHollywoodandthathavebeenhighlyevaluatedthroughouttheworld.Theobjectivewastoprovideusefulknowledgeforcreating3Dcontentinfuture,byquantitativelydeterminingthecharacteristicsandtrendsinthedisparityinthesemovies.

Thispaperintroducesexamplesoftheresultsforanalysisofmovies.Inaddition,thedetailedanalysisisdescribedforfeaturescenesthatareconsideredtoexpressemotionwithinthemovies.

8648-35, Session 9

Methodology for stereoscopic motion-picture quality assessmentAlexanderVoronov,DmitriyVatolin,DenisSumin,VyacheslavNapadovsky,AlexeyBorisov,MoscowStateUniv.(RussianFederation)

Creatingandprocessingstereoscopicvideoplacesadditionalqualityrequirementsrelatingtoviewsynchronization.Inthisworkweproposeasetofalgorithmsfordetectingtypicalstereoscopic-videoproblems,whichappearowingtoimprecisesetupofcaptureequipmentorincorrectpostprocessing.WedevelopedamethodologyforanalyzingthequalityofS3Dmotionpicturesandforrevealingtheirmostproblematicscenes.Wethenprocessed10modernstereofilms,includingAvatar,ResidentEvil:

Afterlife,andHugo,andanalyzedchangesinS3D-filmqualityovertheyears.Thisworkpresentsrealexamplesofcommonartifacts(colorandsharpnessmismatch,verticalparallax,andexcessivehorizontalparallax)intheprocessedmotionpictures,aswellaspossiblesolutionsforeachproblem.Ourresultsenableimprovedqualityassessmentduringthefilmingandpostproductionstages.

8648-36, Session 9

Critical alignment methods for stereoscopic production and post-production image registrationChristopherA.Mayhew,CraigM.Mayhew,VisionIIIImaging,Inc.(UnitedStates)

Thequalityofthethree-dimensionalperceptionofastereoscopicproductionisdirectlydependentupontheprecisionofitsleft/rightimageregistration.Testsshowthatpixel-preciseregistrationimprovesthree-dimensionalperceptionandviewingcomfort.Currentsoftwaretoolsthatrelyontechniqueslike50/50viewsforimagealignmenthavedifficultyachievingpixel-accurateregistration.

TheAuthorshavedevelopeda“CriticalAlignment”(CA)stereoscopicsoftwareplug-intoolsetthatallowsforaccurateregistrationofcamerasystemsandleft/rightimagepairs.Thetoolssimplifyandincreasetheprecisionoftheregistrationprocess.TheCAtoolsetiscompatiblewithstandardvideoeditingsoftwareandcanbeviewedinthefieldonnormalunaided2Ddisplays.

CAtakesadvantageofthehumaneye’shighcomparativecapabilities.Humansevokeanautostereoscopic(automaticthree-dimensionalperception)responsetoparallaximagerypresentedalternatelyatbetween3to6Hz.Transformationsmadeonalternatingimageryallowahighlevelofregistrationinallvectorsatasub-pixellevel.

ApplyingCAtothecamerasystemalignmentduringproductionandtoleft/rightimageregistrationinpost-productioncansignificantlyimprovestereoscopicperceptionandreduceviewerfatigue.Further,testshaveshownthattheCAtoolsetalsobenefitsscene-to-scenetransitionsandeditsbyprecisemanipulationofconvergencepoints.

8648-37, Session 9

Towards a metric of antialiasing sufficiency for stereoscopic displaysCharlesJ.Lloyd,VisualPerformance,LLC(UnitedStates)

Thispaperdescribesthedevelopment,measurement,computation,andvalidationofametricofantialiasingsufficiencyforstereoscopicdisplaysystems.Asummaryisprovidedoftwopreviousevaluationsthatdemonstratedstereoscopicdisparitythresholdsintherangeof3to10arcsecareattainableusingelectronicdisplayswithapixelpitchascoarseas2.5arcmin,however,onlyifsufficientantialiasingisperformed.Anequationisprovidedthatdescribesthecriticallevelofantialiasingrequiredasafunctionofpixelpitch.Theproposed

Conference 8648


metricusesaradialtestpatternthatcanbephotographedfromtheusereyepointusingahandheldconsumercolorcamera.Severalcandidateunitarymetricsthatquantifythespatialsamplingnoiseinthemeasuredtestpatternaredescribed.The(R-squared)correlationobtainedbetweenthebestcandidatemetricandstereoscopicdisparitythresholdswas0.98.Thevarianceintroducedduetomakingrepeatedmeasurementswithahandheldcamerawaslessthan1%ofthevarianceduetotheantialiasingfunction.Theproposedmethodisdisplaytechnologyindependentandrequiresnoknowledgeofhowtheantialiasingisaccomplished.

8648-38, Session 10

Stereoscopic game design and evaluationNicolasS.Holliman,DurhamUniv.(UnitedKingdom)

DuringthelasttenyearsattheDurhamVisualisationLaboratorywehavedesignedanumberofgamestoinvestigatewhetherhighscorescanbeimprovedbyplayinginS3Dcomparedto2D.Theresultshavevariedfromnoimprovementina3Darcherystylegame,toasmallimprovementof8%ina3Dselectiontask,to28%improvementina3Dtetris-styletask,to68%improvementina3Dasteroidsgame.ClearlythetypeofgameandtheuseofdepthcuesinthegamemakeasignificantdifferencetothepotentialbenefitofS3D.

Followingthisexperiencewewerechallengedtocreateagamethatwouldbeimpossibletoplayona2Ddisplay.Wedesignedagamewheretheusermustsuccessfullyflythroughaseriesofhoopstoscorepoints,thislooked3Dbutvaryingthehoopsizemadetheperspectivecueanunreliableguidetoobjectdepth.Wefoundthat,whilenotimpossible,theplayer’shighscoreswereonaverage300%higherwhenthegamewasplayedinS3Dcomparedto2D.WeconcludetherearegameswherehighscoreswillsignificantlybenefitfromS3D,andbringrealbenefittocompetitivegamesplayers.

8648-39, Session 10

Impact of floating windows on the accuracy of depth perception in gamesBrodieM.Stanfield,ChrisR.Zerebecki,AndrewHogue,BillKapralos,Univ.ofOntarioInstituteofTechnology(Canada);KarenCollins,Univ.ofWaterloo(Canada)

Withtherecentgrowthofstereoscopic3Dwithintheentertainmentindustryithascreatedtheimpetusforstereoscopic3Dinvideogames.Withthemassconsumersinthevideogameindustrybeingrelativelynewtotheuseofstereoscopic3Dithascreatedtheneedforguidelinesandstandardsforcreationanduseofstereoscopic3Dingames.Thequalityofthesestereoscopic3Dparameterswilldeterminethelevelofengagementandperformancethatauserwillreceiveshouldadeveloperchoosetoaddoromitsuchguidelinesandstandards.InthispaperweusetheFloatingWindowTechniquetodetermineitseffectonhowaccuratelyanindividualcanperceiveinscreenandoutofscreendepthofanobjectinavirtualenvironment.ThroughthistechniquewehopetodeterminehowtheFloatingWindowcanbeusedtoaffectauser’sdepthperception.Thus,allowingforgamedesignerstoknowifandhowtheFloatingWindowTechniquecanbeeffectivelyusedforgamingtoenhancethestereoscopic3Dexperience.

8648-41, Session 10

The effects of 5.1 sound presentations on the perception of stereoscopic imagery in video gamesBrianCullen,Univ.ofOntarioInstituteofTechnology(Canada);DanielGalperin,KarenCollins,Univ.ofWaterloo(Canada);AndrewHogue,BillKapralos,Univ.ofOntarioInstituteofTechnology(Canada)

Stereoscopic3D(S3D)contentingames,filmandotheraudio-visualmediahasbeenincreasingoverthepastfewyears.However,beforeS3Dbecomesmorewidespread,therearestillopen,fundamentalquestionsregardingitsimplementation,particularlyasitrelatestoamulti-modalexperiencethatinvolvessound.Researchhasshownthatsoundhasconsiderableimpactonourperceptionof2Dphenomena,butverylittleresearchhasconsideredhowsoundmayinfluenceS3D.

Herewepresenttheresultsofanexperimentthatexaminedtheeffectsof5.1surroundsoundandstereoloudspeakersetupsondepthperceptioninrelationtoS3Dimagerywithinavideogameenvironment.Ouraimwastoanswerthequestion:“can5.1surroundsoundenhancetheparticipant’sperceptionofdepthinthestereoscopicfieldwhencomparedtostereosoundpresentations?”Inaddition,ourexperimentexaminedhowthepresenceorabsenceofDopplershiftandfrequencyfall-offaudioeffectscaninfluencedepthjudgementundertheseconditions.Resultssuggestthat5.1surroundsoundpresentationsenhancetheapparentdepthofS3Dimagerywhencomparedtostereopresentations.Resultsalsosuggestthattheadditionofaudioeffects.suchasDopplershiftandfrequencyfall-offfilters,canalsoinfluencetheapparentdepthofS3Dobjects.

8648-42, Session 10

Depth perception of audio sources in stereo 3D environmentsDavidCorrigan,MarcinGorzel,JohnSquires,FrankBoland,TrinityCollegeDublin(Ireland)

Inthispaperweundertakeperceptualexperimentstodeterminethealloweddifferencesindepthbetweenaudioandvisualstimuliinstereo-3denvironmentswhilebeingperceivedascongruent.Wealsoinvestigatewhetherthenatureoftheenvironmentandstimuliaffectstheperceptionofcongruence.Thiswasachievedbycreatinganaudio-visualenvironmentconsistingofaphotorealisticvisualenvironmentcapturedbyacameraunderorthostereoscopicconditionsandavirtualaudioenvironmentgeneratedbymeasuringtheacousticpropertiesoftherealenvironment.Thevisualenvironmentconsistedofaroomwithaloudspeakerandpersonformingthevisualstimulusandwaspresentedtotheviewerusingapassivestereoscopicdisplay.Bothpinknoisesamplesandvoicedphraseswereusedasaudiostimuli.The3Daudiostimuliarepresentedoverheadphonesusingbinauralrecordings.Thestimuliweregeneratedatdifferentdepthsfromtheviewerandtheviewerwasaskedtodeterminewhethertheaudiostimuluswasnearer,furtherawayoratthesamedepthasthevisualstimulus.Fromourexperimentsitisshownthatthereisasignificantrangeofdepthdifferencesforwhichaudioandvisualstimuliareperceivedascongruent.Furthermore,thisrangeincreasesasthedepthofthevisualstimulusincreases.

8648-73, Session 10

An interactive in-game approach to user adjustment of stereoscopic 3D settingsMinaTawadrous,AndrewHogue,BillKapralos,Univ.ofOntarioInstituteofTechnology(Canada);KarenCollins,UnivofWaterloo(Canada)

Sincemovietheatreshavepropagated3Dtechnology,contentdevelopershavebeenworkinghardatcreatingcustomizableStereoscopic3D(S3D)experiencesfortheusertotakehome.S3Dgamedevelopersoftenoffera‘whitebox’approachinwhichtoomanycontrolsandsettingsareexposedtotheaverageconsumer.Thiscanleadtousersbeinguncomfortableorunimpressedwiththeirownuser-definedS3Dsettings.Ourgoalistoinvestigateinteractiveapproachestosettingtheseparametersandcomparethefinalstereoscopicsettingswiththemethodsprovidedbythecontentdevelopersanddeterminetheeffectsthisstrategyhasonengagement.Inspiredbystandardtestingmethodologiesexperiencedattheoptometrist,we’vecreatedasplit-screengamewiththesamegameplayrunninginbothscreens,butwithdifferentinteraxialdistances.Wethencomparethoseresultswiththesametestrunonasinglescreenandallowingtheusertomoveasliderchangingtheinteraxialdistanceuntiltheyaremost

Conference 8648


comfortable.Weexpectthattheinteractivenatureofthecalibrationwillimpactthefinalgameengagementscoressignificantlyprovidinguswithanindicationofwhetherin-game,interactive,S3Dparametercalibrationisamechanismthatgamedevelopersshouldadopt.

8648-44, Session 11

Accommodation responses to horizontal-parallax-only super multiview displayJunyaNakamura,KosukeTanaka,YasuhiroTakaki,TokyoUniv.ofAgricultureandTechnology(Japan)

Supermulti-view(SMV)displayshavebeendevelopedtosolvethevergence-accommodationconflictthatcausesvisualfatiguewithconventional3Ddisplays.AnidealSMVdisplayshouldgeneratedenseviewpointsinboththehorizontalandverticaldirections.However,theSMVdisplaysdevelopedsofargeneratedenseviewpointsonlyinthehorizontaldirectionbecauseofthesystemcomplexityrequiredforfull-parallaxsystems.Therefore,theaccommodationresponsestoSMVdisplaysweremeasuredmostlyfor3Dimagesthatcontaindistinctiveverticaledges.Inthisstudy,weinvestigatedtheinfluencesoftheedgepropertiescontainedin3Dimagesupontheaccommodationresponsesevokedbyhorizontal-parallax-only(HPO)SMVdisplays.Weusedtherecentlydevelopedreduced-viewSMVdisplay,whoseintervalofviewpointswas2.6mm,fortheaccommodationmeasurements.Twotestimageswereused:a“Maltesecross,”whichcontainsseveralsharpedges,andastandardtestimageof“Lenna,”whichcontainsvarioustypesofedges.WefoundthattheHPO–SMVdisplaystillevokedtheaccommodationresponseswhenLennawasdisplayed.ThereweretwotypesofaccommodationresponsesforLenna;theamountsoftheevokedaccommodationweresmallerthanorsimilartothosefortheMaltesecross,dependingonwheretheviewersgazed.

8648-45, Session 11

Lightweight spatial-multiplexed dual focal-plane near to eye display using two freeform prismsDewenCheng,YongtianWang,BeijingInstituteofTechnology(China);GuofanJin,TsinghuaUniv.(China)

Accommodationandconvergenceplayimportantrolesinthenaturalviewingof3-Dscenesandneedtobeaccuratelymatchedtoavoidvisualfatigue.However,conventionalstereoscopicdisplayslacktheabilitytoadjustaccommodationcuesbecausetheyonlyhaveasingle,fixedimageplane,butthe3-Dvirtualobjectsgeneratedbytwostereoscopicimagesaredisplayedatdifferentdepths,eitherinfrontorbehindthefocalplane.Theeyesarethereforeforcedtoconvergeonthoseobjectswhilemaintainingaccommodationfixedontheimageplanetoviewobjectsclearly.Thecontributionofthepaperistousingfreeformopticalsurfacesinthedesignofalightweightandwearablespatial-multiplexeddualfocal-planenear-to-eyedisplaythatcapableofadjustingtheaccommodationcueinaccordancewiththeconvergencecueandofgeneratingtheretinalblurcue.Thesystemhasgreatpotentialapplicationsinbothscientificresearchandcommercialmarket.

8648-47, Session 11

An optical see-through, multi-focal-plane stereoscopic display prototype enabling nearly-correct focus cuesXindaHu,HongHua,CollegeofOpticalSciences,TheUniv.ofArizona(UnitedStates)

Conventionalstereoscopicdisplaysrender3Dscenesfromasinglepairof2Dimagesatafixeddistancetotheviewer.Thereforeconventionalstereoscopicdisplaysforceanunnaturaldecouplingoftheaccommodationandconvergencecues,whichmaycontributeto

variousvisualartifactsandhavesignificanteffectsondepthperceptionaccuracy.Inthispaper,wepresentthedesignandimplementationofanopticalsee-throughdepth-fusedmulti-focal-planestereoscopicdisplay.Theprototypeiscapableofrenderingnearly-correctfocuscuesforalargevolumeof3Dobjectsextendingintoadepthrangefrom0to3dioptersandadiagonalfieldofviewof40degreesataflicker-freespeed.Withtheoptimizedopticalsystemincludingafreeformprismeyepieceandasee-throughcompensator,theprototypesystemdemonstrateshighimagequalitywhilehavingminimaldegradationtothesee-throughview.Ourmulti-focal-planedisplaypotentiallyprovideshigherdepthperceptionaccuracy,higherstereoacuityandlowerfatigue;itcanalsobeanecessarytooltoinvestigatetheeffectsoffocuscuesonhumandepthperceptionandvisualfatigue,thereforeithasgreatpotentialsforboth3Ddisplayapplicationsaswellasbasicvisionresearch.

8648-43, Session 13

S3D depth-axis interaction for video games: performance and engagementChrisR.Zerebecki,BrodieStanfield,AndrewHogue,BillKapralos,Univ.ofOntarioInstituteofTechnology(Canada);KarenCollins,Univ.ofWaterloo(Canada)

ThefilmindustryhasembracedtheStereoscopic3D(S3D)format,andexperimentedwithhowtheinclusionmightmodifyorenhancetheirproducts.Thegamesindustryhasalsoincludedtheformatintotheirproducts,butrarelyhaveinteractivegamemechanicsbeendevelopedtotakeadvantageandenhancethestereoscopic3Dexperience.Ingames,interactionswiththeplayeraremoreimportantthanonlythevisualexperience.Stereoscopicdisplaysallowplayerstoperceiveobjectsatdifferentdepth,anditisourhypothesisthatdesigningacoremechanicaroundthiswouldcreatecompellingcontent.Inthisstudywedevelopagamewherethecoremechanicrequirestheplayertointeractdirectlyalongthestereoscopicdepthaxis.Auser-basedexperimentisthenusedtoinvestigatetheperformance,perceptionandenjoymentofthisgameinstereoscopic3Dvs.traditional2Dviewing.

8648-48, Session 13

Discernible difference and change in object depth afforded by stereoscopic three-dimensional contentShunnanYang,SowjanyaGowrisankaran,PacificUniv.(UnitedStates);AudreyCYounkin,PhilipJCorriveau,UserExperienceResearchGroup,IntelCorporation(UnitedStates);JamesE.Sheedy,JohnHayes,PacificUniv.(UnitedStates)

Stereoscopicthree-dimensional(S3D)viewingenhancesdepthperceptionwithdifferentialimagedisparitybutexcessivedisparityincurssignificantviewingdiscomfort.Thepresentstudymeasuredthenecessarydifferenceandchangeinimagedisparitytoperceivedifferentdepth.Sixtyyoungadultsbetweenage18to40yearsandwithgoodbinocularvisionwererecruitedtodetectdepthdifference,andtheirvergenceeyemovementsweremeasuredwhenperformingthetasks.Theywasaskedtodetectanobjectlocatedatadepthdifferentfromotheridenticalobjectsatthesamedepthandtodiscernthedirectionthistargetobjectwassubsequentlyshiftedindepth.Smallerbaselinedisparity(<2.48%screendepth)resultedinhigheraccuracyandshorterresponsetimeinreportingthetargetobject,whichwasaccompaniedbymoreaccurateconvergenceangle.Changeindisparitywasmoreaccuratelydiscernedwhentheoriginaldisparitydifferencewaslarger(>=4pixels)andwhenthechangeincreasedcrosseddisparity.ThesesuggestthatinconstructingS3Dcontent,depthdifferenceisbetterperceivedwithasmalleroveralldisparityandalargerdisparitydifferencebetweenobjects.Depthchangeisbestrenderedwithincreasedratherthandecreasedcrosseddisparity.

Conference 8648


8648-64, Session 13

Comfortable stereo viewing on mobile devicesTakashiShibata,TokyoUniv.ofSocialWelfare(Japan)andWasedaUniv.(Japan);FumiyaMuneyuki,KeisukeOshima,JunkiYoshitake,TakashiKawai,WasedaUniv.(Japan)

Thesituationofviewingstereo3Dimageryisbecomingdiverseas3Dtechnologyspreads.Nowadays,3Dimagesareviewedontelevisionandmobiledevicesaswellasincinema.Theviewingofmobiledevicesespeciallyhasauniquefeaturethatallowsobserverstoviewimagesatashortdistance.Weconductedanexperimentthatexaminedarelationshipbetweenvisualcomfortandindividualobserver’snearphoriaandinterpupillarydistance.Theresultsshowedthatobserverswithlargeexophoriatendedtopreferstereoimagesbehindthescreen.Ontheotherhand,observerswithsmallexophoriaoresophoriatendedtopreferstereoimagesinfrontofthescreen.Inaddition,theresultsalsoshowedthatobserverswithlargerinterpupillarydistancetendedtopreferstereoimagesbehindthescreen.Thesefindingssuggestthatuserscanadjustthedepthofstereoimagesinadvancebasedontheirphoriaorinterpupillarydistanceforacomfortablestereoviewingonmobiledevices.

8648-50, Session 12

Is the use of 55” LCD 3D screen practicable in large seminar to lecture-hall size audiences?JustusF.IlgnerM.D.,IngoSparrerM.D.,MartinWesthofenM.D.,Univ.HospitalAachen(Germany)

Background:Thepresentationofsurgicalcontentstoundergraduatemedicalstudentscanbechallenging,asthesurgicalapproachisoftendifferentfromtheanatomist’sperspectivethatisreproducedintextbooks.Althoughtherearemanyoptionstorecordendoscopic,microscopicaswellas“open”surgicalprocedures,presentationofcontentsstillcanbecostlyandentailalossinpicturequalityincludingdepthimpression.

MaterialandMethods:

Wepresentedsevenstereoscopicclipsof30secondstominuteand20secondseachto64medicalstudents(43female/21male)aspartofthe“sensoryorgans”courseblockin4thyear;usingone55”LCD3Dscreenwithline-alternating,circularpolarization.Studentswereaskedfortheirsubjectiveviewingimpressionandabouttheiropinionontheusefulnessof3Dpresentationsinmedicallectures.

Results:

63%ofstudentsreturnedtheirquestionnairescompleted.Themainresults(multipleanswersallowed)were:70%notedthat3Dpresentationsmadecomplexanatomyeasiertocomprehendfromanunknownperspective,48%wouldfeelbettermotivatedtolearnsurgicalprocedures,38%wouldgenerallyprefera3Dlecturetoa2Dlecture,while23%wouldnotseeanyadvantageof3Dpresentationswhatsoever.

Discussion:

Whilethescreensizecomparedtoaudiencesizewasfarfromideal,itgavemedicalstudents,whohadnotbeenexposedtosurgicalproceduresintheoperatingtheatreyet,animpressionofgeneralapproachtomicrosurgeryandhowthechoiceofsurgicalapproachinrelationofvitalstructurescanminimizetraumaandunwantedeffectstothepatient.Theavailabilityoflargerscreens,however,maynecessitatechangesinproductionof3Dmaterialfromthemicroscopecameraonward.

8648-52, Session 12

Immersive stereoscopic panoramasJohnS.Toeppen,HoloGraphics(UnitedStates);JasonBuchheim,OdysseyExpeditions(UnitedStates)

ImmersiveStereoscopicPanoramas192words

CONTEXT;Immersivestereoscopicimagingrequiressharpwidefieldimages,specialsoftware,andhighresolutiondisplays.

OBJECTIVE;Examplesofsomesuccessfulmagecapture,splicing,viewing,hosting,andpostingtechniquesusedindigitalstereoscopicpanoramicphotographyaregiven.

METHODS

Imagecaptureusescameramovementsthatapproximatenaturaleyepositionsreasonablywellbyusingmanualormotorizedgimbalmountedsystemsdesignedforthepurpose.Processingrequiresseamlesslystitchingdozensorhundredsofimagesintostereopairs.Creatingstereoscopicimagesover50megapixelsbenefitsfromprogrammablemotorizedequipment.The2giglimitofTIFFsisoftenexceededandrequirestheuseofGigaPan.orgviewingandhostingtechnologies.Gigapixelstereoscopicimagesareviewedasasinglewholewhilesmallfilesarequicklyuploadedtoimprovethesharpnessoftheareasviewed.

RESULTS

Immersivestereocontent,activescrollinganddeepzoomcapabilitiesarepresentlywebaccessibleandformattedfordifferentdisplays.

CONCLUSSION

Immersivecontent,activescrollinganddeepzoomcapabilitiestakestereoscopicphotographyfromsnapshotsintotherealmofimmersivevirtualpresencewhencombinedwithmoderndisplaytechnologies.Scientific,artistic,andcommercialapplicationscanmakeeffectiveuseofexistingstereoscopicdisplayssystemsbyusingtheseextendedcapabilities.

http://gigapan.com/galleries/7454/gigapans

8648-55, Session 12

Experimental verification of conditions for a bubble-projection three-dimensional displayTakumaUchida,HitoshiOzaki,HiroshiKawakami,JippeiSuzuki,MieUniv.(Japan);KoichiNakayama,SagaUniv.(Japan)

Ourgoalistodevelopabubble-projectionthree-dimensionaldisplay(BP3DD).BP3DDdoesnotcauseacontradictionbetweenthebinocularconvergenceandthefocalaccommodationofoureyes.A3DimageonaBP3DDcanbeseenfromalldirectionswithoutrequiringtheuseof3Deyeglasses.

Whenaninfraredlaserbeamiscondensedunderwater,waterboilslocallyandbubblesaregenerated.Eachbubblecorrespondstoapixelofanunderwaterimage.Bycontrollingthepositionatwhichthelaserbeamiscondensed,anarbitrary-shaped3Dscreencanbegeneratedwithmanybubbles.Wecanseeacolorful3Dunderwaterimagewhenvisiblelightisprojectedonthis3Dscreen.

WecreatedbubblesundervariousconditionsinordertodeterminetheconditionsthatweremostsuitableforBP3DD.Whenthewaterpressurewaslower,theenergyrequireddecreasedlinearly.ThisresultshowedthattheenergyrequiredtogeneratebubblesdecreasedwhenthewaterpressureinBP3DDcameclosetothewatervaporpressure.

Becausetheoutputenergyofalaserislimited,ahigh-resolutionimagecannotbegenerated.Weplantocreatehigh-resolutionmotionvideobydevelopingamethodfordecreasingtheenergyrequiredbythelaser.

8648-63, Session 12

Extended depth-of-field in Integral Imaging by depth-dependent deconvolution

Conference 8648


HectorNavarro,GenaroSaavedra-Tortosa,ManuelMartinez-Corral,Univ.deValència(Spain);MårtenSjöström,RogerOlsson,MidSwedenUniv.(Sweden)

OBJECTIVE:

Theaimofthisworkisdesigninganewmethodtosignificantlyextendthedepthoffieldinthereconstructionstage.

METHOD:

Theideaisbasedinatwo-stepmethod.Wefirstproposeobtainingthedepthmapforthedifferentcapturedperspectives.Now,thedepthinformationofthe3Dsceneisavailableandwecanfiltereachelementalimageselectingthepixelsthatareassociatedwithagivendistancetothecameraobjective.Theaxialrangespanningthesceneisdividedinintervals.ItispossibletodefineingoodapproximationaneffectivePSFovereachintervalofdistances.OnlythepixelsbelongingtoacertainintervalaredeconvolvedwiththeeffectivePSFcalculatedforthatdistance.Thefinalimageisthesumofthepixelsofeveryintervalafterbeingfilteredanddeconvolved.WehaveusedaniterativealgorithmbasedintheRichardson-Lucydeconvolution.Toshowthefeasibilityofourmethod,wehaveconductedopticalexperimentstoobtainanintegralimageofa3Dscene.Overanopticaltable,wepreparedascenecomposedbythreeresolutionchartslocatedatthreedifferentaxialpositionsfromthein-focusplane.Asacapturesystem,insteadofamicrolensarray,weusedthesocalledSyntheticApertureIntegralImagingmethod,whereadigitalcameraistranslatedmechanicallytothepositionswherethemicrolenseswouldbelocatedoverthemicrolensarray.Thismethodletushavingalargeamountofpixelsineverycapturedimageandhenceobtaininganaccuratedepthmap.Withthisconfigurationwegotasetof10x10differentperspectivesofthescene,focusingthecamerainaplanelocated650mmawayfromthecamera.Thethreetestswerelocated200,370and500mmawayfromthecamera.Thecamerafocallengthwassetto35mmandthef#=9.Theabove-proposedmethodtoextendthedepthoffieldhasbeenappliedtoeachcapturedperspective.

RESULTS:

Computationalreconstructionsofthe3Dimagehavebeenperformedusingbackprojectionalgorithmsinordertocomparetheresultsobtainedwiththeoriginalcapturedintegralimageandthoseobtainedafterapplyingourmethod.Theanalysisofthesereconstructionsintheplaneswhereeachoneoftheresolutionchartswerelocatedshowsthat,withtheproposedmethod,itispossibletorecoverallthefrequenciesthatwerelostduetothelimiteddepthoffieldofthecamera,evenfortheresolutionchartwhichwasfarthestfromthein-focusplane.

CONCLUSIONS:

Wehaveproposedanewmethodtosignificantlyextendthedepthoffieldfor3Dimagereconstruction.Wehaveillustratedourmethodwithanexperimentinwhichwehaverecoveredthehigh-frequencyinformationofa3Dscene,evenforthemostdefocusedplanes.Theproposedmethodhasbroadapplicationsin3Ddisplay,and3Drecognition.

8648-49, Session 14

Depth enhanced multi-layered display using polarization dependent double reflectionNa-YoungJo,Hee-SeungKim,ChungbukNationalUniv.(Korea,Republicof);Hong-GiLim,ChungbukNationalUniversity(Korea,Republicof);Jae-HyeungPark,ChungbukNationalUniv.(Korea,Republicof)

Recently,multi-layereddisplaysattractgrowingattention.Multi-layereddisplayseffectivelycomposelightrayfieldbyusingseveralliquid-crystaldisplay(LCD)panels.Itcandisplayhighqualitythree-dimensional(3D)imageswithhorizontalandverticalparallaxinavolumearoundthedisplaysystem.Inmulti-layereddisplays,thedepthrangewherein3Dimagescanbedisplayeddependsonthenumberoflayers.Themorelayerscanhavemoreinformationandthusincreasethedepthrange.However,thereisapracticallimitationonthenumberofthelayerssinceeachLCDhasalimitedtransmittance.

Inthispaper,weproposeadepthenhancementofthemulti-layereddisplayusingpolarizationdependentdoublereflection.Theproposed

systemconsistsofamulti-layereddisplay,apolarizationcontrolpanel,twopiecesofquarter-wave-plates,ahalf-mirrorandareflectivepolarizer.Theraysfromthemulti-layereddisplaypassthroughthepolarizationcontrolpanelwhichalternatesthepolarizationbetweentwoorthogonalstates.Inonepolarizationstate,thelightrayspassthroughtwoquarter-wave-plates,ahalfmirror,andareflectivepolarizerdirectly.Inanotherpolarizationstate,thelightraysundergoadditionalreflectionbetweenthereflectivepolarizerandthehalfmirrorbeforeexitingfromthereflectivepolarizer,increasingtheapparentdistanceofthemulti-layereddisplayfromtheobserver.Byalternatingthesetwostates,itispossibletodoubletheeffectivenumberofthelayersoftheconventionalmulti-layereddisplay,achievingtheenhancementofthedepthrange.Inthepresentation,wewillexplaintheprinciplewithexperimentalverifications.

8648-51, Session 14

Polarization analysis of the light emitted by liquid-crystal-based 3D displaysPierreM.Boher,ThierryLeroux,ThibaultBignon,VéroniqueCollomb-Patton,ELDIM(France)

Liquidcrystaltechnologyispresentlymoreorlessusedforalltypesofcommercialapplicationsinvolvingdisplaysandinparticular3Ddisplays.Forauto-stereoscopic3Ddisplays,liquidcrystalsareevenusedtobuilddrivenlensestocontroltheangularemission.Liquidcrystalcellisessentiallyapolarizationstatemodulatorandthepolarizationstateofthelightisofcourseatoolofchoicetomeasurepreciselytheperformancesofsuchdevices.ELDIMhasdevelopedviewingangleandimaginginstrumentcapabletomeasurethepolarizationstateofthelightinadditiontostandardvaluessuchasluminance,colororradiance.Intheproposedpaper,weusetheseinstrumentstocharacterizedifferentauto-stereoscopic,passiveandactiveglass3Ddisplaysandweshowhowtheircanhelpunderstandingbetterthelimitandthedefectsofeachtypeoftechnology.

8648-53, Session 14

accurate depth estimation using spatiotemporal consistency in arbitrary camera arrayWoo-SeokJang,Yo-SungHo,GwangjuInstituteofScienceandTechnology(Korea,Republicof)

Depthestimationisessentialtaskfornatural3Dimagegeneration.Inthispaper,weproposeanaccuratedepthimagesequenceusingstereovideocapturedbyanarbitrarycameraarray.Ingeneral,stereo-video-baseddepthestimationiscarriedoutwithtwoinputvideosthatareobtainedfromtheparallelcameraarray.Recently,thearccameraarrayhasbeenactivelyusedtocreateimmersive3Dvideos.Inthearccameraarrays,itisdifficulttoadoptimagerectificationmethodsforcorrespondencepointmatchingduetoseriousimagedistortion.Inthiswork,weestimatedepthdatawithoutimagerectification.Consecutively,wedefinethepotentialenergyfunctionformismatchingdepthdetectionbasedonspatialconsistency.Energyoptimizationconfirmsmismatchingdepthpixels.Reasonabledepthvaluesareassignedtotheobtainedmismatchingpixelsusingdistancesandintensitydifferencesbetweenthemismatchedpixelanditsneighbors.Finally,wepursueimprovementoftemporalconsistencytoreducevisualdiscomforttoviewer’seyes.Experimentalresultsshowthattheproposedmethodprovidesmorestableresultsthantheimagerectification-basedmethod.

8648-56, Session 14

Stereoscopic 3D video coding quality evaluation with 2D objective metricsKunWang,KjellE.Brunnström,AcreoAB(Sweden)andMid

Conference 8648


SwedenUniv.(Sweden);MarcusBarkowsky,Univ.deNantes(France);MatthieuUrvoy,UniversityofNantes(France);MårtenSjöström,MidSwedenUniv.(Sweden);PatrickLeCallet,Univ.deNantes(France);SylvainTourancheau,MidSwedenUniv.(Sweden);BörjeAndrén,AcreoAB(Sweden)

TheaimofthisworkistocompareandevaluatethequalityofStereoscopic3Dvideoswithexisting2Dqualitymetrics,andexaminingthemetricsperformancebycomparingitwithconductedsubjectivemeasurementresultsonthesameprocessedvideoset.Theperformanceofthe2DobjectivemetricsisassessedaccordingtoevaluationcriteriafromVQEGHDTVphase1procedureandtheS3DgeneralqualityisevaluatedbythecorrelationcoefficientoftheobjectivescoresandthesubjectiveDMOS.Variouspoolingfunctionsareinvestigatedtoapplyonthestereo2Dimagequalityanddepthqualityscores.PreliminaryresultsshowtheVIFPobjectivemetricshaveahighcorrelationwithsubjectiveDMOS.

8648-57, Session 15

Case study: The Avengers 3D: cinematic techniques and digitally created 3DGrahamD.Clark,StereoD(UnitedStates)

Inthedigitalcreationofa3Dfilmfroma2Dimagecapture,recommendationstothefilmmakerscinematictechniquesareofferedateachstepfrompre-productiononwards,throughset,intopost.

Asthefootagearrivesatourfacilitywerespondindepthtothecinematicqualitiesoftheimageryincontextoftheeditandstory,withtheguidanceoftheDirectorsandStudio,creatingstereoscopicimagery.

Thesecinematicconsiderationsthatwerecommendandfollow,inthedigitalcreationof3Dimagery,includethetraditionalstandardVisualDepthCuesthathavelongbeeninuseinthestereoscopicfield.

TraditionalCinematicTechniqueshavebeenfurtherdevelopedthatsupportandutilizethevariousVisualDepthCues,inmostcasestheyareaccentuatedbyStereoscopic3Dimagery.

OurinvolvementinTheAvengerswasearly,afterreadingthescriptwehadtheopportunityandhonortomeettheDirectorJossWhedon,andDPSeamusMcGarveyonset.

Wepresentedwhatcouldonlyseemobvioustosuchgreatfilmmakersinthewaysofcinematictechniquesastheyrelatedtothestandarddepthcues,inparticularStereoscopic.

Ourhopewasanycinematichabitsthatsupportedbetter3Dwouldbeemphasized.Andtheywere!

8648-25, Session IPI

An autostereoscopic display system with four viewpoints in full resolution using active anaglyph parallax barrierQuZhang,HidekiKakeya,Univ.ofTsukuba(Japan)

Parallaxbarrierisoneofthemostwidelyacknowledgedautostereoscopysolutions.However,itsuffersfromthelimitednumberofviewersatthesametime.Weapplyanaglyphintoactiveparallaxbarrierandcreateadisplaypatterntoshowfourviewpointsinfullresolutionsothattwoviewerscanenjoyatthesametime.Inoneframe,stripeswitha“magenta-black-green-black”patternareusedastheparallaxbarrier,whileimagestripesfromfourdifferentviews(A,B,CandD)areshownina“AG/CM-BG/DM-CG/AM-DG/BM”pattern,whereAG/CMmeansthegreencomponentofviewAoverlappedwiththemagentacomponentofviewC.Withparallaxbarrierproperlypositioned,fourdifferentviewswithhalfresolutionandhalfanaglyphcomponentateachpixelcanbeshowninoneframe.Bychangingthepatternsofbarrierandimageseveryframe,fourfull-colorviewsinfullresolutioncanbeachievedfromfourviewpointseveryfourframe.Wemadeasystembasedonthismethodwithtwo120Hzdisplays.Althoughitshouldhaverequired240Hzdisplaystoshowstable

stereoscopyforthissystem,theresultshowedthatanaglyphwithgreenandmagentahelpedtoreduceflickersandfourviewpointsweresuccessfullycreated.


Wide viewing angle three-dimensional display using curved HOE lens arrayYoshiakiOshima,HideyaTakahashi,OsakaCityUniv.(Japan);KenjiYamada,OsakaUniv.(Japan)

Weproposeanomnidirectional3Ddisplaysystem.Thisisatoolforcommunicationarounda3Dimagebetweenasmallnumberofpeople.This3Ddisplayconsistsofmultiplebasic3Ddisplayunits.Eachbasic3DdisplayunitconsistsofaprojectorandacurvedHOElensarraysheets.TheprojectorislocatedonthecenterofthecurvatureofthecurvedHOElensarray,anditprojectslightraysonthecurvedHOElensarraywithascendingverticalangle45-degree.Projectedlightraysarereflectedandreconstructthe3DimageoverthecenterofthecurvaturebythecurvedHOElensarray.Inthismethod,eventhoughtheviewingangleofeachHOElensdoesnotincrease,theviewingangleofthe3DimagebythecurvedHOElensarraycanbeincreased.Toverifytheeffectivenessoftheproposedmethod,weconstructedtheprototypebasic3Dunit.Theviewingangleofa3Dimagewas44-degree.Therefore,theviewinganglewasincreasedbytheproposedmethodcomparedto17-degreebytheconventionalflatHOElensarray.Thispaperdescribestheprincipleofproposed3Ddisplaysystem,andalsodescribestheexperimentalresults.


Volumetric display based on optical scanning of an inclined image plane by an image rotator and imaging by a dihedral corner reflector arrayYukiMaeda,DiasukeMiyazaki,TakaakiMukai,OsakaCityUniv.(Japan);SatoshiMaekawa,NationalInstituteofInformationandCommunicationsTechnology(Japan)

Weproposetousearotationalopticalscanningmethodforavolumetricthree-dimensional(3D)displaybasedonopticalscanningofaninclinedimageplane.Theimageplaneisscannedbyanimagerotatorandformedbyadihedralcornerreflectorarray,whichformsarealimageataplane-symmetricalpoint.Stackofthescannedandformedimageplanecreatesdisplayablespaceofa3Dimage,whichsatisfiesallthecriteriaofstereoscopicvisionandcanbeseenbythenakedeyes.

Theimagerotatorusedinthisstudyconsistsoftwoprismsheetsandaplanarmirror.Theimageplaneisscannedrotationallybyrotatingtheproposedimagerotatoralonganaxisparalleltothemirrorplane.Sincejustextendingadistancebetweentheimageplaneandthemirrormakesthedisplayablespacelarge,thisscanningmethodiseffectivetodisplayalarge3Dimageeasily.

Wemadeaprototypedisplayandobservedscannedimageplanetomeasurethedisplayablespace.Thesizeofthespacewas1500cubiccentimetersthatisapproximatelytentimesaslargeasourpreviousdisplaythatusesatranslationalscanningmethod.


Resolution-enhanced display of three-dimensional objects by using the dynamic mask pattern in a depth-priority integral-imaging systemSuk-PyoHong,Eun-SeongKim,KwangwoonUniv.(Korea,Republicof);DonghakShin,DongseoUniv.(Korea,Republicof);Eun-SooKim,KwangwoonUniv.(Korea,Republicof)

Conference 8648


Inthispaper,weproposeanovelapproachforresolution-enhanceddisplayofthree-dimensional(3-D)objectsbyemployingadynamicmaskpattern(DMP)inthedepth-priorityintegral-imaging(DPII)system.Basically,themaximumresolutionofaDPIIsystemmightbelimitedbythenumberofpicked-upelementalimages(EIs),sothatseveralapproachesincludingthemovingarraylenslettechnique(MALT)andtheintermediate-viewreconstructiontechnique(IVRT)havebeensuggestedtoimprovetheresolutionoftheDPIIsystem.But,thesemethodsrequireamechanicalmovementofthelensletarrayorsufferfromincorrectimageformation.Toovercomethesedrawbacks,weproposeanovelDMP-basedDPIIsystem,inwhichthelocationofthemaskpatterndisplayedonaLCDpanelcanbeelectronicallycontrolledandthemaskpatternexpectstobemovedfastenoughtomakeanafter-imageeffect.Accordingly,intheproposedsystem,high-resolutionEIscanbepickedupjustbytime-multiplexedmovementoftheelectronicmaskpatterndisplayedonaLCDpanel,sothattheupperresolutionlimitimposedbytheNyquistsamplingtheoremcouldbeovercome,whichresultsinaremarkableimprovementoftheresolutionoftheconventionalDPIIsystem.Toconfirmthefeasibilityoftheproposedmethod,experimentsarecarriedoutandtheresultsarediscussed.


A method of reducing number of pixels on display device for super-multiview displayYukiAdachi,TomohiroYendo,NagaokaUniv.ofTechnology(Japan)

Asupermulti-viewdisplayprovidessmoothmotionparallaxwithoutspecialglasses,anditisexpectedtheobserverisfreefromthevisualfatiguecausedbytheaccommodation-vergenceconflict.However,thereisaproblemahugenumberofpixelsarerequiredonadisplaydevicesuchasliquidcrystaldisplaypanelbecausehigh-densityraysarerequiredforgoodqualityimagesandeachrayneedscorrespondingpixel.Weproposedanewthree-dimensionaldisplaybasedonlenticularmethodtoreducetherequirednumberofpixels.Theraysareshotouttoonlyaroundobserver’spupils.Todothis,thelenticularlensofwhichviewingzoneangleisnarrowedisusedandthelenticularlensisilluminatedbyparallellightmadebycylindricallensesandLEDstosuppresssidelobes.Thedirectionoftheparallellightischangedtofollowtheobserver’seye.Wedesignedadisplayusingthesecomponentsandconfirmedtheavailabilityoftheproposedmethodbycomputersimulation.Asaresult,theparallaxpitchandtheviewingzonenearlyequaledtodesignedvalueswhenthedisplaywasobservedfromthefront,butthesevalueswereincreasedwiththeviewingangle.Afteranalysis,thereasonwhytheparallaxpitchandtheviewingzonewereexpandedisthoughtasthecurvatureoffieldofthelenticularlens.Thus,Futureworkincludesproposingthesolutionoftheeffectofthisaberration.


Development of super multiview head-up display and evaluation of motion parallax smoothnessHiroyukiNishio,YasuhiroTakaki,TokyoUniv.ofAgricultureandTechnology(Japan)

Asupermulti-viewhead-updisplay(SMV-HUD)wasdeveloped.ThesmoothmotionparallaxprovidedbytheSMVtechniqueenablesaprecisesuperpositionof3Dimagesonrealobjects.ThedevelopedSMV-HUDwasusedtoexploredisplayconditionstoprovidesmoothmotionparallax.Ithadthreeconfigurationsthatdisplay3Dimagesinshort-,middle-,andlong-distanceranges,assumingtheusageenvironmentsofPCmonitors,TVs,andadvertisementdisplays,respectively.Thesubjectiveevaluationwasperformedbychangingthedepthof3Dimagesandtheintervalofviewingpoints.Theintervalofviewingpointswaschangedbydisplayingidenticalparallaximagestosucceedingviewingpoints.Wefoundthattheratiooftheimageshiftbetweenadjacentparallaximagestothepixelpitchof3Dimagesdominatedtheperceptionofthediscontinuousmotionparallax.Whentheratiowassmallerthan0.2,thediscontinuitywasnotperceived.

Whentheratiowaslargerthan1,thediscontinuitywasalwaysperceivedandthe3Dresolutiondecreasedtwotimesattransitionpointsofviewingpoints.Whenthecrosstalkbetweenviewingpointswasrelativelylarge,thediscontinuitywasnotperceivedevenwhentheratiowasoneortwo,althoughtheresolutiondecreasedtwoorthreetimesthroughouttheviewingregion.


Subjective assessment of visual discomfort induced by binocular disparity and stimulus width in stereoscopic imageSeong-ilLee,YongJuJung,SohnHosik,YongManRo,KAIST(Korea,Republicof)

Stereopsisissuchacomplexmechanismthatstereoscopicdepthperceptioncanbeinfluencedbynotonlydisparitymagnitude,butseveralotherfactors.Forinstance,anumberofpsychophysicalevidenceshavebeenreportedthatbinocularfusionlimitcouldbeaffectedbycharacteristicsofvisualstimulus,suchasdisparitygradient,stimulussize,spatialfrequency,motion,etc.Subjectivesensationofvisualdiscomfortcouldalsobedifferentdependingoncharacteristicsofvisualstimulus.Inourpreviousstudy,wehaveobservedacluethatthehorizontalsizeofobjectscouldaffecttheperceivedvisualcomfort.However,nothoroughsubjectiveassessmenthasbeenconductedtoinvestigatetherelationamonghorizontalstimulussize,disparitymagnitude,andvisualcomfort.Tothebestofourknowledge,furtherstudyisnecessarytoinvestigatetheeffectofhorizontalstimulussizeanddisparitymagnitudeonvisualcomfortofstereoscopicimage.Inthispaper,weinvestigatetheinfluenceofstimulussizeonvisualcomfortofstereoscopicimage.Tothatend,weconductextensivesubjectiveassessmentsusingvisualstimuliwithvarioushorizontalsizesanddisparitymagnitudes.Thestatisticalanalysisresultsshowthattheperceiveddegreeofvisualdiscomfortissignificantlyincreasedasthehorizontalsizeofvisualstimulusbecomesmallerthanagivendisparityvalue.


Evaluation of the no-disparity realistic image from a sense of presence and low fatigueHisakiNate,NobutakaNatui,NaokiHayashi,KazuoIshikawa,TokyoPolytechnicUniv.(Japan);ToyohikoHatada,TokyoOptometricCollege(Japan);YutakaIchihara,NobuyukiMiyake,YoshijiroUshio,NikonCorp.(Japan)

Weevaluatedanobserver’sfatigueandasenseofpresence,inobservingthenodisparityrealisticimage(NDRimage).NDRimageisconsistedoftwoimages(rightandleftimage).Rightimageiscreatedbyshiftingallpixelsinleftimagesameamount.Consequently,therearenodisparitiesinallareaofNDRimage.NDRimagewhichisreconfiguredthecontentsthatcontainimagewithlargedisparitymighthaveapossibilitythatitsuppressanobserver’sfatigueandlethimfeelhighpresence.

Subjectsobservedthreecondition’svideos(stereoscopicvideo,NDRvideoand2Dvideo).Subjectsobservedtwovideos(Eachvideowasabout30minutes.)ineachcondition.Therewerepartswithlargedisparity(morethan5degrees)invideos.

SubjectsrespondedSSQ(SimulatorSicknessQuestionnaire)andVAST(VisualAnalogueScaleofTiredness).Afterobservingthefirstvideo,theyrespondedVAST.Afterobservingsecondvideo,theyrespondedSSQandVASTandansweredthequestionnaireaboutasenseofpresenceofthedisplayedimage.

ResultsshowedthatNDRimageletobserverfeelhighpresenceandanobserver’sfatiguewaslow.NDRimageiseffective,evenifcontentswhichcontainlargedisparityareconvertedintoNDRimage.

Conference 8648



Boosting paired comparison methodology in measuring visual discomfort of 3DTV: performances of three different designsJingLi,MarcusBarkowsky,PatrickLeCallet,Polytech’Nantes(France)

NoAbstractAvailable


Stereoscopic cameras for the real-time acquisition of panoramic 3D images and videosLuisE.Gurrieri,EricDubois,Univ.ofOttawa(Canada)

Therearedifferentpanoramictechniquestoproduceoutstandingomni-stereoscopicimagesofstaticscenes.However,acameraconfigurationcapabletosamplethesceneinreal-timetoproducestereoscopicpanoramasisstillasubjectofresearch.Inthispaper,twodifferentmultiple-cameraconfigurationscapabletoproducehigh-qualitystereoscopicpanoramasinreal-timearepresented.Unlikeexistingmethods,theproposedmultiple-camerasystemsacquirealltheinformationnecessarytorenderstereoscopicpanoramasatonce.Thefirstconfigurationexploitsmicro-stereopsisarisingfromanarrowbaselinetoproduceomni-stereoscopicimages.Thesecondconfigurationusesanextendedbaselinetoproducepoly-centricpanoramasandadditionaldepthinformation,e.g.disparityandocclusionmaps,whichareusedtosynthesizestereoscopicviewsatarbitrarydirections.Theresultsofsimulatingbothcameraconfigurationsandtheprosandconsofeachset-uparepresented.


Applications of 2D to 3D conversion for educational purposesYoshihisaKoido,HiroyukiMorikawa,SakiShiraishi,WataruMaruyama,SoyaTakeuchi,WasedaUniv.(Japan);ToshioNakagori,TokaiUniv.(Japan);MasatakaHirakata,HirohisaShinkai,ToppanPrinting(Japan);TakashiKawai,WasedaUniv.(Japan)

TherearethreemainapproachescreatingstereoscopicS3Dcontent:stereofilmingusingtwocameras,stereorenderingof3Dcomputergraphics,and2DtoS3Dconversionbyaddingbinocularinformationto2Dmaterialimages.Althoughmanual“off-line”conversioncancontroltheamountofparallaxflexibly,2Dmaterialimagesareconvertedaccordingtomonocularinformationinmostcases,andtheflexibilityof2DtoS3Dconversionhasnotbeenexploited.Ifthedepthisexpressedflexibly,comprehensionsandinterestsfromconvertedS3Dcontentsareanticipatedtobedifferedfromthosefrom2D.Therefore,inthisstudywecreatednewS3Dcontentforeducationbyapplying2DtoS3Dconversion.Forsurgicaleducation,wecreatedS3Dsurgicaloperationcontentunderasurgeonusingapartial2DtoS3Dconversiontechniquewhichwasexpectedtoconcentrateviewers’attentiononsignificantareas.Andforarteducation,weconvertedUkiyoeprints;traditionalJapaneseartworksmadefromawoodcut.Theconversionofthiscontent,whichhaslittledepthinformation,intoS3D,isexpectedtoproducedifferentcognitiveprocessesfromthoseevokedby2Dcontent,e.g.,theexcitationofinterest,andtheunderstandingofspatialinformation.Inaddition,theeffectsoftherepresentationofthesecontentswereinvestigated.


A stereoscopic archive for the heritage of industrial modernization and the evaluationHayoungYoon,KoheiAndo,WasedaUniv.(Japan);TakashiKawai,HiroyukiMorikawa,WasedaUniversity(Japan);KeijiOhta,QualityeXperienceDesignCo.,Ltd.(Japan)

Inrecentyears,recordandretentionmethodsofculturalassetsbystereoscopicimages,stereoscopicarchivehaveattractedpublicattentions,andvariousresearchesonthearchivebuildingmethodshavebeencarriedout.However,culturalassetswithouttangiblepropertiesatpresentcannotbepicturedandsorecordingandretainingthemasstereoscopicimagesisnotpossible.Toovercomethelimitation,theuseofthe2Dto3Dconversiontechnology,whichconverts2Dimagesto3Dimagesbyapplyingbinocularinformation,hasbeenemphasized.Inthisstudyproducedastereoscopicarchivefortheheritageofindustrialmodernization(theJVCKENWOODYokohamaFactoryfaçade),whichwasdismantledinMarch2011,byconvertingthe2Dimagesto3D,andexaminedtheimpressions.Moreover,thedrawingimagesoftheculturalassetwerealsoconvertedto3Dimagesinordertoevaluationtheexpressionasastereoscopicarchive.

Fortheexperiment,headmounteddisplayand42-inch3DTVwereused.30studentsintwentiesyearsofage.withnormalbinocularvision.participatedthroughquestionnairesandinterviewstocomparetheimpressions,betweentheconditions.


A wavelet-based image quality metric for the assessment of 3D synthesized viewsEmilieBosc,InstitutNationaldesSciencesAppliquéesdeRennes(France);FedericaBattisti,MarcoCarli,Univ.degliStudiRomaTre(Italy);PatrickLeCallet,Polytech’Nantes(France)

Inthispaperwepresentanovelimagequalityassessmenttechniqueforevaluatingvirtualsynthesizedviewsinthecontextofmulti-viewvideo.Inparticular,FreeViewpointVideos(FVV)aregeneratedfromuncompressedcolorviewsandtheircompressedassociateddepthmapsbymeansoftheViewSynthesisReferenceSoftware(VSRS),providedbyMPEG.Priortothesynthesisstep,theoriginaldepthmapsareencodedwithdifferentcodingalgorithmsthusleadingtothecreationofadditionalartifactsinthesynthesizedviews.Thecoreofproposedwavelet-basedmetricisintheregistrationprocedureperformedtoalignthesynthesizedviewandtheoriginalone,andintheskindetectionthathasbeenappliedconsideringthatthesamedistortionismoreannoyingifvisibleonhumansubjectsratherthanonotherpartsofthescene.TheeffectivenessofthemetricisevaluatedbyanalyzingthecorrelationofthescoresobtainedwiththeproposedmetricwithMeanOpinionScorescollectedbymeansofsubjectivetests.Theseresultsarealsocomparedagainstthoseofwellknownobjectivequalitymetrics(i.e.PSNR,SSIM,MSSIM).Inmoredetails,27observersassessed42FVVsequencesbasedonsixdifferentsequences,eachofthemprocessedwith7depthmapcodingalgorithms.


Stereoscopic display gray-to-gray crosstalk measurementYi-HengChou,FuHaoChen,Kuo-ChungHuang,KuenLee,Ching-ChiuLiao,Yi-henChen,IndustrialTechnologyResearchInstitute(Taiwan)

Thereareseveralstudiesonestimatingcrosstalkof3Ddisplays.Inpreviousstudy?graytograycrosstalkmodelhasbeenmodified.Inthispaper,weusesixcommercialstereoscopicdisplaysincludingpassivepolarizedglassesandactiveshutterglasses,bothleftandrighteyesweremeasuredbyaluminancemetertoverifythegrayto

Conference 8648


graycrosstalkmodelandanalyzethestabilityofthemeasurementdatausingourstandardoperatingprocedure(SOP).AccordingtotheSOP,weusesimplestatisticalmethodtoverifytherepeatabilityofdata.Theseresults,canbeusedasanimportantparametersinstereoscopicdisplayvisualquality,alsocanbeusedasadesignfeedbackforengineer.


Compressing stereo images in discrete Fourier transform domainCheeSunWon,DonggukUniv.(Korea,Republicof);ShahramShirani,McMasterUniv.(Canada)

FrequencyfeaturesofstereoimagesareinvestigatedintheDFT(DiscreteFourierTransform)domainbycharacterizingphaseandmagnitudepropertiesoriginatedfromthehorizontalparallaxofthestereoimages.Also,thewell-knownDFTpropertiesincludingtheconjugatesymmetrypropertyareutilizedtoidentifyessentialfrequencycomponentsofstereoimages.OurinvestigationrevealsthattheDFTofthestereoimageshasusefulpropertiesthatcanprioritizetheDFTcoefficientsforcompactrepresentationsandcompressions.


Depth inloop resampling using dilation filter for free viewpoint video systemSeokLee,SeungsinLee,HocheonWey,JaejoonLee,DusikPark,SamsungAdvancedInstituteofTechnology(Korea,Republicof)

Adepthdilationfilterisproposedforfreeviewpointvideosystembasedonmixedresolutionmulti-viewvideoplusdepth(MVD).Byapplyinggrayscaledilationfiltertodepthimages,foregroundregionsareextendedtobackgroundregion,andsynthesisartifactsoccuroutofboundaryedge.Thus,objectiveandsubjectivequalityofviewsynthesisresultisimproved.Adepthdilationfilterisappliedtoinloopresamplingpartinencoding/decoding,andpostprocessingpartafterdecoding.Accurateviewsynthesisisimportantinvirtualviewgenerationforautostereoscopicdisplay,moreovertherearemanycodingtoolswhichuseviewsynthesistoreduceinterviewredundancyin3Dvideocodingsuchasviewsynthesisprediction(VSP)anddepthbasedmotionvectorprediction(DMVP),andcompressionefficiencycanbeimprovedbyaccurateviewsynthesis.CodingandsynthesisexperimentsareperformedforperformanceevaluationofadilationfilterwithMPEGtestsequences.DilationfilterwasimplementedonthetopoftheMPEGreferencesoftwareforAVCbased3Dvideocoding.Byapplyingadepthdilationfilter,BD-rategainsof0.5%and6.0%intermsofPSNRofdecodedviewsandsynthesizedviews,respectively.


Adaptive hole filling for 3D warping-based virtual view synthesisTatsuroMori,NagoyaUniv.(Japan);MehrdadPanahpourTehrani,NationalInstituteofInformationandCommunicationsTechnology(Japan);ToshiakiFujii,NagoyaUniv.(Japan);MasayukiTanimoto,NagoyaIndustrialScienceResearchInstitute(Japan)

Asoneofmethodstorealizestereoscopicimagesynthesis,especiallymulti-viewpointgeneration,3DwarpingisusuallyemployedwithView+Depthformat.Thisprocessincludestheproblemthatholesappearinthevirtualimage.Inconventionalmethod,theholesweredealtwithbymedianfilter.Therearedifferentreasonswhyholesappear.So,treatingthemcollectivelyisimproper.Weanalyzethefactors,andrecognizethattwoonesexist,boundarybetweenforegroundandbackground,anddeclineofresolution.

Inthispaper,weproposeanewholefillingmethodconsideringthesefactors.Inthefirststep,weclassifynearbypixelsintoboundaryorsameobjectareaaccordingtothegradientofdepthvalue.Forboundarycase,weholdthemandrefertoothertworealcameras.Foranothercaseofsameobjectarea,wesetupsub-pixelsbetweennearbypixelsandwarpthemifthedepthareahaslayeroranobjectgetsmoreclosetocameraafter3Dwarping,whichprobablycausesdeclineofresolution.

Weimplementthesemethodsinsimulation.Astheresult,weconfirmthatproposedmethodpreventboundaryinvirtualimagefrombeingambiguous,andcanfillholesinobjectareamostlythoughholesremaininconventionalmethod.

Conference 8648


Conference 8649: The Engineering Reality of Virtual Reality 2013Monday-Tuesday4–5February2013

Part of Proceedings of SPIE Vol. 8649 The Engineering Reality of Virtual Reality 2013

8649-1, Session 1

CalVR: an advanced open-source virtual reality software frameworkJurgenSchulze,AndrewPrudhomme,PhilipWeber,ThomasA.DeFanti,Univ.ofCalifornia,SanDiego(UnitedStates)

Despiteavarietyofexistingvirtualrealityframeworks,wedecidedtodevelopourown,becausenoneoftheexistingsystemsofferthefeatureswewerelookingfor.OurnewsoftwaresystemiscalledCalVR,andisanopensourceproject,whichweoffertoanyoneintheacademiccommunityfreeofcharge.Keyfeaturesofoursystem,whicharenotpresentinexistingones,are:renderingsupportforautostereoscopicdisplays,multi-userandmulti-handinput,multi-vieweroutput,aclearseparationbetweenGUIfunctionalityandgraphicalimplementation,asynchronousinputdevicehandlingwithamessagequeue,andprogrammingobjectstosimplifythecreationofapplicationswithmultiple,independentobjectsonthescreen.Likeothersystems,wesupportVRdisplaysofanyshapeandsize,drivenbyPCclusters.

8649-2, Session 1

CAVE2: a Hybrid Reality Environment for Immersive Simulation and Information AnalysisAlessandroFebretti,ArthurNishimoto,TerranceThigpen,JonasTalandis,LanceLong,J.D.Pirtle,Univ.ofIllinoisatChicago(UnitedStates);TomPeterka,ElectronicVisualizationLaboratory(UnitedStates);AlanVerlo,MaxineBrown,DanaPlepys,DanielJ.Sandin,LucRenambot,AndrewJohnson,JasonLeigh,Univ.ofIllinoisatChicago(UnitedStates)

NoAbstractAvailable

8649-3, Session 2

MASCARET : creating virtual learning environments from system modellingRonanQuerrec,PaolaVallejo,CédricBuche,EcoleNationaled’IngénieursdeBrest(France)

ThedesignprocessforaVLEsuchasthatputforwardintheSIFORASproject(SImulationFORtrainingandASsistance)meansthatsystemspecificationscanbedifferentiatedfrompedagogicalspecifications.Equally,thesystemspecificationscanbeobtaineddirectlyfromthespecialists’expertise;thatistosaydirectlyfromtheProductLifecycleManagement(PLM)tools.Todothis,thesystemmodelneedstobeconsideredasapieceofVLEdata.Weproposeameta-modelwhichcanbeusedtorepresentthesesystemmodels:MASCARET.Inordertoensurethatthemeta-modeliscapableofdescribing,representingandsimulatingsuchsystems,MASCARETisbasedonastandarddefinedbyOMG:SYSML.

TodesignaVLEfromPLMspecifications,weneed4kindsofinformation:

•the3Dgeometry(Colladafiles),

•thesystemstructure(Blocks,properties,compositionsinSYSML),

•thebehaviouroftheentitieswithinthesystem(StateMachines,Event,SignalsinSYSML),

•thesystem’susageormaintenanceprocedures(Activities,partitions,actionsinSYSML).

8649-4, Session 2

Employing WebGL to develop interactive stereoscopic 3D content for use in biomedical visualizationSemayJohnston,Univ.ofIllinoisatChicago(UnitedStates)andBiomedicalandHealthInformationSciences(UnitedStates);LucRenambot,Univ.ofIllinoisatChicago(UnitedStates);DanielSauter,Univ.ofIllinoisatChicago(UnitedStates)andSchoolofArtandDesign(UnitedStates)

WebGraphicsLibrary(WebGL),theforthcomingwebstandardforrenderingnative3Dgraphicsinabrowser,representsanimportantadditiontothebiomedicalvisualizationtoolset.ThisprojectdefinesaWebGLdesignmethodologyforatargetaudienceofbiomedicalartistswithabasicunderstandingofweblanguagesand3Dgraphics.Themethodologywasinformedbyaliteraturereviewof3Dwebtechnologiesandtheirvalueinbiomedicaleducation,asurveyofcurrentWebGLresourcesandframeworks,andthedevelopmentofaninteractivewebapplicationdepictingtheanatomyandvariouspathologiesofthehumaneye.Theapplicationsupportsseveralmodesofstereoscopicdisplaysforabetterunderstandingof3Danatomicalstructures.Implementedbymostmodernbrowsers,theWebGLspecificationhasbegunchangingthelandscapeofthewebwithnativehardwareaccelerated3Dgraphics—otherwiseachievableonlythroughbrowserpluginslikeAdobeFlashandtheUnity3DWebPlayer.Thoughnotyetuniversallysupported,WebGLhassteadilygainedimportanceduetoshrinkingsupportforthird-partypluginsandisprojectedtobecomeamainstreammethodofdelivering3Donlinecontent.Finally,theavailabilityofstereoscopicdisplaysissteadilyincreasing(3DTV,desktop,andmobile),allowingthedeliveryofaVRexperiencetowebusers.ThevalueofWebGLforeducationhasbeenapparentsincethetechnology’sinception.Google’sBodyBrowserinof2009showcasedthepotentialofWebGLtobringarich3Dinteractivelearningenvironmenttothebrowser.AnumberofdevelopershaveexploredtheuseofWebGLnotonlyforcreatinginteractiveanatomicalmodels,butalsoforchemicalandmolecularvisualizationandweb-basedvolumerendering.Severalobstacleshavehinderedwidespreadadoptionofthistechnologyinbiomedicaleducation.TheWebGLspecificationiscurrentlyindraftstateandhasnotyetbeenimplementedbyallbrowsers.Additionally,becauseWebGLutilizeshardwareacceleration,oldergenerationmachinesmaylackthecomputationalpowertomeetthehigherrequirementsforrendering3Dgraphics.However,WebGLwilleventuallybecomeawebstandard,andnewergenerationcomputersandmobiledeviceswithdataaccessandembedded3Dgraphicsacceleratorswillreplaceolderequipment.LearningWebGLcanprovedifficultforpractitionersinbiomedicalvisualizationduealackofinstructionalliterature.ProgrammingdirectlyinWebGLisinherentlychallengingduetoitslow-levelnature.Fortunately,open-sourcedevelopershavecreatedhigh-levelframeworks.Theseframeworksvaryinqualityandpurpose.Despitethechallenges,thetaskoflearningtouseaWebGLframeworkisveryachievable.Manyprofessionalsinbiomedicalvisualizationalreadyhavetransferableskillsthatwillaidinthelearningprocess(3Dgraphicsanimation,illustrationandstereoscopic3D).Thepivotalroleofwebandmobilelearninginbiomedicaleducationdemandsthatpractitionersinbiomedicalvisualizationadapttotheshiftingtrajectoryoftheweb.Byembracingnewstandards,theywillgaintheabilitytodeliverqualityinteractiveVRcontentinaformatthatpromiseswidespreadaccessibilityandlongevity.ThisprojectaddressesamajorobstaclehinderingbroadadoptionofWebGLtechnologyinbiomedicalvisualizationbyexploringthequestion:HowcanbiomedicalartistswithabasicunderstandingofweblanguagesbeginusingWebGLtocreate3Dinteractivewebapplications?AnexampleworkflowforcreatingaWebGLapplicationisprovided.Thedemonstration,namedEye3D,isaninteractive3DeyemodelbasedontheUICVRMedLab’sVRproject,TheVirtual


Eye,whichremainslargelyinaccessibleduetotheneedforexpensiveequipmentwithwhichtodisplayit.Bringingthiscontenttothewebfulfillsthreeobjectivesof1)makingavailableavaluableeducationaltool,2)providingaplatformforlearningHTML5andWebGLtechnologiesand3)deliveringaVRexperiencethroughtheweb.

8649-5, Session 2

FreeVR: honoring the past, looking to the futureWilliamR.Sherman,IndianaUniv.(UnitedStates);DanielComing,Google(UnitedStates);SimonSu,BallAerospace&TechnologiesCorp.(UnitedStates)

TenyearsofexperienceindesigningandimplementingaVRintegrationlibraryhaveproducedawealthoflessonsuponwhichwecanfurtherbuildandimproveourcapabilitiesofwritingworthwhilevirtualrealityapplications.TheVRXvirtuallibraryisamaturelibrary,yetcontinuestoprogressandbenefitfromtheinsightsandrequestsproducedbyapplicationdevelopment.WecompareVRXwiththestandardprovisionsofvirtualrealityintegrationlibraries,andprovideanin-depthlookatVRXitself.Weexaminewhatdesigndecisionsworked,andwhichfellshort.Inparticular,welookathowthefeaturesofVRXservetorestoreapplicationsofthepastintoworkingconditionandaidinprovidinglongevitytonewlydevelopedapplications.

8649-6, Session 2

An industrial approach to design compelling VR and AR experienceSimonRichir,EcoleNationaleSupérieured’ArtsetMétiers(France);PhilippeFuchs,EcoledesMinesParisTech(France);DomitileLourdeaux,UTC(France);RonanQuerrec,CedricBuche,ENIB(France)

TheconvergenceoftechnologiescurrentlyobservedinthefieldofVR,AR,roboticsandconsumerelectronicreinforcesthetrendofnewapplicationsappearingeveryday.Butwhentransferringknowledgeacquiredfromresearchtobusinesses,researchlaboratoriesareoftenatalossbecauseofalackofknowledgeofthedesignandintegrationprocessesincreatinganindustrialscaleproduct.Infact,theinnovationapproachesthattakeagoodideafromthelaboratorytoasuccessfulindustrialproductareoftenlittleknowntoresearchers.

Theobjectiveofthispaperistopresenttheresultsoftheworkofseveralresearchteamsthathavefinalisedaworkingmethodforresearchersandmanufacturersthatallowthemtodesignvirtualoraugmentedrealitysystemsandenabletheiruserstoenjoy“acompellingVRexperience”.Thatapproach,called“theI2Imethod”,present11phasesfrom“Establishingtechnologicalandcompetitiveintelligenceandindustrialproperty”to“Improvements”throughthe“DefinitionoftheBehavourialInterface,VirtualEnvironmentandBehaviouralSoftwareAssistance”.Asaresultoftheexperiencegainedbyvariousresearchteams,thisdesignapproachbenefitsfromcontributionsfromcurrentVR&ARresearch.Ourobjectiveistovalidateandcontinuouslymovesuchmultidisciplinarydesignteammethodsforward.

8649-7, Session 2

3D interactive augmented reality-enhanced digital learning systems for mobile devicesKai-TenFeng,NationalChiaoTungUniv.(Taiwan);Po-HsuanTseng,NationalTaipeiUniv.ofTechnology(Taiwan);Pei-ShuanChiu,Jia-LinYang,Chun-JieChiu,NationalChiaoTungUniv.(Taiwan)

Withenhancedprocessingcapabilityofmobileplatforms,augmentedreality(AR)hasbeenconsideredapromisingtechnologyforachievingenhanceduserexperiences(UX).Augmentedrealityistoimposevirtualinformation,e.g.,videosandimages,ontoalive-viewdigital

display.UXonreal-worldenvironmentviathedisplaycanbeeffectivelyenhancedwiththeadoptionofinteractiveARtechnology.EnhancementonUXcanbebeneficialfordigitallearningsystems.ThereareexistingresearchworkbasedonARtargetingforthedesignofe-learningsystems.However,noneoftheseworkfocusesonprovidingthree-dimensional(3-D)objectmodelingforenhancedUXbasedoninteractiveARtechniques.Inthispaper,the3-Dinteractiveaugmentedreality-enhancedlearning(IARL)systemswillbeproposedtoprovideenhancedUXfordigitallearning.TheproposedIARLsystemsconsistoftwomajorcomponents,includingthemarkerlesspatternrecognition(MPR)for3-Dmodelsandvelocity-basedobjecttracking(VOT)algorithms.RealisticimplementationofproposedIARLsystemisconductedonAndroid-basedmobileplatforms.UXondigitallearningcanbegreatlyimprovedwiththeadoptionofproposedIARLsystems.

8649-8, Session 2

User contexts: identifying specificity of user needsDianeGromala,SimonFraserUniv.(Canada)

UserContexts:IdentifyingSpecificityofUserNeeds

Thispaperexploresthespecificneedsofusersontwolevels.FIrst,itreviewsthebasicneedsofusers,andextendsthisbasicapproachbyidentifyingneedsspecifictofemaleusers.Second,itidentifiesspecificneedsofuserswhoexperienceacuteandchronicpain.Userswhohavechronicpain,anewlyidentifieddiseasethatisestimatedtoaffect1in5peopleinindustrializednations,presentseveralestablishedneedsthatrangefromphysicalandperceptualsensitivitiesandimpairments.AlthoughtheseareknowninPainMedicine,verylittlehasbeenpublishedinVRresearchoutsideofacutepaincontexts.Thispaperspresentsuserneedsidentifiedinevidence-basedmedicalresearch,severalofwhichhavebeenrecentlybeenverifiedinimmersiveVRcontextsinthePainStudiesLabatSimonFraserUniversity.

Finally,wepresentanewframeworkforapproachingVRusersingeneral,basedonthelessonslearnedfromthesetwogroups,andfromVRresearchfromthe1980stothepresent.

8649-10, Session 3

Using the computer-driven VR environment to promote experiences of natural world immersionLisaA.Frank,BeckmanInstituteforAdvancedScienceandTechnology(UnitedStates)andTheWisconsinInstituteforDiscovery(UnitedStates)

InDecember,2011,nearly800peopleexperiencedtheexhibit,<1>:“der”//patternforavirtualenvironment,createdforthefullyimmersiveCAVE™attheUW-Madison.Thisexhibitiontookmy2Dnature-basedphotographicworkandreinterpreteditforVR.Variedresponsessuchas:“It’slikeamomentofjoy,”or“Ihadtoseeittwice,”or“I’mstillthinkingaboutitweekslater”werecommon.Althoughanimpliedgoalofmy2Dartworkistocreateaconnectionthatmakesviewersmoreawareofwhatitmeanstobeapartofthenaturalworld,thesesixVRenvironmentsopenedupanunexpectedareaofinquirythatmy2Dworkhasnot.Evenastheexperiencewasmediatedbymachines,therewasasofteningattheinterfacebetweentechnologyandhumansensibility.Somehow,forsomepeople,throughtheunlikelyauspicesofacomputer-drivenenvironment,theprojectspoketoahumanessencethattheyconnectedwithinawaythatwentbeyondallexpectationsandfeltcompletelyoutofmyhands.Otherinterestingbehaviorswerenoted:insomescenariossomespokeofintenseanxiety,acrophobia,claustrophobia–evenfearofdeathwhenthescenetookthemunderground.Theseenvironmentswerebelievableenoughtocauseextremeresponsesanddisorientationforsomepeople;werefun,pleasantandwonder-filledformost;andwereliberating,poeticandmeditativeformanyothers.Theexhibitionseemedtopromoteimaginativeskills,creativity,emotionalinsight,andenvironmentalsensitivity.ItalsorevealedtheCAVE™tobeapowerful

Conference 8649


toolthatcanencourageuniquelyproductiveexperiences.Quitebyaccident,Iwatchedasthesenature-basedenvironmentsrevealedandarticulatedanessentialrelationshipbetweenthehumanspiritandthephysicalworld.TheCAVE™iscertainlynotanaturalspace,butthereisclearpotentialtoexplorevirtualenvironmentsasapathtobetteranddeeperconnectionsbetweenpeopleandnature.We’velongassociatedcontactwithnatureasrestorative,butthosepoeticreflectionsofThoreauandothersarenowconfirmedbyresearch.Studiesareshowingthatcontactwithnaturecanproducefaster,greaterrecoveryfromstressandotherillnesses,reductioninanger,andanincreasedsenseofwell-being.Additionally,Idiscoveredthatthenoveltyofavirtualrealityexperiencecanbringnewfocusandfreshattentiontoelementsofourworldthatwehavegrownimmuneto.Possibly,theboletusedulisinonesceneseemedtohavebeenmademoreremarkableandmysteriousinVRthanifitwasseeninthebackyard.AVRenvironmentcanbeusedtocreateopportunitiestoexperiencebeingintheworlddifferently.Heretheycanbeinsideofaneggthatisinsideofanestthatisheldbytreebranchesoveracreekbedinafloatinglandscapewherealightspringsnowisfalling.Weareliberatedfromtheworldlylimitationsofourbody.Thequestionisthis:inananti-naturalenvironment,canimmersantsinaCAVE™becomemoreecologicallysympatheticandspirituallyconnected?Althoughtheexhibitwasnotputthroughanyformoftestingasofyet,myobservationsamounttoaremarkablevisionofwhatVRmightprovideforusasaninstrumenttoexpandconsciousnessandpromotewellness.Creatingexceptional,transformativeexperiencesmayseemlikealoftygoalforVRbutthatpurposeisattheheartofanyartmakingprocess.

8649-11, Session 3

A virtual reality environment of oriental residencesCatherineChi,IndianaUniv.(UnitedStates)

Thepresentationisavirtualrealityenvironmentbuiltwith3Dmodelsandsynthetictextures.Bynavigatingthroughthespace,theviewerscanfeelthemselvespersonallyimmersedintheworld,andinteractwiththebuildingstructures.IntheprojectIwanttoexpresstherelationshipbetweenarchitecturalstyleandculturaldifference,andhowtheemotionalconditionorcharacteristicsoftheresidentsareaffectedbytheirresidencies.Thecommunityweliveinisasymbolforself-reflection.Thedifferencesbetweenthegeographicalconditionsandculturalinfluenceschangetheappearanceofresidencesandcitylandscapes.

Iwasinspiredbythebuildingstructureinmyhometown-Taiwan,whichisreallydifferentwiththearchitecturestyleintheUnitedStates.IntheUSthebuildingsoftenspreadouthorizontally,butinTaiwanthebuildingstrytoextendverticallyanddenselyasmuchaspossibleduetothelimitationofspace.Thegrowthandtheslowlydisappearanceofthebuildingsrepresenttheprogressofhuman’sexistence.Thepiecedemonstratesthatavirtualrealityenvironmentenhancestheimmersiveexperience,whichmakestheviewerspresentinthespaceandatthatmoment.

8649-12, Session 3

Mrs. SquandertimeJosephineAnstey,DavePape,Univ.atBuffalo(UnitedStates)

InthispaperwediscussMrs.Squandertime,areal-time,persistentsimulationofavirtualcharacter,herlivingroom,andtheviewfromherwindow,designedtobeawall-size,projectedartinstallation.Throughherlargepicturewindow,theeponymousMrs.Squandertimewatchesthesea:boats,clouds,gulls,thetidegoinginandout,peopleontheseawall.Thehundredsofimagesthatcomposetheviewaredrawnfromhistoricalprintedsources.Theprogramthatassemblesandanimatestheseimagesisdrivenbyweather,time,andtidedataconstantlyupdatedfromarealphysicallocation.Thecharacterherselfisrenderedphotographicallyinaseriesofslowlydissolvingstillswhichcorrespondtothecharacter’scurrentbehavior.WeintendMrs.Squnadertimetobeaquietlycompelling,virtual,experienceforthe

viewer,tostimulatetheout-of-clock-time,regenerative,contemplativesenseofwell-beingthatareallandorseascapeoffers.Atatimewhengroupsasdiverseaseco-spiritualistsandearnestsocialscientistssuggestthatthementalandphysicalhealthofmuchoftheworld’sincreasinglyurbanpopulationisdamagedbylackofcontactwithnature,canvirtualwildspacebepartofasolution?

8649-13, Session 4

There’s an app for that shirt!: evaluation of augmented reality tracking methods on deformable surfaces for fashion designSilviaRuzanka,RensselaerPolytechnicInstitute(UnitedStates);KatherineBehar,BaruchCollege(UnitedStates);BenChang,RensselaerPolytechnicInstitute(UnitedStates)

InthispresentationwewilldiscussappAREL,acreativeresearchprojectattheintersectionofaugmentedreality,fashion,andperformanceart.AppARELisamobileaugmentedrealityapplicationthattransformsotherwiseordinarygarmentswith3Danimationsandmodifications.WithappAREL,entirefashioncollectionscanbeuploadedinasmartphoneapplication,and“newlooks”canbedownloadedinasoftwareupdate.Theprojectwillculminateinaperformanceartfashionshow,scheduledforMarch2013.

appARELincludestextiledesignsincorporatingfiducialmarkers,garmentdesignsthatincorporatemultiplemarkerswiththehumanbody,andiOSandAndroidappsthatapplydifferentaugments,or“looks”,toagarment.

WewilldemonstrateappAREL;discussourphilosophyforcombiningcomputer-generatedandphysicalobjects;andsharethechallengesweencounteredinapplyingfiduciarymarkerstothe3Dcurvaturesofthehumanbody.

8649-14, Session 4

Augmented reality: past, present, and futureLauraInzerillo,PalermoUniv.(Italy)

Agreatopportunityhaspermittedtocarryoutacultural,historical,architecturalandsocialresearchwithgreatimpactfactorontheinternationalculturalinterest.Wearetalkingabouttherealizationofamuseumwhosethemainthemeisthevisitandthediscoveryofamonumentofgreatprestige:themonumentalcomplexofSteriinPalermo.Themuseumrunsisdividedintosubthemesincludingtheoneaboveall,thathasarousedtheinternationalinterestsomuchthatitwasforwardtheinstancetoincludethemuseumlikeculturalheritageofUNESCO.ItistherealizationofamuseumpaththatregardsthecellsoftheInquisition,whicharelocatedjustinsideofsomebuildingsofthemonumentalcomplex.Theproject,asawhole,isfacedinasynergyviewbetweenthevariouscompetencesimplicated:historical,chemical,architectural,topographical,representativeinformatics.Thebirthofthemuseumwillbeasumsoftheresultsofallthesedisciplinesinvolved.

Methodology,implementation,fruition,virtualmuseum,parameters,potentials,setbacksontheculturalheritageandlandscapeknowledge,that’sandmorethanthathasbeenstudiedanddevelopmentduringalltheresearchphases.

8649-15, Session 4

Vroom: Designing an augmented environment for remote collaboration in digital cinema productionToddMargolis,TracyCornish,Univ.ofCaliforniaSanDiego(UnitedStates)

Asmediatechnologiesbecomeincreasinglyaffordable,compactandinherentlynetworked,newgenerationsoftele-collaborativeplatformscontinuetoarisewhichcapitalizeonthesenewaffordances.Virtual

Conference 8649


realityhasbeenprimarilyconcernedwithcreatingsimulationsofenvironmentsthatcantransportparticipantstorealorimaginedspacesthatreplacethe“realworld”.MeanwhileAugmentedRealitysystemshaveevolvedtoembedelementsfromVirtualRealityenvironmentsintothephysicallandscape.Perhapsnowthereisanewclassofsystemsthatreversethisprecepttocapitalizeondynamicmedialandscapesandphysicaldisplayenvironmentstoenabledataexplorationthroughcollaboration.

Vroom(VirtualRoom)isanext-generationreconfigurabletileddisplayenvironmentindevelopmentattheCaliforniaInstituteforTelecommunicationsandInformationTechnologyattheUniversityofCalifornia,SanDiego.Vroomenablesfreelyscalabledigitalcollaboratories,connectingdistributed,high-resolutionvisualizationresourcesforcollaborativeworkinthesciences,engineeringandthearts.Vroomtransformsaphysicalspaceintoacompletelyimmersivemixedmediaenvironmentwithlargeformatinteractivedisplaysurfaces,videoteleconferencingandspatializedaudiobasedonhigh-speedopticalnetworks.

Thesystemutilizesnarrowbezeldisplaywalls,directionalaudioandadvancedinteractiondevicestoshareultrahighresolutiondata.VroombuildsonpreviousresearchfromtheNSFfundedOptiPuterprojectbyintegratingmodularOptiPortables(portabletileddisplaysystems)withopen-sourcemiddlewareforlocalandremotecollaboration.

Vroomenablesgroupcollaborationforlocalandremoteparticipantstoshareknowledgeandexperiences.Possibleapplicationsinclude:remotelearning,brainstorming/storyboarding,post-productioneditorialreview,highresolutionvideoplayback,tele-immersive3Dinteraction,easy-to-usescreencastingandsimpleimage,videoandtextfilesharing.Tosupportthesevariousscenarios,Vroomfeaturessupportformultipleuserinterfaces(opticaltracking,touchUI,gestureinterface,etc.),supportfordirectionalandspatializedaudio,giga-pixelimageinteractivity,4Kvideostreaming,3Dvisualizationandtelematicproduction.

8649-16, Session 4

New perspectives and limitations in the use of virtual reality in the rehabilitation of motor disordersAlessandroDeMauro,AitorArdanza,Vicomtech-IK4(Spain);EstherMonge,Ctr.deTerapiaIntegraldelNiño,Momo(Spain);FranciscoMolinaRueda,Univ.ReyJuanCarlos(Spain)

Severalstudieshaveshownthatbothvirtualandaugmentedrealityaretechnologiessuitableforrehabilitationtherapyduetotheinherentabilityofsimulatingrealdailylifeactivitieswhileimprovingpatientmotivation.

Inthispaperwewillfirstpresentthestateoftheartintheuseofvirtualandaugmentedrealityapplicationsforrehabilitationofmotordisordersandsecondwewillfocusontheanalysisoftheresultsofourproject.Inparticular,requirementsofpatientswithcerebrovascularaccidents,spinalcordinjuriesandcerebralpalsytotheuseofvirtualandaugmentedrealitysystemswillbedetailed.

8649-17, Session P1

Art, science, and immersion: data-driven experiencesRuthG.West,Univ.ofCaliforniaSanDiego(UnitedStates);LauraMonroe,LosAlamosNationalLab.(UnitedStates);JacquelynFordMorie,TheUniv.ofSouthernCalifornia(UnitedStates);JulietaCristinaAguilera,AdlerPlanetarium&AstronomyMuseum(UnitedStates)

Thispanelanddialog-paperexploresthepotentialsattheintersectionofart,science,immersionandhighlydimensional,“big”datatocreatenewformsofengagement,insightandculturalforms.Wewilladdressquestionssuchas:“Whatkindsofresearchquestionscanbeaddressedattheintersectionofart+science+immersiveenvironmentsthatcan’tbeaddressedotherwise?”“Howis

art+science+immersiondistinctfromstate-of-theartvisualization?”“Whatdoesworkingwithimmersiveenvironmentsandvisualizationofferthatotherapproachesdon’torcan’t?”“Wheredoesimmersionfallshort?”Wewillalsoexplorecurrenttrendsintheapplicationofimmersionforgaming,scientificdata,entertainment,simulation,socialmediaandothernewformsofbigdata.Weaskwhatexpressive,arts-basedapproachescancontributetotheseformsinthebroadculturallandscapeofimmersivetechnologies.

8649-18, Session 5

Nomad devices for interactions in immersive virtual environmentsPaulGeorge,RenaultS.A.(France)andArtsetMétiersParisTech,InstitutImage,LE2I–CNRS(France)andUniv.deTechnologieCompiègne(France);AndrasKemeny,RenaultTechnocentre(France)andArtsetMétiersParisTech,InstitutImage,LE2I–CNRS(France);FrédéricMerienne,ArtsetMétiersParisTech,InstitutImage,LE2I–CNRS(France);Jean-RémyChardonnet,EcoleNationaleSupérieured’ArtsetMétiers(France);IndiraThouvenin,Univ.deTechnologieCompiègne(France);JavierPosselt,EmmanuelIcart,RenaultS.A.(France)

RenaultiscurrentlysettingupanewCAVE,a5rear-projectedwallvirtualrealityroomwithacombinedresolutionof~140Mpixels,distributedoversixteen4kprojectorsandtwo2kprojectoraswellasanadditional3DHDcollaborativepowerwall.

Renault’sCAVEaimsatansweringneedsofdifferentstepsofconceptionofavehicle.Startingfromvehiclearchitecturethroughthesubsequentphasesofdesign,ergonomicandperceivedqualitycontroltoproduction,Renaulthasbuiltupalistofuse-casesanddefinedthespecificationsoftheFrenchnationalcollaborativeprojectVARI3(Virtual&AugmentedRealityIntuitiveandInteractiveInterface)incollaborationwithON-X,Theoris,CEALISTandInstituteImage.

OneofthegoalsofthisprojectistostudyinteractionsinaCAVE,especiallywithnomaddevicessuchasIPhoneorIPadtoexplorethepossibilitiesofmanipulationandobservationofobjects.Inspiredbycurrentusesofnomaddevices(multi-touchgestures,IPhoneUIlook’n’feelandARapplications),wehaveimplementedanearlyfeaturesettakingadvantageofthesepopularinputdevices.Inthispaper,wepresentitsperformancethroughmeasurementdatacollectedinourtestplatform,a4-sidedhomemadelow-costvirtualrealityroom,poweredbyultra-short-rangeandstandardHDhomeprojectors.

8649-19, Session 5

Analysis of tactors for wearable simulator feedback: a tactile vest architectureDavidPrater,StephenGilbert,EliotWiner,IowaStateUniv.(UnitedStates)

Currenttrainingsimulatorsforpoliceofficersandsoldierslacktwocriticalqualitiesforestablishingacompellingsenseofimmersionwithinavirtualenvironment:astrongdisincentivetogettingshot,andaccuratefeedbackaboutthebodilylocationofashot.ThisresearchaddressestheseissueswithahardwarearchitectureforaTacticalTactileTrainingVest(T3V).Inthisstudy,wehaveevaluatedthedesignspaceofimpacttactorsandpresentaT3Vprototypethatcanbeviscerallyfelt.

Thisresearchfocusesondeterminingtheoptimaldesignparametersforcreatingmaximumtactorhittingenergy.Theenergytransferredtotheprojectiledirectlyrelatestothequalityofthedisincentive.ThecompleteT3Vdesignwillincludeanarrayofthesetactorsonfrontandbackofthebodytoofferaccuratespatialfeedback.

Theimpacttactorcreatedandtestedforthisresearchisanelectromagneticprojectilelauncher,similartoasolenoid,butlowerprofileandhigherenergy.Ourbesttactorproducedtheefficiencyatjustabove0.1%withprojectileenergyofapproximately0.08Joules.Usersinaninformalpilotstudydescribedthefeelingas“surprising,”“irritating,”and“startling,”suggestingthatthislevelofforceisapproachingourtargetlevelofdisincentive.

Conference 8649


8649-20, Session 5

Use of virtual reality to promote hand therapy post-strokeDariaTsoupikova,Univ.ofIllinoisatChicago(UnitedStates);NikolayStoykov,RehabilitationInstituteofChicago(UnitedStates);RandyM.Vick,TheArtInstituteofChicago(UnitedStates);YuLi,DerekKamper,MollyListenberger,RehabilitationInstituteofChicago(RIC)(UnitedStates)

Anovel,art-basedvirtualrehabilitationsystemwasdevelopedandtestedforuseasarehabilitationprotocolforupperextremitytherapyfollowingstroke.Thesystemwasdevelopedbyaninterdisciplinaryteamofengineers,arttherapists,occupationaltherapists,andVRartiststoimprovepatients’motivationandengagement.Herewedescribesystemdesign,development,andusertestingforefficiency,subject’ssatisfactionandclinicalfeasibility.Wereportresultsofthecompletedqualitative,pre-clinicalpilotstudyofthesystemeffectivenessfortherapy.Wearecurrentlyconductingalongitudinalinterventionstudyover6weeksinstrokesurvivorswithchronichemiparesis.Initialresultsfollowinguseofthesystemonthefirstsubjectsdemonstratethatthesystemisoperationalandcanfacilitatetherapyforpoststrokepatientswithupperextremityimpairment.

8649-21, Session 5

Collaborative imaging of urban forest dynamics: augmenting re-photography to visualize changes over timeRuthG.West,Univ.ofCaliforniaSanDiego(UnitedStates);JarlathO’Neil-Dunne,Univ.ofVermont(UnitedStates);RobertB.Pless,WashingtonUniv.inSt.Louis(UnitedStates)

Ecologicalmeasurementisachallengebecausethereisaneedmeasuresubtlechangesindatacapturedoverlargeareasandovervariedtime-scales.Furthermore,whilechangeshappenoverlargeregions,theyaremeasuredbasedonthegrowth/healthofindividualorganisms,andthemetricsofindividualsarenotalwaysaccuratelyreflectedinlarge-scalemeasurement.Thechallengeisthustomeasurepatternsofslow,subtlechangeoccurringalongmultiplespatialandtemporalscales,andthentovisualizethosechangesinawaythatmakesimportantvariationsvisceraltotheobserver.Thisresearchenvisionsacommunity-basedandparticipatoryapproachbasedaroundaugmentedre-photographyofecosystems.Weshowacasestudyforthepurposeofmonitoringtheurbantreecanopy.Thegoalistoexplore,forasetofurbanlocations,theintegrationofgroundlevelre-photographywithavailableLIDARdata,andtocreateadynamicviewoftheurbanforest,anditschangesacrossvariousspatio-temporalscales.Thiscasestudygivestheopportunitytoexplorevariousaugmentstoimprovethegroundlevelimagecaptureprocess,protocolstosupport3Dinferencefromthecontributedphotography,andbothin-situandwebbasedvisualizationsofthetemporalchangeovertime.


Integral virtual display for long distance viewKosukeShimazu,TomohiroYendo,NagaokaUniv.ofTechnology(Japan)

Indrivingavehicle,iftheinformationthatisneededtodrivecanbedisplayedlikeAugmentedReality(AR),adriveroperatesthevehicleeffectively.Forexample,itisefficienttodisplaytheimagesthatindicateanintersectionwherethevehicleshouldturnnext.UnlikeconventionalARsystemsuchasusingseethroughHeadMountedDisplay,theimageneedstobedisplayedtolongdistanceatenvironmentofthevehicle.Ontheotherhand,HeadUpDisplay(HUD)iscurrentlyusedonvehiclefordisplayingspeedmeter,tachometerandsoon.ButitisdifficultthattheHUDisappliedtoon-vehicleARbecauseviewingfieldisinsufficient.Torealizethissystem,itneedsaHUDthatdisplaystheopticalvirtualimagesforlongdistanceviewandcoverslargeviewing

field.Therefore,weproposeadisplaywhichisdividedupmanysmallopticalsystems.BasicideaisusingaconvexlensarrayandelementalimagessimilarlytoIntegralPhotography(IP).Eachlenshascorrespondingoneelementalimageinfront,andeachlensgeneratesthevirtualimage.Inthispaper,ontheproposedmethod,atheoreticalformulaofthepositionrelationamongtheelementalimagestocreatethecontinuousvirtualimagesissolved.Andwesimulatedthesystemwithraytracingmethod.

Conference 8649


Conference 8650: 3D Image Processing (3DIP) and Applications 2013Wednesday-Thursday6–7February2013

Part of Proceedings of SPIE Vol. 8650 Three-Dimensional Image Processing (3DIP) and Applications 2013

8650-1, Session 1

Depth image completion using a modified robust PCAHyunjungShim,SAMSUNGElectronicsCo.,Ltd.(Korea,Republicof)

Inrecentyears,depthcamerashavebeenwidelyadoptedintovariousapplicationsduetoitseaseofaccessibilityandanaffordableprice.However,thedepthimageacquiredbythosecamerasoftenincludesinvalidormissingmeasurementsduetothelimitofitssensingprinciple.Theseeventuallypreventfromrobustdepthsensing,yieldingasubstantialamountofmissingdatainmeasurements.

Inthispaper,weintroduceanovelapproachtocompletingthedepthimagewithmissingdatausingamodifiedrobustprinciplecomponentanalysis(PCA).TheoriginalformulationofrobustPCAconsiderstheentriesofdatasamplearestatisticallyindependent.Ourdatasampleisa3Dpositionandthreeentriesaresolelydependentondepthvalue.Itisbecausethe3Dpointshouldliesonaline,definedbyacenterofcameraanditspixelcoordinate.Hence,weproposeamodifiedrobustPCAsotoenforcetherayconstraintondepthcompletion.

ThebasicintuitionofproposedalgorithmisthatthedepthcanberepresentedbyacombinationofclustersandeachclusterholdstheregularityinPCAsubspace.Byimposingsuchconstraints,wedemonstratethattheproposedalgorithmiseffectivetohandlethemissingdataonrepeatable,unevensurfaces.

8650-2, Session 1

Depth image post-processing method by diffusionYunLi,MårtenSjöström,UlfJennehag,RogerOlsson,MidSwedenUniv.(Sweden)

Multi-viewthree-dimensionaltelevisionreliesonviewsynthesistoreducethenumberofviewsbeingtransmitted.Depthimagesfromstereopairsorrangecamerasmaycontainerroneousvalues,whichentailartifactsinarenderedview.Post-processingofthedatamaythenbeutilizedtoenhancethedepthimagewiththepurposetoreachabetterqualityofsynthesizedviewsutilizingthedata.

WeproposeapartialdifferentialequationPDE-basedinterpolationmethodforthereconstructionofthesmoothareasindepthimages,whilepreservingsignificantedges.Wemodeledandevaluatedthedepthimageandtheviewsynthesisqualitybyadjustingthresholdsfor(Canny)edgedetectionandauniformsparsesamplingfactorfollowedbythesecondorderPDEinterpolation.Theedgedetectoridentifiessignificantedgesandtheuniformsparsesamplescontrolstheagreementbetweenthereconstructeddepthimageandtheoriginalone,whiletheinterpolationenforcesthesmoothnessconstraintfortheareasbetweensignificantedges.Theobjectiveresultsshowthatadepthimageprocessedbytheproposedschemecanachieveabetterqualityofsynthesizedviewsthantheoriginaldepthimage.Visualinspectionconfirmedtheresults.

8650-3, Session 1

Beta-function B-spline smoothing on triangulationsLubomirT.Dechevsky,PeterZanaty,NarvikUniv.College(Norway)

Inthisworkweinvestigateanovelfamilyofsmoothrationalbasis

functionsontriangulationsforfitting,smoothing,denoisingandcompressinggeometricdata.

Inaparallelcommunication[Dechevsky,L.T.,Zanaty,P.,AsymptoticallyoptimalK-functionalTikhonovregularizationofmultivariatenon-parametricregressionproblemswithnon-uniformlysampleddeterministicdesign,ComputationalImagingXI(2013)]weexploretheperformanceofageneralsmoothconstructionoveraverygeneralclassofsimplyconnectedpartitionsorcoversof(possibly,notsimply)connectedmultidimensionaldomainswhenusingaspecificapproachtotheso-calledK-functionalTikhonovregularizationusingasmoothnesspenalty.Herewecarrythroughanupgradedversionofthisresearchprogramfortheimportantspecialcaseofsimplicialization(in2D,triangulation)partitionsandstar-1neighborhoodcoversforwhichwediscusstwonewconstructions:asemi-smoothupgradeoftheclassicalpiece-wiseaffineconstructionandasmoothupgradesimultaneouslyofthegeneralsmoothconstructionandthesemi-smoothconstructiononsimplicializations.

Weperformcomparativeanalysisandgraphicalcomparisonoftheresultsobtainedforbothsimplicializedconstructionswiththeanalogousresultsobtainedforthegeneralconstructioninthespecialcaseofsimplicializations.

Basedonthiscomparison,wederiveasetofconclusionsandrecommendationsaboutpossiblyadaptive,context-dependentchoiceofoneormoreoftheconsideredconstructionsanddata-dependenttuningofrespectiveparametersoftheselectedconstruction(s).

8650-4, Session 2

Evaluation of efficient high quality depth upsampling methods for 3DTVLucP.J.Vosters,TechnischeUniv.Eindhoven(Netherlands);ChrisVarekamp,TPVisionNetherlands(Netherlands);GerarddeHaan,TechnischeUniv.Eindhoven(Netherlands)

Depthmapsplayavitalroleforthegenerationandtransmissionof3D-TVcontent.Highquality3Dcontentgenerationrequireshighqualitydepthmaps.Inpractice,depthmapsgeneratedbystereo-matching,time-of-flightcameras,ordecoders,inherentlyhavealowresolutionandsufferfromunreliableestimatesandnoise.Depthpost-processingisthereforetypicallyrequiredtoenhancetheresult.

State-of-the-artmethodsfordepth-mapenhancementperformup-sampling,edge-alignmentandde-noisinginasinglestep.Depthup-samplingmethodscanberoughlyclassifiedintooptimizationandfilterbaseddepthup-sampling.Whiletheperformanceofoptimizationbaseddepthup-samplersishigh,theircomputationalcomplexityandmemoryusageiscurrentlyprohibitiveforreal-timeimplementations.Conversely,filterbasedtechniquesavoidthehighcomputationalcostandmemoryrequirementsandallowforscan-lineimplementations.Therefore,filterbasedmethodsarepotentialcandidatesforimplementationinreal-time3D-TVsystems.

Despitetheimportanceforhighquality3Dvideo,therehasneverbeenafairattempttoquantitativelyassesstheperformanceofthestate-of-the-artfilter-based,depthup-samplingmethodsonalargedatasetformultipleupscalefactorsanddifferentlevelsofnoise.Therefore,inthispaperweprovideathoroughbenchmarkingcomparingeachmethod’sdepthaccuracyandinterpolationqualityonalargenumberofimagesintheMiddleburystereodataset.Togetafaircomparisonwefirstcomputetheoptimalparametersettingforeachmethodbysearchingthewholeparameterspaceandminimizingthenumberofbadpixelsoveralltestimages.Additionally,weanalyzeeachmethod’scomputationalcomplexitywiththebigOnotationandmeasuretheruntimeofaGPUimplementationwebuiltforeachmethod.


8650-5, Session 2

Multiview ToF sensor fusion technique for high-quality depth mapDeukHyeonKim,JinwookChoi,KwanghoonSohn,YonseiUniv.(Korea,Republicof)

Inthispaper,weproposedanovelmethodtofuseandupsamplemultiviewdepthmapsobtainedbyTime-of-Flight(ToF)baseddepthsensors.TheproposedmethodcanberobusttothecameracalibrationerrorandeffectivelyappliedtotheMultiviewVideoplusDepth(MVD)formatbasedonmultipledepthsensors.ToFbaseddepthsensoriswidelyusedincomputervisionfields.However,therearesomeproblemsinasensethatitprovidesalowresolutionandunreliablez-deptharoundobjectboundariesandhighlyreflectiveregions.WeaddresstheseproblemsbyusingmultiviewToFsensorfusiontechnique.Itisessentialforhighqualitymultiview3Dcontents.Theproposedmethodisbasedonthedepthcalibrationwhichadjuststhedistributionofz-depthbetweentwodepthsensors.Itprovidesacoherentz-depthforthecorrespondingpointsbetweendepthmaps.Confidencemapbasedmultiviewdepthfusionmakesitpossibletorestoretheacquisitionerrorandbealignedwellwiththecolorimagebyusingonlyreliablez-depthintheMVDformat.Experimentalresultsshowthattheproposedmethodbasedonmultiviewdepthmapsissuperiortotheconventionalmethodwhichmeansa2D-plus-depthformatusingonecolorimageandonedepthmap

8650-6, Session 2

Time-of-flight depth image enhancement using variable integration timeSunKwonKim,OukChoi,ByongMinKang,JamesD.K.Kim,Chang-YeongKim,SamsungAdvancedInstituteofTechnology(Korea,Republicof)

Time-of-Flight(ToF)camerasareusedforavarietyofapplicationsbecauseitdeliversdepthinformationatahighframerate.Thesecameras,however,sufferfromchallengingproblemssuchasnoiseandmotionartifacts.Toincreasesignal-to-noiseratio(SNR),thecamerashouldcalculateadistancebasedonalargeamountofinfraredlight,whichneedstobeintegratedoveralongtime.Ontheotherhand,theintegrationtimeshouldbeshortenoughtosuppressmotionartifacts.WeproposeaToFdepthimagingmethodtocombineadvantagesofshortandlongintegrationtimesexploitinganimagefusionschemeproposedforcolorimaging.Tocalibratedepthdifferencesduetothechangeofintegrationtimes,adepthtransferfunctionisestimatedbyanalyzingthejointhistogramofdepthsinthetwoimagesofdifferentintegrationtimes.Thedepthimagesarethentransformedintowaveletdomainsandfusedintoadepthimagewithsuppressednoiseandlowmotionartifacts.Toevaluatetheproposedmethod,wecapturedamovingbarofametronomewithdifferentintegrationtimes.TheexperimentshowstheproposedmethodcouldeffectivelyremovethemotionartifactswhilepreservinghighSNRcomparabletothedepthimagesacquiredduringlongintegrationtimes.

8650-7, Session 2

Pseudo-random modulation for multiple 3D time-of-flight camera operationDong-KiMin,SAMSUNGElectronicsCo.,Ltd.(Korea,Republicof);IliaOvsiannikov,SAMSUNGElectronicsCo.,Ltd.(UnitedStates);YohwanNoh,WanghyunKim,SAMSUNGElectronicsSemiconductor(Korea,Republicof);SunhwaJung,JoonhoLee,DeokhaShin,HyekyungJung,LawrenceKim,SAMSUNGElectronicsCo.,Ltd.(Korea,Republicof);GregoryWaligorski,LilongShi,SamsungSemiconductor,Inc.(UnitedStates);YoondongPark,ChilheeChung,SAMSUNGElectronicsCo.,Ltd.(Korea,Republicof)

3Dtime-of-flightdepthcamerasutilizemodulatedlightsourcestodetectthedistancetoobjectsasphaseinformation.Aseriouslimitation

mayexistincaseswhenmultipledepthtime-of-flightcamerasareimagingthesamescenesimultaneously.Theinterferencecausedbythemultiplemodulatedlightsourcescanseverelydistortcaptureddepthimages.Topreventthisproblemandenableconcurrent3Dmulti-cameraimaging,weproposemodulatingthecameralightsourceanddemodulatingthereceivedsignalusingsequencesofpulses,wherethephaseofeachsequenceisvariedinapseudo-randomfashion.Theproposedalgorithmismathematicallyderivedandprovedbyexperiment.

8650-8, Session 2

Lossy contour-coding in segmentation-based intra depth map codingJanHanca,AdrianMunteanu,PeterSchelkens,VrijeUniv.Brussel(Belgium)andInterdisciplinaryInstituteforBroadbandTechnology(Belgium)

Efficientdepthmapcompressionismandatoryinordertofulfillthestrenuoustechnicaldemandsofvarious3Dvideoapplicationssuchas3DTVandFreeViewpointVideo.Inthispaperwepresentanintra-depthmapcodingsystememployingoptimizedsegmentationproceduresfordepthmapsandlossyorlosslesscodingofthesegmentcontours.Losslesscontourcodingmakesuseofchaincodingtechniqueswhilelossycontourcodingisperformedusingcontourapproximationandpredictivecodingmechanisms.Bothsystemsemployhighorderarithmeticentropycoding.DepthvaluesineachsegmentareencodedusingpredictivecodingandDCT-basedencodingoftheresiduals.TheexperimentalresultsshowthatthelosslesscontourcodecoutperformstheconventionalJBIG2codecemployedinothersegmentation-baseddepthmapcodingapproachesintheliterature.Theeffectoflossycontourcodingonthesynthesizedviewsisanalyzed:theexperimentsshowthatlossycontourcodingoutperformsthelosslesscodingmethodforvariousinputdata.Finally,theexperimentalcomparisonagainstJPEG-2000showsthattheproposedsystemisaviablealternativeforlightintracodingofdepthmaps.

8650-9, Session 3

Estimation of spreading fire geometrical characteristics using near infrared stereovisionLucileL.Rossi,TomToulouse,Univ.diCorsicaPasqualePaoli(France);MoulayAkhloufi,Ctr.ofRoboticsandVision(Canada);AntoinePieri,YvesTison,Univ.diCorsicaPasqualePaoli(France)

Infireresearchandforestfirefighting,thereisaneedofrobustmetrologicalsystemsabletoestimatethegeometricalcharacteristicsofoutdoorspreadingfires.Inrecentyears,weassisttoanincreasedinterestinwildfireresearchtodevelopnondestructivetechniquesbasedoncomputervision.Thispaperpresentsanewapproachfortheestimationoffiregeometricalcharacteristicsusingnearinfraredstereovision.Spreadingfireinformationlikeposition,rateofspread,heightandsurface,areestimatedfromthecomputed3Dfirepoints.Theproposedsystempermitstofollowfirespreadingonagroundareaof5mx10m.

8650-10, Session 3

Uniform grid upsampling of 3D lidar point cloud dataPrudhviGurram,ShuowenHu,JohnF.Dammann,U.S.ArmyResearchLab.(UnitedStates)

Airbornelaserscanninglightdetectionandranging(LIDAR)systemsareusedforremotelysensingtopologyandbathymetry.ThemostfrequentlyuseddatacollectiontechniqueusedinLIDARsystemsemploysalinearmodescanning.Theresultingscanningdataisanon-uniformlysampled3Dpointcloud.Tointerpretthe3Dpointcloud

Conference 8650


dataandfurtherprocessthedata,itismostcommonlyconvertedtodigitalelevationmodel(DEM).InordertoobtainDEMsinauniformandupsampledrasterformat,theelevationinformationfromtheavailablenon-uniform3Dpointclouddataismappedontotheuniformgridpoints.Aftermappingisdone,thegridpointswheretheelevationinformationisnotavailablearefilledbyusinginterpolationtechniques.Inthispaper,partialdifferentialequations(PDE)basedapproachhasbeenproposedtoperformtheinterpolationandupsamplethe3Dpointcloudontoauniformgrid.ByemployinghigherorderPDEs,smoothnessismaintainedoverhomogeneousregionsandsharpedgeinformationinthesceneisretained.Theproposedalgorithmwillreducethedrapingeffectsneartheedgesofobjectslikebuildingsinthescene,whicharecausedbyexistingalgorithms.Simulationresultswillbepresentedtoillustratetheadvantagesofthenewalgorithm.

8650-11, Session 3

Improvements on an MMI-based method for automatic texture mapping of 3D dense modelsPasqualeFerrara,IstitutoNazionalediOttica(Italy);FrancescaUccheddu,Univ.degliStudidiFirenze(Italy);AnnaPelagotti,IstitutoNazionalediOttica(Italy)

Notallrangedevicesacquire,alongwith3Ddata,theobject’stexture.Moreover,notalwaysthedesiredtextureisthevisiblelightimage.

Weproposetousedepthmapsfromseveralhierarchicalviewpointstomatchthehomologuepointsbetweenthe3Dmodelandthegiventextureimage.Everydepthmapkeepsineherentlythecorrespondancebetweenthe3Dpointsanditspixels.Byfindingthedepthmapthatbetteroverlapwiththegiventexturewefindthebettercorrespondencebetweenthe3Dverticesandthetextureimage.TheMaximizationofMutualInformation(MMI)methodsprovedtobehighlyreliableassimilaritymeasure,however,MIisstronglydependentonthenumberofsamplesonwhichitiscomputed.InordertocorrectlycompareestimatesofMI,itisthereforeadvisabletoperformtheanalysisonthesamenumberofpixels.

WeproposeaweightedestimationofMIdependingonhowmanypixelsareconsidered.ThenewdevelopedalgorithminvolvesLaplace’sruleofsuccessiontoestimateMI.Inthisway,anaprioriinformationisintroducedwhenlowamountofpixelsareinvolvedintheMIestimation.WealsopresentacomparisonoftheresultsincasetheMMIisappliedstraightforwardly,withaboundingboxandwiththeemployofLaplace’sruleofsuccession.


Passive stereoscopic panomorph systemAnne-SophiePoulin-Girard,SimonThibault,DenisLaurendeau,Univ.Laval(Canada)

Inthelastdecade,wide-anglestereoscopicsystemsusingfish-eyelenseshavebeenproposedbutthecompromisemadeforhavingalargefieldofviewislowresolutionandhighdistortionresultinginimprecisedepthestimationofobjectsina3Dscene.Highandnon-uniformdistortion,especiallyintheazimuthaldirection,isoftenconsideredasaweaknessofpanoramiclensesbecauseitissometimesdifficulttocompensateforbyimageprocessing.

Theaimofthispaperistopresentanalternativetoexistingstereoscopicpanoramicsystemsbytakingadvantageofnon-uniformdistortionandanamorphosisinPanomorphlenses.Therearemanychallengesrelatedtothisprojectlikethecalibrationofthesystemandthecreationofa3Ddepthestimationalgorithmthatsuitstheresolutionofthedifferentareasintheimages.

ThispaperpresentsthestepsleadingtothechoiceoftwospecificPanomorphlensesthatareusedinaPassiveStereoscopicPanomorphSystem.Italsolistsdifferentconfigurationsandtheirrelatedapplications.Then,wepresentpreliminaryresultsforthecalibrationofoneselectedconfiguration.Finally,forthisconfiguration,3DreconstructionofanobjectinascenelocatedattheROIispresented.


3D hand localization by low-cost webcamsCheng-YuanKo,Chung-TeLi,Chen-HanChung,Liang-GeeChen,NationalTaiwanUniv.(Taiwan)

Inrecentyears,HumanComputerInteraction(HCI)isanactiveresearchregion.Themostintuitivewayforusertointeractivewithdeviceisusinguser’shand.Thus,themostimportantinformationforsystemtointeractivewithuseristhelocationofuser’shand.Becauseofourgoaliscreateanewuserinterfaceandwidelyusedinanyconsumerelectronics,suchasinteractive3DTVorinteractiveTabletPersonalComputer,sothecostisanimportantfactor.Weusetwolowcostwebcamstobuildthe3Dhandlocalizationandusestereomatchingalgorithmtogetthedepthmap.However,thequalityofthedepthmapissopoorduetothenoiseincapturedimagepair,sounliketocommonsegmentationmethodindepthmapwhichsetsthethresholdandthensegmentstheuser’shandeasily,wehavemorethingstodo.Inthispaper,unliketraditional2Dhandlocalizationwhicharebasedonskin-colororfeatureextraction,orRGB-Dcamerasusebothskincolordetectionanddepth,weonlyusetwolowcostcommoditywebcamstobuildastereosystemandgetthedepthmapbystereomatching.Accordingtotheexperimentresults,itisrobusttoanyhandshapes.


3D shape extraction of internal and external surfaces of glass objectsAlbanBajard,OlivierAubreton,FrédéricTruchetet,Univ.deBourgogne(France)

Three-dimensional(3D)digitizationofmanufacturedobjectshasbeeninvestigatedforseveralyearsandconsequently,manytechniqueshavebeenproposed.Evenifsometechniqueshavebeensuccessfullycommercialized,mostofthemassumeadiffuseorneardiffusereflectanceoftheobject’ssurface,anddifficultiesremainfortheacquisitionof“opticallynoncooperative”surfaces,suchastransparentorspecularones.Toaddresssuchsurfaces,weproposeanonconventionaltechnique,called“ScanningfromHeating”(SfH).Incontrasttoclassicalactivetriangulationtechniquesthatacquirethereflectionofvisiblelight,wemeasurethethermalemissionoftheheatedsurface.Theaimofthispaperistodemonstrate,byusingtheexperimentalsetupdesignedforspecular(transparentornot)objects,howthismethodallowsreconstructionbothofinternalandexternalsurfacesofglassobjectsfromauniquemeasure.


Real-time self-calibration of depth sensing systems based on structured-light 3DVikasRamachandra,KalinAtanassov,JamesNash,SergioR.Goma,QualcommInc.(UnitedStates)

Astructured-lightsystemfordepthestimationisatypeof3Dactivesensorthatconsistsofastructured-lightprojector,thatprojectsanilluminationpatternonthescene(e.g.maskwithverticalstripes)andcamera,whichcapturestheilluminatedscene.Basedonthereceivedpatterns,depthsofdifferentregionsinthescenecanbeinferred.Forthissetuptoworkoptimally,thecameraandprojectormustbealignedsuchthattheprojectionimageplaneandimagecaptureplaneareparallel,i.e.freeofanyrelativerotations(yaw,pitchandroll).

Inreality,duetomechanicalplacementinaccuracy,theprojector-camerapairwillnotbealigned.Inthispaperwepresentacalibrationprocesswhichmeasuresthemisalignment.Wealsoestimateascalefactortoaccountfordifferencesinfocallengthsofprojectorandcamera.

The3anglesofrotationcanbefoundbyintroducingaplaneinthefieldofviewofthecameraandilluminatingitwithprojectedlightpatterns.Thisplaneimageiscapturedandprocessedtoobtaintherelativepitch,yawandrollangles,aswellasscalethroughaniterativeprocess.Thisalgorithmusestheeffectsofrotationsonthedepthmapoftheplaneimage.

Conference 8650



Discovering unexpected information using a building energy visualization toolBenoitLange,WilliamPuech,NancyRodriguez,Lab.d’InformatiquedeRobotiqueetdeMicroelectroniquedeMontpellier(France)

Buildingenergyconsumptionisanimportantprobleminconstructionfield,oldbuildingsaregapofenergyandtheyneedtoberefactored.Energyfootprintofbuildingsneedstobereduced.Newbuildingsaredesignedtobesuitablewithenergyefficiencyparadigm.Toimproveenergyefficiency,BuildingManagementSystems(BMS)areused:BMSareIT(InformationTechnology)systemscomposedbyarulesengineandadatabaseconnectedtosensors.Unfortunately,BMSareonlymonitoringsystems:theycannotpredictandmineefficientlybuildinginformation.RIDERprojecthasemergedfromthisobservation.ThisprojectisconductedbyseveralFrenchcompaniesanduniversities,IBMatMontpellier,France,leadstheproject.ThemaingoalofthisprojectistocreateasmartandscalableBMS.ThisnewkindofBMSwillbeabletodigintodataandpredictevents.ThisITsystemisbasedoncomponentparadigmandthecorecanbeextendedwithexternalcomponents.Someofthemaredevelopedduringtheproject:datamining,buildinggenerationmodelandvisualization.Allofthesecomponentswillprovidenewfeaturestoimproverulesusedbythecore.Inthispaper,wewillfocusonthevisualizationcomponent.Thisvisualizationuseavolumerenderingmethodbasedonsensorsdatainterpolationandacorrelationmethodtocreatenewviews.Wewillpresentthevisualizationmethodusedandwhichrulescanbeprovidedbythiscomponent.


An efficient anaglyph stereo video compression pipelineAdhatusSolichahA.,NationalTaiwanUnivofScienceandTechnology(Taiwan);Guan-MingSu,DolbyLabs.,Inc.(UnitedStates);Kai-LungHua,Yu-ChiLai,NationalTaiwanUnivofScienceandTechnology(Taiwan)

Inthispaper,weconsidertheend-to-endstereovideodistributionpipelineconsistingofsinge-sensordigitalcamerapairs,thelegacyconsumer-gradevideodecoder,andanaglyphdisplays.Asweobservethatonlypartofthecolorinformationisusedinthefinalreconstructedanaglyphvideo,weproposeanefficientvideocompressionandprocessingpipelinetosignificantlyimprovethefinalreconstructedanaglyphvideoqualitywithlowercomputationcomplexity.Unlikethetraditionalpipelinewhichdemosiackesthecolorfilterarray(CFA)rawpixelsdirectlyafterthecameracapturingstageatthestereovideoencoderside,theproposedpipelinepackstherequiredcolorinformationusedinthefinalanaglyphimagefromtherawCFAstereoimageandcompressesthepackedpixelsusingthelegacyvideocodec.Thedemosaickingisconductedatthestereovideodecodersideafterthepixelsareunpacked.Withthisnewproposedpipeline,wecaneliminatethecolortransformandchromareformattingoperations.Besides,thestereovideocompressionefficiencycanbefurtherimproved.Thesimulationresultsshowthattheproposedframeworkoutperformsthetraditionaloneatallbitratesandcanhaveupto4.66dBPSNRgain,whichdemonstratesthesuperiorperformanceoverthetraditionaldistributionpipeline.


3D/2D image registration by image transformation descriptors (ITDs) for thoracic aorta imagingPawelJ.Lubniewski,LaurentSarry,Univ.d’AuvergneClermont-FerrandI(France);BrunoMiguelM.D.,CHUdeClermont-Ferrand(France);ChristopheLohou,Univ.d’AuvergneClermont-FerrandI(France)

Inthisarticle,wepresentanovelimageregistrationtechnique.Unlikemoststateoftheartmethods,ourapproachallowsustocomputedirectlytherelationshipbetweenimages.Theproposedregistrationframework,builtinamodularway,canbeadjustedtoparticularproblems.Testsonsampleimagedatabaseprovethatourmethodisfastandrobustandcouldbesuccessfullyusedformanycases.

Theproposedregistrationmethod,unlikemostcurrenttechniques,doesnotuseaniterativeframework.Thetransformationparametersareobtainedviacertainquantitiesissuedfromimages-ImageTransformationDescriptors(ITD).AnITDestimatessingletransformationparameterforeveryimageindependentlyandthetransformationiscalculatedinasinglepass.Suchapproachiseffectiveforregisteringmultipleimages.Itiseasiertoestimatethetimeofdataprocessingthanforiterativealgorithms.

ThepresentedtechniquebasedonITDisinterestingalternativeforclassicalgorithms.Itsuniquepropertiescanbeadvantageousformanyimagealignmentproblems.Thepossibilityofusingdifferentdescriptors,adaptedforparticularcases,makesourapproachveryflexible.Fasttimeofcomputingisanimportantfeatureandmotivatestouseourtechniqueevenasaninitializationstepbeforeexecutionofotheralgorithmswhichcouldbemoreprecise,butslowandsensitivetoinitializationoftheparameters.


Stereo matching with partial informationYusufCemSubakan,OmerCanGurol,CagatayDikici,BogaziçiÜniv.(Turkey)

Inthispaper,weaddressthestereomatchingprobleminstereovideosusingpartiallyinformedMarkovRandomFields(MRFs)usingthemotionasasideinformation.Wefirstestimatethemotionwithinoneofthevideosandthenregularizethestereomatchingusingtheestimatedmotionfieldfromoneofthevideos.Theproposedschemeisappliedtobothdenseandsparseimagessuchasedgemaps.

8650-12, Session 4

Analysis of weighting of normals for spherical harmonic cross-correlationRobertL.Larkins,MichaelJ.Cree,AdrianA.Dorrington,TheUniv.ofWaikato(NewZealand)

Theregistrationofoverlappingpointcloudshasrecentlyreceivedincreasedattentionduetothegrowingavailabilityof3Dsensors.Sphericalharmoniccross-correlationisarobustregistrationalgorithmthatusesthenormalsoftwooverlappingpointcloudstobringthemintocoarserotationalalignment.However,notallofthesenormalsarebeneficial,asthereareoftennormalsthatcontributenoisetothecross-correlation,detrimentallyaffectingregistration.Byapplyingaweightingschemetothenormals,theaccuracyofthecross-correlationcanbeimproved.Inthispaperweperformanin-depthanalysisinvestigatinghowdifferentweightingschemesimpacttheregistrationaccuracyofthecross-correlation.Itisfoundthattheinitiallytestedscheme,inwhicheachnormalisgivenabinaryclassification,increasedthelikelihoodofcorrectregistrationwhencontrastedagainstthebaseapproachofequallyweightingeverynormal.Schemeswithgreatercomplexityareshowntofurtherimproveregistrationalignment,evenwithdecreasedoverlapbetweenpointclouds.Theprimarygoalofthisanalysisistoidentifywhichschemesforweightingnormalsprovidesthebestaidforpointcloudregistrationwhenusedwithsphericalharmoniccross-correlation.

8650-13, Session 4

Edge-aided virtual view rendering for multiview video plus depth SuryanarayanaM.Muddala,MårtenSjöström,RogerOlsson,SylvainTourancheau,MidSwedenUniv.(Sweden)

Conference 8650


Depth-Image-BasedRendering(DIBR)ofvirtualviewsisanessentialmethodinthreedimensional3-Dvideoapplicationstoproduceadditionalperspectivesfromtextureanddepthinformation,suchasthemulti-view-plus-depth(MVD)format.However,therearestillartifactsinvirtualviewsasaconsequenceofrenderingusingexistingDIBRmethods.Inthispaper,weproposeanalternativeDIBRmethodforMVD.Theproposedmethodintroducesanedgeaidedpixelandinterpolatespixelvaluesinthevirtualviewusingtheactualprojectedcoordinatesfromtwoadjacentviews,bywhichcracksanddisocclusionsareautomaticallyfilled.Inparticular,weproposeamethodtomergepixelinformationfromtwoadjacentviewsinthevirtualviewbeforetheinterpolation;aweightedaveragingofprojectedpixelsisappliedwithintherangeofonepixelinthevirtualview.Wecomparedvirtualviewimagesrenderedbytheproposedmethodtothecorrespondingviewimagesrenderedbystate-of-the-artmethods.Objectivemetricsdemonstratedanadvantageoftheproposedmethodformostinvestigatedmediacontents.Subjectivetestsgavenosignificantdifferencetostate-of-the-artmethods,whichwefindencouragingastheproposedmethodomitsspecificprocessingstepstoremovedifferentartifacts.

8650-14, Session 4

Smarter compositing with the KinectAlexS.Karantza,RoxanneL.Canosa,RochesterInstituteofTechnology(UnitedStates)

Inthiswork,weapplytheconceptofadeferredshadingpipelinetothedatagatheredbyadepthsensor.Thedepthimagecorrespondstoadepthbuffer,fromwhichthepositionofapixelin3Dspacecanbecalculated,assumingknowledgeofthecamera’sfieldofview.AnotherbuffercontainingsurfacenormalsisrepresentedasanXYZtripletrepresentingthedirectionofeachsurfaceelementrelativetothecamera.Thisinformationisfoundfromthegeometryofthesceneandisusuallycombinedwithatexturemapthatrepresentsfinesurfacedetail.Weproposeusingthedepthimagedirectly,insteadofatexturemap,toapproximatethesurfaceorientation.Oncethebuffersarepopulatedwithrealdata,thesyntheticsceneisrenderedbycombiningthedataintoasinglesetofbuffersandamaskisusedtooverlaythedepthinformation.Thissetofbuffersisusedtore-shadethefinalcomposite,incorporatingeffectsfromconventionalcomputergraphicstoaddrealismtothefinalscene.

8650-15, Session 4

Real time 3D object model reconstruction using self-imagingArezooMovaghar,RezaSafabakhsh,KhosroMadanipour,AmirkabirUniv.ofTechnology(Iran,IslamicRepublicof)

Amongthecurrentmethodsof3Dimagingandvisualization,opticaltopographyisoneofthemostpopularone.Itissimple,inexpensiveandanondestructiveprocedurewhichhasnocontactwiththeobject.Inthispaperweareusingfringeprojectioninamodernwayinordertoacquirehighly-detailedmodelsof3Dobjects.Acollimatedlaserbeamisprojectedonagridandproducesself-imagesofthegridindifferentdistances.Acompletesinusoidalpatternisprojectedontheobject’ssurfaceandthedeformedpatternwasusedforanalyzingthreedimensionalprofileoftheobject.

8650-17, Session 5

Novel calibration procedure for SVBRDF estimation of 3D objects using directional illumination and structured light projectionJakubF.Krzeslowski,RobertSitnik,GrzegorzMaczkowski,WarsawUniv.ofTechnology(Poland)

Estimationofgeometryandreflectanceof3Dobjectsrequiresthatsurfacegeometryisregisteredtogetherwithphotometricdata.We

presentamethodwhichcombinesgeometricalcameracalibrationandphotometriccalibrationintoasingleprocedureutilizingonlyonecalibrationtarget.Usingstructuredlightprojectionanddirectionalillumination,thesurfaceofa3Dobjectcanberegisteredwithanintegratedmeasuringdevice.Toestimatespatialdistributionofreflectanceparameters,aSpatiallyVaryingBidirectionalReflectanceDistributionFunction(SVBRDF)modelisused.Wealsoshowa3DimageprocessingmethodtoestimateSVBRDFparametersusinganarbitrarydefinedarrayofilluminatorsandalgorithmstoreconstructthissurfaceusingspecializedvisualizationsoftware.Thisapproachallowsforeffectivemeasurementofgeometryandvisualpropertiesof3Dobjectsrepresentedbyadensepointcloudmodel.Itcanbecomeavaluabletoolfordocumentationofdigitalheritageandinindustrialcomputervisionapplications.

8650-18, Session 5

Wide range time-of-flight camera: design, analysis, and simulationOukChoi,SamsungAdvancedInstituteofTechnology(Korea,Republicof)

Time-of-flightcamerasmeasurethedistancestoscenepointsbyemittinganddetectingamodulatedinfraredlightsignal.Themodulationfrequencyofthesignaldeterminesacertainmaximumrangewithinwhichthemeasureddistanceisunambiguous.Iftheactualdistancetoascenepointislongerthanthemaximumrange,themeasureddistancesuffersfromphasewrapping,whichmakesthemeasureddistancetobeshorterthanitsactualdistancebyanunknownmultipleofthemaximumrange.Thispaperproposesatime-of-flightcamerathatiscapableofrestoringtheactualdistancebysimultaneouslyemittinglightsignalsofdifferentmodulationfrequenciesanddetectingthemseparatelyindifferentregionsofthesensor.Weanalyzethenoisecharacteristicofthecamera,andacquiresimulateddepthmapsusingacommerciallyavailabletime-of-flightcamera,reflectingtheincreasedamountofnoiseduetotheuseofdual-frequencysignals.Wefinallyproposeaphaseunwrappingmethodforrestoringtheactualdistancesfromsuchadual-frequencydepthmap.Throughexperiments,wedemonstratethattheproposedmethodiscapableofextendingthemaximumrangetoatleasttwice,withhighsuccessrates.

8650-19, Session 5

Efficient intensity-based camera pose estimation in presence of depthMahaElChoubassi,AmericanUniv.ofBeirut(Lebanon);OscarNestares,YiWu,IgorKozintsev,HorstHaussecker,IntelCorp.(UnitedStates)

ThewidespreadsuccessofKinectenablesuserstoacquirebothimageanddepthinformationwithsatisfyingaccuracyatrelativelylowcost.WeleveragetheKinectoutputtoefficientlyandaccuratelyestimatethecameraposeinpresenceofrotation,translation,orboth.Theapplicationsofouralgorithmarevastrangingfromcameratracking,to3Dpointscloudsregistration,andvideostabilization.

Thestate-of-the-artapproachusespointcorrespondencesforestimatingthepose.Moreexplicitly,itextractspointfeaturesfromimages,e.g.,SURForSIFT,andbuildstheirdescriptors,andmatchesfeaturesfromdifferentimagestoobtainpointcorrespondences.However,whilefeatures-basedapproachesarewidelyused,theyperformpoorlyinsceneslackingtextureduetoscarcityoffeaturesorinsceneswithrepetitivestructureduetofalsecorrespondences.Ouralgorithmisintensity-basedandrequiresneitherpointfeatures’extraction,nordescriptors’generation/matching.Duetoabsenceofdepth,theintensity-basedapproachalonecannothandlecameratranslation.WithKinectcapturingbothimageanddepthframes,weextendtheintensity-basedalgorithmtoestimatethecameraposeincaseofboth3Drotationandtranslation.Theresultsarequitepromising.

Conference 8650


8650-20, Session 5

Depth correction in ToF cameraByongMinKang,KeechangLee,JamesD.K.Kim,Chang-YeongKim,SamsungAdvancedInstituteofTechnology(Korea,Republicof)

AcaptureddepthusingaTime-of-Flight(ToF)cameraissometimesdistortedwhenToFcameracapturesadepthofsomeobjectsclosely.Inthiscase,theToFcameracapturesdepthofobjectsdifferently,althoughobjectsarelocatedatsamedistancefromcamera.ThisproblemcauseserrorsorlowqualityinseveralapplicationsusingToFcamerasuchasgame(gesturerecognition),3Dreconstructionand3Ddisplay(multi-viewgeneration).Thereareseveralreasonsforthissuchassaturation,non-linearityandasymmetryofcapacitancesinpixel.Inthispaper,wewilltakesaturationintoaccountandproposeadepthcorrectionmethodfordistortion.

8650-21, Session 6

Karate moves recognition from skeletal motionSimoneBianco,FrancescoTisato,Univ.degliStudidiMilano-Bicocca(Italy)

Theproposedworksaimsatautomaticallyrecognizingsequencesofcomplexkaratemovementsandgivingameasureofthequalityofthemovementsperformed.Sincethisisaproblemwhichintrinsicallyneedsa3Dmodel,inthisworkweproposeasolutiontakingasinputsequencesofskeletalmotionsthatcanderivefrombothmotioncapturehardwareorconsumer-level,offtheshelf,depthsensingsystems.

Theproposedsystemisconstitutedbyfourdifferentmodules:skeletonrepresentation,poseclassification,temporalalignment,andscoring.

Theproposedsystemistestedonasetofdifferentpunch,kickanddefensekaratemovesexecutedstartingfromthesimplestcase,i.e.fixedstaticstances(heikodachi)uptosequencesinwhichthestartingstancesisdifferentfromtheendingone.

ThedatasethasbeenrecordedusingasingleMicrosoftKinect.Thedatasetincludestherecordingsofbothmaleandfemaleathleteswithdifferentskilllevels,rangingfromnovicestomasters.

TheexemplarsequenceswererecordedfromathletesoftheWorldChampionItalianKarateNationalteam.Thedatasetwillbemadefreelyavailableaftertheconference

8650-22, Session 6

Analyzing the relevance of shape descriptors in automated recognition of facial gestures in 3D imagesJulianS.Rodriguez,FlavioPrieto,Univ.NacionaldeColombia(Colombia)

Thepresentarticleshowsthedescriptionandresultsemergingfromanalyzingthebehavioroftwoshapedescriptors(DESIREandSphericalSpinImage)atrecognizingfacialgesturesin3Dimages.DESIREismadeofdepthimages,silhouettesandextendedraysfromthepolygonalmesh;incontrast,SphericalSpinImage(SSI)isassociatedtoapointinthepolygonalmeshanditisa2Dhistogrambuiltfromneighboringpointsusingpositioninformationthatcapturesthefeaturesinalocalway.Thedatabaseusedcontainsimageswith6facialexpressionsoutofwhich3wererecognizedingreatpercentagebythefirstdescriptorimplementinganeuronalmeshasclassifierwhereastheseconddescriptoronlyoneofthefacialexpressionsisrecognized.

8650-23, Session 6

3D segmentation of the true and false lumens on CT aortic dissection imagesNawelFetnaci,PawelJ.Lubniewski,Univ.d’AuvergneClermont-FerrandI(France);BrunoMiguelM.D.,Univ.d’AuvergneClermont-FerrandI(France)andCHUGabrielMontpied,ServicedeChirurgieCardio-Vasculaire,PôleCardiologie(France);ChristopheLohou,Univ.d’AuvergneClermont-FerrandI(France)

Ourworksarerelatedonaorticdissectionswhichareamedicalemergencyandcanquicklyleadtodeath.Inthispaper,wewanttoretrievethefalselumenandthetruelumenwhichareaorticdissectionfeatures.Ouraimistoprovidea3Dviewofthelumens(augmentedreality)thatwecan’tobtaindirectlybyvolumerenderingorbyanothervisualizationtoolwhichgivesonlytheoutercontouroftheaorta.Wechosetosegmentthesedatabyusingadeformablemodelbasedonthefastmarchingmethod.

Intheclassicalfastmarchingapproach,aspeedfunctionisusedtocontrolthefrontpropagationofthedeformingcurve.Thespeedfunctionisonlybasedontheimagegradient.InourCTimages,duetothelowresolution,thefrontpropagatesfromalumentotheother;therefore,thegradientdataisinsufficienttohaveaccuratesegmentationresults.

Inthepaper,wehavemodifiedthefastmarchingmethodtosegmentthetwolumensseparately,byadaptingthespeedfunction.

Wegiveoursegmentationresults[ontheuploadeddocument]onanaorticdissectionimagewiththeclassicalfastmarchingmethodandwiththeadaptedonewithournewspeedfunction.

8650-24, Session 7

Adaptive quality assurance of the product development process of additive manufacturing with modern 3D data evaluation methodsJuliaW.Kroll,SabineBotta,JannisBreuninger,AlexanderVerl,Fraunhofer-InstitutfürProduktionstechnikundAutomatisierung(Germany)

Inthisabstract,thepossibilitiesofmodern3Ddataevaluationformetrologyandqualityassurancearepresentedforthespecialapplicationoftheplasticlasersinterprocess,especiallytheAdditiveManufacturingprocess.Weusetheadvantagesofthecomputertomographyandofthe3Dfocusvariationtoallstagesofaproductionprocessforanincreasedqualityoftheresultingproducts.WiththeCTandthe3Dfocusvariationthemodernqualityassuranceandmetrologyhaveuptodateinstrumentsthatallownon-destructive,completeandaccuratemeasuringofparts.Therefore,thesemetrologicalmethodscanbeusedinmanystagesoftheproductdevelopmentprocessfornon-destructivequalitycontrol.Inthiswork,studiesandevaluationof3D–dataandtheconclusionsforrelevantqualitycriteriaarepresented.Additionally,newdevelopmentsandimplementationsforadaptingtheevaluationresultsforqualityprediction,comparisonandforcorrectionaredescribedtoshowhowanadequateprocesscontrolcanbeachievedwiththehelpofmodern3Dmetrologytechniques.Thefocusisontheoptimizationoflasersinteringcomponentswithregardtotheirqualityrequirementssothatthefunctionalityduringproductioncanbeguaranteedandquantified.

Conference 8650


8650-25, Session 7

Quality assessment of adaptive 3D video streamingSamiraTavakoli,JesúsGutiérrezSánchez,Univ.PolitécnicadeMadrid(Spain);NarcisoGarcíaSantos,Univ.PolitécnicadeMadrid(Spain)

HTTPAdaptiveStreaming(HAS)representsagreatcomplementformanyapplications.Yet,currentstudiesgenerallyfocusontechnicalenhancementoftechnologywhichcanimprovetherobustnessofIPvideodeliveryoveranunmanagednetwork.However,consideringtheuserastheultimatereceiver,studyabouttheQualityofExperience(QoE)ofthistechnologyisacrucialissuewhichhasnotbeendoneextensively.Tothisaim,wediscusstheimpactofpossibleHASclient’sbehaviorwhileadaptingtothenetworkcapacityonend-user.Thishasbeendonethroughasubjectiveexperimentoftestingtheend-userresponsetovariousclients’behaviorforincreasing,decreasing,andoscillationofqualityin3Dvideo.Furthermore,theend-user’sQoEofsomeoftheHAStypicalimpairmentsduringtheadaptationhasbeenevaluated.Theresultshaveshownusthatthemajorityoftheproposedadaptionscenarioshaveefficientlyprovidedagoodperceptualqualitytotheend–user.Theexperimentalconclusionshavemadegoodinsightintotheuser’sresponsetodifferentadaptionscenariosandvisualimpairmentswhichcanbeusedtodeveloptheadaptivestreamingalgorithmtoimprovetheQoE.

8650-26, Session 7

Evaluation methodology for structured light 3D based depth mapsTomOsborne,VikasRamachandra,KalinAtanassov,SergioR.Goma,QualcommInc.(UnitedStates)

Astructured-lightsystemfordepthestimationisatypeof3Dactivesensorthatconsistsofastructured-lightprojector,thatprojectsanilluminationpatternonthescene,andcamera,whichcapturestheilluminatedscene.Basedonthereceivedpatterns,depthsofdifferentsceneregionscanbeinferred.

Here,weproposeanevaluationframeworkforthedepthmapsobtainedusingdifferentstructured-lightsystemsbasedoncertaincriteriadescribedbelow.

a.Resolution:Thistestthesmallestobjectsizethatcanbedetectedatagivendistancefromthesystem

b.Objectcontourfidelity:Thiscriterionmeasurestheshapefidelity,especiallyofobjectswiththincontours,likehumanfingers.

c.Depthgranularity:Measuresthesmallestdetectablechangeindepthwhenanobjectmoves.

d.Flatnessofplanes:Whenaplanarobjectisplacedinthefieldofviewandparalleltothesystem,itsdepthmapshouldappearflat.Thismaynotbethecaseinreality,andiscloselylinkedtothefactthattheprojector/camerapairmightnotbewellcalibrated.

e.Sensitivitytohumanbodyfactorsincludinghumanbodypartdetection.

f.Temporalconsistencyartifactsduetospeedandpostprocessing.

Conference 8650


Conference 8651: Human Vision and Electronic Imaging XVIIIMonday-Thursday4–7February2013

Part of Proceedings of SPIE Vol. 8651 Human Vision and Electronic Imaging XVIII

8651-2, Session K1

Predicting visual memorability (Keynote Presentation)AudeOliva,MassachusettsInstituteofTechnology(UnitedStates)

WhenglancingatamagazineorbrowsingtheInternet,wearecontinuouslyexposedtoimages.Despitethisoverflowofvisualinformation,humansareextremelygoodatrememberingthousandsofpicturesalongwiththeirvisualdetails.Butnotallimagesarecreatedequal.Whereassomeimagesstickinourminds,othersareignoredorquicklyforgotten.Whatmakesanimagememorable?Ourrecentworkshowsthatonecanpredictimagememorability,openinganewdomainofinvestigationattheinterfacebetweenhumancognitionandcomputervision.

8651-3, Session K1

World, environment, umwelt, and innerworld: a biological perspective on visual awareness (Keynote Presentation)JanJ.Koenderink,TechnischeUniv.Delft(Netherlands)

The“world”isallphysicalreality(Higgsbosons,andsoforth),the“environment”isageographicallocality(yourcity,…),the“Umwelt”isthetotalityofpossibleactionsoftheenvironmentonthesensitivebodysurfaceofanagent(you,yourdog,…)andthepossibleactionsoftheagentontheenvironment(mechanical,chemical,…),whereasthe“innerworld”iswhatitisfortheagenttobe,thatisawareness.Awarenessispre-personal,proto-conscious,and(perhaps)proto-rational.Thevarious“worlds”describedaboveareondistinctontologicallevels.Theworld,andtheenvironmentarestudiedintheexactsciences,theUmweltisstudiedbyphysiologyandethology.Ethologyislikebehavioristicpsychology,withthedifferencethatitappliestoallanimals.Itskipstheinnerworld,e.g.,itconsidersspeechtobeamovementofairmolecules.Theinnerworldcanonlybeknownthroughfirstpersonreports,thusisintrinsicallysubjective.Itcanonlybeapproachedthrough“experimentalphenomenology”,whichisbasedonintersubjectivityamonghumans.Inthissettingspeechmaymeansomethinginadditiontothemovementsofmolecules.Theseviewsleadtoamodelofvisionasan“opticaluserinterface”.Ithasconsequencesformanyapplications.

8651-4, Session K1

Does evolution favor true perceptions? (Keynote Presentation)DonaldDHoffman,Univ.ofCalifornia,Irvine(UnitedStates);ManishSingh,RutgersUniversity(UnitedStates);JustinMark,Univ.ofCalifornia,Irvine(UnitedStates)

Doesnaturalselectionfavorveridicalperceptions,thosethatmoreaccuratelydepicttheobjectiveenvironment?Studentsofperceptionoftenclaimthatitdoes.Butthisclaim,thoughinfluential,hasnotbeenadequatelytested.InthistalkIformalizetheclaimandafewalternatives.Totestthem,Iintroduce“interfacegames,”aclassofevolutionarygamesinwhichperceptualstrategiescompete.IpresenttheresultsofMonteCarlosimulationsofsomesimplergamesthatassumefrequency-dependentselectionandcompletemixingininfinitepopulations.Thesesimulationsshowthatveridicalperceptionscanbedriventoextinctionbynon-veridicalstrategiesthataretunedtoutilityratherthanobjectivereality.Thissuggeststhatnaturalselection

neednotfavorveridicalperceptions,andthattheeffectsofselectiononsensoryperceptiondeservefurtherstudy.

8651-5, Session 1

Lightness perception in imaging and art (Invited Paper)AlanL.Gilchrist,Rutgers,TheStateUniv.ofNewJersey(UnitedStates)

Thehighdynamicrangeofmostnaturalimages,duetovariationsinbothreflectanceandilluminationlevel,posesachallengebothforthevisualsystemandfortheartist.Thevisualsystemmustassignalimitedrange(about30:1)ofreflectances(white:90%,toblack:3%)toarangeofimageluminancesthatoftenfarexceeds1000:1.Thepaintermustrepresentthatlargerangeofintensitiesusingthesame(30:1)lowrangeofreflectances.Whilethevisualsystemorganizesthesehighrangeimagesintoregionsofhighandlowillumination,eachcomprisingalimitedrange(<30:1)theartistmustcompressthehighrangeimageintothelimitedrangeofpigment.Whenthepaintingisviewed,theobserver’svisualsystemthenexpandsthecompressedrange.Measurementsofperceivedreflectancerangehaveshownexpansioninpaintingsandlow-rangeabstractMondrianpatterns,butdramaticcompressioninveryhighrange(>5000:1)Mondrians,revealingatendencytonormalizetheperceivedrangetowardsacanonicalwhite-to-blackrange.Whena3Dcanvasisused,asinaPatrickHughes-typeReverspective,directionalilluminationcanbeexploitedtoextendtheluminancerange,andtheobservercanhaveagenuineexperienceoflight-emittingself-luminosity.

8651-6, Session 1

Human lightness perception is guided by simple assumptions about shape and reflectanceRichardF.Murray,YorkUniv.(Canada)

Lightnessconstancyistheremarkableabilityofhumanobserverstoperceivesurfacereflectanceaccuratelydespitevariationsinilluminationandcontext.TwosuccessfulapproachestounderstandinglightnessperceptionthathavedevelopedalongindependentpathsareanchoringtheoryandBayesiantheories.Anchoringtheoryisasetofrulesthatpredictlightnessperceptsunderawiderangeofconditions(Gilchrist,2006).Someoftheserulesarecounterintuitive,e.g.,arulethatlargesurfacestendtolooklighterthansmallsurfaces.Bayesiantheoriesareformulatedasprobabilisticassumptionsaboutlightsandobjects,andtheymodelperceptsasrationalinferencesfromsensorydata(e.g.,Adelson,2000).HereIreconcilethesetwoseeminglydivergentapproachesbyshowingthatmanyrulesofanchoringtheoryfollowfromsimpleprobabilisticassumptionsaboutlightingandreflectance.IdescribeasimpleBayesianmodelthatmakesmaximumaposterioriinterpretationsofluminanceimages,andIshowthatthismodelpredictsmanyofthephenomenadescribedbyanchoringtheory,e.g.,anchoringtowhite,scalenormalization,andarticulationeffects.ThusanchoringtheorycanbenaturallyformulatedinaBayesianframework,andthisapproachshowsthatmanyseeminglyidiosyncraticpropertiesofhumanlightnessperceptionareactuallyrationalconsequencesofsimpleassumptionsaboutlightingandreflectance.


8651-7, Session 1

Spatial imaging in color and HDR: Prometheus unchained (Invited Paper)JohnJ.McCann,McCannImaging(UnitedStates)

TheHumanVisionandElectronicImagingConferenceatEIhasbroughttogetherresearchinthefundamentalsofbothvisionanddigitaltechnology.Thisconferencehasincludedmanycolordisciplinesthathavecontributedtothetheoryandpracticeoftoday’simaging:theoryofhumanvision,colorconstancy,modelsofvision,digitaloutput,highdynamicrangeimaging,andtheunderstandingofperceptualmechanisms.Beforedigitalimaging,silverhalidecolorwasapixelbasedmechanism.Colorfilmsarecloselytiedtocolorimetry,thescienceofmatchingpixelsinablacksurround.Thequantacatchofthesensitizedsilversaltsdeterminestheamountofcoloreddyesinthefinalprint.Therapidexpansionofdigitalimagingoverthepast25yearshaseliminatedthelimitationsofusingsinglepixelsinformingimages.Spatialinteractionscannowgenerateimagesmorelikevision.Sincethe1950’s,neurophysiologyhasshownthatpost-receptorneuralprocessingisbasedonspatialinteractions.Theseresultsreinforcedthefindingsof19thcenturyexperimentalpsychology.ThispaperreviewstheroleofHVEIincolor,emphasizingtheinteractionofresearchonvisionandthenewalgorithmsandprocessesmadepossiblebyelectronicimaging.

8651-8, Session 2

Presenting visual stimuli: past and present (Invited Paper)GeraldWestheimer,Univ.ofCalifornia,Berkeley(UnitedStates)

Candle,lightbulb,Xenonarc,LED,computermonitor;NicolprismandPolaroid;cobaltglass,interferencefilterandmonochromaticlaserlight.Inhis60yearsasanexperimentalvisionresearcherthespeakerhasusedthemall,excepttheveryfirst.Buteventhatiscoveredinthistalkontheinfluenceofthestateoftechnologicaldevelopmentontheaspectsofvisionstudiedatanyonetimeandthewayitisstudied,fromMaxwellandMachtothepresent.

8651-9, Session 2

Emerging technologies: 25 years (Invited Paper)HawleyK.RisingIII,Consultant(UnitedStates)

Thispaperwilltalkaboutthetechnologiesthathavebeenemergingoverthe25yearssincetheHumanVisionandElectronicImagingconferencebeganthattheconferencehasbeenapartof,andthathavebeenapartoftheconference,andwilllookatthosetechnologiesthatareemergingtoday,suchassocialnetworks,haptictechnologies,andstillemergingimagingtechnologies,andwhatwemightlookatforthefuture.

Twenty-fiveyearsisalongtime,anditisnotwithoutdifficultythatwerememberwhatwasemerginginthelate1980s.Yettobedeveloped:Thefirstcommercialdigitalstillcamerawasnotyetonthemarket,althoughtherewerehandheldelectroniccameras.Personalcomputerswerenotdisplayingstandardizedimages,andimagequalitywasnotsomethingthatcouldbetalkedaboutinastandardizedfashion,ifonlybecauseimagecompressionalgorithmswerenotstandardizedyetforseveralyearshence.Evenfurtherawaywereanystandardsformoviecompressionstandards,therewasnopersonalcomputerevenonthehorizonwhichcoulddisplaythem.Whatbecameanemergenttechnologyandfilledmanysessionslater,imagecomparisonandsearch,wasnotpossible,northecurrentemergingtechnologyofsocialnetworks--theworldwidewebwasstillseveralyearsaway.Printertechnologywasstilldevisingdithersandimagesizemanipulationswhichwouldconsumemanyyears,aswouldscanningtechnology,andimagequalityforbothwasamajorissuefordithersandFouriernoise.

Fromthesehumblebeginningstothecurrentmovesthatarechanging

computingandthemeaningofbothelectronicdevicesandhumaninteractionwiththem,wewillseeacoursethroughthechangingtechnologythatholdssomefeaturesconstantformanyyears,whileotherscomeandgo.

8651-10, Session 2

What brain imaging technology can tell us about perception and consciousness (Invited Paper)LoraT.Likova,TheSmith-KettlewellEyeResearchInstitute(UnitedStates)

Theultimategoalofelectronicimagingistoprovideeffectivepresentationofvisualinformation,andtheHVEIConferenceistheuniqueinterfacebetweenelectronicmediaandhumanperception/cognition.Theinclusionofbrainimagingisthelatestmanifestationofthislinkage,whichhasbloomedintoathemeforwell-attendedSpecialSessionsinthelastseveralyears.Thetopicshaverangedfromdetailedmappingofprimaryvisualcortexandsubsequentcorticalhierarchytotheneuralcircuitryunderlyingtheperceptionofdynamicimages,includingthosegivingrisetothe3Dperceptsofdepthandstereomotion,aswellastotheissueofvisualattentionandvisualconsciousness.Thesefieldshaveevenextendedtocross-modalbrainplasticityandawarenessinnotonlyimageencodingandretrieval,butinsensorimotorbrainprocessingofmotorcontrolandhumanimagereproduction(i.e.,drawing).Theprogressiveincorporationofthebrainimagingfieldintoourinterdisciplinarycommunitywillbecrucialforbreakthroughadvancetothenextgenerationofelectronicmedia.

8651-11, Session 2

Perceptual approaches to finding features in data (Invited Paper)BerniceE.Rogowitz,VisualPerspectivesConsulting(UnitedStates)

Electronicimagingapplicationshingeontheabilitytodiscoverfeaturesindata.Forexample,doctorsexaminediagnosticimagesfortumors,brokenbonesandchangesinmetabolicactivity.Financialanalystsexplorevisualizationsofmarketdatatofindcorrelations,outliersandinteractioneffects.Seismologistslookforsignaturesingeologicaldatatotellthemwheretodrillorwhereanearthquakemaybegin.Thesedataareverydiverse(images,numbers,graphs,3-Dgraphics,text,etc.)and,mainlybecauseofautomaticdatacollectiontechnologiessuchassensorsanddigitalimaging,aregrowingexponentially.Thispaperexploreshowtheartandscienceoffindingfeaturesindatahaveevolvedoverthepast25years,andhowresearchinhumanvisionandcognitionhavedriventhisprogress.WewilldiscusskeycontributionsfromtheConferenceonHumanVisionandElectronicImaging,coveringkeytopicssuchasshapeperception,interactivevisualization,virtualrealityandhapticinterfaces,andwillexploreresearchtrendsthatwilldrivethenextgenerationofpattern-recognitiontechnologies.

8651-12, Session 3

Is image quality a function of contrast perception? (Invited Paper)AndrewM.Haun,EliPeli,SchepensEyeResearchInstitute(UnitedStates)

Inthisretrospectivewetracethedevelopmentofimagequalitymeasuresbasedonmodelsoftheearlystagesofthehumanvisualsystem(HVS).Luminancecontrastisthefundamentalattributeofnaturalimages,andunderstandingitsnaturalandpsychophysicalpropertieshasbeenimportantindevelopingnumerousdigitalimagetechnologies.WhiletheSPIEHumanVisionandElectronicImagingmeetinghasstrivedtofindpointsofcontactbetweenthestudyofhumanpsychophysicsandthedevelopmentofcomputervisionand

Conference 8651


imagequalityalgorithms,progressinthefieldhasnotalwaysbeenmadeontheseterms.Itwasassumedthatbyunderstandingthepsychophysicalpropertiesofcontrastperception,andbymodelingthesemoreandmoreprecisely,modelsofimagequalitycouldbedesigned.EncodedimagescouldbepresentedtoasimulatedHVS,whichwouldthenreturnaverdictonquality.Instead,themostsuccessfulimagequalitymetricstodaytendtoincludeonlyhighlyabstractedHVScomponents.Weconcludewithapuzzle:whydodifferentqualitymetricswithsuchdisparatestructuresastheVSNR,VIF,andSSIMtonamejustafew,allworksowell?Whatdotheyhaveincommon,andwhatdotheyhavetodowithhumanspatialvision?

8651-13, Session 3

Visible contrast energy metrics for detection and discrimination (Invited Paper)AlbertJ.Ahumada,NASAAmesResearchCtr.(UnitedStates)

ContrastenergywasproposedbyWatson,Robson,&Barlow(Science,1982)asausefulmetricforrepresentingluminancecontrasttargetstimulibecauseitrepresentsthedetectabilityofthestimulusinphotonnoiseforanidealobserver.Liketheeye,theearisacomplextransducersystem,butrelativelysimplesoundlevelmetersareusedtocharacterizesounds.Thesemetersprovidearangeoffrequencysensitivityfunctionsandintegrationtimesdependingontheintendeduse.Weproposeheretheuseofarangeofcontrastenergymeasureswithdifferentspatialfrequencycontrastsensitivityweightings,eccentricitysensitivityweightings,andtemporalintegrationtimes.Whendetectionthresholdareplottingusingsuchmeasures,theresultsshow“whattheeyeseesbest”whenthesevariablesaretakenintoaccountinastandardway.ThesuggestedweightingfunctionsrevisetheStandardSpatialObserver(Watson&Ahumada,J.Vision,2005)forluminancecontrastdetectionandextenditintothenearperiphery.Undertheassumptionthatthedetectionislimitedonlybyinternalnoise,discriminationperformancecanbepredictedbymetricsbasedonthevisibleenergyofthedifferenceimages(Watson&Ahumada,J.Vision,2008).

8651-14, Session 4

Visual image quality: coding (Invited Paper)SheilaS.Hemami,CornellUniv.(UnitedStates)

NoAbstractAvailable

8651-15, Session 4

Initial spatio-temporal domain expansion of the Modelfest databaseThomCarney,SaharMozaffari,SeanSun,RyanJohnson,SharonaShirvastava,PriscillaShen,EmmaLy,Univ.ofCalifornia,Berkeley(UnitedStates)

ThefirstModelfestgrouppublicationappearedintheSPIEHumanVisionandElectronicImagingconferenceproceedingsin1999.“Oneofthegroup’sgoalswastodevelopapublicdatabaseoftestimageswiththresholddatafrommultiplelaboratoriesfordesigningandtestingHumanVisionModels”Thegroupcollectedandmadepublicdetectionthresholddataforstaticimagestobeusedtotesthumanvisionmodels.Inthesameveinwehavebegunextendingthedatabaseintothetemporaldomainwithanexpandedstimulussetof41videoclips,partiallyoverlappingwiththeoriginalstimulibutspanningabroadrangeoftemporalfrequencies.Averagedetectionthresholdsforsevensubjectsarepresentedalongwithhistoricalvaluesfromtheliterature.LiketheoriginalModelfestdataset,theparticipationofmultiplelaboratoriesisneededensurethereliabilityoftheresultsandtocomparemethodsacrosslaboratories.Welookforwardtoothergroupsjoiningusinthisendeavor.

8651-16, Session 4

A database of local masking thresholds in natural imagesMd.MushfiqulAlam,KedarnathP.Vilankar,DamonM.Chandler,OklahomaStateUniv.(UnitedStates)

Adatabaseoflocalcontrastdetectionthresholdsoffifteennaturalimageswaspresented.Currently,weareextendingthedatabasebyexperimentingonadditionalfifteennaturalimages.Throughquantitativemeasures,weprovedtheintra-subjectandinter-subjectconsistencyofourexperimentalprocedure.Wealsoanalyzedtherelationshipbetweensomelocallow-levelmaskfeaturesandcorrespondingthresholds.Wefoundthat,exceptsharpness,individualfeatureshaveweakrelationshipwithcontrastthresholds.Ournextstepistostudy,whetherthecombinedlow-levelfeatureshavestrongrelationshipwiththedetectionthresholds.WealsoanalyzedtheperformanceofWatsonandSolomon’sneuralmaskingmodelandtwoimage-qualityassessmentalgorithms-MADandMS-SSIMinpredictingthelocalcontrastdetectionthresholds.Amongthethreemodels,WatsonandSolomon’sneuralmodelperformedthebestinpredictingthethresholds.Nevertheless,thepredictionsarenotquiteclosetotheground-truththresholds.Theseresultssuggestawide-scopeofimprovingtheneuralmaskingmodelsinpredictinglocalcontrastdetectionthresholdsinnaturalimages.Ourdatabaseprovidestheprimarydataforthismodeling.Currently,wearedesigninganeuralmaskingmodelfornaturalimageswiththeconsiderationofmultiplelow-levelmaskfeatures.Weintendtoreporttheoutcomesofourstudyinthefull-paper.

8651-17, Session 4

Interplay between image coding and quality estimationGuilhermeO.Pinto,SheilaS.Hemami,CornellUniv.(UnitedStates)

Inimageprocessing,imagequalitymodelsareusefulinseveralareas,suchasinimagequalityestimationandinimagecoding.Intheformer,theimagemodelsareusedtoestimatetheperceivedqualityofimagesthathaveundergonedifferenttypesofdistortions,includingbutnotlimitedtowhitenoise,transmissionerrors,lossy-compressionandblur.Inthelatter,theimagemodelsareusedtoachieveaperceptuallyoptimalbit-allocationacrossthedifferentsubbandsandcode-blocks.Intheareaofimagequalityestimation,severalalgorithms,suchasSSIM,VIF,MADandVSNR,havebeenproposedinordertoovercometheknownlimitationsofthemeansquarederror(MSE)distortionmetric.Inaddition,arecentworkinimagequalityandutilityestimationhasalsoindicatedthattheMSEqualityestimatorpresentsapoorperformanceintheLow-QualityRegime(LQR),whileotherqualityandutilityestimatorsareabletoperformwell.Althoughtherehavebeensomesignificantdevelopmentsinthefieldofqualityestimation,theJPEG-2000standardstillusestheMSEdistortionmetric.Onepossiblereasonforthisliesinthefactthatmanyofthestate-of-the-artqualityandutilityestimationalgorithmsweredevelopedafterthedevelopmentofJPEG-2000imagecodingstandard.Thisbecausemanyofthestate-of-the-artqualityestimationalgorithmsweredevelopedafter2000,whichcorrespondstotheyearofdevelopmentofJPEG-2000.Thefinalpaperwillstudytheuseofdifferentimagequalityandutilityestimators,suchasMS-SSIM,VIFandMS-NICE,inimagecodingapplications.%Foragivencodingpassandestimator,itispossibletocomputetheresultingdistortionintwodifferentways.Thedistortioncanbecomputedforagivencode-blockandsubbandinthewaveletdomain,orthesameestimatorcanbeusedtocomputetheresultingdistortioninthecodedimage,inthepixeldomain.Consideringdifferentqualityandutilityestimators,thefinalpaperwillcomparetheperformancesofthesetwodifferentapproaches.Thepreliminaryresultsgiveninthisabstractfocusontheformerapproach.

Conference 8651


8651-18, Session 5

From image quality to atmosphere experience: how evolutions in technology impact experience assessment (Invited Paper)IngridHeynderickx,PhilipsResearchNederlandB.V.(Netherlands)andTechnischeUniv.Delft(Netherlands);HuibdeRidder,TechnischeUniv.Delft(Netherlands)

Imagequalityisaconceptthatforlongverywellservedtooptimizedisplayperformanceandsignalquality.Newtechnologicaldevelopments,however,forcedthecommunitytolookintohigherlevelconceptstocapturethefullexperience.Termsasnaturalnessandviewingexperiencewereusedtooptimizethefullexperienceof3D-displaysandAmbilightTV.Thesehigherlevelconceptscapturedifferencesinimagequalityanddifferencesinperceiveddepth,orperceivedviewingfield.Withtheintroductionofsolidstatelighting,furtherenhancingthemultimediaexperience,yetmoreadvancedqualityevaluationconceptswillbeneededinthefuturetooptimizetheoverallexperience.

8651-19, Session 5

Preference limits of the visual dynamic range for ultra high quality and aesthetic conveyanceScottJDaly,DolbyLabsInc(UnitedStates);TimoKunkel,XingSun,SuzanneFarrell,PoppyCrum,DolbyLabs.,Inc.(UnitedStates)

NoAbstractAvailable

8651-20, Session 5

Quantifying image quality in graphics: perspective on subjective, objective metrics, and their performance in computer graphics applicationsRafalMantiuk,BangorUniv.(UnitedKingdom)

Weexplorethreeproblemsrelatedtoqualityassessmentincomputergraphics:thedesignofefficientuserstudies;thescene-referredmetricsforcomparinghigh-dynamic-rangeimages;andthecomparisonofmetricperformanceforthedatabaseofcomputergraphicsdistortions.Thispapersummarizesthemostimportantobservationsfrominvestigationoftheseproblemsandgivesahighlevelperspectiveontheproblemofqualityassessmentingraphics.

8651-21, Session 5

Visualizing lighting with images: converging between the predictive value of renderings and photographsUlrichEngelke,MariskaG.M.Stokkermans,PhilipsResearch(Netherlands);MichaelJ.Murdoch,PhilipsResearchNederlandB.V.(Netherlands)

Lightingsystemsareusuallydesignedbasedonreal-worldprototypes,whichareexpensiveandtimeconsumingtobuild.Computer-basedrenderingsaretypicallyusedtocircumventtheseshortcomings.Thesuccessofavirtualprototypedependsonitsabilitytopredicthumanperceptionofthereal-world.Previouslyweshowedthatrenderingspredictareallightingsystemreasonablywell.Still,amarginofpredictionerrorremainedevenafteridentifyingthemostsuitablerenderingpipeline.Thiscouldbepartlyduetopresentinga3Dreal-worldsceneona2Ddisplay,thus,losingonedimensionthatmightbeinstrumentalinjudginglightingperception.

Thisworkaimsatbetterunderstandingtheupperlimitofpredictionaccuracythatcanbeachievedwithanimagecomparedtothereal-world.Weconsiderphotographsasagroundtruthbecauseoftheirgeometricalaccuracy.Weperformedpsychophysicalexperimentsinwhichpeopleratedthelightingofarealofficeroomaswellasofphotographsandrenderingsofthatroom.Theoutcomesrevealthatanerrormarginremainsalsobetweenthephotographsandthereal-world.However,thephotographsgenerallypredictlightingperceptionsuperiortotherenderings.Theresultsareutilizedtofurtherimprovetherenderingstoconvergetowardsthepredictivevalueofphotographs.

8651-22, Session 6

A survey on 3D quality of experience and 3D quality assessmentAnushK.Moorthy,TexasInstrumentsInc.(UnitedStates);AlanC.Bovik,TheUniv.ofTexasatAustin(UnitedStates)

Thisarticlesummarizesrecentadvancesinalgorithmicallyevaluatingthequalityofstereoscopicpresentations.Weundertakeabriefreviewofresearchintheareaof3Dqualityofexperiencewhichencompasses(1)ge-ometry,visualdiscomfortetc.,(2)displayissuessuchascross-talkand(3)qualityassessmentaspreviouslydefined.Wedescribealgorithmsanddatabasesthathavebeenproposedintheliteratureandanalyzethemeritsanddemeritsoftheproposedapproaches.Webeginbysummarizingwhathasbeendoneintheareaofvisualdiscomfortandgeometrywhenviewing3Dpresentations.Weexplainthestate-of-the-artof3Dqualityassess-ment,althoughthefieldisstillnascentandthetopicsodiversethatoursurveyisbynomeanscomprehensive.Ratherweprovideabirds-eyeviewofcurrentprevailingthoughtonthetopic.Wesummarizedatabasesthathaverecentlybeenproposedfor3Dqualityassessment.Finally,wedescribearecent,extremelyusefultoolforresearchersintheareaof3Dimagequalityassessment(IQA)–theLIVE3DIQAdatabase.TheLIVE3DIQAdatabaseistheonlydatabaseofitskindthatnotonlyincorporatesreferenceanddistortedimages,butalsotruedepthinformationcapturedusingahigh-precisionlaserrangescanner.TheLIVE3DIQAdatabaseisfreelyavailabletoresearcherstoenablerapidprogressin3DIQAresearch.

8651-23, Session 6

Visual quality beyond artifact visibilityJudithA.Redi,TechnischeUniv.Delft(Netherlands)

Inthelastthreedecadesalotofefforthasbeendevotedtothedevelopmentoftechnologiesthatcanpredictthevisualqualityofimagesandvideos,asabasisforthedeliveryofoptimalvisualqualitytotheuser.Thesesystemshavebeenbasedforthemostpartonavisibility-centricapproach,assumingthathighartifactvisibilitycorrespondstohighannoyanceand,therefore,tolowvisualquality.Despitetheremarkableresultsachievedwiththisapproach,recentlyanumberofstudiessuggestedthatthevisibility-centricapproachtovisualqualitymighthavelimitations,andthatotherfactorsmightinfluencetheoverallqualityimpressionofanimageorvideo,dependingoncognitiveandaffectivemechanismsthatworkontopofperception.Inparticular,interestinthevisualcontent,engagementandaestheticappealhavebeenfoundtopositivelyimpactontheoverallqualityimpressionoftheimage/video.Inthispaper,wereviewthesestudiesandexploretheimpactthataffectiveandcognitiveprocesseshaveonthevisualquality.Inaddition,asacasestudy,wewillpresenttheresultsofanexperimentinvestigatingontheimpactofaestheticappealonvisualquality.

8651-24, Session 6

Subjective matters: from image quality to image psychology (Invited Paper)ElenaA.Fedorovskaya,RadixNova(UnitedStates);HuibdeRidder,TechnischeUniv.Delft(Netherlands)

Conference 8651


Fromtheadventofdigitalimagingthroughseveraldecadesofstudies,humanvisionresearchcommunitysystematicallyfocusedonperceivedimagequalityanddigitalartifactsduetoresolution,compression,gamma,dynamicrange,captureandreproductionnoise,blur,etc.,tohelpovercomeexistingtechnologicalchallengesandshortcomings.Technologicaladvancesmadedigitalimagesanddigitalmultimedianearlyflawlessinquality,andubiquitousandpervasiveinusage,providinguswiththeexcitingpossibilityandatthesametimedemandingtoturntothedomainofhumanexperienceincludinghigherpsychologicalfunctions,suchascognition,emotion,awareness,socialinteraction,consciousnessandSelf.

Inthepaperwewilloutlinetheevolutionofhumancenteredmultidisciplinarystudiesrelatedtoimagingasweseeitandproposestepsandpotentialfocioffutureresearch.

8651-25, Session 7

The rough side of texture: texture analysis through the lens of HVEI (Invited Paper)ThrasyvoulosN.Pappas,NorthwesternUniv.(UnitedStates)

NoAbstractAvailable

8651-26, Session 7

Adapting environments to observers (Invited Paper)MichaelA.Webster,Univ.ofNevada,Reno(UnitedStates)

Visualcodingcontinuallyadaptstomatchthecurrentenvironment.Yettheextenttowhichthisadaptationaffectsperceptionandperformanceremainsuncertain,inpartbecauseadaptationistypicallystudiedonlyoverbrieftimescales.Weexploredpotentialconsequencesofverylong-termadaptationtocolorindifferentenvironments,byadjustingmagestosimulatetheirappearanceundertheoreticallycompleteadaptation.Thesimulationisbasedonplausiblepropertiesofcolormechanismsandhowtheywouldadapttonaturalcolordistributions,withimagesrenderedbasedontheadaptedresponses.Thesimulatedimagesallowustoexploreanumberofquestionsabouttheconsequencesandfunctionsofadaptationthataredifficulttostudybyinsteadadaptingtheobserver.Theseincludehowmuchcolorappearanceispredictedtovarywhenthesameobserverisplacedindifferentenvironments,orwhendifferentobserversareexposedtoacommonenvironment.Measuresofvisualsearchalsorevealtheextenttowhichcompleteadaptationcouldenhancesensitivityandsalience.Pre-adaptingimagescouldespeciallyenhanceinformationinuncommonenvironments.Themethodthusprovidesanoveltheoreticalapproachtounderstandingadaptationatlongtimescalesandanoveltechniqueforprocessingimagestofacilitatehowwellobserverscaninteractwiththem.

8651-27, Session 8

Efficient image representations and features (Invited Paper)ErhardtBarth,Univ.zuLübeck(Germany);MichaelDorr,HarvardMedicalSchool(UnitedStates);EleonoraVig,HarvardUniv.(UnitedStates)

Interdisciplinaryresearchinhumanvisionandelectronicimaginghas

greatlycontributedtothecurrentstateoftheartinimagingtechnologies.Imagecompressionandimagequalityareprominentexamplesandtheprogressmadeintheseareasreliesonabetterunderstandingofwhatnaturalimagesareandhowtheyareperceivedbythehumanvisualsystem.Akeyresearchquestionhasbeen:giventhe(statistical)propertiesofnaturalimages,whatarethemostefficientandperceptuallyrelevantimagerepresentations,whatarethemostprominentanddescriptivefeaturesofimagesandvideos?

Wegiveanoverviewofhowthesetopicshaveevolvedoverthe25

yearsofHVEIconferencesandhowtheyhaveinfluencedthecurrentstateoftheart.Thereareanumberofstrikingparallelsbetweenhumanvisionandelectronicimaging.Theretinadoeslateralinhibition,oneoftheearlycoderswasusingaLaplacianpyramid;primaryvisualcorticalareashaveorientation-andfrequency-selectiveneurons,thecurrentJPEGstandarddefinessimilarwavelettransforms;thebrainusesasparsecode,engineersarecurrentlyexcitedaboutsparsecodingandcompressedsensing.SomeofthishasindeedhappenedattheHVEIconferencesandwewouldliketodistillthat.

8651-28, Session 8

Highly overcomplete sparse codingBrunoA.Olshausen,HelenWills,Univ.ofCalifornia,Berkeley(UnitedStates)

NoAbstractAvailable

8651-29, Session 8

Blind image quality assessment without training on human opinion scoresAnishMittal,RajivSoundararajan,GautamS.Muralidhar,AlanC.Bovik,JoydeepGhosh,TheUniv.ofTexasatAustin(UnitedStates)

Weproposeafamilyofimagequalityassessment(IQA)modelsbasedonnaturalscenestatistics(NSS),thatcanpredictthesubjectivequalityofadistortedimagewithoutreferencetoacorrespondingdistortionlessimage,andwithoutanytrainingresultsonhumanopinionscoresofdistortedimages.These‘completelyblind’modelscompetewellwithstandardnon-blindimagequalityindicesintermsofsubjectivepredictiveperformancewhentestedonthelargepubliclyavailable‘LIVE’ImageQualitydatabase


Picture perception and visual fieldAndreaJ.vanDoorn,HuibdeRidder,TechnischeUniv.Delft(Netherlands);JanJ.Koenderink,TechnischeUniv.Delft(Netherlands)andKatholiekeUniv.Leuven(Belgium)

Lookingatapicturefillspartofthevisualfield.Inthecaseofstraightphotographsthereisanotionofthe“FieldofView”ofthecameraatthetimeofexposure.Isthereacorrespondingnotionfortheperceptionofthepicture?Inmostcasesthepartofthevisualfield(asmeasuredindegrees)filledbythepicturewillbequitedifferentfromthefieldofviewofthecamera.Thecaseofworksofartsisevenmorecomplicated,thereneednotevenexistawelldefinedcentralviewpoint.Withseveralexamplesweshowthatthereisessentiallynonotionofacorresponding“fieldofview”inpictorialperception.Thisiseventhecasefordrawingsinconventionallinearperspective.Apparentlythe“mentaleye”oftheviewerisoftenunrelatedtothegeometryofthecamera(orperspectivecenterusedindrawing).Observersoftensubstitutetemplatesinsteadofattemptingananalysisofperspective.


Measurements of achromatic and chromatic contrast sensitivity functions for an extended range of adaptation

luminanceKilJoongKim,SeoulNationalUniv.(Korea,Republicof);RafalMantiuk,BangorUniv.(UnitedKingdom);KyoungHoLee,SeoulNationalUniv.(Korea,Republicof)andBundangHospital(Korea,Republicof)

Introduction:Thecontrastsensitivityfunctions(CSFs)areoneofthe

Conference 8651


maincomponentsofvisualmodelsandmetrics.MostmodelsofCSFsweredevelopedbasedonthedatafromnumeroushistoricalpsychophysicalexeperiments,whichoftendifferedintheirmethods,stimuliandviewingconditions.Also,thesuchdataisusuallyavailableforverylimitedrangeofspatialfrequenciesandluminanceadaptationlevels.VeryfewmeasurementsweredoneforthechromaticCSFsandthosemeasurementsweremeasuredforthesensitivitytoequiluminantcolorcontrastratherthanluminancecontrast.Forthosereasons,weconductedaseriesofexperimentstomeasureachromaticandchromaticCSFsatdifferentlevelsofadaptationluminance,rangingfrom0.02to200cd/m2.

Experimentsetting:TheexperimentdesignwasinspiredbytheModelFestandColorFestdatasets.Thestimuliwereshownona24’’LCDdisplaywitha10-bitpanelandRGBLEDbacklight.Stimuliwereobservedfromafixeddistanceof93cm,whichgaveanangularresolutionof60pixelspervisualdegree.Thedisplaywascalibratedusingaspectro-radiometer.ThedisplaywhitepointwasfixedatD65.Thestimuliweregeneratedfordifferentcolordirections,frequencies,andlevelsofadaptationluminance.

Experimentprocedure:Fiveobserversparticipated.Foreachstimulus,eachobserverfoundthethresholdusingtheQUESTprocedurewith25trials.Theexperimentinvolveda4-alternative-forced-choicemethodinwhichanobserverwasaskedtochooseoneofthefourstimuli,ofwhichonlyonecontainedthepattern.Thestimuliwereshownside-by-sideonthesamescreenandthepresentationtimewasnotlimited.

Stimuli:Thestimuliconsistedofverticalsine-gratingsattenuatedbytheGaussianenvelopewithsigmaof1.5deg.Thestimuliincludedthreecolordirections,black-white,red-green,andblue-yellow.Thefrequencyrangeofthesinegratingsvariedfrom0.125to16cyclesperdegree(cpd).Thebackgroundluminancevariedfrom0.02to200cd/m2.Theluminancelevelsbelow10cd/m2wereachievedbywearingmodifiedweldinggogglesinwhichtheprotectiveglasswasreplacedwithneutraldensityfilters.Stimuliwereobservedwithanaturalpupil.

Results:Theresultsweretabulatedinthefiguresbelow.Thedatawillbediscussedinthefullpaperifitisaccepted.


Viewer preferences for adaptive playoutSachinG.Deshpande,SharpLabs.ofAmerica,Inc.(UnitedStates)

Adaptivemediaplayouttechniquesareusedtoavoidbufferunderflowinadynamicstreamingenvironmentwheretheavailablebandwidthmaybefluctuating.Inthispaperwereporthumanperceptionsfromaudioqualitystudiesthatweperformedonspeechandmusicsamplesforadaptiveaudioplayout.TestmethodsfromITU-RBS.1534-1recommendationwereused.Studieswereconductedforbothslowplayoutandfastplayout.Twoscales-acoarsescaleandafinerscalewasusedfortheslowandfastaudioplayoutfactors.Resultsfromourstudycanbeusedtodetermineacceptableslowandfastplayoutfactorsforspeechandmusiccontent.Anadaptivemediaplayoutalgorithmcoulduseknowledgeoftheseupperandlowerboundstodecideitsplaybackschedule.


The effect of familiarity on perceived interestingness of imagesElenaA.Fedorovskaya,RadixNova(UnitedStates);SharonLynnChu,VirginiaPolytechnicInstituteandStateUniv.(UnitedStates);JeffreySnyder,KodakResearchLabs.(UnitedStates);FrancisQuek,VirginiaPolytechnicInstituteandStateUniv.(UnitedStates)

Thispaperaddressestheindividualizedcontentadaptationofmediaforimageexperienceandconsumption.Weinvestigatefactorsbywhichimages/mediamaybepersonalizedwithrespecttointerestingness,consideringhowtwodimensionsoffamiliarity:facialfamiliarityandfamiliaritywithimagecontextorsetting,relatestointerestingness.Throughasequenceofexperiments,welookedat1.Howmanipulationoffacialsimilaritythroughmorphingmayinfluencethesenseoffamiliaritywithanimage;2.Whethervaryingthesource

faceinthemorphingproceduretowardsatargetfaceaffectsperceivedvisualsimilarityofthemorphresultwiththetargetface;and,3.Howtheviewer’sfamiliaritywiththepersonandcontextinphotographscaninfluencetheirinterestinthephoto.Wepositedthatbycreatingpersonalmeaning,familiaritycancauseimagestobeperceivedasinteresting.Fourlevelsofpersonfamiliarityvaryingindegreeofpersonknowledge,andtwolevelsofcontextfamiliarityvaryinginfrequencyofexposure,wereconsidered:Self,Friend,Celebrity,andStrangerinFamiliarandUnfamiliarcontexts.Personfamiliaritywasobtainedthroughfacemorphing.Experimentalresultsshowedinterestingsignificantmaineffectsofcontextandpersonfamiliarity.Ourfindingsdeepenourunderstandingofthecriticalelementoffamiliarityanditsrelationtointerestingnessinthecontextofimages,andcanimpactthedesignofimage-relatedsystems.Wediscusshowourfindingsmaybeappliedinmediaadaptationstoinfluenceinterestingnesstoculturalgroupsandindividuals.


Quantifying patterns of dynamics in eye movement to measure goodness in organization of design elements in interior architecture HastiMirkia,ArashSangari,MarkNelson,AmirH.Assadi,Univ.ofWisconsin-Madison(UnitedStates)

Architecturebringstogetherdiverseelementstoenhancetheobserver’smeasureofestheticsandtheconvenienceoffunctionality.Architectsoftenconceptualizesynthesisofdesignelementstoinvoketheobserver’ssenseofharmonyandpositiveaffect.Howdoesanobserver’sbrainrespondtoharmonyofdesignininteriorspaces?Oneimplicitconsiderationbyarchitectsistheroleofguidedvisualattentionbyobserverswhilenavigatingindoors.PriorvisualexperienceofnaturalscenesprovidestheperceptualbasisforGestaltofdesignelements.Incontrast,Gestaltoforganizationindesignvariesaccordingtothearchitect’sdecision.Weoutlineaquantitativetheorytomeasurethesuccessinutilizingtheobserver’spsychologicalfactorstoachievethedesiredpositiveaffect.Weoutlineaunifiedframeworkforperceptionofgeometryandmotionininteriorspaces,whichintegratesaffectiveandcognitiveaspectsofhumanvisioninthecontextofanthropocentricinteriordesign.Theaffectivecriteriaarederivedfromcontemporarytheoriesofinteriordesign.Ourcontributionistodemonstratethattheneuralcomputationsinanobserver’seyemovementcouldbeusedtoelucidateharmonyinperceptionofform,spaceandmotion,thusameasureofgoodnessofinteriordesign.Throughmathematicalmodeling,wearguetheplausibilityoftherelevanthypotheses.


Development of a human vision simulation camera and its application: implementation of specific color perceptionHiroshiOkumura,ShoichiroTakubo,ShoichiOzaki,TakeruKawasaki,IndraN.Abdullah,KoheiArai,SagaUniv.(Japan);OsamuFukuda,NationalInstituteofAdvancedIndustrialScienceandTechnology(Japan)

Humaneyehasalotofphotoreceptorcellsinitsretina.Humanphotoreceptorcellsconsistof“cone”typeand“rod”type.

Conecellsthatareresponsibleforcolorvisionareconcentratedatthemaculaandrodcellsareconcentratedarroundthemaculaandareusedinperipheralvision.Moresensitivethanconecells,rodcellsarealmostentirelyresponsibleforscotopicvision.Purkinjeeffectformesopicandscotopicvisionandadaptationarecausedbyunevendistributionandsensitivityofphotoreceptorcells.Inthisstudy,HuVisCam,ahumanvisionsimulationcamera,thatcansimulatenotonlyPurkinjeeffectformesopicandscotopicvisionbutalsodarkandlightadaptation,abnormalmiosisandabnormalmydriasiscausedbytheinfluenceofmydriasismedicineornerveagentisdeveloped.Inthisarticle,improvementofHuVisCamforspecificcolorperceptionisdiscussed.

Conference 8651


Forpersonswithnormalcolorperception,simulationfunctionofvarioustypesofspecificcolorperceptionisprovided.

Inaddition,forpersonswithspecificcolorperception,colorinformationpickerisalsoprovided.


IMF-based chaotic characterization of AP and ML visually-driven postural responsesHanifAzhar,GuillaumeGiraudet,JocelynFaubert,Univ.deMontréal(Canada)

Theobjectivewastoanalyzevisuallydrivenposturalresponsesandcharacterizeanynonlinearbehaviour.Werecordedphysiologicalresponsesfortwoadults,260trialseach.Thesubjectsmaintainedquitestancewhilefixatingforfoursecondswithinanimmersiveroom,EONIcube™[9],wherethereferencetothevisualstimuli,i.e.,thevirtualplatform,randomlyoscillatedinGaussianorientation900and2700forantero-posterior(AP),and,00and1800formedio-lateral(ML)atthreedifferentfrequencies(0.125,0.25,and0.5Hz).Weaccomplishedstationaryderivativesofposturetimeseriesbytakingtheintrinsicmodefunctions(IMFs).ThephasespaceplotofIMFshowsevidenceoftheexistenceofnonlinearattractorsinbothMLandAP.CorrelationintegralslopewithincreasingembeddingdimensionissimilartorandomwhitenoiseforML,andsimilartononlinearchaoticseriesforAP.Next,recurrenceplotsindicatetheexistenceofmorenon-linearityforAPthanthatforML.Thepatternsofthedotsafter200thtimestamp(nearonset)appearstobeaperodicinAP.Athighertemporalwindows,APentropytendsmoretowardchaoticseries,thanthatofML.Therearestrongernon-linearcomponentsinAPthanthatinMLregardlessofthespeedconditions.


Application of imaging technology for archaeology researches: framework design for connectivity analysis in pieces of Jomon potteryKimiyoshiMiyata,NationalMuseumofJapaneseHistory(Japan);RyotaYajima,KenichiKobayashi,ChuoUniv.(Japan)

JomonpotteryisonekindofearthenwareproducedinJomonperiodinJapan.Potteriesarefoundbytheexcavationsinarchaeologicalsites,howevertheiroriginalwholeshapeshavebeendismissedbecausethosearebrokenandseparatedintosmallpieces.Intheinvestigationprocess,reproductionofthewholeshapeofthepotteriesisanimportantanddifficulttaskbecausetherearealotofpiecesandthenumberofcombinationsamongthepiecesishuge.

Inthispaper,aframeworkfortheapplicationoftheimagingtechnologyisdiscussedatfirst,thenconnectivityanalysisamongthepiecesofJomonpotteriesisfocusedontoreducethenumberofthetrialanderrortofindconnectablecombinationsofthepieces.Therealpiecesarechosenandtakenbyadigitalcamera,andeachpieceintheimageislabeledtocalculatethestatisticalinformationincludinghistogramsusedasfeaturesintheanalysisoftheconnectivity.Anindexshowingtheconnectivityofthepiecesisdefinedandcalculatedtoindicatetheconnectivityasamatrixshowingpossibilityofallofcombinationsamongthepieces.Finally,validityofcalculatedindexisconfirmedbythearchaeologistasanexperimentalresult.


Top-down visual search in WimmelbildJuliaBergbauer,TechnischeUniv.München(Germany);SibelTari,MiddleEastTechnicalUniv.(Turkey)

Wimmelbildwhichmeans“teemingfigurepicture”isapopulargenreofvisualpuzzles.AbundantmassesofsmallfiguresarebroughttogetherincomplexarrangementstomakeonesceneinaWimmelbild.Itis

picturehuntgame.Assuch,ithasthepotentialtobeanexcellenttestbedfortop-downvisualsearchalgorithms.

FigurehuntinggamesstartingwithearlyGestalthaveplayedanoticeableroleinvisualperceptionstudies.Today,aquicksearchininternetretrievesplentyofchildren-orientedsamplesofthegenreofWimmelbildaswellasstrikingsamplesofcamouflageart,formingawidergenreofcamouflagedobjectspictures.Thesepicturesvaryinstyle,buthardlyanyofthemseemamenabletopurelybottom-upanalysis.Forsomeofthem,itmaybepossiblethatprocessingstartsbottom-up,deliveringinitialgroupingsaswellaslocationsofsalience.Forsomedeceptivelysimpleblackandwhitedrawings,however,itappearstousthattheprocessishighlylikelytostarttop-down.

Here,weaddresstheproblemoflocatingagivenforminaWimmelbild.Akeypointisthatweresorttofieldmodelsofshaperepresentationtohavesmoothwell-conditioneddependenceofthegoodness-of-fittoposeandscaleparameters.


Visual discrimination and adaptation using non-linear unsupervised learningSandraJiménez,ValeroVLaparra,JesusMaloLopez,Univ.deValència(Spain)

Understandinghumanvisionnotonlyinvolvesempiricaldescriptionsofhowitworks,butalsoorganizationprinciplesthatexplainwhyitdoesso(Barlow1961).Identifyingtheguidingprinciplesofvisualphenomenarequireslearningalgorithmstooptimizespecificgoals.Moreover,thesealgorithmshavetobeflexibleenoughtoaccountforthenon-linearandadaptivebehaviorofthesystem.Forinstance,linearredundancyreductiontransformscertainlyexplainawiderangeofvisualphenomena(BuchsbaumandGottschalk1983;Aticketal.1992,1993;OlshausenandField1996;BellandSejnowski1997;HoyerandHyvarinen2000;SimoncelliandOlshausen2001;Doietal.2003).However,thegeneralityofthisorganizationprincipleisstillinquestion(Barlow2001):itisnotonlythatandadditionalconstraintssuchasenergycostmayberelevantaswell(Laughlin2004),butalso,statisticalindependencemaynotbethebettersolutiontomakeoptimalinferencesinsquarederrorterms(MacLeod2003;Simoncelli2009;Laparraetal.2012a).Moreover,linearmethodscannotaccountforthenon-uniformdiscriminationindifferentregionsoftheimageandcolorspace:linearlearningmethodsnecessarilydisregardthenon-linearnatureofthesystem.Therefore,inordertoaccountforthenon-linearbehavior,principledapproachescommonlyapplythetrickofusing(alreadynon-linear)parametricexpressionstakenfromempiricalmodels(SchwartzandSimoncelli2001;Kayseretal.2003;Lyu2011).Thereforetheseapproachesarenotactuallyexplainingthenon-linearbehavior,butjustfittingittoimagestatistics.Insummary,aproperexplanationofthebehaviorofthesystemrequiresflexibleunsupervisedlearningalgorithmsthat(1)aretunabletodifferent,perceptuallymeaningful,goals;and(2)makenoassumptiononthenon-linearity.OverthelastyearswehaveworkedonthesekindoflearningalgorithmsbasedonnonlinearICA(MaloandGuti´errez2006),Gaussianization(Laparraetal.2011),andprincipalcurves(Laparraetal.2012a,b).Inthisworkwestressthefactthatthesemethodscanbetunedtooptimizedifferentdesignstrategies,namelystatisticalindependence,errorminimizationunderquantization,anderrorminimizationundertruncation.Then,weshow(1)howtoapplythesetechniquestoexplainanumberofvisualphenomena,and(2)suggesttheunderlyingorganizationprincipleineachcase.


Chromatic induction and contrast masking: similar models, different goals?SandraJiménez,Univ.deValència(Spain);XavierOtazu,Univ.deBarcelona(Spain);ValeroVLaparra,JesusMaloLopez,Univ.deValència(Spain)

Normalizationofsignalscomingfromlinearsensorsisanubiquitousmechanismofneuraladaptation(CarandiniandHeeger2012).

Conference 8651


Localinteractionbetweensensorstunedtoaparticularfeatureatcertainspatialpositionandneighborsensorsexplainsawiderangeofpsychophysicalfactsincluding(1)maskingofspatialpatterns(WatsonandSolomon1997),(2)non-linearitiesofmotionsensors(SimoncelliandHeeger1998),(3)adaptationofcolorperception(HillisandBrainard2005),(4)brightnessandchromaticinduction(Otazuetal.2008,2010),and(5)imagequalityassessment(Laparraetal.2010).However,theformalandthequalitativesimilaritiesofthelocalinteractionintheabovemodelsdonotnecessarilymeanthatthemechanismsinvolvedarepursuingthesamestatisticalgoal.Forinstance,inthecaseofchromaticmechanisms(disregardingspatialinformation),differentparametersinthenormalizationgiverisetooptimaldiscriminationoradaptation(Abramsetal.2007),anddifferentnon-linearitiesmaygiverisetoerrorminimizationorcomponentindependence(Laparraetal.2012).Inthecaseofspatialsensors(disregardingcolorinformation),anumberofstudieshavepointedoutthebenefitsofmaskinginstatisticalindependenceterms(SchwartzandSimoncelli2001;MaloandGuti´errez2006;MaloandLaparra2010;Lyu2011).However,suchstatisticalanalysishasnotbeenperformedforspatio-chromaticinductionmodelswherechromaticperceptiondependsonspatialconfiguration.Inthisworkweinvestigatewhetherasuccessfulspatio-chromaticinductionmodels(Otazuetal.2010),increasescomponentindependenceaspreviouslyreportedformaskingmodels.Mutualinformationanalysissuggeststhatdespitetheformalsimilarityofthemodels,theirstatisticalrolemaybedifferent.


Aesthetics and entropy II: a critical examinationMelvilleRSahyun,Consultant(UnitedStates)

NoAbstractAvailable

8651-30, Session 10

The evolution of attention research: exploring the interplay of bottom-up and top-down processes (Invited Paper)LaurentItti,TheUniv.ofSouthernCalifornia(UnitedStates)

NoAbstractAvailable

8651-31, Session 10

Saliency identified by absence of background structureFredW.Stentiford,Univ.CollegeLondon(UnitedKingdom)

Visualattentioniscommonlymodelledbyattemptingtocharacteriseobjectsusingfeaturesthatmakethemspecialorinsomewaydistinctiveinascene.Theseapproacheshavethedisadvantagethatitisnevercertainwhatfeatureswillberelevantinanobjectthathasnotbeenseenbefore.Thispaperprovidesabriefoutlineoftheapproachestomodelinghumanvisualattentiontogetherwithsomeoftheproblemsthattheyface.Agraphicalrepresentationforimagesimilarityisdescribedthatreliesonthesizeofmaximallyassociativestructures(cliques)thatarefoundtobereflectedinpairsofimages.Whilecomparinganimagewithitself,thesimilaritymechanismisshowntomodelpop-outeffectswhenconstraintsareplacedonthephysicalseparationofpixelsthatcorrespondtonodesinthemaximalcliques.Backgroundregionsarefoundtocontainstructureincommonthatisnotpresentinthesalientregionswhicharetherebyidentifiedbyitsabsence.Theapproachisillustratedwithfiguresthatexemplifyasymmetryinpop-out,theconjunctionoffeatures,orientationdisturbancesandtheapplicationtonaturalimages.

8651-32, Session 10

Investigation of eye-catching colors using eye trackingMokryunBaik,Hyeon-JeongSuk,JeongminLee,KyungahChoi,KAIST(Korea,Republicof)

Aneyetrackingexperimentwascarriedoutinordertoinvestigatetherelationshipbetweeneyegazingmovementsandthecharacteristicsofcolorattributes.Theexperimentfocusedonidentifyingthecolorattributesthataredecisiveforeye-catchingefficiency.Theiconarrayusedintheexperimenttodesignstimuliwasadoptedwithconsiderationtothecontextofrecentcommunicationmedia,suchassmartphonesorsmartTVs.Eachstimulussetwascomposedof25colorsquarepatchesarrayedintheformatofa5by5grid.Theexperimentwasdividedintothreepartstofocusonaspecificattributeofacoloroneatatime,whilecontrollingitsotherattributes:InPartI,huedifferencewasexamined,andthuseachstimulussetcontained25hueswhilethetoneremainedcontrolled.Itwasrevealedthatsubjectsweremoreattentivetowarmcolorsthantocoolcolors,particularlywhenwarmcolorswerearrangedalongthehorizontalandverticalaxes;InPartII,theexperimentdealtwithtonedifference,andthus25tonevariationswithinredorbluewereprovidedasastimulusset.However,theresultindicatedthatchangesintonedoesnothaveasignificantinfluenceonsubjects’initialattention;Lastly,inPartIII,whethercolorcombinationshaveaninfluenceonparticipants’attentioninamannerdifferentfromthatofsinglecolorswasexamined.Inthestimulusset,25iconswerearranged.Amongthem,iconswithcomplementarycolorcombinationsgainedthegreatestattention.Intheexperiment,fifteencollegestudentswererecruitedtoparticipateinallthreeparts.Throughouttheexperiments,eitherblackorwhitewereappliedasbackground,butthecontrasteffectbetweenforegroundandbackgroundcolordidnothaveanoticeableinfluenceonone’sattention.Basedontheempiricalresults,thispaperprovidesaguidelineforaneye-catchingcolorcompositionanddiscussesitsapplications.

8651-33, Session 10

Can skill be determined from a photographic portfolio?AbhishekAgrawal,RamakrishnaKakarala,NanyangTechnologicalUniv.(Singapore);RajeshSomavarapu,TheUniv.ofTexasatDallas(UnitedStates)

Weplantostudyandunderstandtherolethatphotographicskillhastoplayindecidingimagecompositionandaesthetics.Fewscientificstudieshavebeencarriedoutinthepastinthisdomainandevenfewertoaddressthesuppositionthatskillisapparentinphotography,evenwithoutconsideringthequalityoftheinstrumentusedtotaketheimage.Weconductandobtainresultsfromanexperimentinwhich33subjectsusedidenticalequipmenttophotograph7pre-determinedcommonplacescenessuchasaportrait,zebracrossing,buildingcorneretc.WeexploreanddemonstratetheuseofAmazonMechanicalTurkplatformforthecollectionofscientificdataandobtainratingsonourimagesfromalargenumber(bet.approx.150-200)ofjudges.Wealsousethisplatformtodiscernandcategorizethejudgesbyobtaininginformationontheirskill,knowledgeandpriorexperienceinphotography.Weimprovetheexperimentaldesignusedinthepreviousstudiesandcompareourfindings.Ourresultsdemonstratetherolethatphotographicskillhastoplayindecidingcompositioninperceptionbasedimaging,andhowitrelatestojudges’ratingsoftheimagestoastatisticallysignificantlevel.

8651-34, Session 11

Binocular eye movements in health and disease (Invited Paper)ChristopherW.Tyler,TheSmith-KettlewellEyeResearchInstitute(UnitedStates)

Conference 8651


Recordingbinoculareyemovementsforstereoscopictargetscanbechallengingbecausetherangeofusefuldisparitiesisonlyafewdegreesandtherangewithinstereogramsisevennarrower.ThisissuewillbeaddressedinthecontextoftherecentpapersatSPIEonbinocularcontrolandstereoscopicimagequalityin3Ddisplays.Withcarefulcalibrationitispossibletorecordbinocular(vergence)eyemovementstoanaccuracyofafewarcminutes.Iherereporttheresultsofthefirstquantitativesurveyofthedynamicsofvergenceeyemovementsinafull-spectrumnon-academicpopulationtopurelystereoscopic,purelyaccommodativeandfull-cuestimulation.Theresultsshowawidevarietyofvergencebehaviorsthatneedtobetakenintoaccountinthedesignofstereoscopicdisplays.Althoughoptimalvergencemovementsmaybeaccomplishedinaslittleas300ms,amajorityofthepopulationhaveslowvergencemovementsinatleastonedirectionofvergence,takingover500mstoreachthetargetdisparity(usuallyfordivergence),orotherdeviationsfromoptimalvergencecapabilities.Forlargervergencemovements,theeyesmaytakeasmuchas2storeachthetargetdisparity,evenunderfullcueconditions.

Asubstantialproportionofthepopulationhassufferedfromanincidentofdiffusetraumaticbraininjury(dTBI;animpacttotheheadresultinginlossofconsciousnessbutnoevidenceoftissuedamage),whichinturncanoftenresultinimpairmentofbinocularvergencecontrol.Atupto2millionincidentsperyearintheUS,dTBImayaffectasmanyas20%ofthepopulation,ofwhich50%ormoremayshowvergencecontrolproblems.WefindthatmanydTBIsufferersexhibitnovergenceatalltodisparitychangestimuli,despitehavingnormalstereopsis.

Itisimportanttotakethisrangeofvergencecapabilitiesintoaccountinthedesignofstereoscopic3Ddisplaysandfilmproducts.

8651-35, Session 11

Reflexive and voluntary control of smooth eye movementsJeffreyB.Mulligan,NASAAmesResearchCtr.(UnitedStates);ScottB.Stevenson,Univ.ofHouston(UnitedStates)

Anunderstandingofvisuallyevokedsmootheyemovementsisrequiredtopredictthevisibilityandlegibilityofmovingdisplays,suchasmightbeencounteredinvehicleslikeaircraftandautomobiles.Wehavestudiedtheresponseoftheoculomotorsystemtovariousclassesofvisualstimuli,andanalyzedtheresultsseparatelyforhorizontalandverticalversion(inwhichthetwoeyesmovetogether),andhorizontalandverticalvergence(wheretheymoveinoppositedirections).Ofthefourtypesofmotion,onlyverticalvergencecannotbeperformedundervoluntarycontrol,andwefoundthatcertainstimuli(allhavingrelativelylonglatencies)wereincapableofevokingit.Inanotherexperiment,weinstructedobserverstotrackoneoftwotargets,andmeasuredweakbutreliableresponsestotheunattendedtarget,inwhichthelong-latencycomponentoftheresponseisabolished.Ourresultsareconsistentwithasystemcontainingtwodistinctprocesses,afastreflexiveprocesswhichrespondstoarestrictedclassofstimuli,andaslowervoluntaryprocesscapableoffollowinganythingthatcanbeseen,butincapableofcontrollingverticalvergence.

8651-36, Session 11

Simple gaze-contingent cues guide eye movements in a realistic driving simulatorLauraPomarjanschi,Univ.zuLübeck(Germany);MichaelDorr,PeterJ.Bex,SchepensEyeResearchInstitute(UnitedStates);ErhardtBarth,Univ.zuLübeck(Germany)

Lookingattherightplaceattherighttimeisacriticalcomponentofdrivingskill.Therefore,gazeguidancehasthepotentialtobecomeavaluabledrivingassistancesystem.Inpreviouswork,wehavealreadyshownthatcomplex

gaze-contingentstimulicanguideattentionandreducethenumberofaccidentsinasimpledrivingsimulator.Weheresetouttoinvestigatewhethercuesthataresimpleenoughtobeimplementedinarealcarcanalsocapturegazeduringamorerealisticdrivingtaskinahigh-

fidelitydrivingsimulator.Thisimmediatelyraisesanotherquestion,namelyhowsuchcueswouldinterferewiththedrivingtaskitself.

Weusedastate-of-the-art,wide-field-of-viewdrivingsimulatorwithanintegratedeyetracker.Gaze-contingentwarningswereimplementedusingtwoarraysoflight-emittingdiodeshorizontallyfittedbelowandabovethesimulatedwindshield.Twelvevolunteersdrovealongpredeterminedroutesinthesimulatedenvironmentpopulatedwithautonomoustraffic.Warningsweretriggeredduringtheapproachtohalfoftheintersections,cueingeithertowardstherightortotheleft.Theremainingintersectionswerenotcued,andservedascontrols.Apreliminaryanalysisshowsthatgaze-contingentcuesledtoasignificantshiftingazepositiontowardsthehighlighteddirection.

8651-37, Session 11

Designing an obstacle display for helicopter operations in degraded visual environmentPatriziaM.Knabl,NiklasPeinecke,DeutschesZentrumfürLuft-undRaumfahrte.V.(Germany)

Flyingindegradedvisualenvironmentisanextremelychallengingtaskforahelicopterpilot.Thelossoftheoutsidevisualreferencecausesimpairedsituationawareness,highworkloadandspatialdisorientationleadingtoincidentslikeobstacleorgroundhits.DLRisworkingonidentifyingwaystoreducethisproblembyprovidingthepilotwithadditionalinformationfromfusedsensordata.Therefore,differentdisplaydesignsolutionsarebeingdeveloped.

Inafirststudy,thedesignfocusedontheuseofasynthetichead-downdisplay,consideringdifferentrepresentationsforobstacles,colorcodingandterrainfeatures.Resultsshowasubjectivepreferenceforthemostdetailedobstacledisplay,whileobjectiveresultsrevealbetterperformanceforthedisplaywithlittlelowerdetail.Duetothefactthatattentionislimited,averycomplexrepresentationmightactuallyhinderfastandaccurateinformationprocessinginahighworkloadsituationrespectivelywhendividedattentionisrequired.Inasecondstudy,symbologyforahelmet-mounteddisplayisbeingdesignedandwillbeevaluatedinautumn2012.Designconsiderationswillfocusondifferentlevelsofterraintransparency,shape,sizeandcomplexityofobstaclesandassociatedvisualandperceptualaspects(e.g.attentionaltunneling).

8651-38, Session 12

Visual storytelling in 2D and stereoscopic 3D video: effect of blur on visual attentionQuanHuynh-Thu,CyrilVienne,LaurentBlondé,TechnicolorS.A.(France)

Visualattentionisaninherentmechanismthatplaysanimportantroleinthehumanvisualperception.Visualcompositionofscenesinstereoscopic3Dcontentcreationtodaystillmostlyfollowsthesametechniquesthanthoseusedin2D.Inparticular,out-of-focusblurisoftenusedin2Dmotionpicturesandphotographytodrivetheviewer’sattentiontowardsasharpareaoftheimage.However,instereoscopic3Dcontentviewing,aconflictmayappearwithdefocusedforegroundobjectsasourvisionisnaturallydrawnbothbysharpareasintheimageandbyobjectsappearingclosesttous.Inthispaper,westudytheimpactofdefocusedforegroundobjectsonvisualattentiondeploymentinstereoscopic3Dcontent.Forthatpurpose,weconductedaneye-trackingexperimentusingacontrolledproductionofstereoscopic3Dvideocontent.Contentwasviewedusingpassivepolarizedglassestechnologyinafree-viewingscenario.Observerswereaskedtoviewthe2Dand3Dversionsofthesamevideocontentpresentedwiththefollowingtestconditions:presenceorabsenceofforegroundobject,focusordefocusonthisforegroundobject,magnitudeofdisparitydifferencebetweenforeground/backgroundobjectplanes.Wediscussourresultsinthecontextofstereoscopiccontentcreation,senseofimmersionandvisualcomfort.

Conference 8651


8651-39, Session 12

Using natural versus artificial stimuli to perform calibration for 3D gaze trackingChristopheMaggia,NathalieGuyader,AnneGuérin-Dugué,Gipsa-Lab(France)

Thepresentedstudytestswhichtypeofimage,naturalorartificial,ismoreadaptedtoperformefficientandreliablecalibrationtotrack3Dgazeinspaceusingclassical2Deyetracker.Runninganexperimentwithfourobservers,wecomparednaturalsceneimages(indoorandoutdoor)andgridofsquareson3Dgazecalibration.Twotypesofmodels,linearandnon-linear,weredevelopedtopredictrealdisparityfromthedisparityrecorded(differenceofthex-coordinatesofthetwoeyes).Modelswerecomputedfordifferentspatialpositionsandthenmergedoverthewholespace.WecomparedthetypeofimagesandthemodelsusingRootMeanSquareerrorsbetweenpredictedandrealdisparities.Wedidnotobtainanydifferencefortwosubjectswhereasforthetwoothersnaturalimagesprovidedbetterresults.Thissuggeststhatitmightbeimportanttotestwhichkindofstimulusisbestsuitableforparticularobservertoperform3Dcalibrationonthechosenstimulusbeforerunninganexperiment,forexample,on3Dnaturalsceneimagestostudyvisualattentiononstereoimages.

8651-40, Session 12

Study of center-bias in the viewing of stereoscopic image and a framework for extending 2D visual attention models to 3DJunleWang,MatthieuPerreiraDaSilva,PatrickLeCallet,VincentRicordel,Univ.deNantes(France)

Comparedtothegoodperformancethatcanbeachievedbymany2Dvisualattentionmodels,predictingsalientregionsofa3Dsceneisstillchallengingbecauseoftheadditionaldepthinformationledbybinoculardisparity.Anefficientwaytoachievethiscanbetoexploitexistingstudiesdedicatedfor2Dcontent.Sinceithasbeendemonstratedtheinfluenceof2Dvisualfeatures(e.g.color,contrast,orientation,andcenterofthescreen)instereoscopic3Dviewingcondition.

Inthispaper,wequantitativelyevaluatethedegreeofcenter-biasduringtheviewingofstereoscopic3Dimagesofnaturalcontent.Wealsoproposeasimplecomputationalmodelof3Dvisualattentionwhichcaneasilytakeadvantageofcenter-biasandexisting2Dmodels.Forvalidation,aneye-trackingexperimentisconducted,inwhich35subjectsareinvolved.Wethuscreateadatabasecontainingeye-movementdatarecordedduringtheviewingofeighteen3Dimagesandtheircorresponding2Dversion.Ourresultsindicateacleardifferencebetweencenter-biasin2Dand3Dviewingconditions.Byintegratingcenter-biasintheproposedmodel,anaddedvalueofcenter-biashasbeendemonstratedinthepredictionofsaliencymapsfor3Dimages.

8651-41, Session 12

How visual attention is modified by disparities and textures changes?DaryaKhaustova,JérômeFournier,EmmanuelWyckens,FranceTelecomR&D(France);OlivierLeMeur,IRISA/INRIARennes(France)

3Dimage/videoqualityofexperienceisamultidimensionalconceptthatdependson2Dimagequality,depthquantityandvisualcomfort.Therelationshipbetweentheseparametersisnotclearlydefinedyet.Withthisperspective,wetrytounderstandhowtexture,depthquantityandvisualcomfortinfluencethewaypeopleobserve3Dcontentincomparisonto2D.FivesceneswithdifferentstructuralparametersweregeneratedusingBlendersoftware.Forthesefivescenesthefollowingparametersweremodified:texture,camerabaselineandconvergencedistancecreatingthendifferentcontentsfordifferentpresentationstoeachobserver.Ourstudywasconductedusingan

eye-trackeranda3DTVdisplay.Duringpsychovisualexperimenteachobserverexaminedfreelyeachscenewithdifferenttextures,camerabaselinesandconvergencedistances.Toavoidmemorybias,wemadesurethateveryobserverhadseenscenecontentonlyonce.Collectedfixationdataareusedtobuildsaliencymapsandtoanalyzedifferencesbetween2Dand3Dcondition.Iftheregionsofinterest(RoI)arecoincidedinbothconditions,itwasinterestingtotrackthegazein3D.ThediscomfortintroducedbythebackgroundortheforegroundmightbeneglectedincasewhenobserverspayattentiontoRoI.Fromthisanalysis,wecananswerthequestionif3Dvisualattentioniscontentdependentandwhetherqualityofexperienceinfluencethewayweobservestereoscopicpictures.

8651-43, Session 13

Copy-paste in depth: quantifying the pictorial space of paintingsMaartenW.A.Wijntjes,TechnischeUniv.Delft(Netherlands)

Whereaspictorialspaceplaysanimportantroleinarthistoricdiscussions,thereislittleresearchonthequantitativestructureofpictorialspaces.Recently,anumberofmethodshavebeendeveloped,oneofwhichreliesonsizeconstancy:twospheresarerenderedintheimagewhiletheobserversadjuststherelativesizessuchthattheyappearofequalsizeinpictorialspace.Thismethodisbasedonpair-wisecomparisons,resultinginn(n-1)/2trialsfornsamples.Furthermore,itrendersaprobeintheimagethatdoesnotconformtothestyleofthepainting:itmixescomputergraphicswithapainting.

Themethodproposedhereusesprobesthatarealreadyinthescene,notviolatingthepaintings’style.Anobjectiscopiedfromtheoriginalpaintingandshowninadifferentlocation.Theobservercanadjustthescalingsuchthatthetwoobjects(oneoriginallyinthepainting,andtheothercopy-pasted)appeartohaveequalsizesinpictorialspace.Sincetheoriginalobjectservesasareference,thenumberoftrialsismerelyn-1,anexperimentalreductionof1/ntrialswithrespecttotheoriginalmethod.

Wemeasuredthepictorialspacesoftwopaintingsusingourmethod,oneCanalettoandoneBreughel.Wefoundthatobserverstypicallyagreedwillwithrespecttoeachother,coefficientsofdeterminationashighas0.9werefoundwhentheprobewasahuman,whileotherprobesscoredsomewhat(butsignificantly)lower.Theseinitialfindingsappearverypromisingforthestudyofpictorialspace.

8651-44, Session 13

Drawing accuracy measured using polygonsLindaC.Carson,Univ.ofWaterloo(Canada);MatthewJ.H.Millard,StanfordUniv.(UnitedStates);NadineQuehl,JamesDanckert,Univ.ofWaterloo(Canada)

Thestudyofdrawing,foritsownsakeandasaprobeintohumanvisualperception,generallydependsonratingsbyhumancriticsandself-reportedexpertiseofthedrawers.Tocomplementthoseapproaches,wehavedevelopedanobjectivecontinuousperformance-basedmeasureofdrawingaccuracy.Thismeasureisbasedonrepresentingdrawingsassetsoflandmarkpointsandstudyingfeaturesofparticularresearchinterestbycomparingpolygonsofthosefeatures’landmarkpointswiththeircounterpartpolygonsinagroundtruthimage.Anyelementofadrawing—anobject,aplane,ashadow,aspace—couldberepresentedbythepolygonformedbyitslandmarkpoints.Thisapproachproducesbothlocalaccuracymeasures(foreachpolygon)andaglobalaccuracymeasure(themeanacrossseveralpolygons)foranalysis.Furthermore,therearefourdistinctpropertiesofapolygontomeasure:itssize,itsposition,itsorientationandtheproportionalityofitsshape.Wecandecomposeerrorintofourcomponentsandinvestigatehoweachcontributestodrawingperformance.Webrieflydescribethemethodanditspotentialapplicationstoresearchindrawingeducationandvisualperception,thenreportonitsapplicationtoaspecificresearchquestion:Arewemoreaccuratewhendrawingintheso-called“positivespace”(orfigure)?

Conference 8651


8651-45, Session 13

Fractals in art and nature: why do we like them?BrankaSpehar,TheUniv.ofNewSouthWales(Australia);RichardP.Taylor,Univ.ofOregon(UnitedStates)

Fractalshaveexperiencedconsiderablesuccessinquantifyingthevisualcomplexityexhibitedbymanynaturalpatterns,andcontinuetocapturetheimaginationofscientistsandartistsalike.Fractalpatternshavealsobeennotedfortheiraestheticappeal,asuggestionfurtherreinforcedbythediscoverythatthepouredpatternsoftheAmericanabstractpainterJacksonPollockarealsofractal,togetherwiththefindingsthatmanyformsofartresemblenaturalscenesinshowingscale-invariant,fractal-likeproperties.Whilesomehavesuggestedthatfractal-likepatternsareinherentlypleasingbecausetheyresemblenaturalpatternsandscenes,therelationbetweenthevisualcharacteristicsoffractalsandtheiraestheticappealremainsunclear.Motivatedbyourpreviousfindingsthathumansdisplayaconsistentpreferenceforacertainrangeoffractaldimensionacrossfractalimagesofvarioustypesweturntoscale-specificprocessingofvisualinformationtounderstandthisrelationship.WeextendourinvestigationstothevisualpreferenceforrandomnoiseimagesvaryingintermsoftheslopeoftherotationallyaveragedFourieramplitudespectrum.Furthermore,wemakeadirectcomparisonbetweenvisualpreferenceto1/fgrayscaleimagesandacomparisonsetofthresholded(blackandwhite)images.Wefoundnosignificantdifferencesinpreferencesbetweengray-scaleimagesandbinarycomparisonimagesobtainedbysimplythresholdingtheoriginalgray-scaleimages.Forbothsetofimages,thevisualpreferencepeakedforimageswiththeamplitudespectrumslopesfrom1.25to1.5,thusconfirmingandextendingthepreviouslyobservedrelationshipbetweenfractalcharacteristicsofimagesandvisualpreference.

Conference 8651


Conference 8652: Color Imaging XVIII: Displaying, Processing, Hardcopy, and ApplicationsMonday-Wednesday4–6February2013

Part of Proceedings of SPIE Vol. 8652 Color Imaging XVIII: Displaying, Processing, Hardcopy, and Applications

8652-1, Session 1

A spherical perceptual color model (Invited Paper)TielingChen,Univ.ofSouthCarolina(UnitedStates);ZhongminDeng,WuhanTextileUniv.(China);JunMa,WuhanTextileUniversity(China)

Thepaperintroducesatransformedsphericalmodeltorepresentthecolorspace.Thecircularconewithasphericaltoptightlycircumscribingthecolorcubeisequippedwithasphericalcoordinatesystem,withtheoriginattheblackvertexandthediagonalastheverticalaxis.Everypointinthecolorcubeisthenrepresentedbythreesphericalcoordinates,withtheradiusRhomeasuringthedistancetotheorigin,indicatingthebrightnessattributeofthecolor,theazimuthalangleThetameasuringtheangleonthehorizontalplane,indicatingthehueattributeofthecolor,andthepolaranglePhimeasuringtheopeningofthecircularconewiththeverticalaxisasitscenter,indicatingthesaturationattributeofthecolor.SimilartotheHSVandHSLmodels,thesphericalmodelspecifiescolorsbydescribingthecolorattributesrecognizedbyhumanvision.TheconversionformulasaremuchmathematicallyneaterthanthatoftheHSVmodelandHSLmodel,andtheinterpretationofthemodelismoreintuitivetoo.Mostimportantly,thesphericalcolormodeldoesnothavetherayphenomenonthatoccursintheHSVandHSLmodels.Applicationsofthesphericalmodelincludingcolorcomparisonsarestudiedinthepapertoo.

8652-2, Session 1

Chroma-preserved luma controlling technique using YCbCr color spaceSooyeonLee,YoungshinKwak,UlsanNationalInstituteofScienceandTechnology(Korea,Republicof);YounJinKim,SamsungElectronicsCo.,Ltd.(Korea,Republicof)

YCbCrcolorspacecomposedoflumaandchrominancecomponentsispreferredforitseaseofimageprocessing.Howeverthenon-uniformityandnon-orthogonalityofYCbCrbetweenYCbCrcomponentsinducechangeofperceivedlightness,perceivedchromaandperceivedhueascontrollingluma,CbandCrvalues.Inthisstudy,anewmethodwasdesignedforthechromacompensationgeneratedbylumachange.The6YCC_hueangleswereselectedtodesignthis.ForeachYCC_hueangle,datapointsnamed‘Originaldata’generatedwithuniformlydistributedvariouslumaandYCC_chromavalue.Thentheweightvalueswereappliedtolumavalueof‘Originaldata’as‘Testdata’.AndnewYCC_chromahavingminimizedCIECAM02?Cbetweenoriginalandtestdatawascalculated.Thenewwasusedtodesignthismodel.Thismodelimplementedforimageprocessingaslumacontrollingalgorithmhavingconstantperceivedchroma.Theperformancewastestednumericallyandperceptually.For1324‘testdata’bycomparisonwith‘Originaldata’theCIECAM02?Chasbeendecreasedwith53.31%.Whenthenewmethodwasappliedtolumachangedimage,ithas67.3%improvementofCIECAM02?Cfor130%incrementofluma

8652-3, Session 1

Analysis of a color space conversion engine implemented using dynamic partial reconfigurationRyanToukatly,DorinPatru,EliSaber,EricPeskin,RochesterInstituteofTechnology(UnitedStates);GeneRoylance,BradLarson,Hewlett-PackardCo.(UnitedStates)

DynamicPartialReconfigurationallowspartsofaFieldProgrammableGateArraytobereconfigured,whiletherestofthesystemcontinuesuninterruptedoperation.AColorSpaceConversionEngineisadigitalimage-processingpipeline,whichrequiresfrequentreconfigurationofsome,butnotallofitsstages.Therefore,itisadigitalsignalprocessingsystemthatpresumablycantakeadvantageofdynamicpartialreconfiguration.Thispaperdescribesthenecessarydesignchanges,testing,andperformanceanalysisofacolorspaceconversionengineimplementedontoafieldprogrammablegatearrayusingdynamicpartialreconfiguration.Theanalysisprovidesinsightintotheoperationalscenariosinwhichdynamicpartialreconfigurationisadvantageousornot.

8652-4, Session 2

Color reproductivity improvement with additional virtual color filters for WRGB image sensorShunKawada,RihitoKuroda,ShigetoshiSugawa,TohokuUniv.(Japan)

WehavedevelopedahighaccuracycolorreproductionmethodbasedonanestimatedspectralreflectivityofobjectsusingadditionalvirtualcolorfiltersforawidedynamicrangeCMOSimagesensorwithWRGBcolorfilterwhichwehavedeveloped.TheWRGBimagesensordoesnotneedtoswitchthewayofsignalprocessingdependingonthelightintensityevenwherealargesensitivitydifferenceisexistbetweenthecolorpixels.Thatisbecausethefoursignalsareavailabletothepointofsaturationlightintensity.ThevirtualcolorfiltersarecreatedbymultiplyingthespectralsensitivityofWpixelbynormaldistributionfunctions,andthevirtualsensoroutputsofthosevirtualfiltersareestimatedfromthefouroutputsignalsoftheWRGBimagesensor.TheaccuracyofcolorreproductionwasevaluatedwiththeMacbethColorChecker.Theaveragedvalueofthecolordifferencedelta-Eabof24colorsis2.10byusingtheminimumnormmethodbasedonsixsignalswhichareacombinationofthethreerealRGBsignalsandthethreevirtualsignals.

8652-5, Session 2

Glare and shadow reduction for desktop digital camera capture systemsThanhH.Ha,Chyuan-TyngWu,PurdueUniv.(UnitedStates);PeterMajewicz,FlextronicsInternational(UnitedStates);KurtR.Bengtson,Hewlett-PackardCo.(UnitedStates);JanP.Allebach,PurdueUniv.(UnitedStates)

Thequalityofimagesofobjectswithsignificant3Dstructure,capturedatcloserangeunderaflash,maybesubstantiallydegradedbyglareandshadowregions.Inthispaper,weintroduceanimagingsystem


andcorrespondingalgorithmtoaddressthissituation.Theimagingsystemcapturesthreeframesofthestationarysceneusingasinglecamerainafixedposition,butanilluminationsourceinthreedifferentpositions,oneforeachframe.Thealgorithmincludestwoprocesses:shadowdetectionandimagefusion.Throughshadowdetection,wecanlocatetheareaofshadows.Aftergettingtheshadowmaps,wegenerateamorecompletefinalimagebyimagefusion.Ourexperimentalresultsshowthatinmostcases,theshadowandglarearemarkedlyreduced.

8652-6, Session 2

Reducing flicker due to ambient illumination in camera captured imagesMinwoongKim,PurdueUniv.(UnitedStates);KurtR.Bengtson,LisaLi,Hewlett-PackardCo.(UnitedStates);JanP.Allebach,PurdueUniv.(UnitedStates)

NoAbstractAvailable

8652-8, Session 3

Binary image compression using conditional entropy-based dictionary design and indexingYandongGuo,PurdueUniv.(UnitedStates);DejanDepalov,PeterBauer,BrentBradburn,Hewlett-PackardCo.(UnitedStates);JanP.Allebach,PurdueUniv.(UnitedStates);CharlesABouman,PurdueUniversity(UnitedStates)

Detailswithheldpendingsubmissionofapatentapplicationthatisinpreparation,andwhichwillcoverthetechnologytobedescribedinthispaper.

8652-9, Session 3

Segmentation for better rendering of mixed-content pagesYi-TingChen,PurdueUniv.(UnitedStates);Di-YuanTzeng,TerryNelson,MarkQ.Shaw,Hewlett-PackardCo.(UnitedStates);JanP.Allebach,PurdueUniv.(UnitedStates)

Weproposeamethodforimprovingrenderingofmixed-contentpagesbysegmentingandclassifyingtheregionsofthesepagesaccordingtothenatureofthecontent.Ouralgorithmusesedgeoperatorsandconnectedcomponentsappliedtoboththecontinuous-tonepageimageandanobjectmapgeneratedduringrippingofthepagefrompagedescriptionformattorasterformat.Therenderingisdoneusinganobject-basedhalftoningstrategythatwasreportedatthisconferencein2012.

8652-11, Session 3

YACCD2: yet another color constancy database updatedAlessandroRizzi,CristianBonanomi,DavideGadia,GiuseppeRiopi,Univ.degliStudidiMilano(Italy)

In2003,atthisconference,wehavepresentedanimagedatabase(IDB)totestcolorconstancyandotherkindsofalgorithms.ThemotivationtoaddanotherIDBtothemanyalreadypresentonthewebisduetothefactthateachIDBischaracterizedbychoicesthatcanfitornotwiththealgorithmstotest.E.g.adatabaseofimagescontainingawhiteareaissuitableforalgorithmsbasedontheWhitePatchapproach,onthecontrary,thecompleteabsenceofwhiteareascanadvantagealgorithmswithalternativeapproaches.

YACCDhasthefollowingcharacteristics:

-differentbackgroundswithawidefrequencyrange,containingwhitebutwithanaveragereflectanceclosetomiddlegray

-asetofnatural,artificialandextremeilluminants

-imageswithandwithoutcastedshadows

WedecidedtoredoYACCDinordertoaddthefollowingcharacteristics:

-multipleexposuresforHDRimaging

-RAWandjpgformats

-reflectancedata

InthiswayYACCD2canbesuitabletotestawidervarietyofalgorithmslikee.g.computationalcolorconstancy,humanvisionmodels,HDRtonerendering,intrinsicimagesandothercomputervisionalgorithms.

8652-42, Session 3

An efficient flicker noise reduction method for single imagesPanPan,YuanHe,ShufuXie,JunSun,SatoshiNaoi,FujitsuResearchandDevelopmentCenterCo.,Ltd.(China)

Inthispaper,wepresentanovelefficientflickernoisereductionmethodforsingleimagesscannedbyoverheadlinesensors.Theflickernoisehereisperceivedashorizontalbandswhicharenotnecessarilyperiodic.Weviewtheflickerpatternasthenoiseofrowcumulativehistogramalongtheverticaldirection,andproposetwonovelcumulativehistogramfilteringapproachestosmooththeartifact,includingusingdifferentGaussianvarianceandpaddingtheimage.Theproposedalgorithmisthenusedtoreducetheflickernoiseinourscannedcolorimages.Thecomputationalcomplexityoftheproposedalgorithmisfurtheranalyzed.Thealgorithmoperatesonsingeimages,anddoesnotrelyonthefrequencyofalternativecurrency,norrequiresthehorizontalbandsareperiodic.Experimentalresultsshowthesuperiorperformanceoftheproposedmethodincomparisontootherexistingmethods.

8652-12, Session 4

Gray-world-assumption-based illuminant color estimation using color gamuts with high and low chromaHarumiKawamura,NipponTelegraphandTelephoneCorp.(Japan);ShunichiYonemura,ShibauraInstituteofTechnology(Japan);JunOhya,WasedaUniv.(Japan);AkiraKojima,NipponTelegraphandTelephoneCorp.(Japan)

Thispaperproposesanewapproachforestimatingilluminantcolorbasedonthegrayworldassumptioncombinedwithcolorgamutmethod.Thisassumptionhypothesizestheaveragecolorofalltheobjectsinthesceneisachromatic,therefore,itisdifficulttoestimateanilluminantcolorcorrectlyifthecolorsoftheobjectsinascenearedominatedbycertaincolors.Ourpreviousmethodextendstheimagestowhichthishypothesiscanbeappliedbyusingseveralbutnotallcolors.Withthismethod,however,illuminantcolorscannotbeestimatedincaseswherethereareseveralcolorsintheimageandtheyarelocalizedinonepartofthecolorspace.Tosolvethisproblem,ourapproachusestwokindsofcolorgamuts,ahigh-chromaoneandalow-chromaone.Thehighchromagamutisusedforaddingthecolorstotheimagetomakesitpossibletoselectcolorstomeettheassumptionandthelowchromaoneisusedtoestimatethecandidatesforilluminantcolors.Experimentalresultsusingcolorimagesshowthattheilluminantcolorestimationobtainedwithourapproachisstatisticallyclosertothecorrectonethanthatobtainedwiththeconventionalmethod.

8652-13, Session 4

Estimation of reflectance based on properties of selective spectrum with adaptive Wiener estimation

Conference 8652


Ji-hoonYoo,Wang-JunKyung,Ho-GunHa,Yeong-HoHa,KyungpookNationalUniv.(Korea,Republicof)

Inthispaper,weproposeanadvancedestimationofreflectancebyupdatingtheautocorrelationmatrixofthereflectanceinWienerestimationandusingtheselectivewavelengthofthereflectancefromaspectralanalysis.ThekeypointofWienerestimationistofindatransformmatrixwhoseroleconvertstheacquiredimagesintotheestimatedreflectance.Thetransformmatrixisdefinedasminimizingameansquareerrorbetweenthemeasuredandtheestimatedreflectance.Also,thismatrixconsistsofanautocorrelationmatrixofareflectance,anautocorrelationmatrixofanoise,andamatrixofaspectralsensitivitywhichinvolvesanilluminant,afiltertransmittance,andcamerasensitivity.Inproposedmethod,theautocorrelationmatrixofthereflectanceisupdatedbycalculatingthesimilaritybetweenthemeasuredandtheestimatedreflectance.Thetrainingsamplesisusedas1485colorpatchesofMunsellandthetestsampleisusedas24colorpatchesofMacbethColorChecker.Also,theselectivewavelengthofthereflectanceisusedtoestimateaccuratereflectance.Asaresult,theproposedmethodshowedmoreaccurateestimationofreflectancethantheconventionalWienerestimationinexperiments.

8652-14, Session 4

Metal-dielectric object classification by combining polarization property and surface spectral reflectanceShojiTominaga,HidekiKadoi,KeitaHirai,TakahikoHoriuchi,ChibaUniv.(Japan)

Mostobjectsurfacesinnaturalscenescanbeclassifiedintotwomaterialclasses:metalanddielectric.Inconventionalmethodsformaterialclassification,therearetwoapproaches;acolor-basedapproachandapolarization-basedapproach.Thispaperproposesamethodforclassifyingmultipleobjectsurfacesintothetwomaterialclassesbycombiningsurfacespectralreflectanceandpolarizationproperty.Weutilizepolarizationpropertyinordertoclassifytheobjectsintometalanddielectric,andsurface-spectralreflectanceinordertosegmentthesceneimageintodifferentobjectsurfaceregions.Animagingsystemisdevelopedusingaliquidcrystaltunablefilterforcapturingbothpolarizationandspectralimagessimultaneously.Ourclassificationalgorithmconsistsofthreestages;(1)highlightdetectionbasedonluminancethreshold,(2)materialclassificationbasedonthespatialdistributionofthedegreeofpolarizationatthehighlightarea,and(3)imagesegmentationbasedonilluminant-invariantrepresentationofthespectralreflectance.Thefeasibilityoftheproposedmethodisexaminedindetailinexperimentsusingreal-worldobjects.

8652-15, Session 5

An experiment on the color rendering of different light sourcesSimonettaFumagalli,ENEA(Italy);CristianBonanomi,AlessandroRizzi,Univ.degliStudidiMilano(Italy)

Thecolorrenderingindex(CRI)attemptstomeasurehowmuchthecolorappearanceofobjectsispreservedwhentheyareilluminatedbyagivenlightsource.Thisproblemisofgreatimportanceforvariousindustrialandscientificfields,suchaslightingarchitecture,design,ergonomics,etc.

AlightsourcecanbespecifiedthroughtheCorrelatedcolortemperatureorCCT.Howevertwo(ormore)lightsourceswiththesameCCTbutdifferentspectralpowerdistributioncanexist.ThereforecolorsamplesviewedundertwolightsourceswithequalCCTscanappeardifferent.Hence,theneedforamethodtoassessthequalityofagivenilluminant.

RecentlyCRIhashadarenewedinterestbecauseofthenewLED-basedlightingsystems.Theyusually,haveacolorrenderingindexratherlow,butgoodpreservationofcolorappearanceandapleasantvisualappearance(visualappeal).Variousattemptstodevelopa

newcolorrenderingindexhavebeendonesofar,butstillresearchisworkingforabetterone.

Thisarticledescribesanexperimentperformedbyhumanobserversconcerningtheappearancepreservationofsomelightsources,comparingitwitharangeofavailablecolorrenderingindices.

8652-16, Session 5

Color universal design: analysis of color category dependency on color vision type (4)TomohiroIkeda,KogakuinUniv.(Japan);YasuyoG.Ichihara,KogakuinUniv.(Japan)andNPOColorUniversalDesignOrganization(CUDO)(Japan);NatsukiKojima,KogakuinUniv(Japan);HisayaTanaka,KogakuinUniv.(Japan);KeiIto,TheUniv.ofTokyo(Japan)andNPOColorUniversalDesignOrganization(CUDO)(Japan)

ColorUniversalDesignisaneasy-to-understandsystemthatwascreatedtoconveycolor-codedinformationaccuratelytomostpeople.Inthisstudy,weconductedtwoexperimentstodevelopColorUniversalDesign.Inthefirstexperiment,theconfusionlocuswasverified.WeresearchedtheinconsistencyoftheconfusionlocusthatwefoundinapreviousstudyusinganexperimentalcolorchartbasedontheCIELABuniformcolorspace.ThesubjectshadP-typeandD-typecolorvision.Inthisexperiment,apracticalconfusionlocuswasdeterminedandverified.Thesecondexperimentinvolvedcolorclassificationusingthe100HueTest.WeinvestigatedacolorclassificationsystemthatdoesnotdependonthecolornamesfromC-type,P-type,andD-typecolorvision.Inthisexperiment,wecouldidentifyuniquecolorcategoriesthatdidnotdependonthecolornamesfromeachtypeofcolorvision.

8652-17, Session 5

Color naming 65,274,705,768 pixelsNathanMoroney,Hewlett-PackardLabs.(UnitedStates);GiordanoBeretta,Hewlett-PackardCo.(UnitedStates)

StartingwithasnapshotofaquarterofamillionWikipediaimages,roughlytwohundredthousandJPEGfileswereanalyzedusinganumberofalgorithms,includingamachinecolornamingmodule.Theprocessingtechniquesandcorrespondingresultsofcolornaming65,274,705,768pixelsisbothchallengingandinformative.ThespecificimagesanalyzedarebasedontheImageCLEF2010wikipediaretrievaltaskandbasicimageprocessingoperations,suchasdecompression,madeuseofopensourceimagingcode.Customanalysis,suchasmachinecolornaming,wasimplementedasaseparatemoduleinwhichalistingofinputimageswasprocessedtoderivedescriptivemetricsperimage.ThescaleofthedataanduniquenessoftheresultingmetricswasalsosuchthatitwasmoreefficienttocreatecustomHTML52Dcanvasvisualizationsoftheresults.Thispaperprovidesamoredetaileddescriptionoftheprocessingstepsandthecorrespondingresults.Thisisincludesanintroductorydiscussionofdata-intensiveresearchforimagingandvisualization.Onarelativescale,theroughly20gigabytesofwikipediaimagedataisbothmuchsmallerthantheterabyteimagedatabasesgeneratedbysomeprojectsandalsomuchlargerthanthehalfdozenorsoimagesprocessedandevaluatedbyotherprojects.Howeverwikipediaisawidelyknownwebreferenceandgeneralobservationsaboutthepropertiesofthiscollectionofimagesshouldbeofgeneralinterest.Afterthisbackgroundsection,anumberofbasicimageanalysismodulesaredescribedandtheresultsreportedbeforethemachinecolornamingisperformed.Thisbackgroundanalysisprovidesabettersummaryoftheimagesandincludesmetricssuchasaspectratio,mediandevicecolor,histogramstatistics,andinferredwhiteandblackpointsfortheimages.Thetrendsanddistributionsforeachoftheseisconsideredinsummaryform.Finally,themachinecolornamingisappliedandtheresultingdistributionsarereported.Theseresultsarebasedonanalysisthatusesafixednumberofpre-definedcolortermsorcolorvocabulary.Foraroughly30colortermvocabularyitiseasytoidentify

Conference 8652


theroughly12%ofwikipediaimagesthanonlyincludeachromaticcolorterms.Oftheremainingimagesthemostfrequentlyassignedcolortermsaredescribedandillustrated.Thecolortermsusedperimageisalsoconsideredandatrendforamostlymonotonicallyincreasingnumberofcolortermsperimageisinvestigated.

8652-18, Session 5

Analysis of brain activity and response to colour stimuli during learning tasks: an EEG studyRaffaellaFolgieri,ClaudioLucchiari,DanieleL.R.Marini,Univ.degliStudidiMilano(Italy)

Allweknowtheattractivepowerofcolours,andmanystudieshavebeendoneaboutthepsychologyofcoloursandtheirimpactonhumanactivities,perception,memoryandeducation.

Inthisworkweanalyseddatacollectedfromasampleofstudentsinvolvedinalearningprocessduringwhichtheyreceivedvisualstimulibasedoncolourvariation.Thestimuliconcernedboththebackgroundofthetexttolearnandthecolourofthecharacters.Theexperimentindicatedsomeinterestingresultsconcerningtheuseofprimaryandcomplementarycolours,identifiedfollowingthedefinitionoftheChevreul’scircle.Theobserveddata,collectedthroughtheregistrationofEEGwaves(byaBCIdevice)andthroughafinalevaluationtest,andthefollowinganalysisindicatedsomevariationdependingonthechooseofthecolours(primaryorcomplementary).Theobtainedresultsencouragedustocontinueexperimentsinafuturephaseextendingtheresearchtoother,morecomplex,stimuli.

8652-41, Session 5

Prototypical colors of skin, green plant, and blue skyHuanzhaoZeng,QualcommInc.(UnitedStates);M.RonnierLuo,Univ.ofLeeds(UnitedKingdom)

Coloursofskin,greenplant,andblueskyofdigitalphotographiccolourimageswerestudiedformodellinganddetectionofthesethreeimportantmemorycolourregions.ThecolourmodellingofthesethreeregionsinCIELABandCAM02-UCSwaspresented,andthepropertiesofthesethreecolourgroupswereinvestigated.

8652-19, Session 6

Direct binary search (DBS) algorithm with constraintsKartheekChandu,MikelJ.Stanich,RicohProductionPrintSolutions,LLC(UnitedStates);ChaiWahWu,BarryM.Trager,IBMThomasJ.WatsonResearchCtr.(UnitedStates)

Inthispaper,wedescribeaddingconstraintstotheDirectBinarySearch(DBS)algorithm,suchasrequiringthehalftonepatterntohaveonlyonedotpercolumnandrow,whichisimplementedusingmorethantwotogglesduringeachtrialoperation.ImplementationsoftheDBSalgorithmtraditionallylimitoperationstoeitheronetoggleorswapduringeachtrial.Intheexamplecase,usedtoillustratetheDBSalgorithmwithaconstraintinthispaper,thealgorithmproducesawrap-aroundpatternwithuniformlydistributedONpixelswhichhasapleasingappearancethatcontainspreciselyoneONpixelpereachcolumnandrow.ThealgorithmstartswithaninitialcontinuoustoneimageandaninitialpatternhavingonlyoneONpixelpercolumnandrow.TheautocorrelationfunctionofHumanVisualSystem(HVS)modelisdeterminedalongwithaninitialperceivederror.MultipleoperationpixelerrorprocessingduringeachiterationisusedtoenforcetheoneONpixelpercolumnandrowconstraint.TheconstraintofasingleONpixelpercolumnandrowisusedasanexampleinthis

paper.FurthermodificationoftheDBSalgorithmforotherconstraintsispossible,basedonthedetailsgiveninthepaper.AmathematicalframeworktoextendthealgorithmtothemoregeneralcaseofDirectMulti-bitSearch(DMS)ispresented.

8652-20, Session 6

Improved spectral vector error diffusion by dot gain compensationDanielNyström,LinköpingUniv.(Sweden);OleL.Norberg,VoxvilAB(Sweden)

SpectralVectorErrorDiffusion,sVED,isanalternativeapproachtoachievespectralcolorreproduction,i.e.reproducingthespectralreflectanceofanoriginal,creatingareproductionthatwillmatchunderanyillumination.Foreachpixelinthespectralimage,thecolorantcombinationproducingthespectrumclosesttothetargetspectrumisselected,andthespectralerrorisdiffusedtosurroundingpixelsusinganerrordistributionfilter.However,sincethecolorantseparationandhalftoningisperformedinasinglestepinsVED,thecompensationfordotgaincannotbemadeforeachcolorchannelindependently,asinaconventionalworkflowwherethecolorantseparationandhalftoningisperformedsequentially.Inthisstudy,wemodifythesVEDroutinetocompensateforthedotgain,applyingtheYule-Nielsenn-factortomodifyeachofthetargetspectra.Aglobaln-factor,optimalforeachprintresolution,reducesthespectralreproductionerrorsbyafactorof4,whileann-factorthatisindividuallyoptimizedforeachtargetspectrumreducesthespectralreproductionerrorto7%ofthatfortheunmodifiedprints.TheresultsclearlyillustratethenecessitytoaccountforthedotgainintheprintingprocessinordertomakeSpectralVectorErrorDiffusionarealisticalternativeforspectralcolorreproduction.

8652-21, Session 6

Extending color primary set in spectral vector error diffusion by multilevel halftoningOleL.Norberg,VoxvilAB(Sweden);DanielNyström,LinköpingUniv.(Sweden)

Inviewofthefactthatthehumanvisualsystemincludesthreedifferenttypesofcolorreceptors,threecolorchannelsaresufficienttoapproximatetherangeofcolorsseenbythehuman.However,thistricromaticapproachonlyhastheabilitytomatchcolorswhentheilluminationforthereproductionagreeswiththatoftheoriginal.Byaddingadditionalcolorchannelstotheprintingsystemanoptionalapproach,spectralprinting,canbeappliedtoreproducethespectralinformationoftheoriginalcolor.AtemptingapproachtospectralprintingisSpectralVectorErrorDiffusion,sVED,werethetransformationfromspectraldatatomulti-binarycolorantimageisperformedinonesingleprocessincludingbothcolorseparationandhalftoning.

EssentialfortheperformanceofthesVEDisthesetofavailablecolorantcombinations.Largercolorsetsandoptimalspectralcharacteristicsareexpectedtosignificantlyimprovethecoloraccuracyofthespectralreproduction.

Inthisstudy,sVEDhasbeenappliedonatenchannelinkjetsystem.Theoutputresolutionhasbeenreducedandtheunderlyinghigherprintresolutionhasbeenusedformulti-levelhalftoningtechniqueandtherebysignificantlyexpandedtheavailablecolorantcombinationsfromtheNeugebauerprimaries.ResultsfromthisstudyshowsthatbycombineNeugebauerprimarieswithlightercombinationstheaveragecolordifferencecanbereducedtoonetenthofthereproductionbyNeugebauerprimariesonly.Theresultsillustratestheimportanceoftheavailablecolorantcombinationsandintheprolongationalsothespectralcharacteristicsoftheprocessinksused.

Conference 8652


8652-22, Session 6

Reducing auto moiré in discrete line juxtaposed halftoning VahidBabaei,RogerD.Hersch,EcolePolytechniqueFédéraledeLausanne(Switzerland)

Discretelinejuxtaposedhalftoningcreatescolorhalftoneswithdiscretelinesofvariousrationalthicknesseslaidoutsidebyside,formingthedifferentcolorantsurfaces.Screenelementsaremadeofparallelogramscreentilesincorporatingthecolorantdiscretelines.Therepetitionofdiscretelinesfromonescreenelementtothenextmaycreateautomoiréartifacts.Bydecomposingeachscreentileintotwosub-tilesofrationalthicknesses,weensurethattwosuccessivediscretelineshavedifferentphasesinrespecttotheunderlyingpixelgrid.Thisresultsinahigherscreenfrequencyandinarepetitionvectorwhichisdifferentfromonediscretelinetothenextdiscretelineofthesamecolorant.Themainlow-frequencyartifactisbrokenintohigherfrequencyartifactswhicharenotvisibleanymore.

8652-23, Session 7

Optimizing CMYK mapping for high speed digital inkjet webpressRuzhuZeng,LimingVocationalUniv.(China);HuanzhaoZeng,QualcommInc.(UnitedStates)

TheCMYKtoCMYKmappingpreservingtheblackchannelisamethodtofixtheprobleminstandardICCcolormanagementthatcannotpreservetheKchannelinprintingCMYKcontents.Whilethemethodhasbeensuccessfullyusedfordigitalcommercialprinting,limitationsandareasforimprovementarefound.ToaddresstheseproblemsingeneratingCMYKre-renderingtables,analternativemethodisdeveloped.TheKusageandtotalinkusageareoptimizedinacolorseparationstep.InsteadofpreservingtheKchannelglobally,itonlypreservesK-onlygraysanddeterminesthemappingofothercolorsbyoptimizingtheprintquality.

8652-24, Session 7

Estimating toner usage with laser electrophotographic printersLuWang,PurdueUniv.(UnitedStates);DennisAbramsohn,ThomIves,MarkQ.Shaw,Hewlett-PackardCo.(UnitedStates);JanP.Allebach,PurdueUniv.(UnitedStates)

Estimatingtonerusageisanimportantproblemformanufacturersoflaserelectrophotographicprinters.Wehavedevelopedanewapproachtothisproblemthatcombinesimageanalysistechniqueswithphysicalmeasurements.Wewillpresentexperimentalresultsthatdemonstratetheefficacyofourmethod.

8652-25, Session 7

Perceived acceptability of colour matching for changing substrate white point KwameF.Baah,Univ.oftheArtsLondon(UnitedKingdom);PhilGreen,GjøvikUniv.College(Norway);MichaelR.Pointer,LondonCollegeofCommunication(UnitedKingdom)

Productionandproofingsubstratesfrequentlydifferintheirwhitepoints.Substratewhitepointsfrequentlydifferbetweenreferenceandsample,forexamplebetweenproofandprint,orbetweenatargetpapercolourandanactualproductionpaper.Toachieveanacceptablevisualmatchbetweencoloursspecifiedforthereferencesubstrate,whenprintedonadifferentmaterial,printedcolourscanbeadjustedtocompensateforobserveradaptationtothesubstratewhitepoint.Awidely-usedmethodofadjustmentistoconvertallmeasurementdatatomedia-relativemeasurements,thusscalingthetristimulusvaluesbytheratioofreferencetosamplewhitepoints.

Usingthisapproachevaluationsofcolourdifferenceofacceptabilitybetweenreproduciblecolouranditsperceivedappearanceforsamplesubstrateswereconductedusingthreemodesofproof-to-printpsychophysicalassessmentsnamely,simultaneous,sequentialanddisplay.

Thepsychophysicaltestsshowedthatsimultaneousandsequentialmodeshadsimilarresultsbutthedisplaymodeproducedahighertolerance.Referencepaperthresholdsformediatypecorrectionsofsolidcoloursproducedacolourdifferencethatwaslargerthanretargetingtoasimilarpapertype.Forlighttintsthethresholdwastheoppositewithathresholdoflessthan2.5DEabwhichislessthanwhatisexpectedfromretargetingsimilarpaper.

8652-26, Session 7

The development of vector based 2.5D print methods for a painting machineCarinnaE.Parraman,Univ.oftheWestofEngland(UnitedKingdom)

Throughrecenttrendsintheapplicationofdigitallyprinteddecorativefinishestoproducts,CAD,3Dadditivemanufacturingandresearchinmaterialperception,thereisagrowinginterestintheaccuraterenderingofmaterialsandtangibledisplays.Althoughcurrentadvancesincolourmanagementandinkjetprintinghasmeantthatuserscantakeforgrantedhigh-qualitycolourandresolutionintheirprintedimages,digitalmethodsfortransferringaphotographiccolouredimagefromscreentopaperisconstrainedbypixelcount,filesize,colorimetricconversionbetweencolourspacesandthegamutlimitsofinputandoutputdevices.Thispaperconsidersnewapproachestoapplyingalternativecolourpalettesbyusingavector-basedapproachthroughtheapplicationofpaintmixtures,towardswhatcouldbedescribedasa2.5Dprintingmethod.Theobjectiveistonotapplyanimagetoatexturedsurface,butwheretextureandcolourareintegraltothemark,thatlikeabrush,delineatesthecontoursintheimage.Thepaperdescribesthedifferencebetweenthewayinksandpaintsaremixedandapplied.Whentranscribingthefluidappearanceofabrushstroke,thereisadifferencebetweenahalftoneprintedmarkandapaintedmark.Theissueofsurfacequalityissignificanttosubjectivequalitieswhenstudyingtheappearanceofinkorpaintonpaper.Thepaperprovidesexamplesofarangeofvectormarksthatarethentranscribedintobrushstokesbythepaintingmachine.

8652-27, Session 7

Anti-counterfeiting model of printing micro-mirror in CMYK color spaceTangCheng,YixinZhang,JiangnanUniv.(China)

Opticalvariabledevices(OVD)foranti-counterfeitingapplicationsmainlyarerealizedbymicro-gratingstructure.Thediffractiongratingshouldbelessthan1micronandneedsexpensivehotstampingfoilasthecarrierofOVDinformation.Thepurposeofthispaperistodevelopareflectiveprintingmicro-mirrorstructureforthedesignofopticallyvariabledevices(OVD),whichcanbeduplicatedontothesecuritydocumentinnormalprintingormoldingprocess.Basedonthereflectivepropertiesofprintingmicro-mirrorandmodelofCMYKcolorspace,weestablishanewprintingmicro-mirrorgeometrymodel.Inthismodel,consideringtheeffectsofmicro-mirrorsurfaceshapeandmirrorslopeangleonthereflectedbeams,andtheutilizingoftheJacobianofmicro-mirrorsurfacefunctionandgradientfunction,weanalyzethereflectionbrightnessandstabilityofCMYKprintingmicro-mirror.OurresultsshowthatJacobianisapproximatelyequalto0andgradientisafunctionofcoordinatevalues.Itisprovedthatreflectedbeamsofprintingmicro-mirrorsurfacehavesufficientbrightnessandstability.Therefore,theprintingmicro-mirrorissuitabletocreateanti-counterfeitingopticalvariabledevice,whoseintensityofreflectedbeamswillbespatiallychangedwhenpositionorangleofviewingmodulate.Inthispaper,wedesignananti-counterfeitingopticalvariabledevicewhichusessomeprintingmicro-mirrorofdifferentCMYKvaluesandasimulatingsecuritypatternofprintingmicro-mirrorarrayisprovided.

Conference 8652


8652-28, Session 8

Unsupervised correction of relative longitudinal aberrations for multispectral imaging using a multiresolution approachJulieKlein,RWTHAachen(Germany)

Longitudinalaberrationsappearinmultispectralcamerasfeaturingamonochromesensorwithseveralopticalfiltersinfrontofit.Duetotheslightlydifferentopticalpropertiesofthefilters,thefocallengthsaredifferentandtheimagescannotremainsharpforallthecolorchannels.Weseekforanunsupervisedcorrectionoftheseaberrations,relativetoagivenreferencecolorchannel.“Unsupervised”meansherethatnocalibrationofthesystemisneeded.Weuseamultiresolutionapproachthattakesadvantageofthehighcontrastpresentinthereferencechannelandthatutilizesthisinformationfortheother,moreblurredchannels.Theresultsofthiscorrectionareevaluatedusingthesharpnessofthecorrectedimagewithrespecttotheoriginalblurredimageandusingthecoloraccuracy:analgorithmthatwouldcorruptthespectralinformationofmultispectralimageswouldnotbehelpful.Moreover,usingtheoriginalimageandtheonecorrectedwiththealgorithm,wecancalculatethepointspreadfunctionofthelongitudinalaberrations.Wethencompareittothepointspreadfunctionobtainedwithanothermethodwhichisbasedonthecaptureofanoisechartandthusrequirescalibration.

8652-29, Session 8

Acquisition of multi-spectral flash image using optimization method via weight mapBong-SeokChoi,DaechulKim,KyungpookNationalUniv.(Korea,Republicof);Oh-SeolKwon,ChangwonNationalUniv.(Korea,Republicof);Yeong-HoHa,KyungpookNationalUniv.(Korea,Republicof)

Acquisitioningmulti-spectralflashimage,Inthispaperproposeoptimizationmethodbasedonweightmaptocompensatecolorrepresentation.WegeneratedweightmapbyapplyingcannyedgeoperatortoYchannelofUV-IRspectrumimage.Thisweightmaprepresentfeatureofthescene.inorderword,Color,detail,andfeatureinformationareappliedtooptimizationprocess.Weapplyweightmapinordertodiscriminateweightinginoptimizationprocesswithregionandedge.Intheoptimizationprocess,smoothregiontoincreasingcolorinformationweight,decreasingedgeinformationweight.Onthecontrarytothis,edgeregiontoincreasingedgeinformationweight,decreasingcolorinformationweight.Ourmethodcanbeenhancingcolorreproductionandremovingartifacts.WeevaluatetheperformanceoftheproposedalgorithmusingPSNR,MSSIM,andz-scorewithlongexposureimage.Experimentsshowthatbetterresultthanpreviousmethod.

8652-31, Session 9

Adaptive local backlight dimming algorithm based on local histogram and image characteristicsEhsanNadernejad,NinoBurini,JariKorhonen,SørenO.Forchhammer,ClaireMantel,TechnicalUniv.ofDenmark(Denmark)

LiquidCrystalDisplays(LCDs)withLEDbacklightaretodayaverypopulardisplaytechnology,usedforinstanceintelevisions,monitorsandmobilephones.Itispossibletoincreasetheenergyefficiencyofsuchdisplaysandpreservetheimagequalitythroughtheuseofthebacklightdimming.Therearedifferentbacklightdimmingalgorithms,eachonewithadifferenttradeoffbetweenpowersavingsandimagequality.Inthispaper,wepresentalowcomplexityandadaptivealgorithmfordimmingofLED-backlightdisplaysthatusescharacteristicsofthetargetimage.Theproposedalgorithmusesthelocalfeaturesofthetargetimagesuchasthelocalhistogramandtheaveragetofindtheoptimalbacklightluminance.Theproposedmethod

hasbeenappliedontwomodeledscreensusingalargeselectionofimagesofdifferentkindsandsourcesincluding,forinstance,high-contrastandhigh-luminanceimages;onescreenhasahighresolutiondirect-litbacklight,whilethebacklightoftheotherscreenisedge-litwith16segmentsplacedintwocolumnsandeightrows.Resultsshowthattheproposedalgorithmcansavethepowerconsumptionandimagequalityoftheliquidcrystaldisplaywhencomparingagainstotheralgorithms.

8652-32, Session 9

Color generation and modulation from transparent electroactive polymer gratingsEunkyoungKim,HaejinShin,XuYang,ByeonggwanKim,YonseiUniv.(Korea,Republicof)

Nanostructuresofthesolution-processableelectroactivepolymerscouldbeefficientlyobtainedatlowtemperatureviaMIMICanimprintingmethods.Subwavelengthgratingswithelectroactivepolymerssuchaspoly(3-hexylthiophene)(P3HT)andpoly(3,4-propylenedioxythiophene-phenylene)(P(ProDOT-Ph))controlledthecolorintensityforvariousvisiblecolorsofdiffractedlightinasingledevice.Undertheilluminationofawhitelight,atafixedangleofincidence,thecolorintensityofthediffractedlightwasreversiblyswitchedfromthemaximumvaluedownto15%(85%decrease)byapplying?2to2Vduetoelectrochemical(EC)reaction.Allspectralcolorsincludingred,green,andblueweregeneratedbychangingtheangleofincidence,andtheintensityofeachcolorwasmodulatedelectrochemicallyatasingleECdevice.Withelectroactivesubwavelengthgratings(ESWGs)ofP3HT,themaximummodulationofthecolorintensitywasobservedinthered-yellowquadrantintheCIEcolorplot,whereasfortheESWGsofP(ProDOT-Ph),themaximummodulationofthecolorintensitywasobservedintheyellow-greenandgreen-bluequadrants.BothESWGsshowedamemoryeffect,keepingtheircolorandintensityevenafterpowerwasturnedoffforlongerthan40hours.

Inthisreport,asimpleelectrochromic(EC)devicebasedontransparentelectroactivepolymergratings(EPG)willbepresentedasanewcolorgenerationaswellasmdulatonmethod.

8652-33, Session 9

Optimizing color fidelity for display devices using contour phase predictive coding for text, graphics, and video contentFritzLebowsky,STMicroelectronics(France)

High-endmonitorsandTVsbasedonLCDtechnologycontinuetoincreasetheirnativedisplayresolutionto4k2kandbeyond.Subsequently,uncompressedpixeldatatransmissionbecomescostlywhentransmittingovercableorwirelesscommunicationchannels.Formotionvideocontent,spatialpreprocessingfromYCbCr444toYCbCr420iswidelyaccepted.However,duetospatiallowpassfilteringinhorizontalandverticaldirection,qualityandreadabilityofsmalltextandgraphicscontentisheavilycompromisedwhencolorcontrastishighinchrominancechannels.Ontheotherhand,straightforwardYCbCr444compressionbasedonmathematicalerrorcodingschemesquiteoftenlacksoptimaladaptationtovisuallysignificantimagecontent.

Therefore,wepresenttheideaofdetectingsyntheticsmalltextfontsandfinegraphicsandapplyingcontourphasepredictivecodingforimprovedtextandgraphicsrenderingatthedecoderside.Usingapredictiveparametrictextcontourmodelandtransmittingcorrelatedphaseinformationacrossallthreecolorchannelscombinedwithforeground/backgroundcolorvectorsofalocalcolormappromisestoovercomeweaknessesincompressionschemesthatprocessluminanceandchrominancechannelsseparately.Theresidualerrorofthepredictivemodelismoreeasilybeingminimizedsincethedecoderisanintegralpartoftheencoder.Wealsointroducetheconceptofanauto-adaptivedifferentialcolormapthatimprovescolorqualityovertimebyapplyingstatisticalanalysisofcolorlociintextandgraphicsregions.Acomparativeanalysisbasedonsomecompetitivesolutions

Conference 8652


highlightstheeffectivenessofourapproachandidentifiesitscurrentlimitationswithregardtohighqualitycolorrenderinganditsremainingvisualartifacts.

8652-34, Session 9

Content-dependent contrast enhancement for displays based on cumulative distribution functionSeul-KiJang,Yoon-GyooLee,InhaUniv.(Korea,Republicof);Gyoung-SooPark,InhaUniv(Korea,Republicof);Choon-WooKim,InhaUniv.(Korea,Republicof)

Perceivedcontrastisoneofthemostimportantattributesaffectingimagequalityondisplays.Threedimensionaldisplaysrequirecontrastenhancementtechniquestocompensateforreductioninluminancelevels.Also,mobiledisplaysdemandefficientcontrastenhancementtechniquestoimprovedegreeofvisibilityunderoutdoorviewingenvironmentsandreducepowerconsumption.Thispaperpresentsanewcontentdependentcontrastenhancementtechniquefordisplays.Cumulativedistributionfunctionsofanimageareclassifiedintoanumberofclassesbasedonthecriteriondeterminedbytheresultsofimagequalityevaluation.Also,asimpletonemappingfunctionisdesignedforeachofclassofcumulativedistributionfunctions.Theproposedtechniqueissimpleenoughforonlineimplementationformovingimages.Inthispaper,criterionofclassificationofcumulativedistributionfunctionsisdescribed.Also,constructionoftonemappingfunctionisexplained.

8652-35, Session 10

Can trichromats really know what dichromats see? (Invited Paper)MichaelH.Brill,Datacolor(UnitedStates)

Cantrichromatsreallyknowwhatdichromatssee?

Presumptivelytheanswerwouldbe“yes”,judgingfromseveralalgorithmsandsoftwareapplicationsthatsimulatetheappearancetoadichromatofanygiventrichromaticimage.1-4Mypurposehereistochallengethatpresumption.

Weknowwhatsetsoftristimuliarematchesforeachkindofdichromat.Theconfusionlociareparallellinesegmentsthroughtristimulusspace.Butthatsaysnothingabouthowtomaptheappearanceofadichromaticcolorontheappearancefromatrichromaticspace.Itisnotevennecessaryforthedichromaticappearanceofalighttomatchthetrichromaticappearanceofoneofthelightsonaconfusionlocus.Sohowcanonemakethemap?

Ononelevel,thequestiondevolvestotheclassicphilosophicalconundrumofmynotbeingabletoknowifIseethesamebluethatyoudo.Thesituationissavedtosomeextentbytheexistenceofunilateraldichromats.Theretheappearancematchesbetweenthedichromaticeyeandthetrichromaticeyepromisetobealegitimate“RosettaStone”.Indeed,unilateraldichromatsdeposethenaïvemodelofadichromat’scoloralwayshavingtheappearanceofoneofitsconfusionaliasesintrichromaticvision.Buttobetrustworthy,color-appearancematchesmustbemadeceteraparibus---thatis,allothervariablesbeingequal.Thespatialcontextofascenealwaysaffectstheappearanceofacolorinthatscene,andthecontextsthemselvescannotbeequatedbetweenadichromatandatrichromat.Youwouldhavetoasktheunilateraldichromattomatchallthecolorsinallthepossiblescenesinyouruniversetobesurethatyouhadagoodsimulation.Anadditionalcomplicationisthatunilateraldichromatsaresorarethatwecannotbetoofussyaboutassuringthatthetrichromaticeyeisreally“normal.”Finally,thecolorsdichromatsseecanbeasunstableasGestalteffectsliketheNeckercube.Ifyoustillwanttopredictandsimulatewhatdichromatssee,youhavetrulypassed…totheDarkSideoftheColor.

8652-36, Session 10

Color scales for visualization: traveling though color space (Invited Paper)BerniceE.Rogowitz,VisualPerspectivesConsulting(UnitedStates)

Colorisoftenusedtorepresentdatainvisualizationandimaging.Acolormap,forexample,mapsnumericalvaluesontoacolorscale.Eachcolorscaleisatrajectorythroughathree-dimensionalcolorspace.But,whichtrajectoriesarethebest?Apopularideaisthatanytrajectorywhereequalstepsinthedatacorrespondtoequalstepsinthecolorspaceissufficienttoproduceagoodresult.Accordingtothishypothesis,equalstepsaroundthehuecircleorequalstepsinluminanceshouldproduceequallygoodcolormaps.Thistalkchallengesthisassumptionandprovidesinsightintootherperceptualfactorsatplay.

8652-37, Session 10

Color spaces (Invited Paper)JanJ.Koenderink,TechnischeUniv.Delft(Netherlands)

Colorspacesmaybedividedinto“colorimetric”and“eyemeasure”,althoughthereexistsomeinbetweencasesofindeterminateontologicalstatus.Colorimetricconstructionsuseonlyjudgmentsofindiscriminability,whereaseyemeasuremethodsrelyonfirstpersonreports.IconsidercolorimetricapproachesthatgobeyondthelinearHausdorffspace.Thesebecomepossiblewhenadditionalconstraintsareintroduced.Iconsiderthesimplestmodelofgenericobjectcolors,whichinvolvesspectrallyselectiveLambertiansurfacesunderaveragedaylightillumination.Thisreflectslikelyevolutionarypressuretooptimizetheabilityofdisciminatingsurfacesingenericscenes.Thereexistsauniquegeometricalrepresentationthatmaximizesthecolorgamutcapturedbytheunitcubeofthevectorbasis.Inthisrepresentationcolorvisionbehaveslike3-binspectroscopy,essentiallyyieldingRGB-coordinatesintherange0-1.Intermsofthenaturalmetric,andusingthesymmetriesoftheRGB-cube,oneobtainsacolorspacethatisverysimilartotheOstwaldspace.Althoughindeedpurelycolorimetric,thereisastrongsimilaritytoeyemeasuresystems.Becausethisspacehardlydependsuponarbitraryconventions(onlythechoiceofaveragedaylightis“arbitrary”,althoughreasonablebecauseofevolutionary/ecologicalconsiderations),ithasmanyadvantagesoverconventionalsystems.

8652-38, Session 10

You can’t rely on color, yet we all do (Invited Paper)FlorisL.vanNes,TechnischeUniv.Eindhoven(Netherlands)

Colorisoneofthemostsalientfeaturesofthevisualfield.Beingthrilledbycolor,manearlyinhisdevelopmentlearnedhowtouseit,asforbodypaint–whichcontinuedinfashion,art,forpaintingartefacts.Inaway,manreliedoncolor,forembellishingandlater,whencodingwasinvented,forattachingmeaningtoobjectshavingacertaincolor.

However,thismagnificentpropertyofallobjectsaroundusisalsoelusive,assoonasyouwanttoreallyspecifyit.Notwithstandingperceptualphenomenasuchascolorconstancyonehastocopewith(i)dependenceofallsurfacecolorsonthecolorofillumination;(ii)thefactthatitisnoteasyforonepersontocommunicatetheprecisecolorofanobjecttoanotherperson–especiallyoftheoppositesex;(iii)observingthatcolorsfromvisualdisplaysareoftenstronglydesaturatedbysurroundlight.Tomentionbutafew.Andallthisisratherpoorlyunderstood,ifatall,bythegeneralpublic.Sometimesevenbypeoplewhoarereasonablyknowledgableinvisualmatters,suchasvisualergonomistsordisplayengineers.Afterall,itisnotarareoccurrencethatmulti-coloredtexts,inprintoronslidesshownbyvisualscientistsareillegiblebecausethetextforegroundandbackgroundcolorswereill-chosen.

Conference 8652


Butcolorissoimportantthatpeoplecanhardlyavoidto‘relyoncolor’–sotheydo.Theyrelyonitspredictability;whentheybuyclothesviatheinternet,forinstance,orselectaholidaydestinationfromanalluringlandscapepictureonacalendar.

Astoprofessionalusers:designersofmodernelectronicvisualdisplaysneedtomastermanyfields;colorimetryofonekindoranotherbeingoneofthem.

Myexperienceofseveraldecadeshastaughtmethatdealingwithcolorsondisplaysalwaysturnedouttotaxmostwritersofergonomicsdisplaystandardsheavily–aswellastheprofessionalsindisplaymanufacturing.Because,theorganizationresponsiblefortheagreedstandardsforspecificationandmeasurementofalldimensionsofcolor,theCommissionInternationaledel’Éclairage,(CIE),hasforalongtimenowcreatedchromaticitydiagramsandcolorspacesthatarecomplexandfarfromeasytouseby‘practitioners’inthecolordisplayindustry,andthewritersofstandardsfortheirproducts.

Ofcourse,quantifyingfacetsofcolorisadifficultthingtodo.Yet,therearemanypeoplewhowouldapplaudCIEifthisillustriousorganizationwoulddevoteapartofitsenergytothedevelopmentofacolorspecificationandmeasurementsystemwithasimpleryeteasilyusablecolorspacethan,forinstance,CIECAM02–oritssuccessorthatmayalreadybewaitinginthewings.PerhapsCIEthenshouldresorttothewellestablishedergonomicsmethodofuserinvolvement,theusersbeingcolorsciencepractitioners.Furthermore,itwouldberevolutionarybutrecommendableifaninternationalorganizationsuchastheCIEwouldstartanefforttoeducatethepopulationatlargebetterincolormatters.Onegroupthatcouldbenefitfromsuchanactionarecolor-defectiveviewers.Timeandagaintheyarethevictimofinappropriatecolorchoicesforallsortsofproductsthathavebeencolor-coded,sometimesforimportantinformationintermsofhealthorsafety,bypeoplewhohavenormalcolorvisionbutlackknowledgeonitsintricasies,andwhothereforearenaiveastotheperceptionofthesechoicesbythosewithcolorvisionabnormalities.

8652-39, Session 10

How ‘high-level’ is human color perception? (Invited Paper)MichaelE.Rudd,Univ.ofWashington(UnitedStates)

Colorisoftenusedtorepresentdatainvisualizationandimaging.Acolormap,forexample,mapsnumericalvaluesontoacolorscale.Eachcolorscaleisatrajectorythroughathree-dimensionalcolorspace.But,whichtrajectoriesarethebest?Apopularideaisthatanytrajectorywhereequalstepsinthedatacorrespondtoequalstepsinthecolorspaceissufficienttoproduceagoodresult.Accordingtothishypothesis,equalstepsaroundthehuecircleorequalstepsinluminanceshouldproduceequallygoodcolormaps.Thistalkchallengesthisassumptionandprovidesinsightintootherperceptualfactorsatplay.

8652-40, Session 10

Complex spatiochromatic interactions in a real world art laboratoryScottJ.Daly,DolbyLabs.,Inc.(UnitedStates)

AndyGoldsworthyisanenvironmentalartistwhosethemeforseveraldecadeshasbeentheartisticmanipulationofinsitunaturalmaterialstoformimpermanentoutdoorsculptures.Onecommonthreadisthere-organizationofobjectsbytheircolor,turningnaturalandconsequentlymorerandomdistributionsintohighlyorderedarrangements.Indoingso,thecolorsofthenaturalmaterialsseemtochangeasaresult.Thereal-worldartworkinvokesvariousphysicalandperceptualphenomenon,servingaslaboratoryforcolorscience.Inthistalk,wewilltakeonetheseartworks,titledSt.Abbs,theBorders,1985,andqualitativelyinvestigatethephysicalandperceptualcontributorstotheintriguingeffect.Theseprocessesrangefromatmospherichaze,multiplereflections,spatiochromaticCSF,white-pointanchoring,chromaticinduction,etc.Theywillbeabstractedinanattempttocreateatwo-dimensionalsyntheticversionoftheessenceofGoldsworthy’sthree-dimensionalnaturalchromaticillusion.

Conference 8652


Conference 8653: Image Quality and System Performance XTuesday-Thursday5–7February2013

Part of Proceedings of SPIE Vol. 8653 Image Quality and System Performance X

8653-1, Session 1

A no-reference quality assessment algorithm for JPEG2000-compressed images based on local sharpnessPhongV.Vu,DamonM.Chandler,OklahomaStateUniv.(UnitedStates)

Inthispaper,wepresentano-referencequalityassessmentalgorithmforJPEG2000-compressedimagescalledEDIQ(EDge-basedImageQuality).ThealgorithmworksbasedontheassumptionthatthequalityofJPEG2000-compressedimagescanbeevaluatedbythesharpnessoftheedge/near-edgeregionsandthenon-edgeregionswherenoedgesarepresent.EDIQfirstseparatestheinputimageintoedge/near-edgeregionsandnon-edgeregionsbyapplyingCannyedgedetectionandedge-pixeldilation.Ourprevioussharpnessalgorithm,FISH[1],isusedtogenerateasharpnessmap.Thepartofthesharpnessmapcorrespondingtothenon-edgeregionsiscollapsedbyusingrootmeansquaretoyieldthesharpnessindexofthenon-edgeregions.Theotherpartofthesharpnessmap,whichcorrespondstotheedge/near-edgeregions,isweightedbythelocalrootmeansquare(RMS)contrastandthelocalslopeofmagnitudespectrumtoyieldanenhancedsharpnessmap,whichisthencollapsedintothesharpnessindexoftheedge/near-edgeregions.Thesetwoindicesarecombinedbyageometricmeantoyieldaqualityindicatoroftheinputimage.TestingontheJPEG2000subsetsoffourdifferentimage-qualitydatabasesdemonstratethatEDIQiscompetitivewithotherno-referenceimagequalityalgorithmsonJPEG2000-compressedimages.

8653-2, Session 2

Measurement and protocol for evaluating video and still stabilization systemsEtienneCormier,FrédéricCao,FrédéricGuichard,ClémentViard,DxOLabs(France)

Thisarticlepresentsasystemandaprotocoltocharacterizeimagestabilizationsystemsbothforstillimagesandvideos.Itusesonasixaxesplatform,threebeingusedforcamerarotationandthreeforcamerapositioning.Theplatformisprogrammableandcanreproducecomplexmotionsthathavebeentypicallyrecordedbyagyroscopemountedondifferenttypesofcamerasindifferentusecases.Themeasurementusesasinglechartforstillimageandvideos,thetexturedeadleaveschart.Althoughtheproposedimplementationoftheprotocolusesamotionplatform,themeasurementitselfdoesnotrelyonanyspecifichardware.Forstillimages,amodulationtransferfunctionismeasuredindifferentdirectionsandisweightedbyacontrastsensitivityfunction(simulatingthehumanvisualsystemaccuracy)toobtainanacutance.ThesharpnessimprovementduetotheimagestabilizationsystemisagoodmeasurementofperformanceasrecommendedbyaCIPAstandarddraft.Forvideo,somemarkersonthechartaredetectedwithsub-pixelaccuracytodetermineahomographydeformationbetweenthecurrentframeandareferenceposition.Thismodeldescribeswelltheapparentglobalmotionastranslations,butalsorotationsalongtheopticalaxisanddistortionduetotheelectronicrollingshutterequippingmostCMOSsensors.TheprotocolisappliedtoalltypesofcamerassuchasDSC,DSLRandsmartphones.

8653-3, Session 3

Hyperspectral image quality for unmixing and subpixel detection applications

JohnP.Kerekes,DanielS.Goldberg,RochesterInstituteofTechnology(UnitedStates)

Thequalityofremotelysensedhyperspectralimagesisnoteasilyassessedvisually,asthevalueoftheimageryisprimarilyinherentinthespectralinformationembeddedinthedata.Inthecontextofearthobservationordefenseapplications,hyperspectralimagesaregenerallydefinedashighspatialresolution(1to30meterpixels)imagerycollectedindozenstohundredsofcontiguousnarrow(?/??~100)spectralbandsfromairborneorsatelliteplatforms.Twoapplicationsofinterestareunmixingwhichcanbedefinedastheretrievalofpixelconstituentmaterials(usuallycalledendmembers)andtheareafractionrepresentedbyeach,andsubpixeldetection,whichistheabilitytodetectspatiallyunresolvedobjects.Ourapproachisacombinationofempiricalanalysesofairbornehyperspectralimagerytogetherwithsystemmodelingdrivenbyrealinputdata.Initialresultsofourstudyshowthedominanceofspatialresolutionindeterminingtheabilitytodetectsubpixelobjectsandthenecessityofsufficientnumbersofspectralbandsforunmixingaccuracy.Whiletheseresultsarenotunexpected,theresearchhelpstoquantifythesetrendsforthesituationsstudied.Futureworkisaimedatgeneralizingtheseresultsandtoprovidenewpredictiontoolstoassistwithhyperspectralimagingsensordesignandoperation.

8653-4, Session 3

Acceptable bit-rates for human face identification from CCTV imageryAnastasiaTsifouti,Ctr.forAppliedScienceandTechnology(UnitedKingdom)andUniv.ofWestminster(UnitedKingdom);SophieTriantaphillidou,EfthimiaBilissi,Univ.ofWestminster(UnitedKingdom);Mohamed-ChakerLarabi,Univ.ofPoitiers(France)

Theobjectiveofthisinvestigationistoproducerecommendationsforacceptablebit-ratesofCCTVfootageofpeopleonboardLondonbuses.ThemajorityofCCTVrecordersonbusesuseaproprietaryformatbasedontheH.264/AVCvideocodingstandard,exploitingbothspatialandtemporalredundancy.Lowbit-ratesarefavoredintheCCTVindustrybuttheycompromisetheimageusefulnessoftherecordedimagery.Inthiscontextusefulnessisdefinedbythepresenceofenoughfacialinformationremaininginthecompressedimagetoallowaspecialisttoidentifyaperson.Theinvestigationincludesfoursteps:1)Collectionofrepresentativevideofootage.2)Thegroupingofvideoscenesbasedoncontentattributes.3)Psychophysicalinvestigationstoidentifykeyscenes,whicharemostaffectedbycompression.4)Testingofrecordingsystemsusingthekeyscenesandfurtherpsychophysicalinvestigations.Theresultsarehighlydependentuponscenecontent.Forexample,verydarkandverybrightsceneswerethemostchallengingtocompress,requiringhigherbit-ratestomaintainusefulinformation.Theacceptablebit-ratesarealsofoundtobedependentuponthespecificCCTVsystemusedtocompressthefootage,presentingchallengesindrawingconclusionsaboutuniversal‘average’bit-rates.

8653-5, Session 3

Visual acuity and task-based video quality in public safety applicationsJoelDumke,InstituteforTelecommunicationSciences(UnitedStates)

Thispaperexplorestheutilityofvisualacuityasavideoqualitymetricforpublicsafetyapplications.Anexperimenthasbeenconductedtotracktherelationshipbetweenvisualacuityandtheabilitytoperformaforced-choiceobjectrecognitiontaskwithdigitalvideoorvarying


quality.VisualacuityismeasuredaccordingtothesmallestlettersreliablyrecognizedonareducedLogMARchart.

8653-6, Session 4

Improvement of quality metrics performance using binocular just-noticeable-difference for S3D imagesBilelSdiri,Mohamed-ChakerLarabi,XLIM-SIC(France);AzeddineBeghdadi,Univ.Paris13(France)

Inthispaper,weconsidertheBJNDforthedevelopmentofafullreferencemetricforqualityassessmentofstereoscopicimagesbasedonthewell-knownmetrics:PSNRandSSIM,inwhicheachinter-pixeldistortionisweightedbyitsvisibilitythresholdforabinocularvision.ThepreliminaryvalidationofsuchanapproachprovesthatusingtheBinocularJNDaddsaperceptualdimensiontomathematicalmetrics.

8653-7, Session 4

Evaluation of biological effect on luminance of stereoscopic displaysKyosukeTakahashi,ToshiyaNakaguchi,R.Okamoto,I.Shimoyama,Y.Miyake,ChibaUniv.(Japan)

Thestereoscopicimagesprevailinvariousfieldsandthebiologicaleffectofstereoscopicimagesisinvestigatedbyvariousresearches.However,theevaluationofbiologicaleffectonintensityofstereoscopicdisplaysisunknown.Thereforeweevaluatedbiologicaleffectonintensityofstereoscopicdisplaysanddifferenceinindicationmethodbyconductingsubjectiveevaluationandobjectiveevaluation.WemeasuredpupillarylightreflexandR-RintervalinECGasobjectiveevaluation,andconductedtwokindsofdoublestimulievaluationandsinglestimulusevaluation.Thesignificanteffectbytheintensitychangewasshownbydoublestimulievaluationwhereasitwasnotshownbysinglestimulusevaluation.Basedonthisresultitwasshownthatdoublestimulievaluationissuitedforevaluationofbiologicaleffectonintensityofstereoscopicdisplaysthansinglestimulusevaluation.Nosignificantrelationshipwasnotedintheresultsbetweenthepupillarylightreflexandtheintensity.AlthoughsignificantrelationshipwasobtainedintheresultsbetweentheR-RintervalinECGandtheelapsedtimefor30minstimulus,nosignificantrelationshipwasnotedintheresultsbetweentheR-RintervalinECGandtheintensity.Inaddition,weconfirmedexperimentalaccuracyandreproducibilitybyconductingrepetitiveexperiments.

8653-8, Session 4

Three factors that influence the overall quality of the stereoscopic 3D content: image quality, comfort, and realismRalucaVlad,PatriciaLadret,AnneGuérin-Dugué,GIPSA-lab(France)

Anincreasingamountofstereoscopic3Dcontentisnowavailable,butnostandardizedobjectivemethodologyexistswhichcouldindicatewhetherthiscontentis‘ofgoodquality’.Inthiscase,subjectivemethodologiescouldbeapreferredcompromiseforassessingquality.Theyimplythedeploymentofimportantresources,buttheyillustrateverywellhumanperception.However,nostandardqualitymodelthatdefinesthenotionof‘3Dquality’existstodate.Thus,forevaluatingthequalityof3Dstereoscopicdata,exploratorystudiesareneededbeforehandinordertodeterminewhichcharacteristicsofthe3Dcontentshouldbesubmittedtotest.

Inthiscontext,wecarriedoutapsycho-visualqualitativeexplorationofthevariousimpressionsthatstereoscopicimagesproducetoanobserver.Thetestprovedtoberichinresults.Amongitsconclusions,weconsiderofhighestimportancethefactthatwecouldthusdeterminethreedifferentfactors–quality,comfortandsenseofreality

–whichareimportantinstudyinghowtheoverallqualityof3Ddataisperceived.Theresultsofourtestshowthatallofthesethreefactorsshouldbeincludedinanindependentmannerintheassessmentoftheperceived3Dexperience.

8653-9, Session 4

Performance evaluation of HD camcorders: measuring texture distortions using Gabor filters and spatio-velocity CSF KongfengZhu,DietmarSaupe,Univ.Konstanz(Germany)

Thispaperpresentsanewmethodofmeasuringphysicaltexturedistortions(PhTD)toevaluatetheperformanceofHDcamcordersw.r.t.motionandlossycompression.Itisextendedtomeasureperceptualtexturedistortions(PeTD)bytakingintoaccountofthespatio-velocitycontrastsensitivityfunctionofhumanvisualsystem.PhTDillustratestowhatextenttexturestructuresarephysicallydistortedbyacamcorder,whilePeTDmeasurestheperceptualdistortionoftextures.Thedeadleaveschartinvarianttoscaling,translation,rotation,andcontrast,wasselectedasthetarget.ThePhTD/PeTDmetricsofthetargetdistortedbycamcordersweremeasuredbasedonabankofGaborfilterswitheightorientationsandthreescales.ExperimentalresultsonsixHDcamcordersfromthreevendorsshowed:1)thePhTDvaluedecreasesmonotonicallyw.r.t.themotionspeed,andincreasesmonotonicallyw.r.t.thelossycompressionbitrate;2)thePeTDvaluedecreasesmonotonicallyw.r.t.themotionspeed,butstaysalmostconstantw.r.t.thelossycompressionbitrate.Asaresult,theproposedPhTDmetriccanmeasurephysicaltexturesdistortions,andthePeTDmetriccanmeasureperceptualtexturedistortions.ThePeTDmeasurementresultsshowedthatlossycompressionoftestedHDcamerasleadstounperceivabletexturedistortionsinourexperimentalsetup.

8653-10, Session 5

Evaluation of differences in quality of experience features for test stimuli of good-only and bad-only overall audiovisual qualityDominikStrohmeier,TechnischeUniv.Berlin(Germany);KristinaKunze,KlemensGöbel,JudithLiebetrau,IlmenauUniv.ofTechnology(Germany)

AssessingaudiovisualQualityofExperience(QoE)isakeyelementtoensurequalityacceptanceoftoday’smultimediaproducts.TheuseofdescriptiveevaluationmethodsallowsevaluatingQoEpreferencesandtheunderlyingQoEfeaturesjointly.FromourpreviousevaluationsonQoEformobile3Dvideowefoundthatmainlyonedimension,videoquality,dominatesthedescriptivemodels.Largevariationsofthevisualvideoqualityinthetestsmaybethereasonforthesefindings.AnewstudywasconductedtoinvestigatewhethertestsetsoflowQoEaredescribeddifferentlythanthoseofhighaudiovisualQoE.Reanalysisofpreviousdatasetsseemstoconfirmthishypothesis.Ournewstudyconsistsofapre-testandamaintest,usingtheDescriptiveSortedNappingmethod.Datasetsofgood-onlyandbad-onlyvideoqualitywereevaluatedseparately.TheresultsshowthattheperceptionofbadQoEismainlydeterminedone-dimensionallybyvisualartifacts,whereastheperceptionofgoodqualityshowsmultipledimensions.Here,mainlysemantic-relatedfeaturesofthecontentandaffectivedescriptorsareusedbythenaïvetestparticipants.Theresultsshowthat,withincreasingQoEofaudiovisualsystems,contentsemanticsandusers’affectiveinvolvementwillbecomeimportantforassessingQoEdifferences.

8653-11, Session 5

Contrast sensitivity and discrimination of complex scenesSophieTriantaphillidou,JohnJarvis,GauravGupta,Univ.ofWestminster(UnitedKingdom)

Conference 8653


Theaimoftheresearchistospecifyexperimentallyandfurthermodelspatialfrequencyresponsefunctions,whichquantifyhumansensitivitytospatialinformationinrealcompleximages.Twovisualresponsefunctionsaremeasured:thecontextualVisualPerceptionFunction(VPF),whichdescribesvisualsensitivitytochangesinsuprathresholdcontrastinagivenimage;thecontextualContrastSensitivityFunction(CSF),whichdescribestheabilityofthevisualsystemtodetectanyspatialsignalinanimage.InthepaperwepresentourattemptstoderiveexperimentallyandfurthermodeltheVPFandCSFforarangeofscenetypesandconditions.

Progressispresentedinthefollowingareas:Apparatusandlaboratoryset-up;stimulusacquisitionandcharacterization;imagingsystemcharacterization;spatialdecompositionandcontrastmanipulation;methodologyforsubjectivetests.ContextualgroupsofVPFsandCSFsarepresentedandcomparedwith‘classical’findingsthathaveusedsimplesine-wavestimuli.AninitialsurveyofmodelsofhumancontrastsensitivitysuggeststhatthecontrastdiscriminationframeworkdevelopedbyBartenprovidesasoundstartingpositionforourownmodelingstudies.Weexplainourchoiceofmodelanditsrelevanceanduseinvariousimagequalitymetrics.

8653-12, Session 5

Grouping strategies to improve the correlation between subjective and objective image quality dataSilviaCorchs,FrancescaGasparini,RaimondoSchettini,Univ.degliStudidiMilano-Bicocca(Italy)

ImageQualityAssessment(IQA)methodscanbecategorizedassubjectiveversusobjectiveones.Theefficiencyofstudiesthatinvolvepeoplejudgmentsisverylowcomparedtoacomputerizedobjectivestudy.Howevertovalidateautomatedapproaches,subjectiveIQAisneeded.Greateffortisdevotedtopropercorrelateobjectiveandsubjectivedata.

Inthispaperweaddressthisproblemfocusingonnoreferencemetrics,andevaluatingpsycho-visualexperimentsonJPEGcorruptedimages.

IngeneralNRmetricsarenotabletomeasurewiththesameperformancethedistortionswithintheirpossiblefullrangeandwithrespecttodifferentimagecontents.

Inthispaperweemphasizethatthesubjectivedatadonotshowamonotonicbehavioreither.Infactthecrosstalkbetweencontentanddistortionsignalsinfluencesthehumanperception.

Tothisendwehereproposeastrategythatgroupstheimagesaccordingtotheircontentsothatwithineachsubgrouptheperceptualscorescanbeconsideredmonotonicwithrespecttothedistortionlevel.ThisgroupingstrategyisappliedtoimagesofLIVE,MICTandofanewdatasetproperlygeneratedbytheauthors.WewillalsoshowhowtheseresultscansignificantlyhelpinfindingbettercorrelationsamongNRmetricsandpsycho-visualdata.

8653-13, Session 5

Adapting the ISO 20462 softcopy ruler method for online image quality studiesPeterD.Burns,BurnsDigitalImaging(UnitedStates);JonathanB.Phillips,ImagingScientist(UnitedStates);DonaldR.Williams,ImagingScienceAssociates(UnitedStates)

TheISO20462methodforsubjectiveimagequalityevaluationreliesonasetofreferenceimages,whicharecalibratedintermsofknownabsolutequalitydifferences.ThismethodwasthenextendedsothatthescenecontentcouldbepresentedoncomputermonitorsasDigitalReferenceStimuli(DRS).Recently,ithasbecomemorecommontoconductimagingrelatedsurveysviaInternet-basedparticipation.Inthiscase,imagesarenotviewedundercontrolledconditions.However,itcanbedesirable,e.g.,whensimulatingconsumerdecisionsregardingimageediting.Inaddition,suchcrowdsourcinghastheadvantagethatlargenumbersofresponsescanbeacquiredefficiently.Our

objectivewastodevelopandtestamethodthatusesacommercialonlineservicethatallowspresentationofimages.Therewereseverallimitationstobeovercome,includingimagefilesize,andastaticinterfaceratherthanoneallowingdynamicupdatingofthereferenceimage.Themethodthatwetestedusedreferenceanchorimagesratherthantheslider-selectedreference.However,ouranchorsweredrawnfromtheISO20462set,andthereforeweretakenascalibratedreferenceimages,albeitviewedunderuncontrolledconditions.Wedescribetheverificationstudythatwascompleted,andcompareresultswiththecorrespondingSoftcopyRulerdata.Weconcludethatcrowdsourcingisusefulforthisapplication.Whencalibratedsubjectiveimagequalitymeasuresareneeded,ouradaptedmethodshouldbeconsideredanefficientalternativetotheISO20462standard,providedthatcommonreferenceimagesareused.


Generation of PDF with vector symbols from scanned documentIlyaV.Kurilin,IliaV.Safonov,MichaelN.Rychagov,SamsungElectronicsCo.,Ltd.(RussianFederation);HoKeunLee,SangHoKim,DonChulChoi,SamsungElectronicsCo.,Ltd.(Korea,Republicof)

ThepaperisdevotedtoalgorithmforgenerationofPDFwithvectorsymbolsfromscanneddocuments.Thecomplexmulti-stagetechniqueincludessegmentationofthedocumenttotext/drawingareasandbackground,conversionofsymbolstolinesandBeziercurves,storingcompressedbackgroundandforeground.Inthepaperweconcentrateonone-coloredregionsconversionthatcomprisessegmentationofsymbolbodieswithresolutionenhancement,contourtracingandapproximation.Ourmethodoutperformsexistingsolutionsaccordingtouseropinionsurveyandsecuresthebestcompressionrate/qualityratio.


Minimum image quality assessment based on saliency maps: a human visual approachJoãoBarreira,MaximinoBessa,LuísG.Magalhães,INESCTEC(Portugal)andUniv.deTrás-os-MonteseAltoDouro(Portugal)

magequalityassessmentasperceivedbyhumansisofcrucialimportanceinnumerousfieldsofimageprocessing.Transmissionandstorageofdigitalmediarequireefficientmethodstoreducethelargenumberofbitstostoreanimage,whilemaintainingsufficientlyhighqualitycomparedtotheoriginalimage.Sincesubjectiveevaluationscannotbeperformedinvariousscenarios,itisnecessarytohaveobjectivemetricsthatpredictimagequalityconsistentwithhumanperception.However,objectivemetricsthatconsidershighlevelsofthehumanvisualsystemarestilllimited.Inthispaper,weinvestigatethepossibilityofautomaticallypredict,basedonsaliencymaps,theminimumimagequalitythresholdfromwhichhumanscanperceivetheelementsonacompressedimage.Weconductedaseriesofexperimentalsubjectivetestswherehumanobservershavebeenexposedtocompressedimageswithdecreasingcompressionrates.Tomeasurethedifferencebetweenthesaliencymapofthecompressedandtheoriginalimageitwasusedthenormalizedabsoluteerrormetric.Ourresultsindicatethattheelementsontheimageareonlyperceivedbymostofthehumansubjectsnotataspecificcompressedimagequalitylevel,butdependingonasaliencymapdifferencethreshold.


Qualification process of CR system and quantification of digital image qualityPatriceGARNIER,LaurenceHUN,JeremieKlein,CatherineLEMERLE,CEA-VALDUC(France)

TheLCND(NonDestructiveInspectionLaboratory)usesseveral

Conference 8653


X-Raygeneratorstocarryoutmanyinspections:voidsearch,weldingexpertise,gapmeasurements,etc.Mostoftheseinspectionsarecarriedoutonsilverbasedplates.Forseveralyears,theLCNDhasdecidedtoqualifynewdevicessuchasdigitalplatesorCCD/flatpanelplates.Ontheonehand,thesetechnologicalintelligenceistoforecasttheassumedandeventualdisappearanceofsilverbasedplates;ontheotherhand,itisalsocompulsorythekeepourskillsmasteringup-to-dateandtobeadrivingforcebehindoursub-contractants.

Themainimprovementbroughtbynumericalplatesisthecontinuousprogressofthemeasurementaccuracy,especiallywithimagedataprocessing.Itisnowcommontomeasuredefectsthicknessordepthpositionwithinapart.Insuchapplications,dataimageprocessingisusedtoobtaincomplementaryinformationofscannedsilverbasedplates.Thisscanningprocedureisharmfultothemeasurementforitimpliesadatacorruptionoftheresolution,theaddingofnumericalnoiseandistimeexpensive.Digitalplatesenabletosuppressthescanningprocedureandtoincreasetheresolution.Itisnonethelessdifficulttodefine,fordigitalimages,singlecriteriafortheimagequality.Aprocedureistobedefinedtojudgethequalityofthedigitaldataitself;theimpactofthescanningdeviceandtheconfigurationparametersarealsotobetakenintoaccount.

ThispresentationdealswiththedetailedqualificationprocessdevelopedbytheLCNDfordigitalplates(DUR-NDT)basedonthestudyofquantitativecriteriachosentodefineadirectnumericalimagequalitythatcouldbecomparedwithscannedsilverbasedpicturesandtheclassicalopticaldensity.

TheversatilityoftheX-Rayparametersisalsodiscussed(X-raytension,intensity,timeexposure).TheaimistobeabletotransfertheyearlongexperienceoftheLCNDwithsilver-basedplatesinspectiontothesenewdigitalplatessupports.


Challenges of an automated spectral responsivity characterization systemMarkHelmlinger,GregA.McKee,ChristopherDurell,Labsphere,Inc.(UnitedStates)

Anessentialpartofimagingsystemperformancemeasurementandmodelingisthedeterminationofspectralresponsivity,namelythein-bandband-shapeandout-of-bandresponse.Thesecomplicatedmeasurementshaveheretoforebeendifficulttomakewithconsistency.Toaddressthisindustry-wideproblem,Labsphereisdevelopinganautomatedspectralresponsemeasurementstation,incorporatingseveraltechniquestoenhanceaccuracy.Thispresentationwillcoverthephysicsandconsiderationsbehindthescalingofthishighlyautomatedsystemandtheexperimentalmethodologyitincorporatestoassureabsolutetraceability,aswellasdetailsomeofthelessonslearnedalongtheway.

8653-14, Session 6

A line-based HDR sensor simulator for motion artifact predictionDonaldJ.Baxter,STMicroelectronicsLtd.(UnitedKingdom)

Modelingonlyacamera’slensblur,noiseandsensitivityisnotsufficienttopredictimagequality.Forafullerprediction,motionblur/artifactsmustbeincluded.AutomotiveapplicationsareparticularlychallengingforHDRmotionartifacts.Thispaperextendsaclassiccameranoisemodeltosimulatemotionartifacts.Themotivationistopredict,visualizeandevaluatethemotion/lightingflickerartifactsfordifferentimagesensorreadoutarchitectures.TheproposedmotionartifactHDRsimulatorhas3maincomponents;adynamicimagesource,asimplelensmodelandalinebasedimagesensormodel.Thelinebasednatureofimagesensorprovidesanaccuratesimulationofhowdifferentreadoutstrategiessamplemovementorflickeringlightsinagivenscene.Twosimulationstudiesillustratingthemodel’sperformancearepresented.ThefirstsimulationcomparesthemotionartifactsofframesequentialandlineinterleavedHDRreadoutwhilethesecondstudycomparesthemotionblurofan8MP1.4µm,5MP

1.75µmand3MP2.2µmimagesensorsunderthesameilluminationlevel.Goodalignmentisobtainedbetweentheexpectedandsimulatedresults.

8653-15, Session 6

On the spectral quality of scanner illumination with LEDsLukeC.Cui,LexmarkInternational,Inc.(UnitedStates)

Documentscannerilluminationhasevolvedalongwithgeneralilluminationtechnologies.LEDshavebecomemoreandmorepopularastheilluminationsourcesfordocumentscanning.LEDtechnologiesprovideawiderangeofchoicesbothintermsofstructuraldesignandspectralcomposition.Inthisreport,weexaminesomepopularLEDtechnologiesusedfordocumentscannersandtheirinherentspectralqualitywithcolorrenderingsimulationandsomeactualexamples.ItishopedthatthereportwillhelptoclarifytheimpactofdifferentLEDsonthecolorqualityofdocumentscanners.

8653-16, Session 6

Refined measurement of digital image texture lossPeterD.Burns,BurnsDigitalImaging(UnitedStates)

Imagetextureisthetermgiventotheinformation-bearingfluctuationssuchasthoseforskin,grassandfabrics.Sinceimageprocessingaimedatreducingunwantedfluctuations(noiseareotherartifacts)canalsoremoveimportanttexture,goodproductdesignrequiresabalancebetweenthetwo.Thetexture-lossMTFmethod,currentlyunderinternationalstandardsdevelopment,isaimedattheevaluationofdigitalandmobile-telephoneforthecaptureandretentionofimagetexture.Theseimagefieldsincludepseudo-randomobjects,suchasoverlappingdisks,oftenreferredtoas‘deadleaves’targets.WeinvestigatedseveralaspectsofthisPseudo-randomObjectTarget(PROT)methodthatareaimedatimprovingitspracticalapplicationandcomparisonwithotherimagingperformancemeasures.WeindicatehowthetextureMTFmethodcanbebasedonnoise-powerspectracomparablewithothernoisesourcesintheimagingsystemsuchassignalquantizationandprintmaterials.Inaddition,thecurrentmethodreliesonmodeling(knowledgeof)thespatialnoisecharacteristicsthetesttarget.Forevaluationofimageprocessingfunctionsthemethodcanbeimprovedbycomputingtheinputsignalspectrumdirectlyfromtheimagedata.Theadaptedanalysismethodcanresultinimprovedandlessrelianceondatasmoothingandfitting.Thishasaparticularlyadvantageatlowspatialfrequencies,wheretexture-MTFscalingisperformedwhencomparingvariousmethods.

8653-17, Session 6

F-MAD: a feature-based extension of the most apparent distortion algorithm for image quality assessmentPunitSinghBanga,DamonM.Chandler,OklahomaStateUniv.(UnitedStates)

Inthispaper,wepresentanimprovedversionoftheMostApparentDistortion(MAD)qualityassessmentalgorithm[LarsonandChandler,SPIEImageQualityandSys.Perf.,2009]whichimprovesuponitspredecessorbyusinglocalfeaturemeasurements.MAD’sappearance-basedstageisquitelow-levelrelyingonlyonlog-Gaborstatistics.Hereweshowthathigher-levelfactorssuchassharpness,clarityofedges,andcolorfidelityaregoodpredictorsofimagequality.Weusesimpleimagefeatures(sharpness,contrast,edgestrength,luminancedistance,andcolordistance)asameasurementofimagequalityandincorporatethemintoMAD’sappearance-basedmodel.WehavetitledtheresultingalgorithmF-MAD(Feature-BasedMAD).Testingonseveralimage-qualitydatabasesdemonstratesthatF-MADyieldsimprovedpredictiveperformancecomparedtoMADandothermodernquality-assessmenttechniques.

Conference 8653


8653-18, Session 6

An algorithm for no-reference image quality assessment based on log-derivative statistics of natural scenesYiZhang,DamonM.Chandler,OklahomaStateUniv.(UnitedStates)

Inthispaper,wepresentanalgorithmforno-referenceimagequalityassessment,calledDESIQUE(DErivativeStatistics-BasedImageQUalityEvaluator),whichoperatesbyusinglog-derivativestatisticsofnaturalscenes.Log-derivativestatisticsarepreservedacrossnaturalscenes,butarechangedinthepresenceofdistortions.Tocapturethesechanges,DESIQUEextractslog-derivativestatisticalfeaturesattwoimagescalesinboththespatialandfrequencydomains,fromwhichthesametwo-stageframework(distortionidentificationanddistortion-specificqualityassessment)usedintherecentDIIVINEandBRISQUEalgorithmsareemployedtoestimatequality.Testingonseveralimage-qualitydatabasesshowsthatDESIQUEisgenerallybetteratqualityassessmentthanDIIVINE,BRISQUE,andotherno-referencemethods.DESIQUEyieldsSROCCvaluesof0.918,0.913,and0.870,ontheCSIQ,TID,andToyamadatabases,respectively.Asacomparison,thenextbestcompetitoryieldsSROCCvaluesof0.900,0.898,and0.848onthesedatabases.Furthermore,intermsofcomputationalefficiency,DESIQUEiscompetitivewiththecurrentlyfastestperformancealgorithms.Additionalresultsandanalysescanbeviewedonlineathttp://vision.okstate.edu/yi/DESIQUE/.

8653-19, Session 7

On the analysis of wavelet-based approaches for print grain artifactsAhmedH.Eid,BrianE.Cooper,EdwardE.Rippetoe,LexmarkInternational,Inc.(UnitedStates)

GrainisoneofseveralattributesdescribedinISO/IECDTS24790,adrafttechnicalspecificationforthemeasurementofimagequalityformonochromeprintedoutput.Itdefinesgrainasaperiodicfluctuationsoflightnessgreaterthanabout0.4cyclespermillimeter,adefinitioninheritedfromthelatestofficialstandardonprintedimagequality,ISO/IEC13660.

Inapreviouspublication,weintroducedamodificationtotheISO/IEC13660grainmeasurementalgorithmthatincludesaband-pass,wavelet-based,filteringsteptolimitthecontributionofhigh-frequencyfluctuations.Thismodificationimprovesthealgorithm’scorrelationwiththesubjectiveevaluationofexpertswhoratedtheseverityofprintedgrainartifacts.

SeekingtoimproveuponthegrainalgorithminISO/IEC13660,theISO24790committeeevaluatedseveralgraininessmetrics.Thisledtotheselectionoftheabovewavelet-basedapproachasthetopcandidatealgorithmforinclusioninafutureISO/IECstandard.ComparedagainstsubjectiveevaluationsconductedondatafromtheISOcommittee,theproposedalgorithmoffersmuchbettercorrelationthantheISO/IEC13660approach.

Inaddition,weintroduceanalternativeapproachformeasuringgraindefectsbasedonspatialfrequencyanalysisofwavelet-filteredimages.Weestablishalinkbetweenthespatial-basedgrain(ISO/IECDTS24790)approachanditsequivalentfrequency-basedoneinlightofParseval’stheorem.

8653-20, Session 7

A general approach for assessment of print qualityXiaochenJing,PurdueUniv.(UnitedStates);StephenAstling,ReneeJessome,EricMaggard,TerryNelson,MarkQ.Shaw,Hewlett-PackardCo.(UnitedStates);JanP.Allebach,PurdueUniv.(UnitedStates)

Laserelectrophotographicprintersarecomplexsystemsthatcangenerateprintswithanumberofpossibleartifactsthatareverydifferentinnature.Itisachallengingtasktodetermineasingleprocessingalgorithmthatcaneffectivelyidentifysuchawiderangeofprintqualitydefects.

Inthispaper,wedescribeanimageprocessingandanalysispipelinethatcaneffectivelyassessthepresenceofawiderangeofartifacts,asageneralapproach.Inourpaper,wewilldiscussindetailthealgorithmsthatcomprisetheimageprocessingandanalysispipeline,andwillillustratetheefficacyofthepipelinewithanumberofexamples.

8653-21, Session 7

Verification of proposed ISO methods to measure resolution capabilities of printing systemsMilenaCisarova,GjøvikUniv.College(Norway)andUniv.ofPardubice(CzechRepublic);MariusPedersen,PeterNussbaum,GjøvikUniv.College(Norway);FransGaykema,OcéPLT(Netherlands)

Printerresolutionisanimportantattribute,andISOhasproposedastandard“ISO/IECDTS29112:2011(E):InformationTechnology–Officeequipment–TestchartsandMethodsforMeasuringMonochromePrinterResolution”includingtestchartsandmethodstomeasureprinterresolution.Fromacustomer’spointofviewqualityiswhatthecustomersperceive,andthereforeitisoftenagoalforobjectivemeasurestoreflectandcorrelatewithperceivedquality.WeinvestigateiftheobjectivemethodsproposedbyISOreflecthumanjudgement.Experimentshavebeenconductedinwhichweevaluatedtheproposedobjectivemethodsagainstperceptualdata.

8653-22, Session 7

Assessment of presence of isolated periodic and aperiodic bands in laser electrophotographic printer outputJiaZhang,PurdueUniv.(UnitedStates);StephenAstling,ReneeJessome,EricMaggard,TerryNelson,MarkQ.Shaw,Hewlett-PackardCo.(UnitedStates);JanP.Allebach,PurdueUniv.(UnitedStates)

NoAbstractAvailable

8653-23, Session 8

Printing artificial sweat using ink jet printers for the test set generation in forensics: an image quality assessment of the reproducibility of the printing resultsMarioHildebrandt,Otto-von-Guericke-Univ.Magdeburg(Germany);JenniferSturm,FachhochschuleBrandenburg(Germany);JanaDittmann,Otto-von-Guericke-Univ.Magdeburg(Germany)

Inordertousescientificexpertevidenceincourthearing,severalcriteriamustbemet.IntheUSjurisdictiontheDaubertdecisionhasdefinedseveralcriteriathatmightbeassessedifatestimonyischallenged.Inparticularthepotentialfortestingoractualtesting,aswellasknownorpotentialerrorratearetwoveryimportantcriteria.

Inordertobeabletocomparetheresultswitheachother,thereproduciblecreationofevaluationsamplesisnecessary.However,eachlatentfingerprintisuniqueduetoexternalinfluencefactorssuchassweatcompositionorpressureduringtheapplicationofatrace.Hence,Schwarzintroducedamethodtoprintlatentfingerprintsusingink-jetprintersequippedwithartificialsweat.

Inthispaperweassesstheimagequalityintermsofreproducibilityand

Conference 8653


clarityoftheprintedartificialsweatpatterns.Forthat,wedeterminetheintra-classvariancefromoneprinteronthesameandondifferentsubstratesbasedonasubjectiveassessment,aswellastheinter-classvariancebetweendifferentprintersofthesamemodelusingpatternrecognitiontechniques.Ourresultsindicatethattheintra-classvarianceisprimarilyinfluencedbythedryingbehavioroftheaminoacid.Theinter-classissurprisinglylargebetweenidenticalmodelsofoneprinter.Ourevaluationisperformedusing100samplesonanoverheadfoiland50samplesof5differentpatterns(alinestructure,afingerprintimage,twodifferentarrowsandalargerareawithaminoacid)onacompactdisksurfaceacquiredwithaKeyenceVK-X110laserscanningconfocalmicroscope.Theresultsshowasignificantdifferencebetweenthetwoidenticalprintersallowingfordifferentiatingbetweenthemwithanaccuracyofupto99%.

8653-25, Session 8

Figure of merit for macrouniformity based on image quality ruler evaluation and machine learning frameworkWeibaoWang,PurdueUniv.(UnitedStates);GaryOverall,TravisRiggs,RebeccaSilveston-Keith,JulieWhitney,LexmarkInternational,Inc.(UnitedStates);GeorgeT.C.Chiu,JanP.Allebach,PurdueUniv.(UnitedStates)

NoAbstractAvailable

8653-26, Session 8

Wavelet-based figure of merit for macro-uniformityXingLiu,PurdueUniv.(UnitedStates);GaryOverall,TravisRiggs,RebeccaSilveston-Keith,JulieWhitney,LexmarkInternational,Inc.(UnitedStates);GeorgeT.C.Chiu,JanP.Allebach,PurdueUniv.(UnitedStates)

Waveletsareapowerfultoolthatcanbeappliedtoproblemsinimageprocessingandanalysis.Theyprovideamultiscaledecompositionofanoriginalimageintoaveragetermsanddetailtermsthatcapturethecharacteristicsoftheimageatdifferentscales.Inthisproject,wedevelopafigureofmeritformacro-uniformitythatisbasedonwavelets.WeusetheHaarbasistodecomposetheimageofthescannedpageintoelevenlevels,whichwethengroupintofourseparatefrequencybands,eachofwhichoverlapsbytwolevelswithitsadjacentfrequencybands.Eachfrequencybandimageconsistsofthesuperpositionofthedetailimageswithinthatband.Wenextcompute1-Dhorizontalandverticalprojectionsforeachfrequencybandimage.Foreachfrequencybandimageprojection,wedevelopastructuralapproximationthatsummarizestheessentialvisualcharacteristicsofthatprojection.Forthecoarsestbandcomprisinglevels8,9,10,11,weuseageneralizedsquare-waveapproximation.Forthenextcoarsestbandcomprisinglevels6,7,8,9,weuseapiecewiselinearsplineapproximation.Forthetwofinestbandscomprisinglevels4,5,6,7,and2,3,4,5,respectively,weuseaspectraldecomposition.Foreach1-Dapproximationsignal,wedefineanappropriatesetofscalar-valuedfeatures.Thesefeaturesareusedtodesigntwopredictors–onebasedonlinearregressionandtheotherbasedonthesupportvectormachine,whicharetrainedwithdatafromourimagequalityrulerexperimentswithhumansubjects.

Conference 8653


Conference 8654: Visualization and Data Analysis 2013Monday-Wednesday4–6February2013

Part of Proceedings of SPIE Vol. 8654 Visualization and Data Analysis 2013

8654-1, Session 1

An interactive visual testbed system for dimension reduction and clustering of large-scale high-dimensional dataJaegulChoo,HanseungLee,ZhichengLiu,JohnStasko,HaesunPark,GeorgiaInstituteofTechnology(UnitedStates)

Manyofthemoderndatasetssuchastextandimagedatacanberepresentedinhigh-dimensionalvectorspacesandhavebenefitedfromcomputationalmethodsthatutilizeadvancedtechniquesfromnumericallinearalgebra.Visualanalyticsapproacheshavecontributedgreatlytodataunderstandingandanalysisduetotheircapabilityofleveraginghumans’abilityforquickvisualperception.However,visualanalyticstargetinglarge-scaledatasuchastextandimagedatahasbeenchallengingduetolimitedscreenspaceintermsofboththenumbersofdatapointsandfeaturestorepresent.Amongvariouscomputationaltechniquesupportingvisualanalytics,dimensionreductionandclusteringhaveplayedessentialrolesbyreducingthesenumbersinanintelligentwaytovisuallymanageablesizes.Givennumerousdimensionreductionandclusteringtechniquesavailable,however,decisiononchoiceofalgorithmsandtheirparametersbecomesdifficult.Inthispaper,wepresentaninteractivevisualtestbedsystemfordimensionreductionandclusteringinalarge-scalehigh-dimensionaldataanalysis.Thetestbedsystemenablesuserstoapplyvariousdimensionreductionandclusteringmethodswithdifferentsettings,visuallycomparetheresultsfromdifferentalgorithmicmethodstoobtainrichknowledgeforthedataandtasksathand,andeventuallychoosethemostappropriatepathforacollectionofalgorithmsandparameters.Usingvariousdatasetssuchasdocuments,images,andothersthatarealreadyencodedinvectors,wedemonstratehowthetestbedsystemcansupportthesetasks.

8654-2, Session 1

Multi-focus and multi-level techniques for visualization and analysis of networks with thematic dataMicheleCossalter,OleJ.Mengshoel,TedSelker,CarnegieMellonUniv.(UnitedStates)

Information-richdatasetsbringseveralchallengesintheareasofvisualizationandanalysis,evenwhenassociatedwithnode-linknetworkvisualizations.Thispaperpresentsandvalidatesinauserstudyaninteractiveapproachtomakingmultipleandmulti-stepcomparisonsacrossdifferentpartsofsuchdatasets.WedescribeNetEx,avisualizationtoolprovidingmulti-focusandmulti-leveltechniquesthatenableuserstosimultaneouslyexploredifferentpartsofanetworkanditsunderlyingdataset.NetEx,implementedasaCytoscapeplug-in,hasbeenappliedtotheanalysisofelectricalpowernetworks,Bayesiannetworks,andtheEnrone-mailrepository.

InthispaperwefocusondatafromaNASAelectricalpowernetwork,representativeofthosefoundinaerospacevehicles,todemonstratehowNetExsupportstheanalyticaltaskofdiagnosis.Specifically,resultsfromauserstudywith25subjectssuggestNetExenablesmoreaccurateisolationofcomplexfaultscomparedtoanespeciallydesignedelectricalpoweranalysissoftwaretool.

8654-3, Session 2

Visual analytics of cyber physical data streams using spatio-temporal radial pixel visualizationMingC.Hao,M.Marwah,Hewlett-PackardLabs.(UnitedStates);SebastianMittelstaedt,HalldorJanetzko,DanielA.Keim,Univ.Konstanz(Germany);UmeshwarDayal,CullenBash,CarlosFelix,ChandrakantPatel,MeichunHsu,YuenChen,Hewlett-PackardLabs.(UnitedStates);MichaelHund,Univ.Konstanz(Germany)

Cyberphysicalsystems(CPS),suchassmartbuildingsanddatacenters,arerichlyinstrumentedsystemscomposedoftightlycoupledcomputationalandphysicalelementsthatgeneratelargeamountsofdata.ToexploreCPSdataandobtainactionableinsights,wepresentanewapproachcalledRadialPixelVisualization(RPV);whichusesmultipleconcentricringstoshowthedatainacompactcircularlayoutofpixelcells,eachringcontainingthevaluesforaspecificvariableovertimeandeachpixelcellrepresentinganindividualdatavalueataspecifictime.RPVprovidesaneffectivevisualrepresentationoflocalityandperiodicityofthehighvolume,multivariatedatastreams.RPVsmayhaveanadditionalanalysisringforhighlightingtheresultsofcorrelationanalysisorpeakpointdetection.Ourreal-worldapplicationsdemonstratetheeffectivenessofthisapproach.TheapplicationexamplesshowhowRPVcanhelpCPSadministratorstoidentifyperiodicthermalhotspots,findroot-causesofthecoolingproblems,understandbuildingenergyconsumption,andoptimizeIT-servicesworkloads.

8654-4, Session 2

Exploring large scale time-series data using nested timelinesZaixianXie,OracleAmericaInc.(UnitedStates);MatthewWard,ElkeRundensteiner,WorcesterPolytechnicInstitute(UnitedStates)

Whendataanalystsstudytime-seriesdata,animportanttaskistodiscoverhowdatapatternschangeovertime.

Ifthedatasetisverylarge,thistaskbecomeschallenging.Researchershavedevelopedmanyvisualizationtechniquestohelpaddressthisproblem.However,littleworkhasbeendoneregardingthechangesofmultivariatepatterns,suchaslineartrendsandclusters,ontime-seriesdata.Inthispaper,wedescribeasetofhistoryviewstofillthisgap.Thistechniqueworksundertwomodes:mergeandnon-merge.Forthemergemode,mergealgorithmswereappliedtoselectedtimewindowstogenerateachange-basedhierarchy.Contiguoustimewindowshavingsimilarpatternsaremergedfirst.Userscanchoosedifferentlevelsofmergingwiththetradeoffbetweenmoredetailsinthedataandlessvisualclutterinthevisualizations.Inthenon-mergemode,theframeworkcanusenaturalhierarchicaltimeunitsoronedefinedbydomainexpertstorepresenttimelines.Thiscanhelpusersnavigateacrosslongtimeperiods.Grid-basedviewsweredesignedtoprovideacompactoverviewforthehistorydata.

Inaddition,MDSpatternstarfieldsanddistancemapsweredevelopedtoenableuserstoquicklyinvestigatethedegreeofpatternsimilarityamongdifferenttimeperiods.Theusabilityevaluationdemonstratedthatmostparticipantscouldunderstandtheconceptsofthehistoryviewscorrectlyandfinishedassignedtaskswithahighaccuracyandrelativelyfastresponsetime.


8654-5, Session 3

Visibility-difference entropy for automatic transfer function generationPhilippC.Schlegel,RenatoB.Pajarola,Univ.ofZürich(Switzerland)

Directvolumerenderingallowsforinteractiveexplorationofvolumetricdataandhasbecomeanimportanttoolinmanyvisualizationdomains.Buttheinsightandinformationthatcanbeobtainedaredependentonthetransferfunctiondefiningthetransparencyofvoxels.Constructinggoodtransferfunctionsisoneofthemosttimeconsumingandcumbersometasksinvolumevisualization.Wepresentanovelgeneralpurposemethodforautomaticallygeneratinganinitialsetofbesttransferfunctioncandidates.Thegeneratedtransferfunctionsrevealthemajorstructuralfeatureswithinthevolumeandallowforanefficientinitialvisualanalysis,servingasabasisforfurtherinteractiveexplorationinparticularoforiginallyunknowndata.Thebasicideaistointroduceametricasameasureofthegoodnessofatransferfunctionwhichindicatestheinformationthatcanbegainedfromrenderedimagesbyinteractivevisualization.Incontrasttopriormethods,ourapproachdoesnotrequireauserfeedback-loop,operatesexclusivelyinimagespaceandtakesthecharacteristicsofinteractivedataexplorationintoaccount.Weshowhowournewtransferfunctiongenerationmethodcanuncoverthemajorstructuresofanunknowndatasetwithinonlyafewminutes.

8654-6, Session 3

Coherent view-dependent streamline selection for importance-driven flow visualizationJunMa,ChaoliWang,Ching-KuangShene,MichiganTechnologicalUniv.(UnitedStates)

Streamlinevisualizationcanbeformulatedastheproblemofstreamlineplacementorstreamlineselection.Inthispaper,wepresentanimportance-drivenapproachtoview-dependentstreamlineselectionthatguaranteescoher-entstreamlineupdatewhentheviewchangesgradually.Givenalargenumberofrandomlyoruniformlyseededandtracedstreamlinesandsampleviewpoints,ourapproachevaluates,foreachstreamline,theview-dependentimportancebyconsideringtheamountofinformationsharedbythe3Dstreamlineandits2Dprojectionaswellashowstereoscopicthestreamline’sshapeisreflectedundereachviewpoint.Weachievecoherentview-dependentstreamlineselectionfollowingatwo-passsolutionthatconsidersi)therelationshipsbetweenlocalviewpointsandtheglobalstreamlinesetselectedinaview-independentmannerandii)thecontinuitybetweenadjacentviewpoints.Wedemonstratetheeffectivenessofourapproachwithseveralsynthesizedandsimulatedflowfieldsandcompareourview-dependentstreamlineselectionalgorithmwithana?ıvealgorithmthatselectsstreamlinessolelybasedontheinformationatthecurrentviewpoint.

8654-7, Session 3

Single-pass GPU-raycasting for structured adaptive mesh refinement dataRalfKaehler,TomAbel,SLACNationalAcceleratorLab.(UnitedStates)

StructuredAdaptiveMeshRefinement(SAMR)isapopularnumericaltechniquetostudyprocesseswithhighspatialandtemporaldynamicrange.Itminimizesmemoryrequirementsbyadaptingthelatticeonwhichtheunderlyingdifferentialequationsaresolvedtomostefficientlyrepresentthesolution.Particularlyinastrophysicsandcosmologysuchsimulationsnowcancapturespatialscalestenordersofmagnitudeapartandmore.

TheirregularlocationsandextensionsoftherefinedregionsintheSAMRschemeandthefactthatdifferentresolutionlevels(partially)

overlap,posesachallengeforgraphicshardware-accelerateddirectvolumerenderingmethods.kD-treeshaveproventobewell-suitedto(re-)organizeSAMRdataintoblocksofequallysizedcells,butpreviousGPU-supportedapproachesforSAMRdatausingthisstructurerequiredaseparaterenderingpassforeachofitsnodes,limitingtheapplicabilityofadvancedlightingschemesthataccessmorethanoneblockofcellstooperateon.Inthispaperwepresentanefficientsingle-passGPU-raycastingalgorithmforSAMRdatathatistailoredtocurrentgraphicshardwarearchitectures.ItemploysankD-tree,efficientlyencodedbyasetof3D-textures,toadaptivelysamplecompleteraysentirelyontheGPUwithoutanyCPUinteraction.WefurtherdiscusstwodifferentdatastoragestrategiestoaccessthegriddataontheGPUandprovetheusefulnessofthepresentedapproachesbyapplyingthemtoseveralreal-worldSAMRdatasets.

8654-8, Session 4

Multi-user smartphone-based interaction with large high-resolution displaysLynnNguyen,JurgenSchulze,Univ.ofCalifornia,SanDiego(UnitedStates)

ThisabstractisfortheVisualizationandDataAnalysisconference,NumberEI107.

Thisconferenceexpectsfullpapersubmissions,whichwon’tfitintothisbox.ThePDFwiththefullpaperhasbeensubmitted.

8654-9, Session 4

Stereo frame decomposition for error-constrained remote visualizationStevenL.Martin,Han-WeiShen,TheOhioStateUniv.(UnitedStates)

Asgrowthindatasetsizescontinuestoexceedgrowthinavailablebandwidth,newsolutionsareneededtofacilitateefficientvisualanalysisworkflows.Remotevisualizationcanenablethecolocationofvisualanalysiscomputeresourceswithsimulationcomputeresources,reducingtheimpactofbandwidthconstraints.Whiletherearemanyoff-the-shelfsolutionsavailableforgeneralremotingneeds,thereissubstantialroomforimprovementintheinteractivitytheyoffer,andnonefocusonsupportingstereoremotevisualizationwithprogrammableerrorbounds.

Weproposeanovelsystemenablingefficientcompressionofstereovideostreamsusingstandardcodecsthatcanbeintegratedwithexistingremotingsolutions,whileatthesametimeofferingerrorconstraintsthatprovideuserswithfidelityguarantees.Bytakingadvantageofinterocularcoherence,theflexibilitypermittedbyerrorconstraints,andknowledgeofscenedepthandcamerainformation,oursystemoffersimprovedremotevisualizationframerates.

8654-10, Session K1

Why high performance visual data analytics is both relevant and difficult (Keynote Presentation)EdwardW.Bethel,PrabhatPrabhat,SurendraByna,OliverRuebel,KeshengWu,MichaelFWehner,LawrenceBerkeleyNationalLab.(UnitedStates)

NoAbstractAvailable

8654-11, Session 5

Three-dimensional volume analysis of vasculature in engineered tissues

Conference 8654


MohammedYousefhussien,RochesterInstituteofTechnology(UnitedStates);KelleyGarvin,DianeDalecki,Univ.ofRochester(UnitedStates);EliSaber,MaríaHelguera,RochesterInstituteofTechnology(UnitedStates)

Three-dimensionaltexturalandvolumetricimageanalysisholdsgreatpotentialinunderstandingtheimagedataproducedbymulti-photonmicroscopy.Inthispaper,analgorithmthatquantitativelyanalyzesthetextureandthemorphologyofvasculatureinengineeredtissuesisproposed.Theinvestigated3DartificialtissuesconsistofHumanUmbilicalVeinEndothelialCells(HUVEC)embeddedincollagenexposedtotworegimesofultrasoundstandingwavefieldsunderdifferentpressureconditions.TexturalfeatureswereevaluatedusingthenormalizedGray-ScaleCo-occurrenceMatrix(GLCM)combinedwithGray-LevelRunLengthMatrix(GLRLM)analysis.Tominimizeerrorresultingfromanypossiblevolumerotationandtoprovideacomprehensivetexturalanalysis,anaveragedversionofnineGLCMandGLRLMorientationsisused.Toevaluatevolumetricfeatures,anautomaticthresholdusingthegraylevelmeanvalueisutilized.Resultsshowthatouranalysisisabletodifferentiateamongtheexposedsamples,duetomorphologicalchangesinducedbythestandingwavefields.Furthermore,wedemonstratethatprovidingmoretexturalparametersthanwhatiscurrentlybeingreportedintheliterature,enhancesthequantitativeunderstandingoftheheterogeneityofartificialtissues.

8654-12, Session 5

3D surface reconstruction and visualization of the Drosophila wing imaginal disc at cellular resolutionLingeBai,DrexelUniv.(UnitedStates);ThomasWidmann,CentrodeGenómicaeInvestigaciónOncológica(Spain);FrankJülicher,Max-Planck-InstitutfürPhysikkomplexerSysteme(Germany);ChristianDahmann,DresdenUniv.ofTechnology(Germany);DavidE.Breen,DrexelUniv.(UnitedStates)

Quantifyingandvisualizingtheshapeofdevelopingbiologicaltissuesprovideinformationaboutthemorphogeneticprocessesinmulticellularorganisms.Thesizeandshapeofbiologicaltissuesdependonthenumber,size,shape,andarrangementoftheconstitutingcells.Tobetterunderstandthemechanismsthatguidetissuesintotheirfinalshape,itisimportanttoinvestigatethecellulararrangementwithintissues.Herewepresentadataprocessingpipelinetogenerate3Dvolumetricsurfacemodelsofepithelialtissues,aswellasgeometricdescriptionsofthetissues’apicalcellcross-sections.Thedataprocessingpipelineincludesimageacquisition,editing,processingandanalysis,2Dcellmeshgeneration,3Dcontour-basedsurfacereconstruction,cellmeshprojection,followedbygeometriccalculationsandcolor-basedvisualizationofmorphologicalparameters.Intheirfirstutilizationwehaveappliedtheseprocedurestoconstructa3DvolumetricsurfacemodelatcellularresolutionofthewingimaginaldiscofDrosophilamelanogaster.Theultimategoalofthereportedeffortistoproducetoolsforthecreationofdetailed3Dgeometricmodelsoftheindividualcellsinepithelialtissues.Todate,3Dvolumetricsurfacemodelsofthewholewingimaginaldischavebeencreated,andtheapicolateralcellboundarieshavebeenidentified,allowingforthecalculationandvisualizationofcellparameters,e.g.apicalcross-sectionalareaofcells.Thecalculationandvisualizationofmorphologicalparametersshowposition-dependentpatternsofcellshapeinthewingimaginaldisc.Ourproceduresshouldofferageneraldataprocessingpipelinefortheconstructionof3Dvolumetricsurfacemodelsofawidevarietyofepithelialtissues.

8654-13, Session 6

Visual exploration and analysis of human-robot interaction rulesHuiZhang,IndianaUniv.-PurdueUniv.Indianapolis(UnitedStates);MichaelJ.Boyles,IndianaUniv.(UnitedStates)

Wepresentanovelinteractionparadigmforthevisualexploration,manipulationandanalysisofhuman-robotinteraction(HRI)rules;ourdevelopmentisimplementedusingavisualprogramminginterfaceandexploitskeytechniquesdrawnfrombothinformationvisualizationandvisualdataminingtofacilitatetheinteractiondesignandknowledgediscoveryprocess.HRIisoftenconcernedwithmanipulationsofmulti-modalsignals,events,andcommandsthatformvariouskindsofinteractionrules.Depicting,manipulatingandsharingsuchdesign-levelinformationisacompellingchallenge.Furthermore,theclosedloopbetweenHRIprogrammingandknowledgediscoveryfromempiricaldataisarelativelylongcycle.This,inturn,makesdesign-levelverificationnearlyimpossibletoperforminanearlierphase.Inourwork,weexploitadrag-and-dropuserinterfaceandvisuallanguagestosupportdepictingresponsivebehaviorsfromsocialparticipantswhentheyinteractwiththeirpartners.Forourprincipaltestcaseofgaze-contingentHRIinterfaces,thispermitsustoprogramanddebugtherobots’responsivebehaviorsthroughagraphicaldata-flowcharteditor.Weexploitadditionalprogrammanipulationinterfacestoprovidestillfurtherimprovementtoourprogrammingexperience:bysimulatingtheinteractiondynamicsbetweenahumanandarobotbehaviormodel,weallowtheresearcherstogenerate,traceandstudytheperception-actiondynamicswithasocialinteractionsimulationtoverifyandrefinetheirdesigns.Finally,weextendourvisualmanipulationenvironmentwithavisualdata-miningtoolthatallowstheusertoinvestigateinterestingphenomenasuchasjointattentionandsequentialbehavioralpatternsfrommultiplemulti-modaldatastreams.WehavecreatedinstancesofHRIinterfacestoevaluateandrefineourdevelopmentparadigm.Asfarasweareaware,thispaperreportsthefirstprogrammanipulationparadigmthatintegratesvisualprogramminginterfaces,informationvisualization,andvisualdataminingmethodstofacilitatedesigning,comprehending,andevaluatingHRIinterfaces.

8654-14, Session 6

Emotion scents: a method of representing user emotions on GUI widgetsDanielCernea,TechnischeUniv.Kaiserslautern(Germany)andLinnaeusUniv.(Sweden);ChristopherWeber,AchimEbert,TechnischeUniv.Kaiserslautern(Germany);AndreasKerren,LinnaeusUniv.(Sweden)

Theworldofdesktopinterfaceshasbeendominatedforyearsbytheconceptofwindowsandstandardizeduserinterface(UI)components.Still,whilesupportingtheinteractionandinformationexchangebetweentheusersandthecomputersystem,graphicaluserinterface(GUI)widgetsareratherone-sided,neglectingtocapturethesubjectivefacetsoftheuserexperience.InthispaperweproposeasetofdesignguidelinesforvisualizinguseremotionsonstandardGUIwidgets(e.g.buttons,checkboxes,etc.)inordertoenrichtheinterfacewithanewdimensionofsubjectiveinformationbyaddingsupportforemotionawarenessaswellaspost-taskanalysisanddecisionmaking.WehighlighttheuseofanEEGheadsetforrecordingthevariousemotionalstatesoftheuserwhileheisinteractingwiththewidgetsoftheinterface.Weproposeavisualizationapproach,calledemotionscents,thatallowsuserstoviewemotionalreactionscorrespondingtodifferentGUIwidgetswithoutinfluencingthelayoutorchangingthepositioningofthesewidgets.Ausecaseandanevaluationenforcesourfindings,suggestingthattheseemotion-enhancedUIcomponentscanenrichdesktopinterfacesbyenablingemotionawarenessandsubjectivityanalysis.

8654-15, Session 7

Visual analysis of situationally aware building evacuationsJackGuest,ToddEaglin,KalpathiR.Subramanian,WilliamRibarsky,Univ.ofNorthCarolinaatCharlotte(UnitedStates)

NoAbstractAvailable

Conference 8654


8654-16, Session 7

Improving projection-based data analysis by feature space transformationsMatthiasSchaefer,LeishiZhang,TobiasSchreck,AndradaTatu,Univ.Konstanz(Germany);JohnA.Lee,MichelVerleysen,Univ.CatholiquedeLouvain(Belgium);DanielA.Keim,Univ.Konstanz(Germany)

Generatingeffectivevisualembeddingofhigh-dimensionaldataisdifficult-theanalystexpectstoseethestructureofthedatainthevisualization,aswellaspatternsandrelations.Giventhehighdimensionality,noiseandimperfectembeddingtechniques,itishardtocomeupwithasatisfactoryembeddingthatpreservesthedatastructurewell,whilsthighlightingpatternsandavoidingvisualcluttersatthesametime.Inthispaper,weintroduceagenericframeworkforimprovingthequalityofanexistingembeddingintermsofbothstructuralpreservationandclassseparationbyfeaturespacetransformations.Acompoundqualitymeasurebasedonstructuralpreservationandvisualclutteravoidanceisproposedtoaccessthequalityofembeddings.Weevaluatetheeffectivenessofourapproachbyapplyingittoseveralwidelyusedembeddingtechniquesusingasetofbenchmarkdatasetsandtheresultlookspromising.

8654-17, Session 7

Does interactive animation control improve exploratory data analysis of animated trend visualization?FelwaA.Abukhodair,SimonFraserUniv.(Canada)andKingAbdullahUniv.ofScienceandTechnology(SaudiArabia);BernhardE.Riecke,HalilI.Erhan,ChrisD.Shaw,SimonFraserUniv.(Canada)

Effectivelyanalyzingtrendsoftemporaldatabecomesacriticaltaskespeciallywhentheamountofdataislarge.Motiontechniques(animation)forscatterplotsmakeitpossibletorepresentlotsofdatainasingleviewandmakeiteasytoidentifytrendsandhighlightchanges.Thesetechniqueshaverecentlybecomeverypopularandtoanextentsuccessfulindescribingdatainpresentations.However,comparedtostaticmethodsofvisualization,scatterplotanimationsmaybehardtoperceivewhenthemotionsarecomplex.Thispaperstudiestheeffectivenessofinteractivescatterplotanimationasavisualizationtechniquefordataanalysisoflargedata.Wecomparedinteractiveanimationswithnon-interactive(passive)animationswhereparticipantshadnocontrolovertheanimation.Bothconditionswereevaluatedforspecificaswellasgeneralcomprehensionofthedata.Whileinteractiveanimationwasmoreeffectiveforspecificinformationanalysis,itledtomanymisunderstandingsintheoverallcomprehensionduetothefragmentationoftheanimation.Ingeneral,participantsfeltthatinteractivitygavethemmoreconfidenceandfounditmoreenjoyableandexcitingfordataexploration.Inconclusion,interactingwithanimationsoftrendvisualizationsprovedtobeaneffectivetechniqueforexploratorydataanalysisandwassignificantlymoreaccuratethannon-interactiveanimationalone.Withthesefindingsweaimatsupportingandguidingthedesignofinteractivitytoeffectivelyenhancedataexplorationinanimatedvisualizations.

8654-18, Session K2

Social media analysis and platform (Keynote Presentation)MeichunHsu,Hewlett-PackardLabs.(UnitedStates)

NoAbstractAvailable

8654-19, Session 10

iMap: a stable layout for navigating large image collections with embedded searchChaoliWang,JohnP.Reese,HuanZhang,JunTao,RobertJ.Nemiroff,MichiganTechnologicalUniv.(UnitedStates)

Effectivetechniquesfororganizingandvisualizinglargeimagecollectionsareingrowingdemandasvisualsearchgetsincreasinglypopular.Targetinganonlineastronomyarchivewiththousandsofimages,wepresentoursolutionforimagesearchandclusteringbasedontheevaluationimagesimilarityusingbothvisualandtextualinformation.Tolayoutimages,weintroduceiMap,atreemap-basedrepresentationforvisualizingandnavigatingimagesearchandclusteringresults.iMapnotonlymakeseffectiveuseofavailabledisplayareatoarrangeimagesbutalsomaintainsstableupdatewhenimagesareinsertedorremovedduringthequery.Wealsodevelopanembeddedvisualizationthatintegratesimagetagsforin-placesearchrefinement.Weshowtheeffectivenessofourapproachbydemonstratingexperimentalresultsandconductingacomparativeuserstudy.

8654-21, Session 10

uVis Studio: an integrated development environment for visualizationKostasPantazos,MohammadA.Kuhail,SorenLauesen,ShangjinXu,TheITUniv.ofCopenhagen(Denmark)

Atoolkitfacilitatesthevisualizationdevelopmentprocess.Theprocesscanbefurtherenhancedbyintegratingthetoolkitsindevelopmentenvironments.ThispaperdescribeshowtheuVistoolkit,aformula-basedvisualizationtoolkit,hasbeenextendedwithadevelopmentenvironment,calleduVisStudio.Insteadofprogramming,developersapplyaDrag-Drop-Set-View-Interactapproachtoenhancecognitiveabilities.Developerbindcontrolstodata,andtheStudiogivesimmediatevisualfeedbackinthedesignpanel.Thisisanovelfeature,calledWhat-You-Bind-Is-What-You-Get.TheStudioalsoprovidesModesthatallowdeveloperstointeractandviewthevisualizationfromadeveloper’sandanend-user’sperspectiveswithoutswitchingscreens,andData-Auto-Completion;afeatureofthePropertyGridthathelpsdeveloperswithsuggestionstowritethecorrectformulasthatrefertocontrolsanddatabaseinformation.

WeconductedausabilitystudywithsixdeveloperstoevaluateiftheStudioanditsfeaturesenhancecognitionandfacilitatethevisualizationdevelopment.TheresultsshowthatdevelopersappreciatedtheDrag-Drop-Set-View-Interactapproach,theWhat-You-Bind-Is-What-You-Getfeature,theData-Auto-CompletionandtheModes.Severalusabilityproblemswereidentified,andsomesuggestionsforimprovementinclude:newpanels,betterpresentationofModes,andbettererrormessages.

8654-22, Session 11

Interactive visual comparison of multimedia data through type-specific viewsRussE.BurtnerIV,ShawnBohn,DebbieA.Payne,PacificNorthwestNationalLab.(UnitedStates)

Whenanalyzingdatasourcessuchasnews,blogs,andsocialmediasitestodiscoveractionableintelligence,awealthofinformationisavailable.Thedatacollectedfromthesesourcescomprisesalltypesofmultimediadata:text,image,andvideo.Todate,visualanalytictoolsgenerallyprovideanalysisofonemodeofdata,orasasinglemodewhileattachingadditionalmodesasmetadata.Anexampleisatextanalysistoolthatoffersvisualizationsbasedonprocessingthetextandallowsimagestobedisplayedasauxiliaryinformationattachedtothetext.Whilesuchatoolallowsausertodivedeeplyintothedetailofaspecifictypeofdata,itdoesnotprovideaglobalperspective.

Conference 8654


Traditionalvideoandimageanalyticsincludedigitalimageprocessing,patternrecognition,edgefinding,boundaryandcurvedetection,regiongrowing,shapeidentification,featureextractionwithinindividualimagesorframesofvideo,content-basedimageretrieval,andcontent-basedvideoclipretrieval.Traditionaltextanalyticsincludeinformationretrieval,lexicalanalysis,patternrecognition,tagging/annotation,entityextraction,andlinkandassociationanalysis.Theseanalysistechniquesareusedinindividualvisualanalytictoolswithoutbringingthemtogethertoprovideadditionalinsightintomultimediacollections.Theneedformultimediaanalysisisparticularlyacutewhentheanalysisofasinglemodeofinformationfailstorevealthelargerpicture,forexample;imageanalysisdoesnotyetprovidesemanticunderstanding.

Canopy,isavisualanalytictooldevelopedtoaidinformationanalystsinunderstandingcollectionsofmultimedia.Tosupportthecomputationalneedsofmultimediaprocessing,Canopyhasbeendevelopedasanenterpriseapplication.Canopyhasaclientapplicationforvisualdisplayofmultipleinteractivevisualizationsandmultipleservercomponentsthatextractdata,processtext,processimageandvideodata,andcombineresults.Canopycanhelpwithanalyticproblemssuchas,“Ihavealargecollectionofdata;helpmeinvestigatethiscollectiontodeterminethemostrelevantfileswithoutmyhavingtowatcheverymovie,vieweveryimage,andreadallthetext”;“Ihaveapictureofinterest;helpmefindotherpicturesofthesamesubjectorvideostakeninthesamelocation”;and“Ihaveaname;helpmedetermineifIhaveanyimagesorvideosofthisperson.”Canopybootstrapstheanalysisprocessbyprovidingvisualcluestopotentialdatarelationshipsandhighlightsconnections,providingtheuserwithunderstandingofallthedataandadditionalstructuralcontext.This,inturn,facilitatesdiscoveryofpreviouslyunknowncontentaswellasunexpectedornon-obviousrelationships.

Canopyextractsthebasetypesfromacompounddocument.Forexample,aPowerPointfilewillbeextractedintothetextfromtheslides,metadatafieldssuchasauthor,andtheimagesonanyslide.Canopyappliestheappropriateanalysistoeachextractedportionofthefilebasedonitsmode:text,image,orvideo.Onceaprojecthasprocessed,theuserispresentedwithmultipleinterconnectedviewsthattietheextractedtypesthroughtherelationshipsfoundinthestructureofthedocuments.Theseviewsallowtheusertoexplorethedataandrelationshipswithinandacrossmodalities.

Canopyisdeployedinmultiplelocations,andinformationanalystsarediscoveringpreviouslyhiddenlinkagesamongthetext,image,andvideomedia.Canopyisanovelapplicationthattakesmultimediaanalysistoanewlevelbyapplyingcrossmediaunderstandingthroughvisualizationandallowinganalyststoseerelationshipswithintheirdataregardlessofsourceormediatype.

8654-23, Session 11

Evaluating multivariate visualizations on time-varying dataMarkA.Livingston,U.S.NavalResearchLab.(UnitedStates);JonathanWDecker,USNavalResearchLab(UnitedStates);ZhumingAi,U.S.NavalResearchLab.(UnitedStates)

Multivariatevisualizationtechniqueshavebeenappliedtoawidevarietyofvisualanalysistasksandabroadrangeofdatatypesandsources.Theirutilityhasbeenevaluatedinamodestrangeofsimpleanalysistasks.Inthiswork,weextendourprevioustasktoacaseoftime-varyingdata.Weimplementedfivevisualizationsofoursynthetictestdata:threepreviouslyevaluatedtechniques(Data-drivenSpots,OrientedSlivers,andAttributeBlocks),onehybridofthefirsttwothatwecallOrientedData-drivenSpots,andanimplementationofAttributeBlocksthatintegratesthetemporalslices.Weconductedauserstudyofthesefivetechniques.Ourpreviousfinding(withstaticdata)wasthatusersperformedbestwhenthedensityofthetarget(asmeasuredbythevisualization’ssymbols:spots,slivers,orblocks``on’’)waseitherhighestorhadthehighestratiotonon-targetfeatures.Wepresenttheresultsofourstudyandourconclusionsforhowthetime-varyingpresentationsaffectedthesefindings.

8654-24, Session 11

Multi-focus and multi-window techniques for interactive network explorationPriyaKrishnanSundarararajan,OleJ.Mengshoel,TedSelker,CarnegieMellonUniv.(UnitedStates)

Networksanalystsoftenneedtocomparenodesindifferentpartsofanetworkandkeepinmindthenetworkstructure,nodelabelsandinternaldetailsofnodes.Evenamoderately-sizednetworkonascreenshowscoarsestructure;unfortunatelyitmaymakedetailedstructureandnodelabelsunreadable.Zoomingincanbeusedtostudydetailsandreadnodelabelsbutindoingthisthenetworkanalystmaylosetrackofalreadystudieddetailselsewhereinthenetwork.Wepresentmulti-focusandmulti-windowtechniquestosupportinteractiveexplorationofnetworks.Ourworksupportstheuserinpartitioningandselectivelyzoominginthenetwork;dataassociatedwitheachnodecanbefurtherinspectedusingalignedorfloatingwindows.Basedonauser’sselectionoffocusnodes,thenetworkisenlargednearthefocusnodes.Thetechniqueallowstheusertosimultaneouslyzoominonupto10-20nodesandtheirneighborhoodwhileretainingthelargernetworkcontext.WedemonstrateourtechniqueandtoolbyshowinghowtheysupportinteractivedebuggingofaBayesiannetworkmodelofanelectricalpowersystem.Inaddition,weshowthatitcanvisuallysimplifycomparisonsacrossondifferenttypesofnetworks.


Effective color combinations in isosurface visualizationSussanEinakian,TimothyS.Newman,TheUniv.ofAlabamainHuntsville(UnitedStates)

Selectionofsuitablecolorcombinationsisachallengeforvisualizationsthatutilizecolorasacue,suchassimultaneousvisualizationofmultipleisosurfaces,eachinitsowncolor.Useofvariouscolormappingschemes,includingheatmaps,colorspectrummaps,etc.,havebeenexploredininformationvisualization[1],butlessattentionhasfocusedonemployingvariousartisticandpsychologicaltheoriesofcolor.Here,weconsiderapplicabilityofcolortheoriesofharmonious,disharmonious,andopponentcolorsformultipleisosurfacevisualization.Weconsiderthisviauserstudyofpreferencesforuseofeachsuchcolortheoryinthisvisualizationapplicationdomain.Littlepriorworkhasfocusedoncolorchoiceforisosurfacevisualization.Theworkheredoesbuildonsomeprioreffortstoapplytheoriesofcolor(dis)harmonyandopponencyininformationvisualization.

Inonepriorworks[1],wehaveconsideredsomeaspectsofusingcolortheories(harmony,disharmony,andopponentcolor)ininformationvisualization.Inthisworkhereweconsideruseofthesecolortheoriestodrivechoseofthecolorfordisplayofmultipleisosurfacesfromvolumetricdatasets.Weconsiderthischoiceviaauserstudythatconsidersthepracticaldeterminationofapplicabilityofthesecolortheoriesforthisisosurfacevisualizationtask.Ourfocusinthestudyishoweasilydifferentregionsofthedatasetsaredistinguishable.Inthisuserstudythedistinctnessandpreferencesofcolorswerediscoveredbasedonuserresponses.ThethreedatasetsconsideredherearefromtheVolumeLibrarywebsite[3,4],includingtheEngine,DTI(DiffusionTensorImaging),andH2Odatasets.

WeusedOpenDx,thevisualizationopensourcepackageonLinuxenvironmentforrenderingnestedisosurfaces.Acollectionofisovaluesandopacitywereinitiallyconsideredforeachdatasettoseeifitwaspossibletocreateclearanddistinctsurfaces.Intheend,wefounduseofacommonopacityforallsurfacesallowedpresentationofaclearresultfornestedisosurfacesinthesedatasetsthatcouldbeusedforreadycomparisonofthetheoriesandcolorchoices.Fortyparticipantswereconsideredinthisuserstudy.Participantswereaskedtomakerepeatedpair-wisecomparisonsofvisualizationrenderingtoreporttheirpreferencesforeachcomparison.Fulldetailsofthestepsofthestudywillbedescribedinthefullpaper.

Thestatisticalsignificanceofdistinctivenessofeachdisharmonious,opponent,andharmoniouscolorcombinationwasdeterminedbyapplicationofastatisticalsigntest.Basedonsuchtesting,wefound

Conference 8654


thereisastatisticallysignificantdifferencebetweendisharmonyandopponentcolorsfortheEngineandDTIisosurfacerenderings.Inaddition,ourtestsfoundastatisticallysignificantdifferencebetweendisharmoniousandopponentcolorsforallthreeimages.Athirdtestwastoconsiderthepreferencesfordisharmoniousversusharmoniouscolors,harmoniousversusopponentcolors,anddisharmoniousversusopponentcolorsforeachimage.AstatisticallysignificantdifferencewasfoundbetweenthedisharmoniousandharmoniouscolorcombinationsfortheEngineandH2Odatasets.Also,astatisticallysignificantdifferencewasfoundbetweenthedisharmoniousandtheopponentcolorcombinationsforallimages.FortheEngineandDTIimagesthereisalsoastatisticallysignificantdifferencebetweentheharmoniousandopponentcolors.TherewasnotastatisticallysignificantdifferencebetweenthesetwotheoriesinapplicationtotheH2Oimage.

Insummary,wefoundthatusersappeartofinddisharmoniouscolorcombinationandharmoniouscolorcombinationstohavestatisticallysignificantdifferences(atthe99%confidencelevel)inperceiveddistinctiveness.Moreover,harmoniouscolorswerefoundtobedistinctivetotheopponentcolorcombinations(alsoatthe99%confidencelevel).

Wealsohaveconsideredresultsbygenderandeducationallevelofparticipantsandfoundnostatisticallysignificantdifferencetoexist.Basedontheoutcomeoftheuserstudyitappearsthatthestructureandshapeofaninstanceisoneofthemainissuesaffectingtheperceptionofsuitabilityofharmonicversusdisharmonicversusopponentcolorcombinations.TheEnginehasahardstructureandthereisnointerferencebetweendifferentsurfaces.TheDTIdatasethastissuesofbrainandthereisoverlappingbetweentwoisosurfaces.TheH2Odatasetisbasedonthreeisosurfacesthathavemoreoverlapintheimagethereforegivingrisetomorecomplexcolorandopacityinteractionswhenviewedfromsomedirections,resultingintherebeingnosignificantdifferenceinuserratingofopponentandharmoniouscombinations.

Inconclusion,ourstudyresultssuggestthattheshapeofstructuresinavolumetricdataseteffectstheperceptionofwhichcombinationismostsuitablefornestedisosurfaces.However,itappearsthathardstructureswithnointerferencebetweenisosurfaceswerefounddisharmoniouscolorcombinationstatisticallysignificantdifferenceindistinctiveness.Ontheotherhand,forsoftstructures,likebodytissues,withoverlapbetweenisosurfacestheharmoniouscolorcombinationswerefoundtobemoredistinctthandisharmoniouscolorcombinations.

References:

[1]Einakian,S.andNewman,TimothyS.;ExperimentsonEffectiveColorCombinationinMap-BasedInformationVisualization.Proc.VisualizationandDataAnalysisConf.‘10(SPIEVol.7530),SanJoseJan.2010.

[2]TheVolumeLibrary.[Online]http://schorsch.efi.fh-nuernberg.de/data/volume/.Accessed(March2012).

[3]TheVolumeLibrary.[Online]http://www9.informatik.uni-erlangen.de/External/vollib/.Accessed(March2012).


Web tools for rapid experimental visualization prototypingJonathanW.Decker,MarkA.Livingston,U.S.NavalResearchLab.(UnitedStates)

Quiteoftenaresearcherfindsthemselveslookingatspreadsheetsofhigh-dimensionaldatageneratedbyexperimentalmodelsanduserstudies.Wecanuseanalysistochallengeorconfirmhypothesis,butunexpectedresultscaneasilybelostintheshuffle.Forthisreason,itwouldbeusefultovisualizetheresultssowecanexploreourdataandmakenewdiscoveries.

Webbrowsershavebecomeincreasinglycapableforcreatingcomplex,multi-viewapplications.Javascriptisquicklybecomingadefactostandardforscripting,onlineandoffline.Thisworkdemonstratestheuseofwebtechnologiesasapowerfultoolforrapidvisualizationprototyping.

Wehavedevelopedtwoprototypes:Oneforhigh-dimensionalresultsoftheabELICIT-multi-agentversionoftheELICITplatformtaskedwithcollaboratingtoidentifytheparametersofapendingattack.Anotherprototypedisplaysresponsestoauserstudyontheeffectivenessofmulti-layervisualizationtechniques.WecreatedcoordinatedmultipleviewsprototypesintheGoogleChromeBrowserwritteninJavascript,CSSandHTML.Wewilldiscussthebenefitsandshortcomingsofthisapproach.


Time-based user-movement pattern analysis from location-based social network dataHueyLingChuan,IsarapornKulkumjon,SurbhiDangi,CarnegieMellonUniv.(UnitedStates)

Virtualsocialinteractionsplayanincreasinglyimportantroleinthediscoveryofplaceswithdigitalrecommendations.Ourhypothesisisthatpeopledefinethecharacterofacitybythetypeofplacestheyfrequent.Withabriefdescriptionofourdataset,anomaliesandobservationsaboutthedata,thispaperdelvesintothreedistinctapproachestovisualizethedatasetaddressingourtwogoalsof:1.Arrivingatatime-basedregionspecificrecommendationlogicfordifferenttypesofusersclassifiedbytheplacestheyfrequent.2.Analyzingthebehaviorsofusersthatcheck-iningroupsoftwoormorepeople.Thestudyrevealedthatdistinctpatternsexistforpeoplethatareresidentsofthecityandforpeoplewhoareshort-termvisitorstothecity.Thefrequencyofvisits,however,isbothdependentonthetimeofthedayaswellastheurbanareaitself(e.g.eateries,offices,touristattractions).Theobservationscanbeextendedforapplicationinfoodandtravelrecommendationenginesaswellasforresearchinurbananalytics,smartcitiesandtownplanning.

Keywords:digitalrecommendations,socialinteractions,check-inbehavior,frequencyofvisits,travelrecommendationengines,urbananalytics,smartcities.


Visualizing vascular structures in virtual environmentsThomasWischgoll,WrightStateUniv.(UnitedStates)

Inordertolearnmoreaboutthecauseofcoronaryheartdiseasesanddevelopdiagnostictools,theextractionandvisualizationofvascularstructuresfromvolumetricscansforfurtheranalysisisanimportantstep.Bydeterminingageometricrepresentationofthevasculature,thegeometrycanbeinspectedandadditionalquantitativedatacalculatedandincorporatedintothevisualizationofthevasculature.Toprovideamoreuser-friendlyvisualizationtool,virtualenvironmentparadigmscanbeutilized.Thispaperdescribestechniquesforinteractiverenderingoflarge-scalevascularstructureswithinvirtualenvironments.Thiscanbeappliedtoalmostanyvirtualenvironmentconfiguration,suchasCAVE-typedisplays.Specifically,thetoolspresentedinthispaperweretestedonaBarcoI-Spaceandalarge62x108inchpassiveprojectionscreenwithaKinectsensorforusertracking.


A combined multidimensional scaling and hierarchical clustering view for the exploratory analysis of multidimensional dataPaulCraig,NénaRoa-Seïler,Univ.TechnológicadelaMixteca(Mexico)

Thispaperdescribestheanovelinformationvisualizationtechniquethatcombinesmultidimensionalscalingandhierarchicalclusteringtosupporttheexploratoryanalysisofmultidimensionaldata.Thetechniquedisplaystheresultsofmultidimensionalscalingusinga

Conference 8654


scatterplotwheretheclosenessofanytwoitems’representation’sareapproximatetotheirsimilarityaccordingtoaEuclideandistancemetric.Theresultsofhierarchicalclusteringareoverlaidontothisviewbydrawingsmoothedoutlinesaroundeachnestedcluster.Thedifferenceinsimilaritybetweensuccessiveclustercombinationsisusedtocolourcodeclustersandmakestrongernaturalclustersmoreprominentinthedisplay.Whenaclusterorgroupofitemsisselected,multidimensionalscalingandhierarchicalclusteringarere-appliedtoafilteredsubsetofthedata,andanimationisusedtosmooththetransitionbetweensuccessivefilteredviews.Asacasestudywedemonstratethetechniquebeingusedtoanalysesurveydatarelatingtotheappropriatenessofdifferentphrasestodifferentemotionallychargedsituations.


Visualization of decision processes using a cognitive architectureMarkA.Livingston,ArthiMurugesan,DerekBrock,WendeK.Frost,DennisPerzanowski,U.S.NavalResearchLab.(UnitedStates)

Cognitivearchitecturesarecomputationaltheoriesofreasoningthehumanmindengagesinasitprocessesfactsandexperiences.Acognitivearchitectureusesdeclarativeandproceduralknowledgetorepresentmentalconstructsthatareinvolvedindecisionmaking.Employingamodelofbehavioralandperceptualconstraintsderivedfromasetofoneormorescenarios,thearchitecturereasonsaboutthemostlikelyconsequence(s)ofasequenceofevents.Reasoningofanycomplexityanddepthinvolvingcomputationalprocesses,however,isoftenopaqueandchallengingtocomprehend.Arguably,fordecisionmakerswhomayneedtoevaluateorquestiontheresultsofautonomousreasoning,itwouldbeusefultobeabletoinspectthestepsinvolvedinaninteractive,graphicalformat.Whenachainofevidenceandconstraint-baseddecisionpointscanbevisualized,itbecomeseasiertoexplorebothhowandwhyascenarioofinterestwilllikelyunfoldinaparticularway.

Inthispaper,wepresentinitialworkonaschemeforvisualizingautonomousreasoningthatproducesgraphicalrepresentationsofmodelsruninthePolyschemecognitivearchitecture.First,wegiveabriefoverviewofthearchitectureandnotethekeytypesofdatathatarecriticalforvisualrepresentationsofcognitively-based,computationalreasoningmechanisms.Weproposeanalgorithmtogeneratevisualizationsofmodel-basedreasoning,anddiscusspropertiesofourtechniquethatposechallengesforourrepresentationgoals.Finally,wepresentexamplevisualizationsandsimpleinteractionswiththeunderlyingchainofreasoning.Weconcludewithasummaryoffeedbacksolicitedfromdomainexpertsandpractitionersinthefieldofcognitivemodeling.


Vortex core timelines and ribbon summarizations: flow summarization over time and simulation ensemblesAlexisYeeLynChan,JoohwiLee,RussellM.TaylorII,TheUniv.ofNorthCarolinaatChapelHill(UnitedStates)

Thispaperpresentstwonewvortex-summarizationtechniquesdesignedtoportrayvortexmotionoveranentiresimulationandoveranensembleofsimulationsinasingleimage.Linear“vortexcoretimelines”withconeglyphssummarizeflowoveralltimestepsofasinglesimulation,withcolorvaryingtoindicatetime.Simplified“ribbonsummarizations”withhuenominallyencodingensemblemembershipandsaturationencodingtimeenabledirectvisualcomparisonofthedistributionofvorticesintimeandspaceforasetofsimulations.Feedbackfromdomainexpertsindicatesthattheyfoundthesetechniquestobeimportantandeffectiveforsummarizingandcomparingtime-varyingflowsacrossanensembleofsimulations.


X3DBio2: A visual analysis tool for biomolecular structure comparisonHongYi,SidharthThakur,RenaissanceComputingInstitute(UnitedStates);LatsavongsakdaSethaphong,YaroslavaG.Yingling,NorthCarolinaStateUniv.(UnitedStates)

Amajorprobleminstructuralbiologyistherecognitionofdifferencesandsimilaritiesbetweenrelatedthreedimensional(3D)biomolecularstructures.Investigatingthesestructurerelationshipsisimportantnotonlyforunderstandingoffunctionalpropertiesofbiologicallysignificantmolecules,butalsofordevelopmentofnewandimprovedmaterialsbasedonnaturally-occurringmolecules.Wedevelopedanewvisualanalysistool,X3DBio2,for3Dbiomolecularstructurecomparisonandanalysis.Thetoolisdesignedforelucidationofstructuraleffectsofmutationsinproteinsandnucleicacidsandforassessmentoftimedependenttrajectoriesfrommoleculardynamicssimulations.X3DBio2isafreelydownloadableopensourcesoftwareandprovidestightlyintegratedfeaturestoperformmanystandardanalysisandvisualexplorationtasks.Weexpectthistoolcanbeappliedtosolveavarietyofbiologicalproblemsandillustratetheuseofthetoolontheexamplestudyofthedifferencesandsimilaritiesbetweentwoproteinsoftheglycosyltransferasefamily2thatsynthesizepolysaccharidesoligomers.ThesizeandconformationaldistancesandretainedcorestructuralsimilarityofSpsAtoK4CPrepresentsignificantepochsintheevolutionofinvertingglycosyltransferases.


Improvement of web-based data acquisition and management system for GOSAT validation lidar data analysisHiroshiOkumura,ShoichiroTakubo,TakeruKawasaki,IndraN.Abdullah,SagaUniv.(Japan);OsamuUchino,IsamuMorino,TatsuyaYokota,NationalInstituteforEnvironmentalStudies(Japan);TomohiroNagai,TetsuSakai,TakashiMaki,MeteorologicalResearchInstitute(Japan);KoheiArai,SagaUniv.(Japan)

1.BACKGROUND

Globalwarminghasbecomeaveryseriousissueforhumanbeings.Scientistshavesuggestedthat,attheratetheEarth’stemperatureisrising,anextremeformofglobalclimatechangecouldoccurinafewcenturies.In1997,theKyotoProtocolwasadoptedattheThirdSessionoftheConferenceofthePartiestotheUnitedNationsFrameworkConventiononClimateChange(COP3),makingitmandatoryfordevelopednationstoreducecarbondioxideemissionsbysixtoeightpercentoftheirtotalemissionsin1990,andtomeetthisgoalsometimebetween2008and2012.Furthermore,theGlobalClimateObservationSystem(GCOS)hasbeenproposedbytheWorldMeteorologicalOrganization(WMO)andtheUnitedNationsEnvironmentProgram(UNEP)inordertostrengthenobservationsofland,ocean,andspaceconductedbyeachcountry.Sofar,thenumberofground-basedcarbondioxideobservationpointshasbeenlimited,andtheyhavebeendistributedunequallythroughouttheworld.GreenhousegasObservationSATellite(GOSAT)enablestheprecisemonitoringofthedensityofcarbondioxidebycombiningglobalobservationdatasentfromspacewithdataobtainedonland,andwithsimulationmodels.Inaddition,observationofmethane,anotherGreenhousegas,hasbeenconsidered.ForvalidationofGOSATdataproducts,wehavecontinuedground-baseobservationwithFourierTransformSpectrometer(FTS),aerosollidarandozone-DIALlidaratSagaUniversity,JAPANsinceMarch,2011.

2.DETAILS

Forlidardataanalysis,notonlyacquiredlidardatabutalsometeorologicaldataarerequired.

Wedevelopanweb-basedataacquisitionandmanagementsystemforeffective

acquisitionofmeteorologicaldataandmanagementofacquiredlidardata.

Thesystemconsistsofdataacquisitionpartanddatamanagementpart.

DataacquisitionpartwritteninPerllanguageacquiresAMeDASground-

Conference 8654


levelmeteorologicaldata,RawinSondehighaltitudemeteorologicaldata,ground-leveloxidantdata,skyradiometerdata,skyviewcameraimages,meteorologicalsatelliteIRimagedataandGOSATvalidationlidardata.

AMeDAS(AutomatedMeteorologicalDataAcquisitionSystem)isahigh-resolutionsurfaceobservationnetworkdevelopedbyJapanMeteorologicalAgency(JMA)usedforgatheringregionalweatherdataandverifyingforecastperformance.Begunoperationon

November1,1974,thesystemconsistsofabout1,300stationswithautomaticobservationequipment.Thesestations,ofwhichmorethan1,100areunmanned,arelocatedatanaverageintervalof17kmthroughoutJapan.

TheseAMeDASmeteorologicaldataareacquiredfromSagacitystationeveryhour.

RawinSondeareessentialtoobtainhighaltitudetemperatureandairpressuredata.

TheseRawinSondemeteorologicaldataareacquiredfromFukuokacitystationtwice(00Zand12Z)aday.

Ground-leveloxidantdataarerequiredforanalysisofozone-DIALlidardata.

Theseground-leveloxidantdataareacquiredfromSagacitystationeveryhour.

Skyradiometerisrequiredtoobserveopticalthickness.

Skyradiometerdataareacquiredeveryhour.

SkyviewcameraimagesandmeteorologicalsatelliteIRimagedataarerequiredtoconfirmcloudsabovelidar.

Skyviewcameraimagesareacquiredevery5minutesindaytimeandmeteorologicalsatelliteIRimagedataareacquiredeveryhour.

GOSATvalidationlidardataaremanuallyuploadedviasecureFTP.

DatamanagementpartwritteninPHPlanguagedemonstratessatellite-passdateandallacquireddataandprovidesinteractivegraphicaluserinterface.

3.IMPROVEMENTOFTHESYSTEM

Forhigherperformanceandhigherdatausability,ozoneverticalprofileestimatedwithglobalozonetransportationmodel,opticalthicknesschartandautomaticcalculationfunctionofRayleighmoleculedistinctionandbackscattercoefficientfromactualRawinSondedataandUSstandardatmospheremodelareimplementedforaerosollidarandozone-DIALlidardataanalysis.


Optimizing threshold for extreme scale analysisRobertJ.H.Maynard,Kitware,Inc.(UnitedStates);KennethMoreland,SandiaNationalLabs.(UnitedStates);UtkarshAtyachit,BerkGeveci,Kitware,Inc.(UnitedStates);Kwan-LiuMa,Univ.ofCalifornia,Davis(UnitedStates)

AstheHPCcommunitystartsfocusingitseffortstowardsexascale,itbecomesclearthatwearelookingatmachineswithabillionwayconcurrency.Althoughparallelcomputinghasbeenatthecoreoftheperformancegainsachieveduntilnow,scalingover1,000timesthecurrentconcurrencycanbechallenging.Asdiscussedinthispaper,eventhesmallestmemoryaccessandsynchronizationoverheadscancausemajorbottlenecksatthisscale.Aswedevelopnewsoftwareandadaptexistingalgorithmsforexascale,weneedtobecognizantofsuchpitfalls.Inthispaper,wedocumentourexperiencewithoptimizingafairlycommonandparallelizablevisualizationalgorithm,thresholdofcellsbasedonscalarvalues,forsuchhighlyconcurrentarchitectures.Ourexperimentshelpusidentifydesignpatternsthatcanbegeneralizedforothervisualizationalgorithmsaswell.WediscussourimplementationwithintheDaxtoolkit,whichisaframeworkfordata

analysisandvisualizationatextremescale.TheDaxtoolkitemploysthepatternsdiscussedherewithintheframework’sscaffoldingtomakeiteasierforalgorithmdeveloperstowritealgorithmswithouthavingtoworryaboutsuchscalingissues.


Perceptualization of geometry using intelligent haptic and visual sensingJianguangWeng,ZhejiangUniv.(China);HuiZhang,IndianaUniv.-PurdueUniv.Indianapolis(UnitedStates)

Wepresentasetofparadigmsforinvestigatinggeometricstructuresusinghapticandvisualsensing.Ourprincipaltestcasesincludesmoothlyembeddedgeometryshapessuchasknottedcurvesembeddedin3Dandknottedsurfacesin4D,thatcontainmassiveintersectionswhenprojectedtoonelowerdimension.Onecanexploitatouch-responsive3Dinteractiveprobetohapticallyoverridethisconflictingevidenceintherenderedimages,byforcingcontinuityinthehapticrepresentationtoemphasizethetruetopology.Inourwork,weexploitedapredictivehapticguidance,a“computer-simulatedhand”withsupplementaryforcesuggestion,tosupportintelligentexplorationofgeometryshapesthatwillsmoothandmaximizetheprobabilityofrecognition.Thecognitiveloadcanbereducedfurtherwhenenablinganattention-drivenvisualsensingduringthehapticexploration.Finally,weproposedafamilyofhapticbased“rolling”methodstofacilitatethehapticmanipulation,forexample,justaswecanexploreeverypartofanordinaryknotbyrollingitbetweenourhands,wecanusethis“rollingmethod”tooptimizethecombinedviewableandtouchableaspectsofthegeometrybycleverlyexploitinginteractivecontinuity.Allthesemethodscombinetorevealthefullrichnessofthehapticexplorationofgeometricstructures,andtoovercomethelimitationsoftraditional4Dvisualization.


Review of chart recognition in document imagesYanLiu,XiaoqingLu,YeyangQin,ZhiTang,PekingUniv.(China);JianboXu,PekingUniv.FounderGroupCorp.(China)

Asaneffectiveinformationtransmittingway,chartiswidelyusedtorepresentscientificstatisticsdatuminbooks,researchpapers,newspapersetc.Thoughtextualinformationisstillthemajorsourceofdata,therehasbeenanincreasingtrendofintroducinggraphs,pictures,andfiguresintotheinformationpool.Textrecognitiontechniquesfordocumentshavebeenaccomplishedusingopticalcharacterrecognition(OCR)software.ChartrecognitiontechniquesasanecessarysupplementofOCRfordocumentimagesarestillanunsolvedproblemduetothegreatsubjectivenessandvarietyofchartsstyles.Thispaperreviewsthedevelopmentprocessofchartrecognitiontechniquesinthepastdecadesandpresentsthefocusesofcurrentresearches.Thewholeprocessofchartrecognitionispresentedsystematically,whichmainlyincludesthreeparts:chartsegmentation,chartclassification,andchartInterpretation.Ineachpart,thelatestresearchworkisintroduced.Inthelast,thepaperconcludeswithasummaryandpromisingfutureresearchdirection.

Conference 8654


Conference 8655: Image Processing: Algorithms and Systems XIMonday-Wednesday4–6February2013

Part of Proceedings of SPIE Vol. 8655 Image Processing: Algorithms and Systems XI

8655-2, Session 1

Robust face recognition algorithm for identifition of disaster victimsWouterJ.R.Gevaert,EindhovenUniv.ofTechnology(Netherlands)andUniv.CollegeWestFlanders(Belgium);PeterH.N.deWith,EindhovenUniv.ofTechnology(Netherlands)

Wepresentarobustfacerecognitionalgorithmfortheidentificationofoccluded,injuredandmutilatedfaceswithalimitedtrainingsetperperson.Insuchcases,theconventionalfacerecognitionmethodsfallshortduetospecificaspectsintheclassification.TheproposedalgorithminvolvesrecursivePrincipleComponentAnalysisforreconstructionofaffectedfacialparts,followedbyafeatureextractorbasedonGaborwaveletsanduniformmulti-scaleLocalBinaryPatterns.Asaclassifier,aRadialBasisNeuralNetworkisemployed.Intermsofrobustnesstofacialabnormalities,testsshowthattheproposedalgorithmoutperformsconventionalfacerecog-nitionalgorithmslike,theEigenfacesapproach,LocalBinaryPatternsandtheGabormagnitudemethod.Tomimicreal-lifeconditionsinwhichthealgorithmwouldhavetooperate,specificdatabaseshavebeenconstructedandmergedwithpartialexistingdatabasesandjointlycompiled.Experimentsontheseparticulardatabasesshowthattheproposedalgorithmachievesrecognitionratesbeyond95%.

8655-3, Session 1

Target re-identification in low-quality camera networksFedericaBattisti,MarcoCarli,GiovannaFarinella,AlessandroNeri,Univ.degliStudidiRomaTre(Italy)

Personre-identificationtroughcameranetworkdealwiththeproblemoffindingcorrectlinkbetweenconsecutiveobservationsofthesametargetamongdifferentcameraschoosingthemostprobablecorrespondenceamongasetofpossiblematches.Thispapercanbedividedintwoparts.Firstofall,thetypologyofilluminationchangesviewedfromatargetwhilecrossingournetworkislearned.Theresultsshowthatthechanginginvolvestheintensity.Afterwards,anewlinkspruningsystemcalledTCS(TargetColorStructure)isproposed:thelinkbetweentwoobservationsiskeptifthereiscoherenceinthecolorstructureofclotheswornbytargets.

ThecolorstructureisevaluatedusinganovelcolordescriptorcalledCSD(ColorStructureDescriptor)whichisintensityscaleandshiftinvariant.TheCSDdescribesthedifferenceintermofdominantcolorsbetweentworegionsofinterest:theshirtandthepantsofatarget.

TheproposedTCScontrolcanberunoverwhateverpersonre-identificationalgorithm;inthispaperweusedhuehistogramintersectiontoassignasimilarityscoretothesurvivedlinks.

ResultsshowthattheimprovementsachievedapplyingTCScontrolareupto4%forthetoprankandupto16%consideringthefirstelevenmoresimilarcandidates.

8655-5, Session 2

Improved image copyright protection scheme exploiting visual cryptography in wavelet domainAlessioMeneghetti,GiuliaBoato,FrancescoG.B.DeNatale,Univ.degliStudidiTrento(Italy)

Thelastfewyearshaveseenamassiveincrementintheuseoftheinternetasachannelforsharingandtransmittingdata,sothatseveralcopyrightprotectionschemeshavebeenproposed.Theideaofembeddingawatermarkdirectlyinthedataishoweverunacceptableinvariousfieldsofapplication,duetotheintrinsicdegradationintroducedbynonreversiblewatermarkingschemes.Hencesomezerowatermarkingschemeshavebeendeveloped.Inthisworkweproposeanoptimizationofarecentwatermarkingmethodbasedonvisualcryptography,byimprovingresultsagainstmostcommontypesofattacksandachievingahigherperceptualqualityoftheextractedmark.

8655-6, Session 2

HDR image multi-bit watermarking using bilateral-filtering-based maskingVassilisSolachidis,EmanueleMaiorana,PatrizioCampisi,Univ.degliStudidiRomaTre(Italy)

Thepresentpaperproposesamulti-bitwatermarkingmethodforHighDynamicRange(HDR)images.TheproposedapproachisdesignedinordertoguaranteethemarkimperceptibilityinboththeHDRmarkedimageanditsLowDynamicRange(LDR)counterpart,andtoberobustagainstsignificantnon-lineardistortionssuchasthoseperformedbytonemappingoperators(TMOs).Thepaperoffersadetailedreviewonthestate-of-the-artonHDRwatermarking,andthenpresentstheproposedschemewherethewatermarkisembeddedinthewavelettransformoftheJustNoticeableDifference(JND)-scaledspaceoftheoriginalHDRimage.AvisualmasktakingintoaccountspecificaspectsoftheHumanVisualSystem(HVS)regardingthesensibilitytothemodificationsinthedetailspartofanimage,aswellasthedependenceoftheperceivedcontrastonorientationandspatiallocation,isalsodefinedandemployedtomodulatethewatermarkintensityforeachwaveletcoefficient.Theproceduretodetectandextractamulti-bitmessageisthenhighlighted,andanextensivesetofexperimentalresultsisprovidedtotestifytheeffectivenessoftheproposedschemeinembeddingwatermarksintoHDRimageswithoutaffectingthevisualqualityoftheoriginalimage,whilebeingrobustagainstTMOs.

8655-7, Session 2

Body-part estimation from Lucas-Kanade tracked Harris pointsVladimirPribula,RoxanneL.Canosa,RochesterInstituteofTechnology(UnitedStates)

Skeletonestimationfromsingle-cameragrayscaleimagesisgenerallyaccomplishedusingmodel-basedtechniques.Multiplecamerasaresometimesused;however,skeletalpointsextractedfromasinglesubjectusingmultipleimagesareusuallytoosparsetobehelpfulforlocalizingbodyparts.Forthisproject,weuseasingleviewpointwithoutanymodel-basedassumptionstoidentifyacentralsourceofmotion,thebody,anditsassociatedextremities.HarrispointsaretrackedusingLucas-Kanaderefinementwithaweightedkernelfoundfromexpectationmaximization.Thealgorithmtrackskeyimagepointsandtrajectoriesandre-representsthemascomplexvectorsdescribingthemotionofaspecificbodypart.Normalizedcorrelationiscalculatedfromthesevectorstoformamatrixofgraphedgeweights,whichissubsequentlypartitionedusingagraph-cutalgorithmtoidentifydependenttrajectories.TheresultingHarrispointsareclusteredintorigidcomponentcentroidsusingmeanshift,andtheextremitycentroidsareconnectedtotheirnearestbodycentroidtocomplete


thebody-partestimation.Wecollectedgroundtruthlabelsfromsevenparticipantsforbodypartsthatarecomparedtotheclustersgivenbyouralgorithm.

8655-8, Session 2

Hue processing in tetrachromatic spacesAlfredoRestrepo,Univ.delosAndes(Colombia)

Inthetetrachromatichypercubetheluminanceisgivenbythemidrangeµofthetetradofcoordinatesofacolourpointwhilethechromaticsaturationisgivenbytherangeofthetetrad.

Theboundaryofthetetrachromatichypercubeconsistsof8solidcubes.Twotypesofclosedsurfacethatareunionsofthesquarefacesofthecubesinthisboundaryaretobefound:topologicalspheresandtopologicaltori.Bymakingthecoordinatesofthepointsdependantinspecialwaysonsuchsurfaces,tetrachromatichuesresult.Atetrachromatichueis2-dimensionaland,togetherwiththeluminanceandthechromaticsaturation,ituniquelyspecifiesatetradinthehypercube.Thesehuesurfacescanbeexploitedtovisualizetetrachromaticimagesinsuchawaythatcertainaspectsaremadeconspicuous.Thehuemaythenbeprocessedbyautomorphismseitherofahuesphereorofahuetorus.Itispossibletoidentify8(intersecting)dodecahedraandalsothree(intersecting)PLtori,soitispossibletodefineahueinmultipleways.Asthehuesurfacesarerotatedorotherwiseautomorphed,thecoloursofatetrachromaticimagearechangedininterestingways.Theautomorphismsrespectthecontinuity;therotationsrespecttheantipodicityorcomplementarycoloursaswell.Byprocessingthehueofatetrachromaticimage(forexample,a4-spectralsatelliteimage)andthenvisualizingitbyfeedingtheRGBchannelsofanormalvisualizingsystemwith3ofthe4bandsofthetetrachromaticimage,asiscommonlydoneinsatelliteimaginery,di?erentaspects

8655-9, Session 3

Decomposition of satellite derived images for the distinction of cloud types featuresJulesR.Dim,HiroshiMurakami,JapanAerospaceExplorationAgency(Japan)

Linearfilteringmethodsusingconvolutiontechniquesareappliedincomputervision,todetectspatialdiscontinuitiesintheintensityofluminanceinphotographsimages.Thesetechniquesarebasedontheprincipalthat,apixel’sneighborhoodinanimage,containsinformationaboutitsintensity.Thevariationofthisintensityprovidesinformationaboutthedistributionandpossibledecompositionoftheimageinspecificfeaturesbasedontherelativepositionofthepixelattheedgeornotintheimage.Theuseoftheseprincipalsonpolarorbitsatellitederivedthermalimagescanprovidealternativemethodstocloudtypes’classifications,oftenbasedonopticalorthermodynamicpropertiesoftheclouds.Aproductfromsatelliteinfraredimages,theCloudtoptemperatures(CTT),isusedtocomputelocalhistogramedgegradients.Thesegmentationoftheseedgegradientshistogramsandtheirinterpretationbasedonasatelliteremotesensinganalysesallowsthedistinctionofmorphologicalcloudfeatures,whichcombinedtothealtitudeofoccurrenceofthecloudyieldaseparationoftheimageindifferentcloudtypes.Theinterpretationofthedistributionofthesecloudmorphologicalfeatures,andtheirfrequencyisevaluatedwithanothercloudclassificationmethod(basedoncloudopticalproperties)inordertounderstandtheimplicationsoftheresultsobtainedonageneraldiscussionaboutclimatevariations.

8655-10, Session 3

Locally tuned inverse sine nonlinear technique for color image enhancementSaibabuArigela,VijayanK.Asari,Univ.ofDayton(UnitedStates)

Inthispaper,anovelinversesinenonlineartransformationbased

imageenhancementtechniqueisproposedtoimprovethevisualqualityofimagescapturedinextremelightingconditions.Thismethodisadaptivetothelocalinformationofapixelandsimpletoimplement.Theproposedtechniqueconsistsoffourmainstagesnamelyhistogramadjustment,dynamicrangecompression,contrastenhancementandnonlinearcolorrestoration.Histogramadjustmentoneachspectralbandisperformedtobelittletheeffectofillumination.Dynamicrangecompressionisaccomplishedbyaninversesinenonlinearfunctionwithalocallytunableimagedependentparameterbasedonthelocalstatisticsofeachpixel’sneighborhoodregionsoftheluminanceimage.Anonlinearcolorrestorationprocessbasedonthechromaticinformationandluminanceoftheoriginalimageisemployed.Astatisticalquantitativeevaluationisperformedwiththestateofthearttechniquestoanalyzeandcomparetheperformanceoftheproposedtechnique.Theproposedtechniqueisalsotestedonfacedetectionincomplexlightingconditions.Theresultsofthistechniqueonimagescapturedinhazy/foggyweatherenvironmentarealsopresented.Theevaluationresultsconfirmthattheproposedmethodcanbeappliedtosurveillance,securityapplicationsincomplexlightingenvironments.

8655-12, Session 3

Fusing electro-optic and infrared signals for high resolution visible images: part IXiaopengHuang,StevensInstituteofTechnology(UnitedStates);RaviNetravali,ColumbiaUniv.(UnitedStates);HongMan,VictorLawrence,StevensInstituteofTechnology(UnitedStates)

NoAbstractAvailable

8655-13, Session 4

Embedding high dynamic range tone mapping in JPEG compressionJianLiu,TheUniv.ofAkron(UnitedStates);FirasHassan,OhioNorthernUniv.(UnitedStates);JoanE.Carletta,TheUniv.ofAkron(UnitedStates)

Amethodthatintegratestonemappingforhighdynamicrange(HDR)gray-scaleimageswithJPEGcompressionisproposed.Thetonemappingoperator(TMO)isblock-based,andstructuredsothatthesamediscretecosinetransform(DCT)thatisusedfortheJPEGcompressionservestocompleteamajorpartofthetone-mappingoperation.SimulationshavebeendoneonhighdynamicrangeimagesfromtheDebeveclibrary.Experimentalresultsshowthetechniquesuccessfullytonemapsandcompressessimultaneously;thenumberofbitsperpixelisreducedfrom32toanaverageof0.67bythecompression,withanaveragePSNRof56.3dBforthecompressedtone-mappedimagescomparedtoimagesthathavebeenonlytone-mapped.Theoutputoftheproposedmethodisanimagethatrequiresonlylimitedstoragespace,andcanbedecompressedwithastandardJPEGdecoder.

8655-14, Session 4

Formulation, analysis, and hardware implementation of chaotic dynamics based algorithm for compression and feature recognition in digital imagesChanceM.Glenn,AlabamaA&MUniv.(UnitedStates)andRochesterInstituteofTechnology(UnitedStates);SrikanthMantha,SajinGeorge,DeeptiAtluri,AntonioMondragon,RochesterInstituteofTechnology(UnitedStates)

Inthispresentationwewilldiscusstheutilizationofasetofwaveformsderivedfromchaoticdynamicalsystemsforcompressionandfeaturerecognitionindigitalimages.Wewillalsodescribethedesignandtestingofanembeddedsystemsimplementationofthealgorithm.Wewillshowthatalimitedsetofcombinedchaoticoscillationsare

Conference 8655


sufficienttoformabasisforthecompressionofthousandsofdigitalimages.Wewilldemonstratethisintheanalysisofimagesextractedfromthesolarheliosphericobservatory(SOHO),showingthatweareabletodetectcoronalmassejections(CMEs)inquadrantsoftheimagedataduringaseveresolarevent.Weundertakehardwaredesigninordertooptimizethespeedofthealgorithm,takingadvantageofitsparallelnature.WecomparethecalculationspeedofthealgorithmincompiledC,enhancedMatlab,Simulink,andinhardware.

8655-15, Session 4

Quality constraint and rate-distortion optimization for predictive image codersKhouloudSamrouth,FrançoisPasteau,OlivierDeforges,InstitutNationaldesSciencesAppliquéesdeRennes(France)

Nextgenerationsofimageandvideocodingmethodsshouldofcoursebeefficientintermsofcompression,butalsoproposeadvancedfunctionalities.Amongthesefunctionalitiessuchasscalability,lossyandlosslesscoding,dataprotection,RateDistortionOptimization(RDO)andRateControl(RC)arekeyissues.RDOaimsatoptimizingcompressionperformances,whileRCmechanismenablestoexactlycompressatagivenrate.AlesscommonfunctionalitythanRC,butcertainlymorehelpful,isQualityControl(QC):theconstraintisheregivenbythequality.Inthispaper,weintroduceajointsolutionforRDOandQCappliedtoastillimagecodeccalledLocallyAdaptiveResolution(LAR),providingscalabilitybothinresolutionandSNRandbasedonamulti-resolutionstructure.Thetechniquedoesnotrequireanyadditionalencodingpass.Itreliesonamodelingandestimationofthepredictionerrorsobtainedinanearlywork.First,qualityconstraintisappliedandpropagatedthroughthewholeresolutionlevelscalledpyramid.Then,thequantizationparametersarededucedconsideringinterandintrapyramidlevelrelationships.Resultsshowthatperformancesoftheproposedmethodareveryclosetoanexhaustivesearchsolution.

8655-16, Session 5

Visual quality analysis for images degraded by different types of noiseNikolayPonomarenko,VladimirV.Lukin,OlegIeremeyev,NationalAerospaceUniv.(Ukraine);KarenO.Egiazarian,JaakkoAstola,TampereUniv.ofTechnology(Finland)

Imagescorruptedbydifferenttypesofnoiseareconsidered.VisualqualityforequalinputMSEbutthreetypesofthenoise,namely,Poisson,additiveandmultiplicative,isinspected(assessedinexperimentsbyobservers)andcharacterizedbyvisualqualitymetrics.ColorimagesfromthedatabaseTID2008areexploited.Itisshownthatmetricsdonotadequatelydescribeimagevisualqualityfortheconsideredsituation.Possiblereasonsforthisareanalyzed.Themainreasonsaremaskingeffectsintexturalregionsandobserverattentiontospecialobjectspresentinimages.

8655-17, Session 5

Graph cut and image intensity-based splitting improves nuclei segmentation in high-content screeningMuhammadFarhan,PekkaRuusuvuori,TampereUniv.ofTechnology(Finland);MarioEmmenlauer,PauliRämö,Univ.Basel(Switzerland);OlliYli-Harja,TampereUniv.ofTechnology(Finland);ChristophDehio,Univ.Basel(Switzerland)

Quantificationofphenotypesinhigh-contentscreeningexperimentsdependsontheaccuracyofsinglecellanalysis.Insuchanalysisworkflows,cellnucleisegmentationistypicallythefirststepandisfollowedbycellbodysegmentation,featureextraction,andcomplicateddataanalysisworkflows.Therefore,itisofutmost

importancethatthefirststepsofhigh-contentanalysisaredonecorrectlyinordertoguaranteecorrectnessofthefinalanalysisresults.Existingnucleisegmentationmethods,whenappliedindividually,tendtofuseaswellastosplitnucleialongwithgivingsuboptimalseparationoftouchingnuclei.Thisresultsinlossofmanyinterestingbiologicalphenotypes.Inabilitytofindallthephenotypescausestheirmisclassificationwhichleadstoinaccuratesubsequentbiologicalanalysis.Here,weproposedanovelcellnucleisegmentationframeworkwhichexploitsrobustnessofgraphcuttoobtaininitialsegmentationforimageintensity-basedclumpsplittingmethodtodelivertheaccurateoverallsegmentation.Byusingquantitativebenchmarksandqualitativecomparisonofreal-worldimageswithcomplicatedmultinucleatecells,weshowthatourmethodoutperformstheotherstate-of-the-artnucleisegmentationmethods.Moreover,weprovideamodularandeasy-to-useimplementationofthemethodforawidelyusedplatformtomakeitavailableforroutineuseinhigh-contentimageanalysis.

8655-18, Session 5

Near real-time skin deformation mappingSteveT.Kacenjar,LockheedMartinCorp.(UnitedStates);SuzieChen,Rutgers,TheStateUniv.ofNewJersey(UnitedStates);MadihaJafri,LockheedMartinCorp.(UnitedStates);BrianWall,RutgersUniversity(UnitedStates);RichardPedersen,LockheedMartinCorp.(UnitedStates);RichardBezozoM.D.,MoleSafeUSA(UnitedStates)

Anovelinvivoapproachisdescribedthatprovideslargeareamappingofthemechanicalpropertiesoftheskininhumanpatients.Suchinformationisimportantintheunderstandingofskinhealth,cosmeticsurgery[1],aging,andimpactsofsunexposure.

Currently,severalmethodshavebeendevelopedtoestimatethelocalbiomechanicalpropertiesoftheskin,includingtheuseofaphysicalbiopsyoflocalareasoftheskin(invitromethods)[2,3,and4],andalsotheuseofnon-invasivemethods(invivo)[5,6,and7].Allsuchmethodsexaminelocalizedareasoftheskin.

Ourapproachexaminesthelocalelasticpropertiesviathegenerationoffielddisplacementmapsoftheskincreatedusingtime-sequenceimaging[9]with2Ddigitalimagingcorrelation(DIC)[10].Inthisapproach,largeareasoftheskinarereviewedrapidly,andskindisplacementmapsaregeneratedshowingthecontourmapsofskindeformation.Thesemapsarethenusedtopreciselyregisterskinimagesforpurposesofdiagnosticcomparison.

Thispaperreportsonourmappingandregistrationapproach,anddemonstratesitsabilitytoaccuratelymeasuretheskindeformationthroughadescribednullinginterpolationprocess.TheresultoflocaltranslationalDICalignmentiscomparedusingthisinterpolationprocess.TheeffectivenessoftheapproachisreportedintermsofresidualRMS,imageentropymeasures,anddifferentialsegmentedregionalerrors.

8655-20, Session 6

Object segmentation using graph cuts based edges featuresYukiMasumoto,WeiweiDu,NobuyukiNakamori,KyotoInstituteofTechnology(Japan)

Thepaperpresentsasimplegraphcutsalgorithmbasededgesfeaturestoobjectsegmentationproblems.Theusergivessomescribblestobackgroundandforegroundofanimage.Gaussianmixturemodels(GMMs)arebuiltbasedonthescribbles.Thepixelwithoutscribblebelongstothebackgroundortheforegrounddependingontherelativeprobabilityofeachpixel.ThecontributionofourpaperistoaddedgesfeaturestoGMMs.TheapproachisappliedwithimagesfromtheGrabcutssegmentationdatabase.Theapproachissuitableforimageswithnoiseandintheforegroundandbackgroundwithsimilarcolors.

Conference 8655


8655-21, Session 6

A hybrid skull-stripping algorithm based on adaptive balloon snake modelsHung-TingLiu,TonyW.H.Sheu,Herng-HuaChang,NationalTaiwanUniv.(Taiwan)

Skull-strippingisoneofthemostimportantpreprocessingstepsinneuroimageanalysis.Weproposedahybridalgorithmbasedonanadaptiveballoonsnakemodeltohandlethischallengingtask.Theproposedframeworkconsistsoftwostages:first,thefuzzypossibilisticc-means(FPCM)isusedforvoxelclustering,whichprovidesalabeledimageforthesnakecontourinitialization.Atthesecondstage,thecontourisinitializedoutsidethebrainsurfacebasedontheFPCMresultandevolvesundertheguidanceoftheballoonsnakemodel,whichdrivesthecontourwithanadaptiveinwardnormalforcetocapturetheboundaryofthebrain.ThesimilarityindicesindicatesthatourmethodoutperformedtheBSEandBETmethodsinskull-strippingtheMRimagevolumesintheIBSRdataset.Experimentalresultsshowtheeffectivenessofthisnewschemeandpotentialapplicationsinawidevarietyofskull-strippingapplications.

8655-47, Session 6

Approximations to camera sensor noiseKeigoHirakawa,XiaodanJin,KeigoHirakawa,Univ.ofDayton(UnitedStates)

Noiseisa“necessaryevil”inimageacquisitionthatispresentinallimagesensordata.Poissondistributionissaidtomodelthestochasticnatureofthephotonarrivalprocess,whileitiscommontoapproximatereadout/thermalnoisebyadditivewhiteGaussiannoise(AWGN).Othersourcesofsignal-dependentnoisesuchasFanoandquantizationalsocontributetotheoverallnoiseprofile.Questionremains,however,abouthowbesttomodelthecombinedsensornoise.

ThoughadditiveGaussiannoisewithsignal-dependentnoisevarianceandPoissoncorruptionaretwowidelyusedmodelstoapproximatetheactualsensornoisedistribution,thethejustificationgiventothesetypesofmodelsarebasedonlimitedevidence.Thegoalofthispaperistoprovideamorecomprehensivecharacterizationofrandomnoise.

WeconcludedbypresentingconcreteevidencethatPoissonmodelisabetterapproximationtorealcameramodelthanAWGN.WesuggestfurthermodificationtoPoissonthatmayimprovethenoisemodel.

8655-22, Session 7

Fast noise variance estimation by principal component analysisStanislavPyatykh,LeiZheng,JürgenHesser,HeidelbergUniv.(Germany)

Noisevarianceestimationisrequiredinmanyimagedenoising,compression,andsegmentationapplications.Inthiswork,weproposeafastnoisevarianceestimationalgorithmbasedonprincipalcomponentanalysisofimageblocks.First,werearrangeimageblocksintovectorsandcomputethecovariancematrixofthesevectors.Then,weuseBartlett’stestinordertoselectthecovariancematrixeigenvalues,whichcorrespondonlytonoise.Thisallowsestimatingthenoisevarianceastheaverageoftheseeigenvalues.Sincethemaximumpossiblenumberofeigenvaluescorrespondingtonoiseisutilized,itisenoughtoprocessonlyasmallnumberofimageblocks,whichallowsreductionoftheexecutiontime.Theblockstoprocessareselectedfromimageregionswiththesmallestvariance.Duringourexperimentsinvolvingsevenstateoftheartmethods,theproposedapproachwassignificantlyfasterthanthemethodswithsimilarorhigheraccuracy.Meanwhile,therelativeerrorofourestimatorwasalwayslessthan15%.Wealsoshowthattheproposedmethodcanprocessimageswithouthomogeneousareas.

8655-23, Session 7

Spatial-temporal noise reduction method optimized for real-time implementationIlyaV.Romanenko,Apical(UnitedKingdom);EranEdirisinghe,LoughboroughUniv.(UnitedKingdom);DanielLarkin,Apical(UnitedKingdom)

Imagede-noisinginspatial-temporaldomainhasbeenaproblemstudiedin-depthinthefieldofdigitalimageprocessing.Howeveralgorithmiccomplexityoftenleadstohighhardwareresourceandmemoryusage,orincreasedcomputationalcomplexity,makingtheirpracticaluseimpossible.Inourresearchwehaveaddressedtheseproblemswithanimplementationofapracticalspatial-temporalde-noisingalgorithm.Spatial-temporalfilteringisperformedinBayerRAWdataspace,whichallowsustobenefitfrompredictablesensornoisecharacteristicsandreducememorybandwidthrequirements.Proposedalgorithmefficientlyremovesdifferenttypesofnoiseinawiderangeofsignaltonoiseratios.InouralgorithmthelocalmotioncompensationwasperformedinBayerRAWdataspace,whilepreservingtheresolutionandeffectivelyimprovingsignaltonoiseratiosofmovingobjects.

Themainchallengefortheuseofspatial-temporalnoisereductionalgorithmsinvideoapplicationsisthecompromisemadebetweenthequalityofthemotionpredictionandthecomplexityofthealgorithmandrequiredmemorybandwidth(andconsequentiallythecostofintegration).Inphotoandvideoapplicationsitisveryimportantthatmovingobjectsshouldstaysharp,whilethenoiseisefficientlyremovedinbothstaticbackgroundandmovingobjectareas.

TakingintoaccounttheachievableimprovementinPSNR(onthelevelofthebestknownnoisereductiontechniques)andlowalgorithmiccomplexity,enablingitspracticaluseincommercialvideoapplications,theresultsofourresearchcanbeveryvaluable.

8655-24, Session 7

Evolution of image regularization with PDEs toward a new anisotropic smoothing based on half kernelsBaptisteMagnier,PhilippeMontesinos,EcoledesMinesd’Alès(France)

ThispaperisdedicatedtoanewanisotropicdiffusionapproachforimageregularizationbasedonagradientandtwodiffusiondirectionsobtainedfromhalfGaussiankernels.

Thisapproachresultsinsmoothinganimagewhilepreservingedges.

Fromananisotropicedgedetector,builtofhalfGaussianderivativekernels,weintroduceanewsmoothingmethodpreservingstructureswhichdrivesthediffusionfunctionoftheanglebetweenthetwoedgedirectionsandthegradientvalue.

Duetothetwodirectionsdiffusionusedinthecontrolfunction,ourdiffusionschemeenablestopreserveedgesandcorners,contrarytootheranisotropicdiffusionmethods.

Moreover,parametersoftheGaussiankernelcanbetunedtobesufficientlythinextractingpreciselyedgeswhereasitslengthallowsdetectingincontourorientationswhichleadstoacoherentimageregularization.

Finally,wepresentsomeexperimentalresultsanddiscussaboutthechoiceofthedifferentparameters.

8655-25, Session 7

Poisson shot noise parameter estimation from a single scanning electron microscopy imageStephenKockentiedt,Otto-von-Guericke-Univ.Magdeburg(Germany)andFederalInstituteforOccupationalSafetyandHealth

Conference 8655


(Germany);KlausD.Tönnies,Otto-von-Guericke-Univ.Magdeburg(Germany);ErhardtGierke,NicoDziurowitz,CarmenThim,SabinePlitzko,FederalInstituteforOccupationalSafetyandHealth(Germany)

Scanningelectronmicroscopy(SEM)hasanextremelylowsignal-to-noiseratioleadingtoahighlevelofshotnoisewhichmakesfurtherprocessingdifficult.Unlikeoftenassumed,thenoisestemsfromaPoissonprocessandisnotGaussianbutdependsonthesignallevel.

Amethodtoestimatethenoiseparametersofindividualimagesshouldbefound.

UsingstatisticalmodelingofSEMnoise,arobustoptimalnoiseestimationalgorithmisderived.

Anon-localmeansnoisereductionfiltertunedwiththeestimatednoiseparametersonaverageachievesan18%lowerroot-mean-squareerrorthantheuntunedfilteronsimulatedimages.

Thealgorithmisstableandcanadapttovaryingnoiselevels.

8655-26, Session 8

Parallel algorithms for fast subpixel detection in hyperspectral imageryChungM.Wong,JohnShepanski,StephanieSandor-Leahy,NorthropGrummanAerospaceSystems(UnitedStates)

WepresentparallelalgorithmsforfastsubpixeldetectionoftargetsinhyperspectralimageryproducedbyourHyperspectralAirborneTacticalInstrument(HATI-2500).Theparalleldetectionalgorithmselectedforprocessingthehyperspectraldatacubesisbasedontheadaptivecoherence/cosineestimator(ACE).TheACEdetectorisarobustdetectorthatisbuiltuponthetheoryofgeneralizedlikelihoodratiotesting(GLRT)inimplementingthematchedsubspacedetectortounknownparameterssuchasthenoisecovariancematrix.Subspacedetectorsinvolveprojectiontransformationswhosematricescanbeefficientlymanipulatedthroughmultithreadedmassivelyparallelprocessorsonmoderngraphicsprocessingunits(GPU).TheGPUkernelsdevelopedinthisworkarebasedontheCUDAcomputingarchitecture.Weconstrainthedetectionproblemtoamodelwithknowntargetspectralfeaturesandunstructuredbackground.Theprocessingincludesthefollowingsteps:1)quickscaleandoffsetcorrectiontoreducesystemnoiseinthecollecteddata,2)updatethemeanandnoisecovariancematricesusingmostrecentlycollecteddata;and3)computethegeneralizedlikelihoodratioforeachpixelforbinaryhypothesistesting.WepresenttheperformanceimprovementsobtainedusingGPUanddiscusspossiblearchitecturesforimplementationofsuchprocessingsystemforgeneralon-boardairbornehyperspectralimagingapplications.

8655-27, Session 8

Vasculature segmentation using parallel multi-hypothesis template tracking on heterogeneous platformsDongpingZhang,LeeHowes,AdvancedMicroDevices,Inc.(UnitedStates)

Wepresentaparallelmulti-hypothesistemplatetrackingalgorithmonheterogeneousplatformsusingalayereddispatchprogrammingmodel.Thecontributionofthisworkistwo-fold:anovelapproachtosegmentthevascularlumennetworkfromvolumetricCTAimagesinreal-time;alayereddispatchprogrammingmodeltofreethedevelopersfromhand-craftingmappingstoparticularlyconstrainedexecutiondomainsonthehighthroughputarchitecture.Thisabstractionisdemonstratedthroughthevasculaturesegmentationapplicationandcanalsobeappliedinotherreal-worldapplications.

8655-28, Session 8

IMPAIR: massively parallel deconvolution on the GPUMichaelJ.Sherry,AndrewShearer,NationalUniv.ofIreland,Galway(Ireland)

IMPAIR:MassivelyparalleldeconvolutionontheGPU

TheIMPAIRsoftwareisahighthroughputimagedeconvolutiontoolforprocessinglargeout-of-coredatasetsofimages,varyingfromlargeimageswithspatiallyvaryingPSFstolargenumbersofimageswithspatiallyinvariantPSFs.IMPAIRimplementsaparallelisedversionofthetriedandtestedRichardsonLucydeconvolutionalgorithmregularisedviaacustomwaveletthresholdinglibrary.Itexploitstheinherantlyparallelnatureoftheconvolutionoperationtoachievequalityresultsonconsumergradehardware:throughtheNVIDIATeslaGPUimplementation,themulticoreOpenMPimplementation,andtheclustercomputingMPIimplementationofthesoftware.IMPAIRaddressestheproblemofparalleisationinbothtop-downandbottom-upapproaches:bymanagingtheinputdataattheimagelevel,andbymanagingtheexecutionattheinstructionlevel.Thesecombinedtechniquesleadtoascalablesolutionwithminimalresourceconsumptionandmaximalloadbalancing.IMPAIRisbeingdevelopedasbothastand-alonetoolforimageprocessing,andasalibrarywhichcanbeembeddedintonon-parallelisedcodetotransparentlyprovideparallelhighthroughputdeconvolution.

8655-29, Session 9

Fast texture and structure image reconstruction using the perceptual hashViacheslavV.Voronin,VladimirI.Marchuk,VladimirA.Frantc,DonStateTechnicalUniv.(RussianFederation);KarenO.Egiazarian,TampereUniv.ofTechnology(Finland)

Thispaperfocusesonthefasttextureandstructurereconstructionofimages.Theproposedmethod,appliedtoimages,consistsofseveralsteps.ThefirstdealswiththeextractedtexturalfeaturesoftheinputimagesbasedontheLaw’senergy.Thepixelsarounddamagedimageregionsareclusteredusingthesefeatures,thatallowtodefinethecorrespondencebetweenpixelsfromdifferentpatches.Second,cubicsplinecurveisappliedtoreconstructstructureandconnectedgeandcontourindamagedarea.Thechoiceofthecurrentpixeltoberecoveredisdecidedusingthefastmarchingapproach.TheTeleamethodormodificationsoftheexemplarbasedmethodareusedafterthisdependingontheclassificationoftheregionswhereto-be-restoredpixelislocated.Inmodificationtoquicklyfindpatchesweuseperceptualhash.Suchstrategyallowstogetsomedatastructurecontainingthehashesofsimilarpatches.Thisenablesustoreducethesearchproceduretotheprocedurefor“calculations”ofthepatch.Theproposedmethodistestedonvarioussamplesofimages,withdifferentgeometricalfeaturesandcomparedwithstate-of-the-artimageinpaintingmethods;theproposedtechniqueisshowntoproducebetterresultsinreconstructionofmissingsmallandlargeobjectsonthetestimages.

8655-30, Session 9

Improved multichannel up-sampling method for reconstruction based super-resolutionSeunghoonJee,MoonGiKang,YonseiUniv.(Korea,Republicof)

Inreconstructionbasedsuper-resolution,itisanimportantissuetoup-sampleandtomergethehigh-frequencyinformationcontainedinlow-resolutionimagesefficientlyandwithoutartifact.Theconventionalup-samplingmethods,whichusedforregisteringlow-resolutiondatatohigh-resolutiongrid,havethedifferencebetweenthemandidealup-samplinginobservationmodeling.Inthispaper,weanalyzethedifferenceandproposeanewup-samplingandmergingmethodwhichisabletoincorporatelow-resolutionimageswithoutlossofthehighfrequencydataandminimizestheartifactcausedbydatainsufficiency.

Conference 8655


Bythismethod,regionsthattheregisteredhighfrequencydatadonotcoverarenaturallyregularizedwithoutusingcomplexregularizers.Theexperimentalresultsshowthatthechoiceofup-samplingsignificantlyaffectsthequalityofresultingimagesandtheproposedup-samplingmethodgivesbetterresultscomparedwithconventionalsimpleup-samplingmethod.

8655-31, Session 9

Kernel-based image upscaling method with shooting artifact reductionChulHeePark,JoonyoungChang,MoonGiKang,YonseiUniv.(Korea,Republicof)

Thispaperdescribestheinterpolationalgorithmwhichcontainsshootingorringingartifactsuppressionbasedonwindowedsincinterpolator.Ingeneral,thewindowedsincinterpolatorcanachievebetterperformancebyusingwiderwindow.However,usingwidewindowcausesmoreripplesthatproduceunwanteddefectssuchasringingorshootingartifact.Therefore,shootingreductiontechniqueisproposedinthispaperforusingwiderwindowstoimprovetheperformancewithoutshootingartifact.Theproposedalgorithmcansuppressshootingartifactbyusingmediansincinterpolatoranditcanbealsousedasapostprocessorformanykernel-basedinterpolationmethods.Theresultedimageshowsthattheproposedalgorithmcanmaintainlocaldetailsandsuppressshootingartifactintheimagewell.

8655-32, Session 10

A stochastic approach for non-rigid image registrationIvanA.Kolesov,JehoonLee,PatricioVela,GeorgiaInstituteofTechnology(UnitedStates);AllenTannenbaum,BostonUniv.(UnitedStates)

Thisnotedescribesanon-rigidimageregistrationapproachthatparametrizesthedeformationfieldby

anadditivecompositionofarigidtransformationandasetofGaussianradialbasisfunctions.Thebases’centers,variances,andweightsaredeterminedwithaglobaloptimizationapproachthatisintroduced.Thisapproachiscomposedofsimulatedannealingwithaparticlefilterbasedgeneratorfunctiontoperformtheoptimization.Theregistrationprocessisimplicitlyregularizedbylimitingthenumberofbasesmakingupthedeformation.Further,aconstraintonthedeformationisenforcedwhoseroleistoensurephysicallymeaningfulfields(i.e.,invertible).Resultson2Dand3Ddatademonstratethealgorithm’srobustnesstolargedeformations.

8655-33, Session 10

Video inpainting using scene model and object trackingVladimirA.Frantc,VyatcheslavV.Voronin,VladimirI.Marchuk,DonStateTechnicalUniv.(RussianFederation);KarenO.Egiazarian,TampereUniv.ofTechnology(Finland)

Thispaperdescribesanovelapproachforvideorestoration.Ourmethodisbasedonscenemodelandobjecttracking.Italsousescameraegomotionestimationforcompensatecamerachangeofcamerapositionandparameters.Proposedapproachallowtoremoveobjectsorrestoremissingortaintedregionspresentinavideosequencebyutilizingspatialandtemporalinformationfromneighboringscenes.Thealgorithmiterativelyperformsfollowingoperations:achieveframe;updatethescenemodel;updatepositionsofmovingobjects(thisstepusethecondensationalgorithm);replacepartsoftheframeoccupiedbytheobjectsmarkedforremovewithuseofabackgroundmodel.Asapossiblemodelofbackgroundweconsideredrunningaverage,codebookandsomeothers.Themainadvantageofthisapproachistheabilitytoworkinstreamingmode.Thereisnoneedtoknowthe

wholesequencetorestoreit.Wedemonstratetheperformanceofanewapproachviaseveralexamples,showingtheeffectivenessofouralgorithmandcomparedwithstate-of-the-artvideoinpaintingmethods.

8655-34, Session 10

Fast DCT-based algorithm for signal and image accurate scalingLeonidBilevich,LeonidP.Yaroslavsky,TelAvivUniv.(Israel)

AnewDCT-basedalgorithmforsignalandimagescalingbyarbitraryfactorispresented.Thealgorithmisvirtuallyfreeofboundaryeffectsandimplementsthediscretesinc-interpolation,whichpreservesthespectralcontentofthesignal,andthereforeisfreefrominterpolationerrors.BeingimplementedthroughthefastFFT-typeDCTalgorithm,thescalingalgorithmhascomputationalcomplexityofO(N*log(sigma*N))operationsperoutputsample,whereNandsigma*Narenumberofsignalinputandoutputsamples,correspondingly.

8655-1, Session IPI

A new set of wavelet- and fractals-based features for Gleason grading of prostate cancer histopathology imagesClaraM.MosqueraLopez,SosS.Agaian,TheUniv.ofTexasatSanAntonio(UnitedStates)

Prostatecancerdetectionandstagingisanimportantsteptowardspatienttreatmentselection.Advancementsindigitalpathologyallowtheapplicationofnewquantitativeimageanalysisalgorithmsforcomputer-assisteddiagnosis(CAD)ondigitizedhistopathologyimages.Inthispaper,weintroduceanewsetoffeaturestoautomaticallygradepathologicalimagesusingthewell-knownGleasongradingsystem.ThegoalofthisstudyistoclassifybiopsyimagesbelongingtoGleasonpatterns3,4,and5byusingacombinationofwaveletandfractalfeatures.ForimageclassificationweusepairwisecouplingSupportVectorMachine(SVM)classifiers.Theaccuracyofthesystem,whichiscloseto97%,isestimatedthroughthreedifferentcross-validationschemes.Theproposedsystemoffersthepotentialforautomatingclassificationofhistologicalimagesandsupportingprostatecancerdiagnosis.


Method and architecture for quantification of bone structure using microscopic image slices SunderamKrishnan,SosS.Agaian,TheUniv.ofTexasatSanAntonio(UnitedStates);DanaMecke,SergioMontelongo,WangXiaodu,Universityoftexassanantonio(UnitedStates)

Thispaperpresentsanewsystemthatreconstructs,visualizesandclassifiestrabecularbonestructurebyusingmicroscopicimageslices.Inthisstudy,weevaluatedthestructureofatrabecularboneusing3DX-rayimagingafterpassingthroughthespecialimageenhancementandde-nosingalgorithms.Weproposeanewsimpleimagingtechniquetoolforthequantificationofstructuralchangeswithinthemicroarchitectureofhumanbonesbyenhancingthecharacteristicsattributesofthebonearchitecturefromµ-CTscans.Computersimulationillustratesthatthepresentedimagingtechniquehasthepotentialtobecomeapowerfultooltoinvestigatethestructureoftrabeculaeduringinvivomeasurements.

Conference 8655



Active shape models and depth for image registrationColinBellmore,RoxanneL.Canosa,RochesterInstituteofTechnology(UnitedStates)

Mostcurrentregistrationtechniquesrelyonmatchingpatchesorpointsdirectlyfrominformationobtainedfrommultipleimages.Thistechniquebecomesinfeasiblewhenimagesourcesareindependentofoneanotherandcommonlandmarkscannotbeguaranteed.Toovercomethedifficultiesinherentincorrelatingimagepatchesobtainedfrommultiplesensorsandintheabsenceofreallandmarks,weproposeusinganon-rigiddeformationtechniquebasedonstatisticalshapeinformation.Thistechniqueisappliedtorealtimevideoandusesanactiveshapemodel(ASM)toassesscurvatureandlearndiscriminatingpointsonaper-channelbasis.Thecombinationofindependentchannelsforregistrationisusefulforothertechniquesthatrequirewell-registeredimagesbutdonothaveaccesstoreliablelandmarks.ThealgorithmwastestedonASMstrainedonlow-resolutionfacialimagesgeneratedfromaninexpensivecommerciallyavailableRGB+Dsensor.Themeanfittingtimeofour14-pointASMmodelwassignificantlyreducedascomparedtoothercurrentfacefindingmodels.Modelannotation,modelbuilding,andmodelfittingallfunctionedcorrectlytogethertoregisterthecoloranddepthimages.ThesystemsuccessfullygeneratedastreamoffilteredcoloranddepthimageswithfaciallandmarkpointsfittedbythetrainedASM.


Principal component analysis for through wall image enhancementMuhammadM.Riaz,AbdulGhafoor,NationalUniv.ofSciencesandTechnology(Pakistan)

NoAbstractAvailable


Parallel GPGPU stereo matching with an energy-efficient cost function based on normalized cross correlationSaralaArunagiri,JaimeJaloma,PatriciaJ.Teller,TheUniv.ofTexasatElPaso(UnitedStates)

Stereomatchingisaheavilyinvestigatedtopicincomputervisionbecauseofitswiderangeofapplicabilityanditshighcomputationalcomplexity.Inthispaperweexploreanenergy-efficienttechniquethatissuitableforparallel(GPGPU)stereomatchingofremotely-sensedradarimages.Amajorissuewithimagescapturedbyremotesensing,e.g.,byairborneradars,isthattheyarelikelytohavespecklenoise,whichisundesiredinformationthatcontaminatesanddistortstheimage.Althoughtherearefiltersthatalleviatetheeffectsofspecklenoise,theyintroduceacertainamountofimagedistortion.IthasbeendemonstratedthatcostfunctionsbasedonNormalizedCrossCorrelation(NCC)canbeusedforaccuratestereomatchinginthepresenceofspecklenoise,thus,weexploredsuchafunctionforpassivestereomatching.Accordingly,thispaperpresentsanewNCC-basedcostfunctionthatminimizesthenumberoffloating-pointoperationsutilizedforstereomatchingandcomparesitsperformancetothatoftheoriginalNCCcostfunction.ThisevaluationisachievedviaexperimentsinwhichthesecostfunctionsareemployedbyGPGPUstereomatchingcodesthatusethesimulatedannealingalgorithm.Performancecomparisonisconductedintermsofpowerandenergyconsumption,executiontime,andoutputquality.


Hyperspectral bands prediction based on inter-band spectral correlation structureAymanAhmed,NARSS(Egypt);MohamedE.El.Sharkawy,Egypt-JapanUniv.ofScienceandTechnology(Egypt);SalwaH.Elramly,AinShamsUniv.(Egypt)

Hyperspectralimaginghasbeenwidelystudiedinmanyapplications;notablyinclimatechanges,vegetation,anddesertstudies.However,suchkindofimagingbringsahugeamountofdata,whichrequirestransmission,processing,andstorageresourcesforbothairborneandspaceborneimaging.Compressionofhyperspectraldatacubesisaneffectivesolutionfortheseproblems.Losslesscompressionofthehyperspectraldatausuallyresultsinlowcompressionratio,whichmaynotmeettheavailableresources;ontheotherhand,lossycompressionmaygivethedesiredratio,butwithasignificantdegradationeffectonobjectidentificationperformanceofthehyperspectraldata.Moreover,mosthyperspectraldatacompressiontechniquesexploitsthesimilaritiesinspectraldimensions;whichrequiresbandsreorderingorregrouping,tomakeuseofthespectralredundancy.Inthispaper,weanalyzethespectralcrosscorrelationbetweenbandsforAVIRISandHyperionhyperspectraldata;spectralcrosscorrelationmatrixiscalculated,assessingthestrengthofthespectralmatrix,weproposenewtechniquetofindhighlycorrelatedgroupsofbandsinthehyperspectraldatacubebasedon“interbandcorrelationsquare”,andfinally,weproposeanewtechniqueofbandregroupingbasedoncorrelationvaluesweightsfordifferentgroupofbandsasnetworkofcorrelation.


A GPU based implementation of direct multi-bit search (DMS) screen algorithmBarryM.Trager,IBMThomasJ.WatsonResearchCtr.(UnitedStates);KartheekChandu,InfoPrintSolutionsCo.(UnitedStates);ChaiWahWu,IBMThomasJ.WatsonResearchCtr.(UnitedStates);MikelJ.Stanich,InfoPrintSolutionsCo.(UnitedStates)

Inthispaper,westudythefeasibilityforusingprogrammableGraphicsProcessingUnit(GPU)technologyforimagehalftoning,inparticularimplementingthecomputationallyintenseDirectMulti-bitSearch(DMS)Screenalgorithm.Multi-bitscreeningisanextensionofbinaryscreening,inwhicheverypixelincontinuous-toneimagecanberenderedtooneamongmultipleabsorptancelevels.Forexamplea2bitprinteriscapableofprintingwithfourdifferentdropsizes.Inourpreviouswork,wehaveextendedtheDirectBinarySearch(DBS)tothemulti-bitcaseusingDirectMulti-bitSearch(DMS)whereateverypixelthealgorithmchoosesthebestdropabsorptanceleveltocreateavisuallypleasinghalftonepatternwithoutanyuserdefinedguidance.Thisprocessisrepeatedthroughouttheentirerangeofgraylevelswhilesatisfyingthestackingconstrainttocreateahighqualitymulti-bitscreen(dithermask).Inthispaper,weillustratehowemployingGraphicsProcessingUnits(GPU)canspeed-upintensiveDMSimageprocessingoperations.Particularly,weillustratehowdifferentmodulescanbebeenparallelized.ThemaingoalofmanyofthepreviousarticlesregardingDBSistodecreasetheexecutiontimeofthealgorithm.Oneofthemostcommonapproachesistodecreasetheneighborhoodsizeorfiltersize.Theproposedparallelapproachallowsustousealargeneighborhoodandfiltersize,toachievethehighesthalftonequality,whilehavingminimalimpactonperformance.Inaddition,wealsodemonstrateprocessingseveralnon-overlappingneighborhoodsinparallel,byutilizingtheGPU’sparallelarchitecture,tofurtherimprovethecomputationalefficiency.


Ore minerals textural characterization by hyperspectral imagingGiuseppeBonifazi,Univ.degliStudidiRomaLaSapienza(Italy);NicolettaPicone,UnivdegliStudidiRomaLaSapienza(Italy);

Conference 8655


SilviaSerranti,Univ.degliStudidiRomaLaSapienza(Italy)

Theutilisationofhyperspectraldetectiondevices,fornaturalresourcesmapping/exploitationthroughremotesensingtechniques,datesbacktotheearly1970s.Fromthefirstdevicesutilisingaone-dimensionalprofilespectrometerHyperSpectralImaging(HSI)deviceshavebeendeveloped.Thus,fromspecific-customiseddevices,originallydevelopedbyGovernmentalAgencies(e.g.NASA,specialisedresearchlabs,etc.),alotofHSIbaseddevicesaretodayavailableatcommerciallevel.Severalcompanies,infact,develophyperspectralsensors.Paralleltothishugeincreaseofhyperspectraldevicesdevelopment/manufacturingaddressedtoairborneapplication,astrongincreasealsooccurredindevelopingHSIbaseddevicesfor“ground”utilisationthatissensingunitsabletoplayinsidealaboratory,aprocessingplantand/orinanopenfield.Thankstothisdiffusionmoreandmoreapplicationshavebeendevelopedandtestedinthislastyearsalsointhematerialssectors.Suchanapproach,whensuccessful,isquitechallengingbeingusuallyreliable,robustandcharacterisedbylowercostsifcomparedwiththoseusuallyassociatedtocommonlyappliedanalyticaloff-and/oron-lineanalyticalapproaches.Inthispapersuchanapproachispresentedwithreferencetooremineralscharacterization.Accordingtothedifferentphasesandstagesoforemineralsandproductscharacterization,andstartingfromtheanalysesofthedetectedhyperspectralfirms,itispossibletoderiveusefulinformationaboutmineralflowstreampropertiesandtheirphysical-chemicalattributes.Thislastaspectcanbeutilizedtodefineinnovativeprocessmineralogystrategiesandtoimplementon-lineproceduresatprocessinglevel.Thepresentstudydiscussestheeffectsrelatedtotheadoptionofdifferenthardwareconfigurations,theutilizationofdifferentlogicstoperformtheanalysisandtheselectionofdifferentalgorithmsaccordingtothedifferentcharacterization,inspectionandqualitycontrolactionstoapply.


Colour modification and colour combination in double-cone colour spaceAlfredoRestrepo,Univ.delosAndes(Colombia)

Itisusefultohaveameanstopredictthecolourthatwillresultwhentwobeamsoflightconvergeonagivensurface,forsimulationpurposesforexample.GiventheRGBcoordinatesofthecoloursofthelightbeams,weproposeanarithmeticmodeltogetthecolourcoordinatesofthecombination.Weworkindouble-conespace,whichisreadiyderivedfromRGBcubicspace.Double-conespaceisofthetypehue-saturation-luminanceandthemodelgivesthehue,thesaturationandtheluminanceofthecombination,providedthoseoftheraysthatarebeingcombined.

Double-conespaceresultsfromthespinningoftherange-midrangetriangleaboutitsbase(themidrangeaxis).Therange?oftheRGBtriple,beingameasureofthedistancetotheachromaticsegmentinthecube,measuresthechromaticsaturationwhilethemidrange?,ameasureofthedistancetotheblackpointinthecube,measurestheluminance.Inaddition,weassumethereisarelationbetweenluminanceandintensityofthebeamofthesaturationtype,suchasahyperbolictangentoraNaka-Rushtoncurve;thisisusedtogettheluminanceofthecombinedbeamandtoappropriatelyweightthehueandsaturationofthecombiningbeams.

Alsoindouble-conespace,wegiveformulaeforthemodificationoftheluminanceandthesaturation.


Automatic image and video denoising based on Ising theory EliahuCohen,TelAvivUniv.(Israel);MayaCarmi,RonHeiman,OferHadar,AsafCohen,Ben-GurionUniv.oftheNegev(Israel)

Statisticalmodels,suchastheIsingmodel,havebeenproventobeveryusefulindescribingsolidstatesystemsinphysics.Althoughtheirresultsrelayonprobabilisticbasis,averygoodagreementwasfound

betweenthemandexperimentalresultsobtainedfromexaminingphysicalsystems.Furthermore,Thesemodelscanbeappliedtoavarietyofproblemsinengineering,chemistry,biologyandmore,withoutlosingtheireffectiveness,simplicityandintuitiveness.

Inapreviouswork[1]wehavegreatlyimprovedKandes’model[2]andusedanIsing-likemodelinordertorestorecoloredimagesandvideosdamagedfromvariouskindsofnoise.

Inourrecentworkwewishtopresentimportantalgorithmicdevelopments.Mostofthemobtainedasacombinationofbetterphysicalmodelsandwellknownimagerestorationtechniques.Theproposedmodelanalyzesautomaticallythenoisyimageandchoosestheappropriatemodelparametersaccordingly,withouttheneedofmanualsupport.

Finally,anaverageimprovementof3dBsincomparisontotheoldmodel’sPSNRisachievedandanoverallimprovementofupto7dBsincomparisontoastandarddenoisingperformedbymedianfilter.


A study of non-diagonal models for image white balance Ching-ChunHuang,De-KaiHuang,NationalKaohsiungUniv.ofAppliedSciences(Taiwan)

Whitebalanceisanalgorithmproposedtomimicthecolorconstancymechanismofhumanperception.However,asshownbyitsname,currentwhitebalancealgorithmsonlypromisetocorrectthecolorshiftofgraytonestocorrectpositions;forothercolorvalues,whitebalancealgorithmsprocessthemasgraytonesandthereforeproduceundesiredcolorbiases.Toimprovethecolorpredictionofwhitebalancealgorithms,inthispaper,weproposea3-parameternon-diagonalmodel,namedasPCA-CLSE,forwhitebalance.UnlikemanypreviousresearcheswhichusethevonKriesdiagonalmodelforcolorprediction,weproposedapplyinganon-diagonalmodelforcolorcorrectionwhichaimedtominimizethecolorbiaseswhilekeepingthebalanceofwhitecolor.Inourmethod,toreducethecolorbiases,weproposedaPCA-basedtrainingmethodtogainextrainformationforanalysisandbuiltamappingmodelbetweenilluminationandnon-diagonaltransformationmatrices.Whileacolor-biasedimageisgiven,wecouldestimatetheilluminationanddynamicallydeterminetheillumination-dependenttransformationmatrixtocorrectthecolor-biasedimage.OurevaluationshowsthattheproposedPCA-CLSEmodelcanefficientlyreducethecolorbiases.


A comparison between space-time video descriptorsLucaCostantini,LiciaCapodiferro,FondazioneUgoBordoni(Italy);AlessandroNeri,Univ.degliStudidiRomaTre(Italy)

Videorepresentationtechniquesareusedindatabaseretrieval,indexing,orsceneidentification.Inthecontextofvideorepresentationandindexing,twomainapproachesareadopted.Thefirstoneisbasedonthekeyframedetection:afteratemporalsegmentationofthevideo,thedetectedkeyframesareprocessedasstillimages.Oncethefeaturesareextractedfromeachkeyframe,retrieval,indexingorrecognitionoperationscanbeperformed.Inthesecondapproach,thevideoisrepresentedbyspace-timedescriptors.Inthiscase,thevideoisrepresentedasasetofpointsinahighdimensionalspace,inwhicheverypointsistherepresentationofaspace-timepatch.Thissecondapproachisespeciallyemployedinthehumanactionsrecognitionalgorithms.Thesealgorithmsareveryimportantinmanycomputervisionapplicationssuchasvideosurveillanceorhumancomputerinterfaces.

Conference 8655


The3DZernikepolynomialsareemployedtocomputethe3DZernikemoments,thatcanbeusedasspace-timedescriptorsinsphericalcoordinates.Inthispapertheauthorpresentaspace-timedescriptorincylindricalcoordinates.Ouraimistoinvestigatewhichdescriptor,betweenthesphericalandthecylindrical,ismoreappropriatetobuildanefficientspace-timedescriptor.Ourideaisthatusingtwodifferentspace-timedescriptorbothbasedontheZernikepolynomials,butthefirstonedescribesasphere,andtheotheronedescribesacylinder,allowsustoinvestigateonlythedifferencesbetweensphericaldescriptorsandcylindricaldescriptors.


Face recognition based on logarithmic local binary patternsDebashreeMandal,KarenA.Panetta,TuftsUniv.(UnitedStates);SosS.Agaian,TheUniv.ofTexasatSanAntonio(UnitedStates)

ThispaperpresentsanovelapproachtotheproblemoffacerecognitionthatcombinestheclassicalLocalBinaryPattern(LBP)featuredescriptorswithimageprocessinginthelogarithmicdomainandthehumanvisualsystem.Particularly,wehaveintroducedparameterizedlogarithmicimageprocessing(PLIP)operatorsbasedLBPfeatureextractor.Wealsousethehumanvisualsystembasedimagedecomposition,whichisbasedontheWeber’slawtoextractfeaturesfromthedecomposedimagesandcombinethosewiththefeaturesextractedfromtheoriginalimagestherebyenrichingthefeaturevectorsetandobtainingimprovedratesofrecognition.Comparisonswithothermethodsarealsopresented.ExtensiveexperimentsclearlyshowthesuperiorityoftheproposedschemeoverLBPfeaturedescriptors.Recognitionratesashighas99%canbeachievedascomparedtotherecognitionrateof96.5%achievedbytheclassicalLBPusingtheAT&TLaboratoriesfacedatabase.


Hyperspectral images lossless compression using the 3D binary EZW algorithm Kai-jenCheng,JeffreyDill,OhioUniv.(UnitedStates)

Thispaperproposesatransform

basedimagelosslesscompressionforhyperspectralimageswhichisinspiredbyShapiro(1993)’sEZWalgorithm.TheproposedcompressionmethodusesahybridtransformwhichincludesanintegerKarhunrn-Loevetransform(KLT)anddiscretewavelettransform(DWT).TheintegerKLTisemployedtoeliminatethepresenceofcorrelationsamongthebandsofthehyperspectralimage.Theinteger2Ddiscretewavelettransform(DWT),whichisimplementedbyliftingscheme,isappliedtoeliminatethecorrelationsinthespatialdimensionsandproducewaveletcoefficients.ThesecoefficientsarethencodedbyabinaryEZWalgorithm.ThebinaryEZWcaneliminatethesubordinatepassofconventionalEZWbycodingresidualvaluesandproducingbinarysequences.ThebinaryEZWalgorithmcombinesthemeritsfromtheconventionalEZWandSPIHT,anditiscomputationallysimplerforlosslesscompression.TheproposedmethodisappliedtoAVIRISimagesandcomparedtootherstateofartimagecompressiontechniques.Theresultsshowthattheproposedlosslessimagecompressionismoreefficientanditalsohashighercompressionratiothanotheralgorithms.

Conference 8655


Conference 8656: Real-Time Image and Video Processing 2013Wednesday-Thursday6–7February2013

Part of Proceedings of SPIE Vol. 8656 Real-Time Image and Video Processing 2013


Achieving real-time capsule endoscopy (CE) video visualization through panoramic imagingStevenYi,JeanXie,PeterMui,Xyken,LLC(UnitedStates);JonathanLeighton,MayoClinic(UnitedStates)

Inthispaper,wepresentanovelandreal-timecapsuleendoscopy(CE)videovisualizationalgorithmbasedonpanoramicimaging.TypicalCEvideosrunabout8hoursandaremanuallyreviewedbyphysicianstolocatediseasessuchasbleedingsandpolyps.Todate,thereisnocommerciallyavailabletoolcapableofprovidingstabilizedandprocessedCEvideothatiseasytoanalyzeinrealtime.Theburdenonphysicians’diseasefindingeffortsisthusbig.Infact,sincetheCEcamerasensorhasalimitedforwardlookingviewandlowimageframerate(typical2framespersecond),andcapturesverycloserangeimagingontheGItractsurface,itisnosurprisethattraditionalvisualizationmethodbasedontrackingandregistrationoftenfailstowork.Thispaperpresentsanovelconceptforreal-timeCEvideostabilizationanddisplay.InsteadofdirectlyworkingontraditionalforwardlookingFOV(fieldofview)images,weworkonpanoramicimagestobypassmanyproblemsfacingtraditionalimagingmodalities.Methodsonpanoramicimagegenerationbasedonopticallensprincipleleadingtoreal-timedatavisualizationwillbepresented.Inaddition,non-rigidpanoramicimageregistrationmethodwillbediscussed.


Analysis and characterization of embedded vision systems for taxonomy formulationMuhammadImran,MidSwedenUniv.(Sweden);KhaledBenkrid,TheUniv.ofEdinburgh(UnitedKingdom);KhursheedKhursheed,NaeemAhmad,MattiasO’Nils,NajeemLawal,MidSwedenUniv.(Sweden)

Inthispaper,wepresentedasystemtaxonomywiththeaimofprovidinganabstractreferencemodelofvisionsystems.Centraltoourproposedsystemtaxonomyisalimitednumberofvisionfunctionssufficetocoveralargenumberofrealvisionssystemsreportedintheliterature.Wehavetested20visionssystemsfromtheliteratureagainstourproposedtaxonomy.Theevaluationcriterionfortheproposedsystemtaxonomywasbasedontwoparameters:1)functionspresence,and2)theorderingoffunctionsinthetaxonomyinrelationtotheactualvisionsystemordering.Indeed,oursystemtaxonomyfocusesonthefunctionalityofvisionsystemsratherthantheexactorderingoffunctionswhichdependsonthedetailsoftheapplicationathand.Wehopethatourproposedtaxonomywillprovidearealisticmodelofembeddedvisionsystemswhichcanbedevelopedinthefuturebythecommunityatlarge.Thiswillfacilitatefairerbenchmarkingandevaluationofembeddedvisionsystems,aswellasthedevelopmentofgenericandefficientsolutionsfordifferentgradesorclassesofembeddedvisionsystems,reducingdevelopmentcostoffutureembeddedvisionsystems.


Design and implementation of a real-time image registration in an infrared search and track systemFu-yuanXu,GuohuaGu,NanjingUniv.ofScienceandTechnology(China);TiekunZhao,XI’ANSICONGCHUANGWEIOPTO-

ELECTRONICCO.,LTD.(China);QianChen,WeixianQian,NanjingUniv.ofScienceandTechnology(China)

Inthispaper,anInfraredimageregistrationmethodwhichcombinethephasecorrelationregistrationandtheCornerregistration.Thisregistrationmethodisappliedtoinfraredsearchandtracksystemandrealizemotioncompensationoftheinfraredimage.Thisalgorithmhassmallamountofcalculationandhighaccuracyofmatching.Theexperimentsshowthatthisalgorithmcanprovidehigherconfidencemotioncompensationparametersinreal-timeinfraredsearchandtracksystemandeffectivelyreducethefalsealarmrateofthegroundSmallTargetDetectiononthemotionplatform.Therefore,theproposedalgorithmwithastrongpracticalandwidelyusedinimagestabilization,panoramastitching,andimagemosaic.


Binary video codec for data reduction in wireless visual sensor networksKhursheedKhursheed,NaeemAhmad,MuhammadImran,MattiasO’Nils,MidSwedenUniv.(Sweden)

EfficiencyinbothcomputationandcompressionrateoftheVisualSensorNode(VSN)isrequiredfortheenergyconstrainedoutdoorapplicationsofWirelessVisualSensorNetworks(WVSN).Weinvestigatedthecompressionefficiencyoftheinformationreductiontechniquessuchasimagecoding,changecodingandRegionofInterest(ROI)codingfordatareductioninWVSN.ThecompressionefficiencyofbothchangecodingandROIcodingisbetterthanthatofimagecodingforapplicationsinvolvingfewchangesinacontinuoussetofframes.But,thecompressionefficiencyofboththechangecodingandROIcodingbecomesworsethanthatofimagecodingifthechangeframecontainstoomanyobjects.Inthispaper,weanalysedthecompressionefficiencyoftheBinaryVideoCodec(BVC)forvariouskindsofchangessuchasdifferentshapes,sizes,locationsandnumberofobjectsinasetofframes.WeobservedthatthecurverepresentingtheaveragecompressedfilesizeoftheBVCisalwayslowerthanthatofchangecodingandROIcoding.WeconcludedthatthecompressionefficiencyofBVCisalwaysbetterthanthatofchangecodingandisalwaysbetterthanorequaltothatofROIcodingandimagecoding.


Determinant of homography-matrix-based multiple-object recognitionNagachetanBangalore,MadhuKiran,AnilSuryaprakash,VisioIngeniiLtd.(UnitedKingdom)

Findingagivenobjectinanimageorasequenceofframesisoneofthefundamentalcomputervisionchallenges.Humanscanrecognizeamultitudeofobjectswithlittleeffortdespitescale,lightingandperspectivechanges.Arobustcomputervisionbasedobjectrecognitionsystemisachievableonlyifaconsiderabletolerancetochangeinscale,rotationandlightisachieved.Partialocclusiontoleranceisalsoofparamountimportanceinordertoachieverobustobjectrecognitioninreal-timeapplications.Inthispaper,weproposeaneffectivemethodforrecognizingagivenobjectfromaclassoftrainedobjectsinthepresenceofpartialocclusionsandconsiderablevarianceinscale,rotationandlightingconditions.Theproposedmethodcanalsoidentifytheabsenceofagivenobjectfromtheclassoftrainedobjects.Unliketheconventionalmethodsforobjectrecognitionbasedonthekeyfeaturematchesbetweenthetrainingimageandatestimage,theproposedalgorithmutilizesastatisticalmeasurefromthehomographytransformbasedresultantmatrixtodetermineanobjectmatch.Themagnitudeofdeterminantofthe


homographymatrixobtainedbythehomographytransformbetweenthetestimageandthesetoftrainingimagesisusedasacriteriontorecognizetheobjectcontainedinthetestimage.Themagnitudeofthedeterminantofhomographymatrixisfoundtobeveryneartozero(i.e.lessthan0.005)andrangesbetween0.05and1,fortheout-of-classobjectandin-classobjectsrespectively.Hence,anout-of-classobjectcanalsobeidentifiedbyusinglowthresholdcriteriaonthemagnitudeofthedeterminantobtained.Theproposedmethodhasbeenextensivelytestedonahugedatabaseofobjectscontainingabout100similaranddifficultobjectstogivepositiveresultsforbothout-of-classandin-classobjectrecognitionscenarios.Theoverallsystemperformancehasbeendocumentedtobeabout95%accurateforavariedrangeoftestingscenarios.


Investigating the structure preserving encryption of high efficiency video coding (HEVC)ZafarShahid,WilliamPuech,Univ.Montpellier2(France)

HEVCistheemergingvideocodingstandardofITU-TandISO/IEC.HEVCachievessimilarvisualqualitytoitsprecedentH.264/AVCHighProfile,witharound30%bit-ratereductionforlowdelaymode,andwitharound20%bit-ratereductionforrandomaccessmodeonaverage,butwithlowercomplexitythanH.264/AVCBaselineProfile.HEVCperformsbetterbecauseofsomeadditionaltools.Sincedigitalcontentcanbeeasilycopiedandmodified,itispertinenttoanalyzethisstandardregardingitsprotectionandauthentication.Selectiveencryption(SE)isusedtorestrictaccessofvideodatatoonlyauthenticatedusers.Thispaperpresentsanovelmethodforthereal-timeprotectionofHEVCvideostandard.StructurepreservingselectiveencryptionisbeingperformedinCABACentropycodingmoduleofHEVC,whichissignificantlydifferentfromCABACentropycodingofH.264/AVC.InCABACofHEVC,exponentialGolombcodingisreplacedbytruncatedRice(TR)uptoaspecificvalueforbinarizationoftransformcoefficients.SelectiveencryptionisperformedusingAEScipherincipherfeedbackmodeonaplaintextofbinstringsinacontextawaremanner.Theencryptedbitstreamhasexactlythesamebit-rateandisformatcomplaint.Experimentalevaluationandsecurityanalysisoftheproposedalgorithmisperformedonseveralbenchmarkvideosequencescontainingdifferentcombinationsofmotion,textureandobjects.


A computationally efficient approach to 3D point cloud reconstructionChih-HsiangChang,NasserKehtarnavaz,TheUniv.ofTexasatDallas(UnitedStates)

Thispaperaddressesimprovingthecomputationalefficiencyofthe3Dpointcloudreconstructionpipelineusingun-calibratedimagesequences.Intheexistingpipelines,aglobalbundleadjustmentiscarriedoutwhichisquitetimeconsuming.Thecomputationalcomplexityofthismoduleaspartofthe3Dpointcloudreconstructionpipelineis,wheredenotesnumberofimagesinanentiresequenceandnumberofmatchedpoints.Also,thesearchingandsortingaspectsoftheglobalbundleadjustmentisofthecomputationalcost[3].Alocalrefinementprocessisintroducedinthispaperinordertoreducethecomputationalcomplexitybybypassingthesearchingandsortingaspectsofthebundleadjustmentvialimitingtoonly3.


TDC-based readout electronics for real-time acquisition of high resolution PET bio-imagesNahemaMarino,SergioSaponara,Univ.diPisa(Italy)andINFN

sez.Pisa(Italy);G.Ambrosi,INFNsez.Perugia(Italy);FedericoBaronti,Univ.diPisa(Italy);MariaGiuseppinaBisogni,Univ.diPisa(Italy)andINFNsez.Pisa(Italy);PiergiorgioCerello,INFNsez.Torino(Italy);F.Ciciriello,FrancescoCorsi,PolitecnicodiBari(Italy)andINFNsez.Bari(Italy);LucaFanucci,Univ.diPisa(Italy)andINFNsez.Pisa(Italy);MariaIonica,Univ.degliStudidiPerugia(Italy);F.Licciulli,CristoforoMarzocca,PolitecnicodiBari(Italy)andINFNsez.Bari(Italy);M.Morrocchi,Dept.ofPhysics,UniversityofPisa(Italy)andINFNsez.Pisa(Italy);F.Pennazio,UniversitàdiTorino(Italy)andINFNsez.Torino(Italy);RobertoRoncella,Univ.diPisa(Italy);C.Santoni,INFNsez.Perugia(Italy);R.Wheadon,INFNsez.Torino(Italy);AlbertoDelGuerra,Univ.diPisa(Italy)andINFNsez.Pisa(Italy)

Positronemissiontomography(PET)isaclinicalandresearchtoolforreal-timemetabolicimaging.ThedemandforbetterimagequalityentailscontinuousresearchtoimprovePETinstrumentation.Inclinicalapplications,PETimagequalitybenefitsfromtheTimeofFlight(TOF)feature.Indeed,bymeasuringthephotonsarrivaltimeonthedetectorswitharesolutionlessthan100ps,theannihilationpointcanbeestimatedwithsub-centimeterresolution.Thisleadstobetternoiselevel,contrastandclarityofdetailintheimageseitherusinganalyticaloriterativereconstructionalgorithms.ThisworkdiscussesaSiPM-basedmagnetic-fieldcompatibleTOF-PETmodulewithDepthofInteraction(DOI)correction.Thedetectorfeaturesa3DarchitecturewithtwotilesofSiPMscoupledtoasingleLYSOscintillatoronbothitsfaces.Thereal-timefront-endelectronicsisbasedonacurrent-modeASICwherealowinputimpedance,fastcurrentbufferallowsachievingtherequiredtimeresolution.ApipelinedTimetoDigitalConvertermeasuresanddigitizesthearrivaltimeandtheenergyoftheeventswithatimestampof100psand400ps,respectively.AnFPGAclustersthedataandevaluatestheDOI,withasimulatedzresolutionofthePETimageof1.4mmFWHM.


A visibility improvement technique for fog images suitable for real-time applicationYoshitakaToyoda,DaisukeSuzuki,KoichiYamashita,TakashiIto,NarihiroMatoba,TetsuyaKuno,HiroakiSugiura,MitsubishiElectricCorp.(Japan)

Camerasusedinoutdoorscenesrequirehighvisibilityperformanceundervariousenvironmentalconditions.Wepresentavisibilityimprovementtechniquewhichcanimprovethevisibilityofimagescapturedinbadweathersuchasfogandhaze,andalsoapplicabletoreal-timeprocessinginsurveillancecamerasandvehiclecameras.Ouralgorithmenhancescontrastpixelbypixelaccordingtothebrightnessandsharpnessofneighboringpixels.Inordertoreducecomputationalcosts,wepreliminaryspecifytheadaptivefunctionswhichdeterminecontrastgainfrombrightnessandsharpnessofneighboringpixels.Weoptimizethesefunctionsusingthesetsoffogimagesandexaminehowwelltheycanpredictthefog-degradedareausingbothqualitativeandquantitativeassessment.Wedemonstratethatourmethodcanpreventexcessivecorrectiontotheareawithoutfogtosuppressnoiseamplificationinskyorshadowregion,whileapplyingpowerfulcorrectiontothefog-degradedarea.Incomparisonwithotherreal-timeorientedmethods,ourmethodcanreproduceclear-dayvisibilitywhilepreservinggradationinshadowsandhighlightsandalsopreservingnaturalnessoftheoriginalimage.Ouralgorithmwithlowcomputationalcostscanbecompactlyimplementedonhardwareandthusapplicabletowide-rangeofvideoequipmentsforthepurposeofvisibilityimprovementinsurveillancecameras,vehiclecameras,anddisplays.


Fast non-blind deconvolution based on 2D point spread function database for real-time ultrasound imagingJooyoungKang,Sung-ChanPark,KyuhongKim,Jung-HoKim,SAMSUNGElectronicsCo.,Ltd.(Korea,Republicof)

Conference 8656


Inthispaper,weintroduceanewmethodfornon-blindde-convolutionwithpre-measuredandestimated2DPSFsdatabasethatacquiredfromtheactualtransducerbeingused.Ouralgorithmisbasedontwo-stagereconstructionscheme,inwhich2DPSFselectionfirstandimagerestorationsecond.InthePSFselectionblock,minimumvariancevalueofappliedeachdifferentPSFinalldepthofimagearecomparedtoderivetheinformationofdepthandspeedofsound.Inthede-convolutionblock,oncetheappropriatedPSFofeachdepthselected,itrestoresabeam-formeduncompressedradio-frequencydatausingseveralde-convolutiontechniquelikeWeinerandfastde-convolutiontechniqueusinghyper-Laplacianpriors.Therefore,usingtherealPSFfromactualtransducerbeingused,ouralgorithmproducesabetterrestorationofultrasoundimagethande-convolutionbysimulatedPSF,andhaslowcomplexityforreal-timeultrasoundimaging.

8656-1, Session 1

Real-time, robust target tracking in videos via graph-cuts (Invited Paper)BarakFishbain,IsraelInstituteofTechnology(Israel);DoritS.Hochbaum,YanT.Yang,Univ.ofCalifornia,Berkeley(UnitedStates)

Real-time,robustandfast,polynomialtime-integer-programmingalgorithmfortargettrackinginvideosequences,thatdelineatesatargetofinterestinavideofromitsbackground,ispresentedhere.Thetrackingtaskiscastasagraph-cutproblem,incorporatingintensityandmotiondataintotheformulation.Previouslyreportedtrackingalgorithmsusedcontinuousmodelsthatarenotappropriatefordigitalvideos.Otheralgorithmsutilizedstochastic,iterativemodelswhicharecomputationallyintenseanddonotguaranteeoptimalsolutionorconsistencyoversequentialrunsonthesameinputdata.Furthermore,previousmethods,whichconsideredmotioninthetrackingprocess,didsobyregardingitasasetofconstraintsratherthanpixels’features.Thisapproachresultedincomplex,non-polynomial(NP-hard)problems.Finally,inprevioustrackingalgorithms,motionisestimatedbyopticalflowtechniques.Theseopticalflowmethodsarecomputationallyintense,whichmakesthemoftentimesunsuitableforreal-timeapplications.OurmethodishighlyrobustinthatitallowsexploitingfastbutnoisyandcoarsemotiondatasuchasMPEG-4motionestimationschemes.Theevaluationofthemethodonstandardandnon-standardbenchmarkvideosclearlyshowsthatthemethodismoreefficientthanexistingtechniques,andthatitdeliversgoodqualityresults.

8656-2, Session 1

Tracking yarns in high resolution fabric images: a real-time approach for online fabric flaw detectionDorianSchneider,RWTHAachen(Germany)

Analgorithmicframeworkforreal-timelocalizationofsingleyarnswithinindustrialfabricimagesispresented.Theinformationaboutpreciseyarnlocationsformsthefoundationforafabricawdetectionsystemwhichisbasedonindividualyarnmeasurements.Matchingacameraframerateof15fps,wedefinetheterm“real-time”bythecapabilityoftrackingallyarnswithina5megapixelimageinlessthan35ms,leavingatimeslotof31msforfurtherimageprocessinganddefectdetectionalgorithms.Theprocessingpipelinecomprisesadaptivehistogramequalization,Wienerdeconvolution,normalizedtemplatematchingandanovelfeaturepointsortingscheme.Tomeetreal-timerequirements,extensiveuseoftheNVIDIACUDAframeworkismade.Implementationdetailsaregivenandsourcecodeforselectedalgorithmsisprovided.Evaluationresultsshowthatweftsandwarpscanbetrackedreliablyandindependentlyofthefabricmaterialorbinding.Videoandimagefootageisprovidedontheprojectwebsitetoexpandthepapercontent.

8656-3, Session 1

Real-time bicycle detection at signalized intersections using thermal imagingRobinCollaert,TraficonN.V.(Belgium)

Moreandmoregovernmentsandauthoritiesaroundtheworldarepromotingtheuseofbicyclesincities,asthisishealthyforthebicyclistandimprovesthequalityoflifeingeneral.Safetyandefficiencyofbicyclistshasbecomeamajorfocus.Toachievethis,thereisaneedforasmarterapproachtowardsthecontrolofsignalizedintersections.Varioustraditionaldetectiontechnologies,suchasvideo,microwaveradarandelectromagneticloops,canbeusedtodetectvehiclesatsignalizedintersections,butnoneofthesecanconsistentlyseparatebikesfromothertraffic,dayandnightandinvariousweatherconditions.

Asbikesshouldgetahigherpriorityandalsorequirelongergreentimetosafelycrossthesignalizedintersection,trafficmanagersarelookingforalternativedetectionsystemsthatcanmakethedistinctionbetweenbicyclesandothervehiclesnearthestopbar.Inthispaper,wepresentthedrawbacksofavideo-basedapproachandthebenefitsofathermal-video-basedapproachforvehiclepresencedetectionwithseparationofbicycles.Also,wehighlightthespecifictechnicalchallengesindevelopingasystemthatcombinesthermalimagecapturing,imageprocessingandoutputtriggeringtothetrafficlightcontrollerinnearreal-timeandinonehousing.

8656-4, Session 1

How fast can one arbitrarily and precisely scale images?LeonidBilevich,LeonidP.Yaroslavsky,TelAvivUniv.(Israel)

Imagescalingisafrequentoperationinvideoprocessingforopticalmetrology.Inthepaper,resultsofcomparativestudyoftheaccuracyandcomputationalcomplexityofdifferentalgorithmsforarbitraryscalingofdigitalimagesarepresentedanddiscussed.Thefollowingalgorithmswerecompared:differenttypesofspatialdomainprocessing(bilinear,bicubic,cubicspline)algorithmsandDFT-basedonesandanewDCT-basedalgorithm,whichimplementsperfect(interpolationerrorfree)scalingthroughdiscretesinc-interpolationThecomparisonresultsenableevaluationofthefeasibilityofreal-timeimplementationofthealgorithmsforarbitraryimagescaling.

8656-5, Session 1

Digital ruler: real-time object tracking and dimension measurement using stereo cameras (Invited Paper)JamesNash,KalinAtanassov,SergioR.Goma,VikasRamachandra,QualcommInc.(UnitedStates);HasibSiddiqui,QualcommInc(UnitedStates)

Stereometrologyisamethodforobtainingspatialmeasurementsofanobject’sperimeterusingthedisparitybetweenboundarypoints.True3Dsceneinformationisnecessarytoextractlengthmeasurementsofanobject’sprojectionontothe2Dimageplane.Instereometrologythemeasurementishighlysensitivetoobjectdistance,baselinedistance,calibrationerrors,andrelativemovementoftheleftandrightdemarcationpointsbetweensuccessiveframes.Real-timeoperationatvideoratesisdifficultbecausetraditionalfeatureextractionandself-calibrationroutinesarecomputationallyexpensive.Inthispaperwepresentareal-timeimplementationofadigitalrulerusingcell-phonestereocalibratedcamerasonamobileplatform.

Conference 8656


8656-6, Session 2

FPGA design of a real-time edge enhancing smoothing filter (Invited Paper)ChangChoo,NimitPandya,BhavikaPatel,SanJoséStateUniv.(UnitedStates)

Noiseremovalfiltershaveanundesirablesideeffectofblurringedges,whichisnottolerableformanyimageprocessingapplications.Toovercomethisproblem,wedesignedanedgeenhancingsmoothingfilterandimplementeditonanFPGAtoreducenoisewhilesharpeningedgesforreal-timeimageprocessing.ItconsistsofacombinationofthebilateralfilterforedgepreservingsmoothingandtheShockfilterforedgeenhancementtoachievethedesiredresult.Thebit-accurateMatlabmodelfortheedge-enhancingsmoothingfilterwasconvertedtotheVeriloghardwaredesignonAlteraStratixIIIandCycloneIIIdevices.TheFPGAimplementationutilizes,forexample,21%oflogiccellsonStratixIIIEP3SE50F484C2deviceandrunsat126MHz.Theresultingimageshavemuchsharperedgesintermsofitsgradientandreducednoise.

8656-7, Session 2

Large object extraction for binary images on the GPUGregoryC.Huchet,SamsungInformationSystemsAmerica,Inc.(UnitedStates)

Objectfilteringbysizeisabasictaskincomputervision.Acommonwaytoextractthelargestconnectedcomponentsinabinaryimageistoruntheconnected-componentlabeling(CCL)algorithmandtocomputetheareaofeachcomponent.Selectingthecomponentswiththelargestareasisthenstraightforward.SeveralCCLalgorithmsfortheGPUhavealreadybeenimplementedbutfewoftherelatedpapersmentionthetimerequiredtocomputethecomponentarea.Thisextrastepcanbecriticalforreal-timeapplicationssuchasreal-timevideosegmentation.Theaimofthispaperistopresentafastalgorithmfortheextractionofvisuallylargeobjectsinabinaryimage.ItisimplementedusingCUDA(ComputeUnifiedDeviceArchitecture),aparallelcomputingarchitecturedevelopedbyNVIDIA.

8656-8, Session 2

Real-time structured light intraoral 3D measurement pipelineRaduGheorghe,AndreiTchouprakov,RomanSokolov,D4DTechnologies,L.P.(UnitedStates)

Computeraideddesignandmanufacturing(CAD/CAM)isincreasinglybecomingastandardfeatureandserviceprovidedtopatientsindentistofficesanddenturemanufacturinglaboratories.Althoughthequalityofthetoolsanddatahasslowlyimprovedinthelastyears,duetovarioussurfacemeasurementchallenges,practical,accurate,in-vivo,real-time3Dhighqualitydataacquisitionandprocessingstillneedsimproving.AdvancesinGPUcomputationalpowerhaveallowedforachievingnearreal-time3Dintraoralin-vivoscanningofpatient’steeth.Weexploreinthispaper,fromareal-timeperspective,ahardware-software-GPUsolutionthataddressesalltherequirementsmentionedbefore.Moreoverweexemplifyandquantifythehardandsoftdeadlinesrequiredbysuchasystemandillustratehowtheyaresupportedinourimplementation.

8656-9, Session 2

Three-dimensional fuzzy filter in color video sequence denoising implemented on DSPVolodymyrPonomaryov,InstitutoPolitécnicoNacional(Mexico);HectorMontenegro,InstitutoPolitécnicoNacional(Mexico)and

ESIME-CULHUAN(Mexico);RicardoPeralta-Fabi,Univ.NacionalAutónomadeMéxico(Mexico)

Noiseproducessystemdeficienciesduringacquisition,broadcastorstorageofcolorimagesequences.Inthispaper,weproposeanefficientfuzzy3Dapproachforimpulsivenoisesuppressionincolorvideosequences.Incontrasttocurrentstate-of-the-artfuzzyfilters,theproposedframeworkgathersred,greenandbluechannelssequencedata,usesfuzzylogictoanalyzethebasicpixelgradientvalueandseveralrelatedpixelgradientvaluesineightdirections,andprocessestwoneighboringframesconcurrently.SeveralvideocolorsequencesMissAmerica,Salesman,Flowers,Stefan,etc.withdifferentcolor,texture,finefeaturesanddifferentmovementswereusedtoevaluate3Dfuzzyalgorithms.Theframesofthecolorvideosequenceswerecontaminatedartificiallybyrandomimpulsivenoiseofdifferentintensities(0%to30%)ineachcolorchannelindependently.

Theresultsofnumeroussimulationsdemonstratethattheproposed3Dfilteringframeworkperformswellinobjectivecriteria(PSNR,MAE,NCD,andSSIM)andahumansubjectiveanalysisoftheframesinthecolorvideosequences.Inaddition,theresultsofimplementationandefficiencyanalysisoftheproposedandbetterdenoising3DalgorithmsinrealtimemodeonDSPTMS320DM642ofTI,usingMATLAB,TC6software,CCS,IDEwiththeReal-TimeWorkshopsoftware,andSimulinkarepresented.

8656-12, Session 3

Design of a pseudo-log image transform IP in an HLS-based memory management frameworkShahzadAhmadButt,PolitecnicodiTorino(Italy);StéphaneMancini,FrédéricRousseau,TIMALab.(France);LucianoLavagno,PolitecnicodiTorino(Italy)andCadenceDesignSystems,Inc.(UnitedStates)

Thepseudo-logimagetransformisessentiallyalogarithmictransformationthatsimulatesthedistributionoftheeye’sphotoreceptorsandfindsapplicationinmanyimportantareasofrealtimeimageprocessingsuchasmotiondetectionandestimationinrobots,foveatedspacevariantcameras,timeofimpactcalculation,etc.Itbelongstoafamilyofnon-linearimageprocessingkernelsinwhichreferencesmadetomemoryarenon-linearfunctionofloopindices.

Non-linearkernelsrequiresomeformofmemorymanagementinordertoachievetherequiredthroughput,minimizeon-chipmemoryrequirementandmaximizepossibledatare-use.

Inthispaperwepresentthedesignofapseudo-logimageprocessingIPblock,integratedwithdifferentinterpolationfilteringtechniques,usingaframeworkthatcanautomaticallygenerateamemoryhierarchyaroundtheIP.Thismemoryhierarchyreduceson-chipmemoryrequirements,optimizesthroughputandincreasesdata-reuse.ThedesignoftheIPisfullyperformedatthealgorithmiclevelinC/C++.Thealgorithmicdescriptionisprofiledwithaframeworkthatcreatesacustomizedmemoryhierarchy,alsodescribedatalgorithmiclevel,whichcanbesynthesizedusinghighlevelsynthesistools.Finally,highlevelsynthesisisusedtoperformhardwaredesignspaceexplorationandperformanceestimation.ThegeneratedmemoryhierarchyisabletofeedtheIPwiththehighestbandwidtheveninpresenceofhighexternalmemorylatencies.

8656-13, Session 3

Real-time color/shape-based traffic signs acquisition and recognition systemSergioSaponara,Univ.diPisa(Italy)

Areal-timesystemisproposedtoacquirefromanautomotivefish-eyeCMOScamerathetrafficsigns,andprovidetheirautomaticrecognitiononthevehiclenetwork.Differentlyfromthestate-of-the-art,inthisworkcolor-detectionisaddressedexploitingtheHSVcolorspacewhichisrobusttolightingchanges.Hencethefirststageoftheprocessingsystemimplementsfish-eyecorrectionandRGBto

Conference 8656


HSVtransformation.Aftercolor-baseddetectionanoisedeletionstepisimplementedandthen,fortheclassification,atemplate-basedcorrelationmethodisadoptedtoidentifypotentialtrafficsigns,ofdifferentshapes,fromacquiredimages.Startingfromasegmented-imageamatchingwithtemplatesofthesearchedsignsiscarriedoutusingadistancetransform.Thesetemplatesareorganizedhierarchicallytoreducethenumberofoperationsandhenceeasingreal-timeprocessingforseveraltypesoftrafficsigns.Finally,fortherecognitionofthespecifictrafficsign,atechniquebasedonextractionofsignscharacteristicsandthresholdingisadopted.ImplementedonDSPplatformthesystemrecognizestrafficsignsinlessthan150msatadistanceofabout15metersfrom640x480-pixelacquiredimages.Testscarriedoutwithhundredsofimagesshowadetection&recognitionrateofabout90%,withfalserecognitionslessthan4%

8656-14, Session 3

DSPACE hardware architecture for on-board real-time image/video processing in European space missions (Invited Paper)SergioSaponara,MassimilianoDonati,LucaFanucci,Univ.diPisa(Italy);MaximilianOdendahl,ReinerLeupers,RWTHAachen(Germany);WalterErrico,SITAELS.p.A.(Italy)

Theincreasingdemandofon-boardreal-timeimage/videoprocessingrepresentsoneofthecriticalissuesinforthcomingscientificandcommercialEuropeanspacemissions.Toaccomplishplanetaryobservation,surveillance,SyntheticApertureRadarimagingandtelecommunicationfasterandfastersignalandimageprocessingalgorithmsarerequired.Theonlyexistingspace-qualifiedEuropeanDigitalSignalProcessor(DSP)freeofInternationalTrafficinArmsRegulationsrestrictions(ATMELTSC21020)facesapoorperformanceof60MFLOPSpeak.

TheDSPACEspace-qualifiedDSParchitecturefillsthegapbetweenthecomputationalrequirementsandtheavailabledevice.Itscoreleveragesapipelinedandmassivelyparallelarchitecturebasedontheverylonginstructionword(VLIW)paradigm:64registers,4arithmeticlogicunits,2multipliersand2addressgenerationunitsarearrangedinto2identicaldata-pathswithcross-pathcapabilities.

BoththesynthesizableVHDLandthesoftwaredevelopmenttoolsaregeneratedfromtheLISAhigh-leveldescriptionandthenrefinedatRTLlevel.AXilinx-XC5VLX110FPGAischosentorealizeanengineeringprototype,providinganeasymigrationtothespace-qualifiedXilinx-XQR5VFX130FPGA.Finally,firstsynthesisresultsonATMEL180nmstandardcellASICtechnologyshowanareaofaround300kgatesandapeakperformanceof1GOPSand750MFLOPSat125MHz.

8656-15, Session 4

Priority-based methods for reducing the impact of packet loss on HEVC encoded video streams

(Invited Paper)JamesM.Nightingale,QiWang,ChristosGrecos,Univ.oftheWestofScotland(UnitedKingdom)

NoAbstractAvailable

8656-16, Session 4

Low complexity DCT engine for image and video compressionMaherJridi,YousriOuerhani,AymanAlfalou,ISENBrest(France)

Inthispaper,wedefinedanewscalableandreconfigurable2D-DCTarchitecture.Thelatterisabletotransformspatialpixelstospectral

pixelswhiletakingintoaccounttheconstraintsoftheconsideredcompressionstandard.Duetoournewmatrixdecomposition,wecoulddefineonecommon2D-DCTarchitecture.TheconstantmultiplierscanbeconfiguredtohandlethecaseofRealDCTand/orIntDCT(multiplicationby±2).Theproposedalgorithmnotonlyprovidesareductionofcomputationalcomplexity,butalsoleadstoscalablepipelineddesigninsystolicarrays.Indeed,the8*8StdDCTcanbecomputedbyusing4*4StdDCTwhichcanbeobtainedbycalculating2*2StdDCT.Besides,theproposedstructurecanbeextendedtodealwithhighernumberofN(i.e.16*16and32*32).TheFPGAimplementationresultsshowtheperformanceoftheproposedarchitecturewhencomparedwithconventionaldesigns.Inparticular,forN=4,itisfoundthattheproposeddesignhavenearlythirdthearea-timecomplexityoftheexistingDCTstructures.Thisgainisexpectedtobehigherforagreatersizeof2D-DCT.

8656-17, Session 4

A CABAC codec of H.264AVC with secure arithmetic codingNihelNeji,MaherJridi,AymanAlfalou,ISENBrest(France);NouriMasmoudi,Univ.ofSfax(Tunisia)

Thispaperpresentsanoptimized1080pH.264/AVCdecodersystemforHDTVdisplays.Forhighqualitystreaming,thedecoderusesBinaryArithmeticDecodingalgorithmwithhighcomplexityandaJVCEschemewhichwasnotusedbefore.Theproposedoptimizedallowsustoencrypttheinformationusingapseudorandomnumbergenerator(PRNG).Thusweachievedthetowoperations(compressionandencryption)simultaneouslyandinadependedmannerwhichisanoveltyinthiskindofarchitecture.Moreover,weinvestigatedthehardwareimplementationofCABACdecoder.Theproposedarchitectureisbasedonparallelismtohandlesignificantpixelrates(i.e.1080p)videoswithlowcostandhighperformanceformostfrequentSEs.Thiswascheckedusing5videoframes.TheexperimentalresultsweobtainedusinganFPGA(Xilinx’sISE)showthatourdesignisrelevanttodecodemainprofileHDvideostream.

8656-18, Session 4

A modified prediction scheme of the H.264 multiview video coding to improve the decoder performanceAymanM.Hamdan,HusseinA.Aly,MohamedM.Fouad,MilitaryTechnicalCollege(Egypt);RichardMDansereau,CarletonUniversity(Canada)

Inthispaper,wemodifytheinterviewpredictionschemeofthestandardMultiviewVideoCoding(MVC)toreducethebit-ratewithcomparablevideoqualitymeasures.TheproposedschemeiscomparedtotheMVCstandardanditspredecessor,namely,H.264/MPEG-4AVCusingrealdatasets.Clearimprovementsareshownusingtheproposedschemeovercompetingapproachesintermsofbit-ratewithaslightincreaseintheaveragePSNRvalues(dB),aswellasaslightincreaseinthecomputationalcomplexity.

Conference 8656


Conference 8657: Computational Imaging XITuesday-Thursday5–7February2013

Part of Proceedings of SPIE Vol. 8657 Computational Imaging XI

8657-1, Session 1

A unifying retinex model based on non-local differential operatorsDominiqueZosso,GiangTran,StanleyJ.Osher,Univ.ofCalifornia,LosAngeles(UnitedStates)

Inthispaper,wepresentaunifyingframeworkforretinexthatisabletoreproducemanyoftheexistingretineximplementationswithinasinglemodel.Thefundamentalassumption,assharedwithmanyretinexmodels,isthattheobservedimageisamultiplicationbetweentheilluminationandthetrueunderlyingreflectanceoftheobject.StartingfromMorel’s2010PDEmodelforretinex,whereilluminationissupposedtovarysmoothlyandwherethereflectanceisthusrecoveredfromahard-thresholdedLaplacianoftheobservedimageinaPoissonequation,wedefineourretinexmodelinsimilarbutmoregeneraltwosteps.

First,lookforafilteredgradientthatisthesolutionofanoptimizationproblemconsistingoftwoterms:Thefirsttermisasparsitypriorofthereflectance,suchastheTVorH1norm,whilethesecondtermisaquadraticfidelitypriorofthereflectancegradientwithrespecttotheobservedimagegradients.Inasecondstep,sincethisfilteredgradientalmostcertainlyisnotaconsistentimagegradient,wethenlookforareflectancewhoseactualgradientcomesclose.

Beyondunifyingexistingmodels,weareabletoderiveentirelynovelretinexformulationsbyusingmoreinterestingnon-localversionsforthesparsityandfidelityprior.Hencewedefinewithinasingleframeworknewretinexinstancesparticularlysuitedfortexture-preservingshadowremoval,cartoon-texturedecomposition,colorandhyperspectralimageenhancement.

8657-2, Session 1

Subspace methods for computational relightingHaQ.Nguyen,SiyingLiu,MinhN.Do,Univ.ofIllinoisatUrbana-Champaign(UnitedStates)

WeproposeavectorspaceapproachforrelightingaLambertianconvexobjectwithdistantlightsource,whosecrucialtaskisthedecompositionofthereflectancefunctionintoalbedos(orreflectioncoefficients)andlightingsbasedonasetofimagesofthesameobjectandits3-Dmodel.Makinguseofthefactthatreflectancefunctionsarewellapproximatedbyalow-dimensionallinearsubspacespannedbythefirstfewsphericalharmonics,thisinverseproblemcanbeformulatedasamatrixfactorization,inwhichthebasisofthesubspaceisencodedinthesphericalharmonicmatrixS.AnecessaryandsufficientconditiononSforuniquefactorizationisderivedwithanintroductiontoanewnotionofmatrixrankcallednonseparablefullrank.AnSVD-basedalgorithmforexactfactorizationinthenoiselesscaseisintroduced.Inthepresenceofnoise,thealgorithmisslightlymodifiedbyincorporatingthepositivityofalbedosintoaconvexoptimizationproblem.Implementationsoftheproposedalgorithmsaredoneonasetofsyntheticdata.

8657-3, Session 1

Bayesian demosaicing using gaussian scale mixture priors with local adaptivity in the dual tree complex wavelet packet transform domainBartGoossens,JanAelterman,HiepLuong,AleksandraPi?urica,WilfriedPhilips,Univ.Gent(Belgium)

Indigitalcamerasandmobilephones,thereisanongoingtrendtoincreasetheimageresolution,decreasethesensorsizeandtouselowerexposuretimes.Becausesmallersensorsinherentlyleadtomorenoiseandaworsespatialresolution,digitalpost-processingtechniquesarerequiredtoresolvemanyoftheartifacts.Colorfilterarrays(CFAs),whichusealternatingpatternsofcolorfiltersappliedateverypixelposition,areverypopularbecauseofpriceandpowerconsumptionreasons.However,colorfilterarraysrequiretheuseofapost-processingtechniquesuchasdemosaicingtorecoverfullresolutionRGBimages.Recently,therehasbeensomeinterestintechniquesthatjointlyperformthedemosaicinganddenoising.Thishastheadvantagethatthedemosaicinganddenoisingcanbeperformedoptimally(e.g.intheMSEsense)fortheconsiderednoisemodel,whileavoidingartifactsintroducedwhenusingdemosaicinganddenoisingsequentially.

Inthispaper,wewillcontinuetheresearchlineofthewavelet-baseddemosaicingtechniques.Theseapproachesarecomputationallysimpleandverysuitedforcombinationwithdenoising.Therefore,wewillderiveBayesianMinimumSquaredError(MMSE)jointdemosaicinganddenoisingrulesinthecomplexwaveletpacketdomain,takinglocaladaptivityintoaccount.Asanimagemodel,wewilluseGaussianScaleMixtures,therebytakingadvantageofthedirectionalityofthecomplexwavelets.Ourresultsshowthatthistechniqueiswellcapableofreconstructingfinedetailsintheimage,whileremovingallofthenoise,atarelativelylowcomputationalcost.Inparticular,thecompletereconstruction(includingcolorcorrection,whitebalancingetc)ofa10megapixelRAWimagetakes4.4seconarecentmid-rangeGPU.

8657-4, Session 1

Demosaicing for RGBZ sensorLilongShi,IliaOvsiannikov,SamsungSemiconductor,Inc.(UnitedStates);Dong-KiMin,YohwanNoh,SAMSUNGElectronicsSemiconductor(Korea,Republicof);WanghyunKim,SunhwaJung,JoonhoLee,DeokhaShin,HyekyungJung,SAMSUNGElectronicsCo.,Ltd.(Korea,Republicof);GregoryWaligorski,MichelleWang,WendyWang,SamsungSemiconductor,Inc.(UnitedStates);YoondongPark,ChilheeChung,SAMSUNGElectronicsCo.,Ltd.(Korea,Republicof)

Inthispaper,weproposedanewtechniquefordemosaicingauniqueRGBZcolor-depthimagingsensor,whichcapturescoloranddepthimagessimultaneously,withaspeciallydesignedcolor-filter-array(CFA)wheretwooutofsixRGBcolorrowsarereplacedby“Z”pixelsthatcapturedepthinformationbutlittlecolorinformation.Therefore,inanRGBZimage,thered,greenandbluecolorsaremoresparselysampledthaninanormalBayerimage.Duetothemissingrowsinthedataimage,commonlyuseddemosaicingalgorithmsforthestandardBayerCFAcannotbeapplieddirectly.Tothisend,ourmethodfirstfills-inthemissingrowstoreconstructafullBayerCFA,followedbyacolor-selectiveadaptivedemosaicingalgorithmthatinterpolatesmissingcolorcomponents.Inthefirststep,unlikecommonbilinearinterpolationapproachesthattendtobluredgeswhenestimatingthemissingpixels,ouredge-baseddirectionalinterpolationapproach,derivedfromdeinterlacingtechniques,emphasizesonreconstructingmorestraightandsharpedgeswithfewerartifactsandtherebypreservestheverticalresolutioninthereconstructedtheimage.Toavoidusingthenewlyestimatedpixelsfordemosaicing,thebilateral-filter-basedapproachinterpolatesthemissingcolorsamplesbasedonweightedaverageofadaptivelyselectedknownpixelsfromthelocalneighborhoods.Testsshowthattheproposedmethodreconstructsfullcolorimageswhilepreservingedgesdetails,avoidingartifacts,andremovingnoisewithhighefficiency.


8657-5, Session 1

Auto zoom crop from face detection and facial featuresRaymondPtucha,DavidRhoda,BrianMittelstaedt,EastmanKodakCo.(UnitedStates)

Weproposeanautomaticalgorithmthatrecomposesanoriginalcapturesuchthatitrepresentsthecompositionthatthephotographerhadwishedhe/shehadtakeninthefirstplace.Additionally,thisalgorithmenablestrimmingtodifferingaspectratiosforemail,smartphone,tablet,orprintviewing.

Ourfacialbasedalgorithmutilizesautomaticzoomandcropcompositionrulesthatarebaseduponheadsize,numberoffaces,positionoffaces,ahierarchicalgroupingoffaces,andinputtooutputaspectratio.Weutilizesymmetriccroppingandruleofthirdsforfinalcomposition.Wefurtherintroducemethodsofautozoomcropthatutilizefacialpose,blink,eyegaze,expression,andsubjectinterrelationships.Forexample,ifasubjectislookingtotheleft,thecropregionisbiasedtowardstheleft.Ifadetectedfaceexhibitseyeblink,itsweightisdecreased.

Wedemonstratethroughcontrolledpsychophysicalstudiesthatournewlyintroducedfacebasedalgorithmsignificantlyoutperformssimplerrulebasedzoom/cropalgorithmscommonlyinusetoday,sophisticatedstateoftheartmainsubjectdetectors,andevenmainsubjectdetectorsaidedbyskinandfacedetection.

8657-6, Session 2

Optimal filters for high-speed compressive detection in spectroscopyGregeryT.Buzzard,BradleyJ.Lucier,PurdueUniv.(UnitedStates)

RecentadvancesallowfortheconstructionoffilterswithpreciselydefinedfrequencyresponseforuseinRamanchemicalspectroscopy.Inthispaperwegiveaprobabilisticinterpretationoftheoutputofsuchfiltersandusethistogiveanalgorithmtodesignoptimalfilterstominimizethemeansquarederrorintheestimatedphotonemissionratesformultiplespectra.Experimentsusingthesefiltersdemonstratethatdetectingasfewas10Ramanscatteredphotons(inaslittletimeas30microseconds)canbesufficienttopositivelydistinguishchemicalspecies.Thisspeedshouldallow``chemicalimaging’’ofsamples.

8657-8, Session 2

Neutron Imaging with Coded Sources: Design Pitfalls and the Implementation of a Simultaneous Iterative Reconstruction TechniqueHectorJ.Santos-Villalobos,PhilipR.Bingham,OakRidgeNationalLab.(UnitedStates);JensGregor,TheUniv.ofTennessee(UnitedStates)

Thelimitationsinneutronfluxandresolution(L/D)ofcurrentneutronimagingsystemscanbeaddressedwithaCodedSourceImagingsystemwithmagnification(xCSI).Moreprecisely,themultiplesourcesinanxCSIsystemcanexceedthefluxofasinglepinholesystemforseveralordersofmagnitude,whilemaintainingahigherL/Dwiththesmallsources.Moreover,designingforanxCSIsystemreducesnoisefromneutronscattering,becausetheobjectisplacedawayfromthedetectortoachievemagnification.However,xCSIsystemsareadverselyaffectedbycorrelatednoisesuchasnon-uniformilluminationoftheneutronsource,incorrectsamplingofthecodedradiograph,misalignmentofthecodedmasks,masktransparency,andtheimperfectionofthesystemPointSpreadFunction(PSF).Wearguethatamodel-basedreconstructionalgorithmcanovercometheseproblemsanddescribetheimplementationofaSimultaneousIterativeReconstructionTechniquealgorithmforcodedsources.Designpitfallsthatprecludeasatisfactoryreconstructionaredocumented.

8657-10, Session 3

Physics-based regularizationJeffreyP.Simmons,AirForceResearchLab.(UnitedStates)

NoAbstractAvailable

8657-11, Session 3

Reconstruction of bright field STEM with Bragg diffraction correctionCharlesA.Bouman,PurdueUniv.(UnitedStates)

NoAbstractAvailable

8657-35, Session K1

Petapixel photography and the limits of camera information capacity (Keynote Presentation)DavidJBrady,DanielLMarks,StevenFeller,DukeUniv.(UnitedStates);MichaelEGehm,DathonGolish,EstebanVera,UniversityofArizona(UnitedStates);DavidKittle,DukeUniv.(UnitedStates)

Themonochromaticsingleframepixelcountofacameraislimitedbydiffractiontothespace-bandwidthproduct,roughlytheapertureareadividedbythesquareofthewavelength.Wehaverecentlyshownthatitispossibletoapproachthislimitusingmultiscalelensesforcameraswithspacebandwidthproductbetween1and100gigapixels.Whencolor,polarization,coherenceandtimeareincludedintheimagedatacube,camerainformationcapacityincreasesbyover6ordersofmagnitude.ThistalkreviewsprogressintheconstructionofDARPAAWAREgigapixelcamerasanddescribescompressivemeasurementstrategiesthatmaybeusedincombinationwithmultiscalesystemstopushcameracapacitytonearphysicallimits.

8657-12, Session 4

Sparse imaging for fast electron microscopyHyrumS.Anderson,JovanaHelms,BrandonRohrer,JasonWheeler,KurtW.Larson,SandiaNationalLabs.(UnitedStates)

Scanningelectronmicroscopes(SEMs)areusedinneuroscienceandmaterialssciencetoimagecentimetersofsampleareaatnanometerscales.SinceimagingratesareinlargepartSNR-limited,largecollectionscanleadtoweeksofaround-the-clockimagingtime.Toincreasedatacollectionspeed,weproposeanddemonstrateonanoperationalSEMafastmethodtosparselysampleandreconstructsmoothimages.Toaccuratelylocalizetheelectronprobepositionatfastscanrates,wemodelthedynamicsofthescancoils,andusethemodeltorapidlyandaccuratelyvisitarandomlyselectedsubsetofpixellocations.Imagesarereconstructedfromtheundersampleddatabycompressedsensinginversionusingimagesmoothnessasaprior.Wereportimagefidelityasafunctionofacquisitionspeedbycomparingtraditionalrastertosparseimagingmodes.Ourapproachisequallyapplicabletootherdomainsofnanometermicroscopyinwhichthetimetopositionaprobeisalimitingfactor(e.g.,atomicforcemicroscopy),orinwhichexcessiveelectrondosesmightotherwisealterthesamplebeingobserved(e.g.,scanningtransmissionelectronmicroscopy).

8657-13, Session 4

Building and enforcing shape priors for segmentation of alloy micrographsLandisM.Huffman,TheMITRECorp.(UnitedStates);JeffreyP.Simmons,AirForceResearchLab.(UnitedStates);MarcDeGraef,

Conference 8657


CarnegieMellonUniv.(UnitedStates);IlyaPollak,PurdueUniv.(UnitedStates)

Computersimulationofmetalalloysisanemergingtrendinmaterialsdevelopment.Simulatedreplicasoffabricatedalloysarebasedonthesegmentationsofalloymicrographs.Therefore,accuratesegmentationofvisibleprecipitatesisparamounttosimulationaccuracy.Sincetheshapeandsizeofprecipitatesarekeyindicatorsofphysicalalloyproperties,automatedsegmentationalgorithmsmustaccountforabundantpriorinformationofprecipitateshape.Wepresentanewmethodforconstructingapriorenforcingrectangularshapewhichcanbeappliedwithinamin-cutframeworkformaximuma-posteriorisegmentation.

8657-14, Session 5

Real-time dynamic range and signal to noise enhancement in beam-scanning microscopy by integration of sensor characteristics, data acquisition hardware, and statistical methodsGarthJ.Simpson,PurdueUniv.(UnitedStates)

Despitetheubiquitoususeofmulti-photonandconfocalmicroscopymeasurementsinbiology,thecoretechniquessufferfromfundamentalcompromisesbetweensignaltonoise(S/N)andlineardynamicrange(LDR).Inthisstudy,directsynchronousdigitizationofvoltagetransientscoupledwithstatisticalanalysisisshowntoallowS/NapproachingthetheoreticalmaximumthroughoutanLDRspanning6decades,limitedonlybythedarkcountsofthedetectoronthelowendandbytheintrinsicnonlinearitiesofthephotomultipliertube(PMT)detectoronthehighend.Synchronousdigitizationofeachvoltagetransientrepresentsafundamentaldeparturefromestablishedmethodsinconfocal/multi-photonimaging,whicharecurrentlybasedoneitherphotoncountingorsignalaveraging.Highinformation-densitydataacquisition(upto3.2GB/sofrawdata)enablesthesmoothtransitionbetweenthetwomodalitiesonapixel-by-pixelbasisandtheultimatewritingofmuchsmallerfiles(fewkB/s).ModelingofthePMTresponseallowsextractionofkeysensorparametersfromthehistogramofvoltagepeak-heights,whichinturncanbeusedforautomatedoptimizationofthethresholdsandontheflycorrectionof1/fnoiseinthebaseline.Applicationsinsecondharmonicgeneration(SHG)microscopyaredescribedrequiringquantitationoverlargedynamicranges.

8657-15, Session 5

Multi-modal electron tomographyLawrenceF.Drummy,UES,Inc.(UnitedStates)

NoAbstractAvailable

8657-16, Session 5

Segmentation of materials images using 3D electron interaction modelingMaryL.Comer,DaeWooKim,PurdueUniv.(UnitedStates)

NoAbstractAvailable

8657-17, Session 5

A forward modeling approach to electron back-scatter diffraction patternsMarcDeGraef,CarnegieMellonUniv.(UnitedStates)

NoAbstractAvailable

8657-18, Session 5

Interactive grain image segmentation using graph cut algorithmsJarrellWWaggoner,YoujieZhou,Univ.ofSouthCarolina(UnitedStates);JeffreyP.Simmons,AirForceResearchLab.(UnitedStates);AymanSalem,MaterialsResourcesInternational(UnitedStates);MarcDeGraef,CarnegieMellonUniv.(UnitedStates);SongWang,Univ.ofSouthCarolina(UnitedStates)

Segmentingmaterialsimagesisalaboriousandtime-consumingprocessandautomaticimagesegmentationalgorithmsusuallycontainimperfectionsanderrors.Interactivesegmentationisagrowingtopicintheareasofimageprocessingandcomputervision,whichseekstofindabalancebetweenfullyautomaticmethodsandfully-manualsegmentationprocesses.Byallowingminimalandsimplisticinteractionfromtheuserinanotherwiseautomaticalgorithm,interactivesegmentationisabletosimultaneouslyreducethetimetakentosegmentanimagewhileachievingbettersegmentationresults.Giventhespecializedstructureofmaterialsimagesandlevelofsegmentationqualityrequired,weshowaninteractivesegmentationframeworkformaterialsimagesthathastwokeycontributions:1)amulti-labelingframeworkthatcanhandlealargenumberofstructureswhilestillquicklyandconvenientlyallowingmanualinteractioninreal-time,and2)aparameterestimationapproachthatpreventstheuserfromhavingtomanuallyspecifyparameters,increasingthesimplicityoftheinteraction.Weshowafullformulationofeachofthesecontributionsandexampleresultsfromtheirapplication.


Sub-pixel depth camera registration using gradient circle patternSeungkyuLee,SamsungAdvancedInstituteofTechnology(Korea,Republicof)

Recently,consumerdepthcamerashavewidelyappliedtorobotics,computervisionandgraphicsapplications.Furthermoresensorfusionwithcolorcamerahasprovidedmultimodal3-dimensionaldataformanyresearchproblems.Asanaturaladvanceinthisfield,nowresearcherstartedtocombinemultipledepthsensorsformanyreasons;additionalnoiseelimination,morecomplete3Dreconstructionandrobotnavigation.

Inthiswork,weproposeanewsub-pixelexternalcalibrationandregistrationmethodofmultipledepthcameras.Weproposeanewchessboardpatternfitforlowresolutiondepthimages.Insteadofusingcornerpointforcorrespondencedetectionbetweenmultipledepthimages,weuseourchessboardpatternstructurematchingschemeforhigheraccuracyofcorrespondencedetectioninsuchchallenginglowresolutioncondition.Ratherthandirectmatchingbetweenthecuesextractedfrommultipleimages,webuildaparameterizedmodelofournewchessboardpatternandperformthematchingbetweenthemodelandeachimage.Bydoingthiswecanfinelytunethematchinglocationforsub-pixelexternalcalibration.Simulationandexperimentalresultsonrealdepthimagesprovetheproposedsub-pixelexternalcalibrationandregistrationshowingaround40%ofaccuracygain.Theproposedmethodwillcontributefurtheruseofmultipledepthimagesofotherresearchersprovidingmergeddepthpointcloudfrommultipledepthsensors.


Efficient synthetic refocusing method from multiple coded aperture images for 3D user interactionSungjooSuh,ChangkyuChoi,DusikPark,Chang-YeongKim,SamsungAdvancedInstituteofTechnology(Korea,Republicof)

Inthispaper,weproposeanefficientsyntheticrefocusingmethodfrommultiplecodedapertureimagesfor3Duserinteraction.Theproposed

Conference 8657


methodisappliedtoaflatpaneldisplaywithasensorpanelwhichformslens-lessmulti-viewcameras.Tocapturethesceneinfrontofthedisplay,themodifieduniformlyredundantarrays(MURA)patternsaredisplayedontheLCDscreenwithoutthebacklight.ThroughtheimagingpatternsontheLCDscreen,MURAcodedimagesarecapturedinthesensorpanel.Insteadofdecodingallcodedimagestosyntheticallygeneratearefocusedimage,theproposedmethodonlydecodesonecodedimagecorrespondingtotherefocusingimageatacertaindistanceaftercircularlyshiftingandaveragingallcodedimages.Further,basedontheproposedrefocusingmethod,thedepthofanobjectinfrontofthedisplayisestimatedbyfindingthemostfocusedimageforeachpixelthroughastackoftherefocusedimagesatdifferentdepthlevels.Experimentalresultsshowthattheproposedmethodcapturesanobjectinfrontofthedisplay,generatesrefocusedimagesatdifferentdepthlevels,andaccuratelydeterminesthedepthofanobjectincludingrealhumanhandsnearthedisplay.


Multiscale based adaptive contrast enhancement MuhammadI.Abir,MissouriUniv.ofScienceandTechnology(UnitedStates)andIdahoNationalLab.(UnitedStates);FahimaF.Islam,MissouriUniv.ofScienceandTechnology(UnitedStates);DanielM.Wachs,IdahoNationalLab.(UnitedStates);HyoungKooLee,MissouriUniv.ofScienceandTechnology(UnitedStates)

Acontrastenhancementalgorithmisdevelopedforenhancingthecontrastofx-rayimages.ThealgorithmisbasedonLaplacianpyramidimageprocessingtechnique.Theimageisdecomposedintothreefrequencysub-bands-low,medium,andhigh.Eachsub-bandcontainsdifferentfrequencyinformationoftheimage.Thedetailstructureoftheimageliesonthehighfrequencysub-bandandtheoverallstructureliesonthelowfrequencysub-band.Apparentlyitisdifficulttoextractdetailstructurefromthehighfrequencysub-bands.Enhancementofthedetailstructuresisnecessaryinordertofindoutthecalcificationsonthemammograms,cracksonanyobjectsuchasfuelplate,etc.Inourproposedmethodcontrastenhancementisachievedfromhighandmediumfrequencysub-bandimagesbydecomposingtheimagebasedonmulti-scaleLaplacianpyramidandenhancingcontrastbysuitableimageprocessing.StandardDeviation-basedModifiedAdaptivecontrastenhancement(SDMACE)techniqueisappliedtoenhancethelow-contrastinformationonthesub-bandswithoutovershootingnoise.Analpha-trimmedmeanfilterisusedinSDMACEforsharpnessenhancement.Aftermodifyingallsub-bandimages,thefinalimageisderivedfromreconstructionofthesub-bandimagesfromlowerresolutionleveltoupperresolutionlevelincludingtheresidualimage.Todemonstratetheeffectivenessofthealgorithmanx-rayofafuelplateandtwomammogramsareanalyzed.Subjectiveevaluationisperformedtoevaluatetheeffectivenessofthealgorithm.Theproposedalgorithmiscomparedwiththewell-knowncontrastlimitedadaptivehistogramequalization(CLAHE)algorithm.Experimentalresultsprovethattheproposedalgorithmoffersimprovedcontrastofthex-rayimages.

8657-19, Session 6

An enhanced grid-based Bayesian array for target trackingQianSang,ZongliLin,ScottT.Acton,Univ.ofVirginia(UnitedStates)

Agrid-basedBayesianapproach(GBA)forrobustvisualtrackinghasrecentlybeendeveloped,whichproposesanovelmethodofdeterministicsamplegenerationandsampleweightingforpositionestimation.Inparticular,atargetmotionmodelisconstructed,predictingtargetpositioninthenextframebasedonestimationsinpreviousframes.Samplesaregeneratedbygriddingwithinanellipsoidcenteredattheprediction.Forlocalization,radialedgedetectionisappliedforeachsampletodetermineifitisinsidethetargetboundary.Sampleweightsarethenassignedaccordingtothenumberoftheedgepointsdetectedaroundthesampleanditsdistancefromthepredicted

position.Thepositionestimationiscomputedastheweightedsumofthesampleset.

Inthispaper,weenhancethecapacityoftheGBAtrackerinaccommodatingthetrackingoftargetsinvideowitherraticmotion,byintroducingadaptationinthemotionmodelanditerativepositionestimation.TheimprovedtrackingperformanceovertheoriginalGBAtrackeraredemonstratedintrackingasingleleukocyteinvivoandgroundvehicletargetobservedfromUAVvideos,bothundergoingabruptchangesinmotion.TheexperimentalresultsshowthattheenhancedGBAtrackeroutperformstheoriginalbytrackingmorethan10%ofthetotalnumberofframes,andincreasesthenumberofvideosequenceswithallframestrackedbygreaterthan20%.

8657-20, Session 6

A fourth-order active contour shape prior for multiple-instance object detection in imagesIkhlefBechar,INRIASophiaAntipolis-Méditerranée(France);IanH.Jermyn,DurhamUniv.(UnitedKingdom);JosianeB.Zerubia,INRIASophiaAntipolis-Méditerranée(France)

NoAbstractAvailable

8657-21, Session 6

Efficient occlusion reasoning for articulated tracking in monocular viewsLandisM.Huffman,TheMITRECorp.(UnitedStates);IlyaPollak,PurdueUniv.(UnitedStates)

Poseestimationandtrackingofarticulatedobjectslikehumansisparticularlydifficultduetothecomplexocclusionsamongthearticulatedparts.Withoutthebenefitofmultipleviews,resolutionofocclusionsbecomesbothincreasinglyvaluableandchallenging.Weproposeamethodforarticulated3Dposeestimationfrommonocularvideowhichusesnonparametricbeliefpropagationandemploysanovelandefficientapproachtoocclusionreasoning.Wepresentahumantrackingapplication,andevaluateresultsusingthetheHumanEvaIIdataset.

8657-22, Session 6

An efficient optimizer for simple point process modelsAhmedGamalEldin,INRIARhône-Alpes(France);GuillaumeCharpiat,XavierDescombes,JosianeB.Zerubia,INRIASophiaAntipolis-Méditerranée(France)

NoAbstractAvailable

8657-23, Session 7

Texture mapping 3D planar models of indoor environments with noisy camera posesPeterCheng,MichaelAnderson,StewartHe,AvidehZakhor,Univ.ofCalifornia,Berkeley(UnitedStates)

Automated3Dmodelingofbuildinginteriorsisusefulinapplicationssuchasvirtualrealityandenvironmentmapping.Applyingtexturestothesemodelsresultsinusefulphotorealisticvisualizationsoftheindoorenvironment.Thequalityofsuchtexturesislargelydeterminedbythecameraposesusedtoprojectsourceimagesontosurfaceswhiletexturing.Incaseswherecameraposesarehighlyimprecise,strongmisalignmentsarevisibleinareaswheresuccessiveimagesareprojected.Weproposetwoapproachestoreducediscontinuitiesintexturemapping3Dmodelsmadeofplanarsurfaces.Thefirstoneisrobusttoimagestakenfromgreatlyvaryingangles,andcanbeused

Conference 8657


forallmannerofimagesandplanes.Thesecondapproachproducesmoreseamlesstexturesbytakingadvantageofscenarioswherecameraheadingsarelargelyperpendiculartotheirrespectivesurfacetobetextured.Thisapproachtypicallyresultsinmoreartifact-freetexturemapping.Weproposeaschemewherebythefirstmethodisusedtotexturemapfloorsandceilings,andthesecondoneisusedonwalls.Wedemonstratetheeffectivenessofthisschemetoanumberofdatasetscapturedbyanambulatory3Dmodelingsystemequippedwithlaserscannersandcameras.

8657-24, Session 7

Optical touch sensing: practical bounds for design and performanceAlexanderBlaessle,UBCOkanagan(Canada);BebartJanbek,SimonFraserUniv.(Canada);LifengLiu,Univ.ofPittsburgh(UnitedStates);KannaNakamura,Univ.ofMaryland,CollegePark(UnitedStates);KimberlyNolan,DrexelUniv.(UnitedStates);VictorParaschiv,Univ.ofVictoria(Canada)

Touchscreensarecommonlyusedinmanyconsumerapplications,ranginginsizefromsmartphonesandtabletstotouch-wallsandcollaborative-surfaces.Opticaltouchsensingtechnologyisespeciallywellsuitedforlarge-sizetouchsurfaces,sincecamerasandlightsourcesareusuallyplacedontheperimeterofthetoucharea.Thus,thecostofthesolutionisproportionaltotheperimeterratherthantheareaofthescreen.Thenumberofcamerasandlightsourcesplaysanimportantroleindeterminingthecostandperformanceofthesystem.Inthiswork,weanalyzeanddeterminevariousboundsandtradeoffsrelatingthenumberofsensorsusedandtheresultingresolutionanddiscrimination-capabilityofthesystem.

Westartwithanalyzingthesystem’sabilitytodiscriminatetwofingerstouchingthescreen.Specifically,wedeterminetheminimumnumberofcamerasrequiredtoguaranteetheabilitytodiscerntwocircularobjectsonthescreen.Thenextproblemweconsideristhatofidentifyingpolygonstouchingthescreenanddescribeamethodforplacingthecamerasthatensuresthisability.Weconcludewithadirectnumericaltooltorelatethenumberofcamerasandlightsourcesandtheresultingresolutionofthetouchscreenusingsomepracticalcases.

8657-25, Session 8

Light field image denoising using a linear 4D frequency-hyperfan all-in-focus filterDonaldG.Dansereau,DanielL.Bongiorno,OscarPizarro,StefanB.Williams,TheUniv.ofSydney(Australia)

Imaginginlowlightisproblematicassensornoisecandominateimagery,andincreasingilluminationoraperturesizeisnotalwayseffectiveorpractical.Computationalphotographyoffersapromisingsolutionintheformofthelightfieldcamera,whichbycapturingredundantinformationoffersanopportunityforelegantnoiserejection.WeshowthatthelightfieldofaLambertianscenehasa4Dhyperfan-shapedfrequency-domainregionofsupport,andbydesigningandimplementingafilterwithappropriatelyshapedpassbandweaccomplishdenoisingwithasingleall-in-focuslinearfilter.DrawingexamplesfromtheStanfordLightFieldArchivewithsyntheticnoise,andreal-worldimagerycapturedusingtheLytroconsumer-gradeplenopticcamera,wedemonstratethatthehyperfanoutperformscompetingmethodsincludingsyntheticfocus,fan-shapedantialiasingfilters,andarangeofmodernnonlinearimageandvideodenoisingtechniques.Weshowthefilterpreservesdepthoffield,makingitasingle-stepall-in-focusdenoisingfiltersuitableforgeneral-purposelightfieldrendering.Weincluderesultsfordifferentnoisetypesandlevels,overavarietyofmetrics,andinreal-worldscenarios.Finally,weshowthatthehyperfan’sperformancescaleswithaperturecount.

8657-26, Session 8

Computational imaging approach for fanbeam x-ray scatter imagingJosephAO’Sullivan,DavidG.Politte,WashingtonUniv.inSt.Louis(UnitedStates);KennethMacCabe,KalyaniKrishnamurthy,DukeUniv.(UnitedStates);IkennaOdinaka,WashingtonUniv.inSt.Louis(UnitedStates);AnujKapadia,DavidJ.Brady,DukeUniv.(UnitedStates)

Inx-rayscatterimaging,tomographicmeasurementsoftheforwardscatterdistributionareusedtoinferscatterdensitieswithinavolume.Amaskplacedbetweentheobjectandthedetectorarrayprovidesinformationaboutscatterangles.AnefficientcomputationalimplementationoftheforwardandbackwardmodelfacilitateiterativealgorithmsbaseduponaPoissonlog-likelihood.ResultsarepresentedonsimulatedandMonteCarlodata.

8657-27, Session 8

Robust registration of electron tomography projections without fiducial makersVietDungTran,MaximeMoreaud,IFPEnergiesNouvelles(France);ÉricM.Thiébaut,Ctr.deRechercheAstronomiquedeLyon(France);LoïcDénis,JeanMarieBecker,Lab.HubertCurien,CNRS(France)

Amajorissueinelectrontomographyisthemisalignmentoftheprojectionscontributingtothereconstruction.Thecurrentalignmenttechniquesusuallyemployfiducialmarkerssuchasgoldparticlesforacorrectalignmentoftheimages.Whentheuseofmarkersisnotpossible,thecorrelationbetweenadjacentprojectionsisusedtoalignthem.However,thismethodsometimesfails.Inthispaper,weproposeanewmethodforthealignmentoftransmissionelectronmicroscopy(TEM)imagesserieswithouttheneedoffiducialmarkers.Theproposedapproachiscomposedoftwosteps.Thefirststepconsistsofaninitialalignmentprocess,wherereliesontheminimizationofacostfunctionbasedonrobuststatisticsmeasuringthesimilarityofaprojectiontoitspreviousprojectionsintheseries.Itreducestrongshiftsresultingfromtheacquisitionbetweensuccessiveprojections.Thesecondstepsalignstheprojectionsfinely.Pre-registeredprojectionsareusedtoinitializeaniterativerefinementalignmentprocess,whichalignseachprojectionwithitssimulatedversionobtainedfromreconstructedvolume.WehavesuccessfullytestedourmethodwithrealprojectionsofazeolitesupportcatalystwithTEMimages.Itrecoversinanaccuratemannerthechangesintranslation,rotationandscalingparameters.

8657-28, Session 9

Low signal noise modeling for statistical CT reconstructionJean-BaptisteThibault,GEHealthcare(UnitedStates);KenD.Sauer,Univ.ofNotreDame(UnitedStates);CharlesA.Bouman,PurdueUniv.(UnitedStates)

NoAbstractAvailable

8657-29, Session 9

Joint reconstruction and segmentation of electron tomography dataAhmetTuysuzoglu,WilliamC.Karl,DavidA.Castañón,M.SelimÜnlü,BostonUniv.(UnitedStates)

NoAbstractAvailable

Conference 8657


8657-30, Session 9

Analysis of image color and effective bandwidth as a tool for assessing air pollution at urban spatiotemporal scaleYaelEtzion-Cohen,DavidM.Broday,BarakFishbain,Technion-IsraelInstituteofTechnology(Israel)

Sizeandconcentrationofairborneparticulatematter(PM)areimportantindicatorsforairpollutioneventsandpublichealthrisks,sinceexposuretofineparticlessmallerthan2.5?m(PM2.5)hasbeenassociatedwithadversehealtheffects.ItisthereforeimportanttomonitorthespatiotemporalvariationsofPMsizeresolvedconcentrationsintheambientairnearthegroundatanurban-scaleresolution.ThesuggestedmethodhereutilizesquantitativemeasuresofurbansceneimagecharacteristicsforpredictingPMconcentrations.OurstudyexploresdifferentquantitativecharacteristicsoftheimagecontrastaspotentialpredictionvariablesforPMconcentrations,includingFishbain’setal.(2008)imageeffectivebandwidth(IEB)andyellowintensitylevel,whichisphysicallyrelatedtoscatteringofradianceinthevisiblerangebyfineparticlessmallerthan1?m.Forvalidatingthesuggestedmethod,wehaveassembledalargedatasetthatconsistsoftimeseriesimagingofmixedscenery(industrialfacilities,roadsandresidentialneighborhoods)andmeasurementsfromairqualitymonitoringstationsinthestudiedregionthatmonitorPMconcentrationsandmeteorologicaldata.QuantitativeandqualitativestatisticalevaluationofthesuggestedmethodshowsthatPMdynamicsandconcentrationscanbeinferredandstudiedfromtheacquiredimages.

Conference 8657


Conference 8658: Document Recognition and Retrieval XXTuesday-Thursday5–7February2013

Part of Proceedings of SPIE Vol. 8658 Document Recognition and Retrieval XX

8658-1, Session 1

Semi-structured document image matching and recognitionOlivierAugereau,NicholasJournet,Jean-PhilippeDomenger,LaBRI(France)

Thisarticlepresentsamethodtorecognizeandtolocalizesemi-structureddocumentssuchasIDcards,tickets,invoices,etc.Standardobjectrecognitionmethodsbasedoninterestpointsworkwellonnaturalimagesbutfailondocumentimagesbecauseofrepetitivepatternsliketext.Inthisarticle,weproposeanadaptationofobjectrecognitionforimagedocuments.Theadvantagesofourmethodisthatitdoesnotusecharacterrecognitionorsegmentationanditisrobusttorotation,scale,illumination,blur,noiseandlocaldistortions.Furthermore,testsshowthatanaverageprecisionof97.2%andrecallof94.6%isobtainedformatching7differentkindsofdocumentsinadatabaseof2155documents.

8658-2, Session 1

Rotation-robust math symbol recognition and retrieval using outer contours and image subsamplingSiyuZhu,LeiHu,RichardZanibbi,RochesterInstituteofTechnology(UnitedStates)

Thispaperpresentsanunifiedrecognitionandretrievalsystemforisolatedofflineprintedmathematicalsymbolsforthefirsttime.ThesystemisbasedonnearestneighborschemeandusesmodifiedTurningFunctionandGridFeaturestocalculatethedistancebetweentwosymbolsbasedonSumofSquaredDifference(SSD).AnunwrapprocessandanalignmentprocessareappliedtomodifyTurningFunctiontodealwiththehorizontalandverticalshiftscausedbythechangingofstaringpointandrotation.ThismodifiedTurningFunctionmakesoursystemrobustagainstrotationofthesymbolimage.Thesystemobtainstop-1recognitionrateof96.90%and47.27%AreaUnderCurve(AUC)ofPrecision/RecallplotontheInftyCDB-3dataset.ExperimentresultshowsthatthesystemwithmodifiedTurningFunctionperformsmuchbetterthanthesystemwithoriginalTurningFunctionontherotatedInftyCDB-3dataset.

8658-3, Session 1

NESP: Nonlinear enhancement and selection of plane for optimal segmentation and recognition of scene word imagesDeepakKumar,M.N.AnilPrasad,A.G.Ramakrishnan,IndianInstituteofScience(India)

Inthispaper,wereportabreakthroughresultonthedifficulttaskofsegmentationandrecognitionofcolouredtextfromthewordimagedatasetofICDARrobustreadingcompetitionchallenge2:readingtextinsceneimages.Wesplitthewordimageintoindividualcolour,grayandluminousplanes;Wethenenhancethecontrastofeachoftheseplanesindependentlybyapower-lawtransform.Wecomputethediscriminationfactorofeachplane.Theplanethathasmaximumdiscriminationfactorisselectedforsegmentation.WeusethetrialversionofOmnipageOCRonthebinarizedwordsforrecognition.WecompareourrecognitionresultswiththosereportedonICDAR

2011worddatasetandalsoonICDAR2003worddataset.ThewordrecognitionrateofNESPmethodis72.77%and66.22%forICDAR2011and2003worddatasets,respectively.Wehavecreatedground-truthforeachimageatthepixelleveltobenchmarkthesedatasets.WehaveusedourMASTtoolkitforcreationofground-truthimages.Therecognitionrateofbenchmarkedimagesis86.73%and83.87%forICDAR2011and2003datasets,respectively.

8658-4, Session 2

Combining evidence using likelihood ratios in writer verificationSargurN.Srihari,Univ.atBuffalo(UnitedStates);DimitryKovalenko,YiTang,GregoryBall,Univ.atBuffalo,SUNY(UnitedStates)

Forensicidentificationisthetaskofdeterminingwhetherornotobservedevidencearosefromaknownsource.Itinvolvesdeterminingalikelihoodratio(LR)--theratioofthejointprobabilityoftheevidenceandsourceundertheidentificationhypothesis(thattheevidencecamefromthesource)andundertheexclusionhypothesis(thattheevidencedidnotarisefromthesource).InLR-baseddecisionmethods,particularlyhandwritingcomparison,avariablenumberofinputevidencesisused.AdecisionbasedonmanypiecesofevidencecanresultinnearlythesameLRasonebasedonfewpiecesofevidence.Weconsidermethodsfordistinguishingbetweensuchsituations.Oneoftheseistoprovideconfidenceintervalstogetherwiththedecisionsandanotheristocombinetheinputsusingweights.WesuggestanewmethodthatgeneralizesBayesianapproachandusesanexplicitlydefineddiscountfunction.Empiricalevaluationwiththreedatasetsincludinglarge-scalesynthesizeddataandspecificexamplesofhandwritingcomparisonshowsgreaterflexibilityofthedesignedmethod.

8658-5, Session 2

Handwritten word preprocessing for database adaptationCristinaA.Oprean,LaurenceLikforman-Sulem,TelecomParisTech(France);ChaficMokbel,UniversityofBalamand(Lebanon)

Handwritingrecognitionsystemsaretypicallytrainedusingpubliclyavailabledatabases,wheredatahavebeencollectedincontrolledconditions(imageresolution,paperbackground,noiselevel,..).Sincethisisnotoftenthecaseinreal-worldscenarios,classificationperformancecanbeaffectedwhennoveldataispresentedtothewordrecognitionsystem.Toovercomethisproblem,wepresentinthispaperanewapproachcalleddatabaseadaptation.Itconsistsofprocessingoneset(trainingortest)inordertoadaptittotheotherset(testortraining,respectively).Weconsideraspecificcaseofprocessing,namelystrokethicknessnormalizationandpixelintensityuniformization.Theadvantageofsuchapproachisthatwecanre-usetheexistingrecognitionsystemtrainedoncontrolleddata.WeconductseveralexperimentswiththeRimes2011worddatabaseandareal-worlddatabase.Weadapteitherthetestsetorthetrainingset.Resultsshowthattrainingsetadaptationachievesslightlybetterresultsthantestsetadaptation,atthecostofasecondtrainingstageontheadapteddata.Accuracyofdatasetadaptationisincreasedby2%to3%inabsolutevalueovernoadaptation.


8658-6, Session 2

Optimal policy for labeling training samplesLesterLipsky,Univ.ofConnecticut(UnitedStates);DanielLopresti,LehighUniv.(UnitedStates);GeorgeNagy,RensselaerPolytechnicInstitute(UnitedStates)

Confirmingthelabelsofautomaticallyclassifiedpatternsisgenerallyfasterthanenteringnewlabelsorcorrectingincorrectlabels.Mostlabelsassignedbyaclassifier,eveniftrainedonlyonrelativelyfewpre-labeledpatterns,arecorrect.Thereforetheoverallcostofhumanlabelingcanbedecreasedbyinterspersinglabelingandclassification.Givenaparameterizedmodeloftheerrorrateasaninversepowerlawfunctionofthesizeofthetrainingset,theoptimalsplitscanbecomputedrapidly.Projectedsavingsinoperatortimeareover60%forarangeofempiricalerrorfunctionsforhand-printeddigitclassificationwithtendifferentclassifiers.

8658-7, Session 2

Evaluation of lexicon size variations on a verification and rejection system based on SVM, for accurate and robust recognition of handwritten wordsYannRicquebourg,BertrandCouasnon,IRISA/INSAdeRennes(France);LaurentGuichard,E2ISAS(France)

Thetranscriptionofhandwrittenwordsremainsastillchallenginganddifficulttaskremains.Whenprocessingfullpages,approachesarelimitedbythetrade-offbetweenautomaticrecognitionerrorsandthetediousaspectofhumanuserverification.Inthisarticle,wepresentourinvestigationstoimprovethecapabilitiesofanautomaticrecognizer,soastobeabletorejectunknownwords(nottotakewrongdecisions)whilecorrectlyrejecting(i.e.torecognizeasmuchaspossiblefromthelexiconofknownwords).

Thisistheactiveresearchtopicofdevelopingaverificationsystemthatoptimizethetrade-offbetweenperformanceandreliability.Tominimizetherecognitionerrors,averificationsystemisusuallyusedtoacceptorrejectthehypothesesproducedbyanexistingrecognitionsystem.Thus,were-useournovelverificationarchitecture1here:therecognitionhypothesesarere-scoredbyasetofthesupportvectormachines,andvalidatedbyaverificationmechanismbasedonmultiplerejectionthresholds.Inordertotunethese(class-dependent)rejectionthresholds,analgorithmbasedondynamicprogramminghasbeenproposedwhichfocusonmaximizingtherecognitionrateforagivenerrorrate.

ExperimentshavebeencarriedoutontheRIMESdatabaseinthreesteps.Thefirsttwoshowedthatthisapproachresultsinaperformancesuperiororequaltootherstate-of-the-artrejectionmethods.Wefocushereonthethirdoneshowingthatthisverificationsystemalsogreatlyimprovesresultsofkeywordsextractioninasetofhandwrittenwords,withastrongrobustnesstolexiconsizevariations(21lexiconshavebeentestedfrom167entriesupto5,600entries)whichisparticularlyrelevanttoourapplicationcontextcooperatingwithhumans,andonlymadepossiblethankstotherejectionabilityofthisproposedsystem.Theproposedverificationsystem,comparedtoaHMMwithsimplerejection,improvesonaveragetherecognitionrateby57%(resp.33%and21%)foragivenerrorrateof1%(resp.5%and10%).

Keywords:Handwrittenwordrecognizer,Verificationsystem,SVMre-scoring,Rejectionmethod,Lexiconsizevariation,RIMESdatabase

8658-8, Session 3

Comic image understanding based on polygon detectionLuyuanLi,YongtaoWang,ZhiTang,DongLiu,PekingUniv.(China)

Comicimageunderstandingaimstoautomaticallydecomposescannedcomicpageimagesintostoryboardsandthenidentifythereadingorderofthem,whichisthekeytechniquetoproducedigital

comicdocumentsthataresuitableforreadingonmobiledevices.Weproposeanovelcomicimageunderstandingmethodbasedonpolygondetectioninthispaper.First,wesegmentacomicpageimagesintostoryboardsbyfindingthepolygonalenclosingboxofeachstoryboard.Then,eachstoryboardcanberepresentedbyapolygon,andthereadingorderofthemisdeterminedbyanalyzingtherelativegeometricrelationshipbetweeneachpairofpolygons.Theproposedmethodistestedon2000comicimagesfromtenprintedcomicseries,andtheexperimentalresultsdemonstratethatitworkswellondifferenttypesofcomicimages.

8658-9, Session 3

Context modeling for text/non-text separation in free-form online handwritten documentsAdrienDelaye,Cheng-LinLiu,InstituteofAutomation(China)

Free-formonlinehandwrittendocumentscontainahighdiversityofcontent,organizedwithoutconstraintsimposedtotheuser.Thelackofpriorknowledgeaboutcontentandlayoutmakesthemodelingofcontextualinformationofcrucialimportanceforinterpretationofsuchdocuments.Inthiswork,wepresentacomprehensiveinvestigationofthesourcesofcontextualinformationthatcanbenefitthetaskofdiscerningtextualfromnon-textualstrokesinhandwrittenonlinedocuments.Anin-depthanalysisofinteractionsbetweenstrokesisconductedthroughthedesignofvariouspairwisecliquesystemsthatarecombinedwithinaConditionalRandomFieldformulationofthestrokelabellingproblem.Ourresultsdemonstratethebenefitsofcombiningcomplementarysourcesofcontextforimprovingthetext/non-textrecognitionperformance.

8658-10, Session 3

Annotating image ROIs with text descriptions for multimodal biomedical document retrievalDaekeunYou,NationalLibraryofMedicine(UnitedStates);MatthewSimpson,NationalLibraryofMedicine/NIH(UnitedStates);SameerK.Antani,DinaDemner-Fushman,GeorgeR.Thoma,NationalLibraryofMedicine(UnitedStates)

NoAbstractAvailable

8658-11, Session 3

Graphic composite segmentation for PDF documents with complex layoutsCanhuiXu,ZhiTang,XinTao,CaoShi,PekingUniv.(China)

ConvertingthePDFbookstore-flowableformathasrecentlyattractedvariousinterestsintheareaofe-bookreading.RobustgraphicsegmentationishighlydesiredforincreasingthepracticabilityofPDFconverters.Tocopewithvariouslayouts,amulti-layerconceptisintroducedtosegmentthegraphiccompositesincludingphotographicimages,drawingswithtextinsetsorsurroundedwithtextelements.Bothimagebasedanalysisandinherentdigitalborndocumentadvantagesareexploitedinthismulti-layerbasedlayoutanalysismethod.Bycombininglow-levelpageelementsclusteringappliedonPDFdocumentsandconnectedcomponentanalysisonsyntheticallygeneratedPNGimagedocument,graphiccompositescanbesegmentedforPDFdocumentswithcomplexlayouts.TheexperimentalresultsongraphiccompositesegmentationofPDFdocumentpageshaveshownsatisfactoryperformance.

Conference 8658


8658-12, Session 4

A classification-free word-spotting systemNikosVasilopoulos,ErginaKavallieratou,Univ.oftheAegean(Greece)

Inthispaper,aclassification-freeWord-Spottingsystem,appropriatefortheretrievalofprintedhistoricaldocumentimagesisproposed.Thesystemskipsmanyoftheproceduresofacommonapproach.Itdoesnotincludesegmentation,featureextractionorclassification.Insteadittreatsthequeriesascompactshapesandusesimageprocessingtechniquesinordertolocalizeaqueryinthedocumentimages.OursystemwastestedonahistoricaldocumentcollectionwithmanyproblemsandaGooglebook,printedin1675.

8658-13, Session 4

Combining geometric matching with SVM to improve symbol spottingNibalNayef,ThomasM.Breuel,TechnischeUniv.Kaiserslautern(Germany)

Symbolspottingisimportantforautomaticinterpretationoftechnicallinedrawings.Currentspottingmethodsarenotreliableenoughforsuchtasksduetolowprecisionrates.Inthispaper,wecombineageometricmatching-basedspottingalgorithmwithanSVMclassifiertoimprovethespotting’sprecision.Insymbolspotting,aquerysymbolistobelocatedwithinalinedrawing.Candidatematchescanbefound,however,thefoundmatchesmaybetrueorfalse.Todistinguishafalsematch,anSVMclassifierisused.Theclassifieristrainedontrueandfalsematchesofaquerysymbol.Thematchesarerepresentedasvectorsthatindicatethequalitiesofhowwellthequeryfeaturesarematched,thosequalitiesareobtainedviageometricmatching.Usingtheclassification,theprecisionofthespottingimprovedfromanaverageof76.6%toanaverageof97.2%onadatasetoftechnicallinedrawings.

8658-14, Session 4

Segmentation-free keyword spotting framework using dynamic background modelGauravKumar,SafwanR.Wshah,VenuGovindaraju,Univ.atBuffalo(UnitedStates);RamachandrulaSitaram,Hewlett-PackardLabs.India(India)

WeproposeasegmentationfreewordspottingframeworkusingDynamicBackgroundModel.Theproposedapproachisanextensiontoourpreviousworkwheredynamicbackgroundmodelwasintroducedandintegratedwithasegmentationbasedrecognizerforkeywordspotting.Thedynamicbackgroundmodelusesthelocalcharactermatchingscoresandglobalwordlevelhypothesesscorestoseparatekeywordsfromnon-keywords.WeintegrateandevaluatethismodelonHiddenMarkovModel(HMM)basedsegmentationfreerecognizerwhichworksatlinelevelwithoutanyneedforwordsegmentation.WeoutperformthestateoftheartlinelevelwordspottingsystemonIAMdataset.

8658-15, Session K1

History of the Tesseract OCR engine: what worked and what didn’t (Keynote Presentation)RayWSmith,Google(UnitedStates)

ThedevelopmenthistoryandcurrentstateoftheTesseractopensourceOCREngineislaidoutandcomparedtochangesintheOCRfieldoverthelast20yearsorso.Theuseandabandonmentof

statisticalclassificationapproachesisexaminedinparticular.Therearesignificantsimilaritiesbetweenthenon-statisticalapproachesusedinTesseractandthemethodsusedinHiddenMarkovModelOCRapproaches,andthepaperdrawsattentiontotherecentuseoftheterms“ad-hoc”and“principled”thatattempttoseparatethesestrangelysimilarapproaches.ArecurringthemethroughoutthepaperisthelessonslearnedfromdevelopinganindustrialstrengthOCRsystemoveraperiodof20years,notjustcoveringfeatureandclassifierdesign,butalsotheveryimportanttopicsoftestingandlanguagecoverage.ThediscussionoflanguagesprovidesaminimalsetthatcoversthesignificantproblemsthatatrulyinternationalindustrialstrengthOCRsystemmustaddress.Resultsareprovidedon9differentlanguageswithatotaltestsetsizeexceeding350millioncharacters.

8658-16, Session 5

Data acquisition from cemetery headstonesCameronS.Christiansen,WilliamA.Barrett,BrighamYoungUniv.(UnitedStates)

Dataextractionfromengravedtextisdiscussedrarely,andnothingintheopenliteraturediscussesdataextractionfromcemeteryheadstones.Headstoneimagespresentuniquechallengessuchasengravedorembossedcharacters(causinginner-charactershadows),lowcontrastwiththebackground,andsignificantnoiseduetoinconsistentstonetextureandweathering.Currentsystemsforextractingtextfromoutdoorenvironments(billboards,signs,etc.)makeassumptions(i.e.cleanand/orconsistently-texturedbackgroundandtext)thatfailwhenappliedtothedomainofengravedtext.Theabilitytoextractthedatafoundonheadstonesisofgreathistoricalvalue.Thispaperdescribesanovelandefficientfeature-basedtextzoningandsegmentationmethodfortheextractionofnoisytextfromahighlytexturedengravedmedium.Thispaperalsodemonstratestheusefulnessofconstrainingaproblemtoaspecificdomain.Thetranscriptionsofimageszonedandsegmentedthroughtheproposedsystemhaveaprecisionashighas87%comparedto2%precisionwithoutzoninganda70%recallcomparedto50%.

8658-17, Session 5

Automated recognition and extraction of tabular fields for the indexing of census recordsRobertClawson,KevinBauer,GlenChidester,MilanTyler-Pohontsch,DouglasJ.Kennard,JonghaRyu,WilliamA.Barrett,BrighamYoungUniv.(UnitedStates)

Wedescribeasystemforindexingofcensusrecordsintabulardocumentswiththegoalofrecognizingthecontentofeachcell,includingbothheadersandhandwrittenentries.Eachdocumentisautomaticallyrectified,registeredandscaledtoaknowntemplatefollowingwhichlinesandfieldsaredetectedanddelimitedascellsinatabularform.Whole-wordorwhole-phraserecognitionofnoisymachine-printedtextisperformedusingaglyphlibrary,providinggreatlyincreasedefficiencyandaccuracy(approaching100%),whileavoidingtheproblemsinherentwithtraditionalOCRapproaches.Constrainedhandwritingrecognitionresultsforasingleauthorreachashighas98%and94.5%fortheGenderfieldandBirthplacerespectively.Multi-authoraccuracy(currently82%)canbeimprovedthroughanincreasedtrainingset.Activeintegrationofuserfeedbackinthesystemwillacceleratetheindexingofrecordswhileprovidingatightlycoupledlearningmechanismforsystemimprovement.

8658-18, Session 5

Old document image segmentation using the autocorrelation function and multiresolution analysisMarouaMehri,Univ.deLaRochelle(France)andUniv.ofRouen

Conference 8658


(France);PetraGomez-Krämer,Univ.deLaRochelle(France);PierreHéroux,Univ.ofRouen(France);RémyMullot,Univ.deLaRochelle(France)

Recentprogressinthedigitizationofheterogeneouscollectionsofhistoricaldocumentshasrekindlednewchallengesininformationretrievalindigitallibrariesanddocumentlayoutanalysis.Therefore,inordertocontrolthequalityofhistoricaldocumentimagedigitizationandtomeettheneedofacharacterizationoftheircontentusingintermediatelevelmetadata,weproposeanautomaticpixel-basedlayoutsegmentationmethodofhistoricaldocumentimagescontentbasedonfeaturescomputedwiththeautocorrelationfunctionandperformedusingmultiresolutionanalysis.Thealgorithmproposedinthisarticlehasadvantagetobeperformedwithoutexpressinganyhypothesis:eitheraboutthedocumentmodel(physicalstructure),ortypographicalparameters(logicalstructure).Itisalsoparameter-freesinceitautomaticallyadaptstotheimagecontent.Wedemonstratethatispossibletoautomaticallyfindthehomogeneousregionsdefinedbysimilarindicesofautocorrelationwithoutaprioriknowledgeusingadaptedclusteringapproach.Toassessourmethod,weapplyouralgorithmonheterogeneouscorpuscontent.Moreover,wedefineanewevaluationmetric,thehomogeneitymeasure,whichaimsatevaluatingthesegmentationandcharacterizationaccuracyofourproposal.Theoverallresultsarequitesatisfying.Suchanalysiswouldhelpprovidingasimilaritymeasurebetweenpagesandconstructingacomputer-aidedcategorizationtoolofpages.

8658-19, Session 5

Lexicon-supported OCR of eighteenth century Dutch books: a case studyJessedeDoes,KatrienDepuydt,LeidenUniv.(Netherlands)

IMPACT(2008-2012)isaprojectfundedbytheEuropeanCommission.ItsaimwastosignificantlyimproveaccesstohistoricaltextandtotakeawaythebarriersthatstandinthewayofthemassdigitizationoftheEuropeanculturalheritage.ForthatpurposeIMPACTaimedtoimprovethequalityofOCR(OpticalCharacterRecognition)forhistoricaldocumentsandtoenhancetheiraccessibility.TherearemanyaspectsinvolvedindealingwiththeproblemsaddressedbyIMPACT.Imageprocessing,whichtriestoremedytypicalproblemslikeskewed,warpedorotherwisenoisydata;bettersegmentationproceduresandadaptiveOCRaimtoovercometheirregularitiesofhistoricaltypography.ThepresentcontributionfocusesonenhancementofOCRresultsbyusingtheappropriatehistoricallexica.WereportonacasestudyonOCRofeighteenthcenturybooksconductedintheIMPACTproject.AfterintroducingtheIMPACTprojectanditsapproachtolexiconbuildinganddeployment,wezoomintotheapplicationofIMPACTtoolsanddatatotheDutchEDBOcollection.TheresultsareexemplifiedbydetaileddiscussionofvariouspracticaloptionstoimprovetextrecognitionbeyondabaselineofrunninganuncustomizedFinereader10.Inparticular,wediscussrecognitionoflongs.

8658-20, Session 6

Character feature integration of Chinese calligraphy and fontCaoShi,JianguoXiao,WenhuaJia,PekingUniv.(China);CanhuiXu,PekingUniv.(China)andBeijingFounderApabiTechnologyLtd.(China)

AframeworkisproposedinthispapertoeffectivelygenerateanewhybridcharactertypebymeansofintegratinglocalcontourfeatureofChinesecalligraphywithstructuralfeatureoffontincomputersystem.Toexploretraditionalartmanifestationofcalligraphy,multi-directionalspatialfilterisappliedforlocalcontourfeatureextraction.Thenthecontourofcharacterimageisdividedintosub-images.Thesub-imagesintheidenticalpositionfromvariouscharactersareestimatedbyGaussiandistribution.Accordingtoitsprobabilitydistribution,thedilationoperatoranderosionoperatoraredesignedtoadjusttheboundaryoffontimage.AndthennewChinesecharacterimagesaregeneratedwhichpossessbothcontourfeatureofartistical

calligraphyandelaboratestructuralfeatureoffont.Experimentalresultsdemonstratethenewcharactersarevisuallyacceptable,andtheproposedframeworkisaneffectiveandefficientstrategytoautomaticallygeneratethenewhybridcharacterofcalligraphyandfont.

8658-21, Session 6

A segmentation-free approach to Arabic and Urdu OCRNazlySabbour,TheGermanUniv.inCairo(Egypt);FaisalShafait,DFKIGmbH(Germany)

WepresentagenericOpticalCharacterRecognitionsystemforArabicscriptlanguagescalledNabocr.NabocrusesOCRapproachesspecificforArabicscriptrecognition.PerformingrecognitiononArabicscripttextisrelatively

moredifficultthanLatintextduetothenatureofArabicscript,whichiscursiveandcontextsensitive.Moreover,Arabicscripthasdifferentwritingstylesthatvaryincomplexity.NabocrisinitiallytrainedtorecognizebothUrduNastaleeqandArabicNaskhfonts.However,itcanbetrainedbyuserstobeusedforotherArabicscriptlanguages.Wehaveevaluatedoursystem’sperformanceforbothUrduandArabic.InordertoevaluateUrdurecognition,wehavegeneratedadatasetofUrdutextcalledUPTI(UrduPrintedTextImageDatabase),whichmeasuresdifferentaspectsofarecognitionsystem.

TheperformanceofoursystemforUrducleantextis91%.ForArabiccleantext,theperformanceis86%.Moreover,wehavecomparedtheperformanceofoursystemagainstTesseract’snewlyreleasedArabicrecognition,andtheperformanceofbothsystemsoncleanimagesisalmostthesame.

8658-22, Session 6

Local projection-based character segmentation method for historical Chinese documentsLinjieYang,LiangruiPeng,TsinghuaUniv.(China)

DigitizationofhistoricalChinesedocumentsincludestwokeytechnologies,charactersegmentationandcharacterrecognition.ThispaperfocusesondevelopingCharacterSegmentationalgorithm.Astheoriginalhistoricaldocumentimagescontainvariousnoises,weproposeanEMclusteringalgorithmtoremovethenoise-likeannotationsandseals.Weuseprojectiveinformationtoremovethepageborderlines.Wealsoapplyalinearfittingmethodinalocalwindowtoremovetherulelinesbetweenthecharacterrows.ThispaperadoptstheOtsu’smethodtobinarizetheimage.ThispaperproposesanewcharactersegmentationalgorithmbasedonprojectionofCostImageinalocalwindowwhichaimsatsegmentingsinglecharacterfromthebinarizedimage.First,weusemorphologicaloperationtoextracttheskeletonimage;Thenwederivethestrokeinformationfromtheskeletonimageandgettheboundingboxofeverystroke.ThisalgorithmcombinestheinformationofstrokeboundingboxandtheskeletonimagetoformaCostImage,anduseittosegmentthecharacters.Weevaluatetheproposedalgorithmbasedonmatchingdegreeofcharacterboundingboxesbetweentheanalysisresultandtheground-truth,whichprovestheeffectivenessofthealgorithm.


A super resolution framework for low resolution document image OCRDiMa,GadyAgam,IllinoisInstituteofTechnology(UnitedStates)

NoAbstractAvailable

Conference 8658



A robust pointer segmentation in biomedical images toward building a visual ontology for biomedical article retrievalDaekeunYou,NationalLibraryofMedicine(UnitedStates);MatthewSimpson,NationalLibraryofMedicine/NIH(UnitedStates);SameerK.Antani,DinaDemner-Fushman,GeorgeR.Thoma,NationalLibraryofMedicine(UnitedStates)

NoAbstractAvailable


Combining multiple thresholding binarization values to improve OCR outputWilliamB.Lund,DouglasJ.Kennard,BrighamYoungUniv.(UnitedStates);EricK.Ringger,BrighamYoungUniv.(UnitedStates)andComputerScienceDept.(UnitedStates)

Fornoisy,historicaldocuments,ahighopticalcharacterrecognition(OCR)worderrorrate(WER)canrendertheOCRtextunusable.Sinceimagebinarizationisoftenthemethodusedtoidentifyforegroundpixels,abodyofresearchseekstoimproveimage-widebinarizationdirectly.Thispaperpresentsnewresearchandanovelapproachbynotrelyingonanyoneadaptivebinarizationbutleveragesinformationfrommultiplesimplethresholdingbinarizationsofthesameimagetoimprovetextoutput.Usinganewcorpusof19thcenturynewspapergrayscaleimagesforwhichthetexttranscriptionisknown,weobserveWERsof13.8%andhigherusingcurrentbinarizationtechniquesandastate-of-the-artOCRengine.OurnovelapproachcombinestheOCRoutputsfrommultiplethresholdedimagesbyaligningthetextoutputandproducingalatticeofwordalternativesfromwhichalatticeworderrorrate(LWER)iscalculated.OurresultsshowaLWERof7.6%whenaligningtwothresholdimagesandaLWERof6.8%whenaligningfive.FromthewordlatticewecommittoonehypothesisbyapplyingthemethodsofLundetal.(2011)achievinganimprovementovertheoriginalOCRoutputanda8.41%WERresultonthisdataset.


Goal-oriented evaluation of binarization algorithms for historical document imagesTayoObafemi-Ajayi,Univ.ofMissouri-Columbia(UnitedStates);GadyAgam,IllinoisInstituteofTechnology(UnitedStates)

Binarizationisofsignificantimportanceindocumentanalysissystems.Itisanessentialfirststep,priortofurtherstagessuchasOpticalCharacterRecognition(OCR),documentsegmentation,orenhancementofreadabilityofthedocumentaftersomerestorationstages.Hence,properevaluationofbinarizationmethodstoverifytheireffectivenessisofgreatvaluetothedocumentanalysiscommunity.Inthiswork,weperformadetailedgoal-orientedevaluationofimagequalityassessmentofthe18binarizationmethodsthatparticipatedintheDIBCO2011competitionusingthe16historicaldocumenttestimagesusedinthecontest.WeareinterestedintheimagequalityassessmentoftheoutputsgeneratedbythedifferentbinarizationalgorithmsaswellastheOCRperformance,wherepossible.WecompareourevaluationofthealgorithmsbasedonhumanperceptionofqualitytotheDIBCOevaluationmetrics.TheresultsobtainedprovideaninsightintotheeffectivenessofthesemethodswithrespecttohumanperceptionofimagequalityaswellasOCRperformance.


Document segmentation via oblique cutsJeremyP.Svendsen,AlexandraBranzan-Albu,Univ.ofVictoria(Canada)

ThispaperpresentsanovelsolutionforthelayoutsegmentationofgraphicalelementsinBusinessIntelligencedocuments.WeproposeageneralizationoftherecursiveX-Ycutalgorithm,whichallowsforcuttingalongarbitraryobliquedirections.Anintermediateprocessingstepconsistingoflineandsolidregionremovalisalsonecessaryduetopresenceofdecorativeelements.Theoutputoftheproposedsegmentationisahierarchicalstructurewhichallowsfortheidentificationofprimitivesinpieandbarcharts.Thealgorithmwastestedonadatabasecomposedofchartsfrombusinessdocuments.Resultsareverypromising.


Preprocessing document images by resampling is error prone and unnecessaryGeorgeNagy,RensselaerPolytechnicInstitute(UnitedStates)

Integritytestsareproposedforimageprocessingalgorithmsthatshouldyieldessentiallythesameoutputunder90degreerotations,edge-paddingandmonotonicgray-scaletransformationsofscanneddocuments.Thetestsaredemonstratedonbuilt-infunctionsoftheMatlabImageProcessingToolbox.Onlytheroutinethatreportstheareaoftheconvexhullofforegroundcomponentsfailstherotationtest.Ensuringerror-freepreprocessingoperationslikesizeandskewnormalizationthatarebasedonresamplinganimagerequiresmoreradicaltreatment.Eveniffaultlesslyimplemented,resamplingisgenerallyirreversibleandmayintroduceartifacts.Fortunately,advancesinstorageandprocessortechnologyhaveallbuteliminatedanyadvantageofpreprocessingorcompressingdocumentimagesbyresamplingthem.Usingfloatingpointcoordinatetransformationsinsteadofresamplingimagesyieldsaccuraterun-length,moment,slope,andothergeometricfeatures.


Multilingual artificial text detection and extraction from still imagesAhsenRaza,AliAbidi,NationalUnivofSciencesandTechnology(Pakistan);ImranA.Siddiqi,BahriaUniv.(Pakistan)

Wepresentanovelmethodformultilingualartificialtextextractionfromstillimages.Weproposealexiconindependent,blockbasedtechniquethatemploysacombinationofspatialtransforms,texture,edgeandgradientbasedoperationstodetectunconstrainedtextualregionsfromstillimages.Finally,somemorphologicalandgeometricalconstraintsareappliedforfinelocalizationoftextualcontent.TheproposedmethodwasevaluatedontwostandardandthreecustomdevelopeddatasetscomprisingawidevarietyofimageswithartificialtextoccurrencesinfivedifferentlanguagesnamelyEnglish,Urdu,Arabic,ChineseandHindi.


A proposal system for historic Arabic manuscript transcription and retrievalAbdelazizLabben,AfefK.Echi,UnitédeRechercheenTechnologiesdel’InformationetdelaCommunication(Tunisia);AbdelBelaïd,LORIA(France)

Inthispaper,weproposeacomputer-assistedtranscriptionsystemofoldregisters,handwritteninArabicfromthe19thcenturyonwards,heldintheNationalArchivesofTunisia(NAT).Theproposedsystemassiststhehumansupervisortocompletethetranscriptiontaskasefficientlyaspossible.Thisassistanceisgivenatalldifferentrecognitionlevels.Oursystemaddressesdifferentapproachesfortranscriptionofdocumentimages.Italsoimplementsanalignmentmethodtofindmappingsbetweenwordimagesofahandwrittendocumentandtheirrespectivewordsinitsgiventranscription.

Conference 8658



Evaluation of document binarization using eigen value decompositionDeepakKumar,M.N.AnilPrasad,A.G.Ramakrishnan,IndianInstituteofScience(India)

Anecessarystepfortherecognitionofscanneddocumentsisbinarization,whichisessentiallythesegmentationofthedocument.Inordertobinarizeascanneddocument,wecanfindseveralalgorithmsintheliterature.Whatisthebestbinarizationresultforagivendocumentimage?Toanswerthisquestion,auserneedstocheckdifferentbinarizationalgorithmsforsuitability,sincedifferentalgorithmsmayworkbetterfordifferenttypeofdocuments.Manuallychoosingthebestfromasetofbinarizeddocumentsistimeconsuming.Canwecomeupwithametricwhichevaluatesthesebinarizeddocuments?Toautomatetheselectionofthebestsegmenteddocument,eitherweneedtouseground-truthofthedocumentorproposeanevaluationmetric.Ifground-truthisavailable,thenprecisionandrecallcanbeusedtochoosethebestbinarizeddocument.Whatisthecase,whenground-truthisnotavailable?Hence,weproposeametrictoevaluatebinarizeddocumentimagesusingeigenvaluedecomposition.WehaveevaluatedthismeasureonDIBCOandH-DIBCOdatasets.Theproposedmethodchoosesthebestbinarizeddocumentthatisclosetotheground-truthofthedocument.


Efficient symbol retrieval by building a symbol index from a collection of line drawingsNibalNayef,ThomasM.Breuel,TechnischeUniv.Kaiserslautern(Germany)

Symbolretrievalisimportantforcontent-basedsearchindigitallibrariesandforautomaticinterpretationoflinedrawings.Inthiswork,wepresentacompletesymbolretrievalsystem.Theproposedsystemhasanoff-linecontent-analysisstage,wherethecontentsofadatabaseoflinedrawingsarerepresentedasasymbolindex,whichisacompactindexablerepresentationofthedatabase.Suchrepresentationallowsefficienton-linequeryretrieval.Withintheretrievalsystem,threemethodsarepresented.First,afeaturegroupingmethodforidentifyinglocalregionsofinterest(ROIs)inthedrawings.ThefoundROIsrepresentsymbols’parts.Second,aclusteringmethodbasedongeometricmatching,isusedtoclusterthesimilarpartsfromallthedrawingstogether.Asymbolindexisthenconstructedfromtheclusters’representatives.Finally,theROIsofaquerysymbolarematchedtotheclusters’representatives.Thematchingsymbols’partsareretrievedfromtheclusters,andspatialverificationisperformedonthematchingparts.Byusingthesymbolindexweareabletoachieveaquerylook-uptimethatisindependentofthedatabasesize,anddependentonthesizeofthesymbolindex.Theretrievalsystemachieveshigherrecallandprecisionthanthestate-of-the-artmethods.

8658-23, Session 8

Structural analysis of online handwritten mathematical symbols based on support vector machinesFoteiniSimistira,VassilisM.Papavassiliou,VassilisKatsouros,InstituteforLanguageandSpeechProcessing(Greece);GeorgeCarayannis,NationalTechnicalUniv.ofAthens(Greece)

Mathematicalexpressionrecognitionisstillaverychallengingtaskfortheresearchcommunitymainlybecauseofthetwo-dimensional(2d)structureofmathematicalexpressions(MEs).Inthispaper,wepresentanovelapproachforthestructuralanalysisbetweentwoon-linehandwrittenmathematicalsymbolsofaME,basedonspatialfeaturesofthesymbols.Weintroducesixfeaturestorepresentthespatialaffinityofthesymbolsandcomparetwomulti-class

classificationmethodsthatemploysupportvectormachines(SVMs):onebasedonthe“one-against-all”techniqueandonebasedonthe“one-against-all”,inidentifyingtherelationbetweenapairofsymbols(i.e.subscript,numerator,etc).Adatasetcontaining1906spatialrelationsderivedfromtheCompetitiononRecognitionofOnlineHandwrittenMathematicalExpressions(CROHME)2012trainingdatasetisconstructedtoevaluatetheclassifiersandcomparethemwiththeclassifieroftheILSP-1systemparticipatedinthecontest.Theexperimentalresultsgiveanoverallerrorrateof2.61%forthe“one-against-one”SVMapproach,6.57%forthe“one-against-all”SVMtechniqueand12.31%errorratefortheILSP-1classifier.

8658-24, Session 8

Using online handwriting and audio streams for mathematical expressions recognition: a bimodal approachSofianeMedjkoune,Univ.deNantes(France)andLeMansUniv,LIUMLab.(France);HaroldMouchère,Univ.deNantes(France);SimonPetitrenaud,Univ.duMaine(France);ChristianViard-Gaudin,Univ.deNantes(France)

Thispaperdealswiththeproblemofmathematicalexpressionsrecognition.Themaingoalistosetupamultimodalsystemdedicatedtothistask.Intheproposedsystem,thehypothesesformulatedinbothspecializedsystemsarecombinedtoincreaseaspossiblethereliabilityofthefinalhypotheseswhichwouldbeconsideredduringthestructuralanalysistoformulatethemathematicalexpressionsolution.Variousfusionmethodsareexplored.PerformancesevaluatedontheHAMEXdatasetshowasignificantimprovementcomparedtoasinglemodality(handwriting)basedsystem.

8658-25, Session K2

What does the future hold for search user interfaces? (Keynote Presentation)MartiHearst,Univ.ofCalifornia,Berkeley(UnitedStates)

NoAbstractAvailable

8658-26, Session 9

Using clustering and a modified classification algorithm for automatic text summarizationAbdelkrimeAries,HoudaOufaida,EcoleNationaleSupérieued’Informatique(Algeria);OmarNouali,Ctr.derecherchesurl’InformationScientifiqueetTechnique(Algeria)

Inthispaperwedescribeamodifiedclassificationmethoddestinedforextractivesummarizationpurpose.Theclassificationinthismethoddoesn’tneedalearningcorpus;itusestheinputtexttodothat.First,weclusterthedocumentsentencestoexploitthediversityoftopics,thenweusealearningalgorithm(hereweusedNaiveBayes)oneachclusterconsideringitasaclass.Afterobtainingtheclassificationmodel,wecalculatethescoreofasentenceineachclass,usingascoringmodelderivedfromclassificationalgorithm.Thesescoresareused,then,toreorderthesentencesandextractthefirstonesastheoutputsummary.

Weconductedsomeexperimentsusingacorpusofscientificpapers,andwehavecomparedourresultstoanothersummarizationsystemcalledUNIS.Also,weexperimenttheimpactofclusteringthresholdtuning,ontheresultedsummary,aswellastheimpactofaddingmorefeaturestotheclassifier.Wefoundthatthismethodisinteresting,andgivesgoodperformance,andtheadditionofnewfeatures(whichissimpleusingthismethod)canimprovesummary’saccuracy.

Conference 8658


8658-27, Session 9

Evaluating supervised topic models in the presence of OCR errorsDanielD.WalkerIV,EricK.Ringger,KevinD.Seppi,BrighamYoungUniv.(UnitedStates)

Supervisedtopicmodelsarepromisingtoolsfortextanalyticsthatsimultaneouslymodeltopicalpatternsindocumentcollectionsandrelationshipsbetweenthosetopicsanddocumentmetadata,suchastimestamps.WetoexamineempiricallytheeffectofOCRnoiseontheabilityofsupervisedtopicmodelstoproducehighqualityoutputthroughaseriesofexperimentsinwhichweevaluatethreesupervisedtopicmodelsandanaivebaselineonsyntheticOCRdatawithvariouslevelsofdegradationandonrealOCRdatafromtwodifferentdecades.Theevaluationincludesexperimentswithandwithoutfeatureselection.Ourresultssuggestthatsupervisedtopicmodelsarenobetter,oratleastnotmuchbetterintermsoftheirrobustnesstoOCRerrors,thanunsupervisedtopicmodelsandthatfeatureselectionhasthemixedresultofimprovingtopicqualitywhileharmingmetadatapredictionquality.

8658-28, Session 9

Rule-based versus training-based extraction of index terms from business documents: how to combine the resultsDanielSchuster,MarcelHanke,KlemensMuthmann,DanielEsser,TechnischeUniv.Dresden(Germany)

Currentsystemsforautomaticextractionofindextermsfrombusinessdocumentseithertakearule-basedortraining-basedapproach.Asbothapproacheshavetheiradvantagesanddisadvantagesitseemsnaturaltocombinebothmethodstogetthebestofbothworlds.Wepresentacombinationmethodwiththestepsselection,normalization,andcombinationbasedoncomparablescoresproducedduringextraction.Furthermore,novelevaluationmetricsaredevelopedtosupporttheassessmentofeachstepinanexistingextractionsystem.Ourmethodswereevaluatedonanexampleextractionsystemwiththreeindividualextractorsandacorpusof12,000scannedbusinessdocuments.

8658-29, Session 9

Post processing with first- and second-order hidden Markov modelsKazemTaghva,Univ.ofNevada,LasVegas(UnitedStates);SrijanaPoudel,Univ.ofNevada(UnitedStates)

Inthispaper,wepresenttheimplementationandevaluationoffirstorderandsecondorderHiddenMarkovModelstoidentifyandcorrectOCRerrorsinthepostprocessingofbooks.Ourexperimentsshowthatthefirstordermodelapproximatelycorrects10%oftheerrorswith100%precision,whilethesecondordermodelcorrectsahigherpercentageoferrorswithmuchlowerprecision.

8658-30, Session 9

Combining discriminative SVM models for the improved recognition of investigator names in medical articles XiaoliZhang,JieZou,DanielX.Le,GeorgeR.Thoma,NationalLibraryofMedicine(UnitedStates)

Investigatorsarepeoplewhoarelistedasmembersofcorporateorganizationsbutnotenteredasauthorsinanarticle.Beginningwithjournalspublishedin2008,investigatornamesarerequiredtobeincludedinanewbibliographicfieldinMEDLINEcitations.Automatic

extractionofinvestigatornamesisnecessaryduetotheincreaseincollaborativebiomedicalresearchandthelargenumberofsuchnames.WeimplementedtwodiscriminativeSVMmodels,i.e.,SVMandstructuralSVM,toidentifynamedentitiessuchasthefirstandlastnamesofinvestigatorsfromonlinemedicaljournalarticles.Bothapproachesachievegoodperformanceatthewordandnamechunklevels.WefurtherconductederroranalysisandfoundthatSVMandstructuralSVMcanoffercomplementaryinformationaboutthepatternstobeclassified.Hence,wecombinedthetwoindependentlytrainedclassifierswhereSVMischosenasabaselearnerwithitsoutputfurtherenhancedbythepredictionsfromstructuralSVM.TheoverallperformanceespeciallytherecallrateofinvestigatornameretrievalimprovesascomparedtostandaloneSVMmodel.

8658-31, Session 10

Adaptive detection of missed text areas in OCR outputs: application to the automatic assessment of OCR quality in mass digitization projectsAhmedBenSalah,BibliothèquenationaledeFrance(France)andLITIS-Univ.deRouen(France);NicolasRagot,Univ.deTours(France);ThierryPaquet,Univ.deRouen(France)

TheFrenchNationalLibrary(BnF)hasstartedmanymassdigitizationprojectsinordertogiveaccesstoitscollectionofdocuments.TheindexationofdigitaldocumentsonGallica(thedigitallibraryoftheBnF)isdonethroughtheirtextualcontentobtainedthankstoserviceprovidersthatuseOpticalCharacterRecognitionsoftwares(OCR).

OCRsoftwareshavebecomeincreasinglycomplexsystemscomposedofseveralsubsystemsdedicatedtotheanalysisandtherecognitionoftheelementsinapage.However,thereliabilityofthesesystemsisalwaysanissueatstake.Indeed,insomecases,wecanfinderrorsinOCRoutputsthatoccurbecauseofanaccumulationofseveralerrorsatdifferentlevelsintheOCRprocess.OneofthefrequenterrorsinOCRoutputsisthemissedtextcomponents.Thepresenceofsucherrorsmayleadtoseveredefectsindigitallibraries.

Inthispaper,weintendtoexaminethedetectionofmissedtextcomponentstocontroltheOCRresultsinthecollectionsoftheFrenchNationalLibrary.OurverificationapproachuseslocalinformationwithinthepagesbasedonRadontransformdescriptorsandLocalBinaryPatternsdescriptors(LBP)coupledwithOCRresultstocontroltheirconsistency.Theexperimentalresultsshowthatourmethoddetects84.15%ofthemissedtextualcomponents,bycomparingtheOCRALTOfilesoutputsproducedbytheserviceproviderstotheinputimageofthedocument.

8658-32, Session 10

Evaluating structural pattern recognition for handwritten math via primitive label graphsRichardZanibbi,RochesterInstituteofTechnology(UnitedStates);HaroldMoucheere,ChristianViard-Gaudin,Univ.deNantes(France)

Currentlyevaluationofpatternrecognitionsystemsthatproducegraphs(i.e.parsers)isoftenperformedusingmetricscomparingobjectsandobjectrelationshipsingroundtruthandthosedetectedbyanalgorithm,e.g.recognitionrates,recallandprecision.Wepresentastructurerepresentationusinglabelsforallinputprimitives(e.g.handwrittenstrokes)andallprimitivepairsbasedonabipartitegraphoverprimitives.WedefineanumberofnewevaluationmetricsusingHammingdistancesoverstructurestrings.Wedemonstratethesemetricsthroughcomparingandcontrastingastroke-levelevaluationwithasymbol-levelevaluationfortheCROHME2012handwrittenmathrecognitioncompetitionsystems.Benefitsofstructurestringsincludetheabilitytorepresentclassification,segmentationandparsingerrorswithinproperdistancemetrics,andtoeasilyidentifyandcompletelycharacterizesourcesoferror.Alibraryprovidingimplementationofprimitiveandobject-level(e.g.recallandprecision)metricswillbe

Conference 8658


madepubliclyavailable.Structurestringsmayalsobeusefulforinformationretrievalandmachinelearningapplications.

8658-33, Session 10

WFST-based ground truth alignment for difficult historical documents with text modification and layout variationsMayceAlAzawi,TechnischeUniv.Kaiserslautern(Germany);MarcusLiwicki,DeutschesForschungszentrumfürKünstlicheIntelligenzGmbH(Germany);ThomasM.Breuel,TechnischeUniv.Kaiserslautern(Germany)

ThisworkproposesseveralapproachesthatcanbeusedforgeneratingcorrespondencesbetweenrealscannedbooksandtheirtranscriptionswhichmighthavedifferentmodificationsandlayoutvariationsunderconsiderationofOCRerrors.Ourapproachesforthealignmentbetweenthemanuscriptandthetranscriptionarebasedonweightedfinitestatetransducers(WFST).Inparticular,weproposeadaptedWFSTstorepresentthetranscrip-tiontobealignedwiththeOCRlattices.Thecharacter-levelalignmenthaseditrulestoalloweditoperations(insertion,deletion,substitution).Thoseeditoperationssupportthetranscriptionmodeltobeadaptedregard-ingtotheOCRsegmentationandrecognitionerrorsandalsotheabilityofaligningwithdifferenttexteditions.Weimplementedanalignmentmodelwhichhasahyphenationmodelsoitcanadaptthenonhyphenatedtran-scription.OurmodelscanalsoworkwithFrakturligatureswhichistypicalforhistoricalFrakturdocuments.WeevaluatedtheourapproachonFrakturdocumentsfrom“WanderungendurchdieMarkBrandenburg”volumes(1862-1889)andobservedtheperformanceofthosemodelsunderOCRerrors.Wecomparetheperformanceofourmodelforthreedifferentscenarios:havingnoinformationaboutthecorrespondenceattheword(i),line(ii),sentence(iii)orpage(iv)level.

Conference 8658


Conference 8659: Sensors, Cameras, and Systems for Industrial/Scientific Applications XIVWednesday-Thursday6–7February2013

Part of Proceedings of SPIE Vol. 8659 Sensors, Cameras, and Systems for Industrial and Scientific Applications XIV

8659-1, Session 1

Fundamental performance differences of CMOS and CCD imagers: part VJamesR.Janesick,SarnoffCorp.(UnitedStates)

ThispaperisacontinuationofseveralprevioussequentialpapersfocusedonfundamentalperformancedifferencesofscientificCMOSandCCDimagers.Discussionsinthispaperinclude:1).dataforcustomultralownoisescientificCMOS5T/6TPPDpixelimagersfabricatedonvariousnewSandboxfabruns,2).fundamentalionizationandbulkCMOSpixelradiationdamagecharacteristicsatdoselevelsupto10MrdalongwithCCDdatafordamagecomparisons(protons,electrons,Co-60),3).progressreportonultralargestitchedCMOSimagersdevelopedfortheMkxNkCMOSimagerfamily,4).fabricationofa4kx4kCMOSimagerforNRL’sSoloHiflightinstrumentontheSolarOrbiterMission,5).highperformanceburiedchannelCCDsbasedonCMOSprocessing,6).RTNandradiationhardnesscomparisonofpandnpixels,7).variousmiscellaneousfundamentalpixeltopicsand11).upcomingdevelopmentworkexpected.

8659-2, Session 1

Kirana: a solid-state megapixel uCMOS image sensor for ultrahigh speed imagingJamieCrooks,BenMarsh,RenatoTurchetta,STFC-RutherfordAppletonLab.(UnitedKingdom);KeithTaylor,WaiChan,SpecialisedImagingLtd.(UnitedKingdom);AssafLahav,AmosFenigstein,TowerJazzSemiconductorLtd(Israel)

Thispaperdescribesasolid-statesensorforultrahigh-speed(UHS)imaging.The‘Kirana’sensorwasdesignedandmanufacturedina180nmCMOStechnologytoachievefull-frame0.7Megapixelvideocaptureatspeedsgreaterthan1MHz.The30umpixelscontainapinnedphotodiode,asetof180low-leakagestoragecells,afloating-diffusion,andasourcefolloweroutputstructure.Boththeindividualcellsandthewaytheyarearrangedinthepixelarenovel.Thepixelarchitectureallowscorrelateddoublesamplingforlownoiseoperation.Inthefastmode,thestoragecellsareoperatedasacircularbuffer,where180consecutiveframesarestoreduntilreceiptofatrigger;upto5video-burstspersecondcanbereadout.Inthe‘slow’mode,thestoragecellsactlikeapipeline;thesensorcanbereadoutlikeaconventionalsensoratacontinuousframerateof1,000fps.Thesensorarchitectureisfullyscalableinresolutionsincememorycellsarelocatedinsideeachpixel.Thepixelarchitectureisscalableinmemorydepth(numberofframes)asatrade-offwithpixelsize,dependentonapplication.Thepresentimplementationof0.7Mpixelshasanarrayfocalplanewhichisoptimisedforstandard35mmoptics,whilstofferingacompetitive180-framerecordingdepth.Thesensordescribedhasbeenmanufacturedandiscurrentlybeingcharacterised.Operationofthesensorinthefastmodeat2millionframespersecondhasbeenachieved.Detailsonthecamera/sensoroperationwillbepresentedtogetherwithfirstexperimentalresults.

8659-3, Session 1

Back-side-illuminated image sensor with burst capturing speed of 5.2 Tpixel per second

ToshikiArai,JunYonai,TetsuyaHayashida,HiroshiOhtake,JapanBroadcastingCorp.(Japan);HarryvanKuijk,TeledyneDALSA(Netherlands);TakeharuG.Etoh,RitsumeikanUniv.(Japan)

Wedevelopedaback-sideilluminatedimagesensorwithburstcapturingspeedof5.2Tpixelpersecond.Theimagesensorhasaspecialcharge-coupleddevice(CCD)within-situCCDmemories.Thetotalpixelcountsare411pixelsverticallyand760pixelshorizontally.Theareaofeachpixelwas46.8?m?46.8?m.Theoppositeanglelengthoftheimageareawas40.4mm.Thesizeofeachmemoryelementwas3.0?m?3.6?m.Thenumberofrecordableimageswas139frames.Theresultsofadrivingexperimentindicatedthatthesensitivityofthesensorwas13timeshigherthanthatoffront-sideilluminatedimagesensor.Thesensitivityoftheevaluationcamerasystemusingthemonochromesensorwas2000luxF90.Theresolutionofthesensorwas410TVlines.Asaturationsignallevelof100%wasmaintainedover10Mfps.Themaximumframeratewas16.7Mfps.Theproductofthepixelcountandmaximumframerateisoftenusedasafigureofmeritinhigh-speedimagingdevices.Inthiscase,312,360pixelstimesthemaximumframerateof16.7Mfpsyields5.2Tpixelpersecond.Theburstcapturingspeedofthisimagesensorwasthus5.2Tpixelpersecond.

8659-4, Session 2

A custom CMOS imager for multi-beam laser scanning microscopy and an improvement of scanning speedMin-WoongSeo,KeiichiroKagawa,KeitaYasutomi,ShojiKawahito,ShizuokaUniv.(Japan)

Multi-beamconfocalmicroscopywitha256?256-pixelcustomCMOSimagerperformingfocal-planepinholeeffect,inwhichanyrotatingdiskisnotrequired,isdemonstrated.Aspecimenisilluminatedby32?32diffraction-limitedlightspotswhosewavelengthandpitchare532nmand8.4?m,respectively.Thespotarrayisgeneratedbyamicrolensarray,whichisscannedbytwo-dimensionalpiezoactuatoraccordingtothescanningoftheimagesensor.Theframerateoftheprototypeis0.26Hz,whichislimitedbytheactuator.Theconfocaleffecthasbeenconfirmedbycomparingthedepthoffocusintheconfocalimagingmodewiththatofthenormalimagingmode.Thedepthoffocusintheconfocalmodebythefullwidthathalfmaximum(FWHM)isreducedby61.8%comparedwiththatofthenormalmode.Throughthisresult,theconfocaleffectoftheproposedimagerisconfirmed.Thefocal-planepinholeeffectintheconfocalmicroscopywiththeproposedCMOSimagerhasbeendemonstratedatlowframerate.AnimprovementofthescanningspeedandaCMOSimagerwithphoto-sensitivitymodulationpixelssuitableforhigh-speedscanningarealsodiscussed.

8659-5, Session 2

An ultrafast ultracompact sensor for diffuse correlation spectroscopyKiniaBarjean,EricTinet,DominiqueEttori,Univ.Paris13(France);AntoineDupret,Commissariatàl’ÉnergieAtomique(France);MariusVasiliu,Univ.Paris-Sud11(France);Jean-MichelTualle,Univ.Paris13(France)

DiffuseCorrelationSpectroscopy(DCS)isbasedontime-resolvedcorrelationsofthelightthathavediffusedthroughabiologicalmedia.


Measurementsmustbemadeonasmallcoherenceareaofthesizeofaspecklegrain.SummingindependentmeasurementenablestoincreasetheSNRasthesquarerootofthenumberofdetectors.WepresentabidimensionnalpixelCMOSdetectorarrayspeciallydesignedforthistask,withparallelin-pixeldemodulationandtime-resolvedcorrelationcomputation.Opticalsignalcanbeprocessedataratehigherthan10,000samplespersecondwithdemodulationfrequenciesintheMHzrange.

8659-6, Session 2

A 3D image sensor with adaptable charge subtraction scheme for background light suppressionJungsoonShin,ByongMinKang,KeechangLee,JamesD.K.Kim,SamsungElectronics(Korea,Republicof)

Wepresenta3DToF(Time-of-Flight)imagesensorwithadaptivebackgroundlightstorageandsubtractionschemeforbackgroundlightsuppression.Theproposedsensorcanalternatelycapturehighresolutioncolorimageandhighqualitydepthmapineachframe.Indepth-mode,thesensorrequiresenoughintegrationtimeforaccuratedepthacquisition,butsaturationwilloccurinhighbackgroundlightillumination.WeproposetodividetheintegrationtimeintoNsub-integrationtimesadaptively.Duringthefirstsub-integrationtime,oursensorcapturesanimageonlyfrombackgroundlightwithoutIRmodulatedlightandstoresitinframememory.ThestoredimageissubtractedfromsubsequentcapturedimageframeswithIRmodulatedlightandsamesub-integrationduration.ThesubtractionresultsarecumulatedNtimesobtainingafinalresultimagewithoutbackgroundilluminationatfullintegrationtime.ExperimentalresultswithourownToFsensorshowhighbackgroundsuppressionperformance.Wealsoproposein-pixelstorageandcolumn-levelsubtractioncircuitforchip-levelimplementationoftheproposedmethod.Webelievetheproposedschemewillenable3Dsensorstobeusedinout-doorenvironment.

8659-7, Session 2

High performance 7.4-micron interline transfer CCD platform for applied imaging marketsDouglasA.Carpenter,JamesA.DiBellaSr.,RobertKaser,TruesenseImaging,Inc.(UnitedStates);BrentJ.Kecskemety,TruesenseImaging,Inc.(Andorra);StephenL.Kosman,JohnP.McCarten,ChristopherParks,TruesenseImaging,Inc.(UnitedStates)

Technologydevelopedfora5.5µmpixelinterlinetransferCCDfamilyhasbeenincorporatedintoanewfamilyofhigh-performance7.4µmpixelCCDs,providingsignificantimprovementsinseveralkeyperformanceparameterscomparedtoboththe5.5µmfamilyaswellasthepreviousgenerationof7.4µmpixelproducts.Smearinthenewplatformhasbeenreducedto-115dB,andframeratehasbeendoubledrelativetothepreviousgenerationof7.4µmpixelproducts.Dynamicrangeinnormaloperationhasbeenimprovedto70dB,andtheplatformsupportsanewextendeddynamicrangemodewhichprovides82dBwhenbinning2x2.Thenewfamilyleveragesthepackageandpin-outconfigurationsusedinthe5.5µmpixelfamily,allowingeasyintegrationintoexistingcameradesigns.

8659-8, Session 3

A 33M-pixel wide color gamut image capturing system using four CMOS image sensors at 120 HzTakujiSoeno,NHKScience&TechnologyResearchLabs.(Japan);KoheiOmura,NHKEngineeringServices,Inc.(Japan);TakayukiYamashita,RyoheiFunatsu,MasayukiSugawara,MakotoOkui,NHKScience&TechnologyResearchLabs.(Japan)

Wehavebeendevelopinganextremelyhighresolutionvideosystemwith33Mpixels.Experimentalresultsregardinghumanperceptionshowthattheframefrequencyforthissystemshouldbeatleast120Hztoachieveasignificantimprovementinmotionportrayal.Wehavethereforeinvestigateda33M-pixelimage-capturingsystemat120Hz.ThesystemconsistsoffourCMOSimagesensorsthatcannotonlyreadout33Mpixelat60Hzbutalsoreadoutdataat120Hzfromonlyoddorevenlines.Twoimagesensorsareusedforthegreenchannel(G1andG2),andoneisusedforeachoftheredandbluechannel.G1sensoroutputstheoddlines,whileG2sensoroutputstheevenlines.33M-pixelgreenimagesareproducedbycombiningG1andG2.Theredandbluesensorsscantheoddlinesandtheevenlines,respectively.Subsequently,thelineswhosedatahavenotbeenreadareinterpolatedintheverticaldirection.Wehavebuildprototype,andevaluateditsresolutionandimagelag.Theperformanceoftheimage-capturingsystemhasbeenfoundtobesufficienttocapture33M-pixelimagesatthefrequencyof120Hz.

8659-9, Session 3

A 3 mpixel ROIC with 10um pixel pitch and 120 Hz frame rate digital outputEladIlan,NivShiloah,ShimonElkind,RomanDobromislin,WillieFreiman,AlexZviagintsev,ItzikNevo,OrenCohen,FannyKhinich,AmnonAdin,RonTalmor,YaakovMilstain,SCDSemiconductorDevices(Israel)

A1920x1536matrixreadoutIC(ROIC)for10x10um2P-on-NInSbphotodiodearrayisreported.TheROICfeaturesseveralconversiongainoptionsimplementedatthepixellevel.A2-by-2pixelbinningfeatureisimplementedatthepixellevelaswell,improvingSNRandenablinghigherframeratesbyafactoroffour.AdedicatedcolumnADCisdesignedforlownoiseandlowpowerconsumption,whilereaching95kSpssamplingrate.Since3840columnADCsareintegratedonchip,thetotalconversionrateisover360Mpxl/sec.TheROICachieves120Hzframerateatthefullformat,withpowerconsumptionoflessthan400mW.Ahighspeedserialvideointerfaceisdevelopedtooutputtherequireddatabandwidthatareasonablepincount.

8659-11, Session 3

Dynamic capability of sensors with nonlinear pixels utilized by security camerasAndersJohannesson,HenrikEliasson,SebastianFors,AxisCommunicationsAB(Sweden)

Wehavepreviouslyproposedaframeworkcontainingatypicalsecuritycamerausecaseandhavediscussedhowwellthisishandledbylinearimagesensorswithvariouscharacteristics.Thefindingswerevisualizedgraphically,usingasimplecamerasimulatorgeneratingimagesunderwell-definedconditions.Inordertosuccessfullyrenderlow-contrastobjectstogetherwithlargeintra-scenevariationsinilluminance,thesensorrequirementsmustincludeahighdynamicrangecombinedwithacomparablyhighsignal-to-noiseratio.Inthispaperwereusetheframeworkandextendthediscussionbyincludingalsosensorswithnon-linearpixelresponses.

Theobviousbenefitofanon-linearpixelisthatitgenerallycancopewithahigherscenedynamicrangeandthatinmostcasestheexposurecontrolcanberelaxed.Knowndrawbacksare,forexample,thatthenoiselevelcanbefairlyhigh.Morespecifically,thespatialnoiselevelsarehighduetovariablepixel-to-pixelcharacteristicsandlackofon-chipcorrections,likecorrelateddoublesampling.

Inthispaperweignorethespatialnoise,sincesomeoftherelatedissueshavebeenaddressedrecently.Insteadwefocusonthetemporalnoiseanddynamicresolutionissuesinvolvedinnon-linearimagingonasystemlevel.Sincetherequirementsaredefinedbyourselectedusecase,andsincewehavedefinedavisualframeworkforanalysis,itisstraightforwardtocompareourfindingswiththeresultsforlinearimagesensors.Asinthepreviouspaper,theimagesimulationsarebasedonsensordataobtainedfromourownmeasurements.

Conference 8659



Optical high-altitude balloon tracking and imaging for optical communicationsJeremyStraub,RonaldA.Fevig,Univ.ofNorthDakota(UnitedStates)

AnopticalcommunicationssystemhasbeenproposedwhichutilizesanLEDoneachfaceofaCubeSatfordownlinkcommunicationsfromdeepspace.Testingofthissystemviahighaltitudeballoon(HAB)willbeconductedtovalidatekeyassumptionsandsystemfunctionality.Thereceivingsystemincorporatestwocomponents.Ahigh-resolution,low-spatial-coveragetelescopeandCCDreceivesthedata,whichisdecodedbyacomputerandcomparedtothepre-programmedmessagesthattheHABtransmittersendsduringthetesting.Alower-resolution,wide-field-of-viewsensorisusedtotrackthepoint-light-sourceandissuecommandstothetelescope’sorientationsystemtokeepthetransmittercenteredintheframeandtodeterminewhatareaoftheCCDshouldbereadtomaximizedatathroughput.

Apredictivealgorithm,basedonpathsofmovement,isusedtoattempttokeepaheadofthemovementfacedbytheHAB.Thisalgorithmutilizesknowledgeoftheexactpixelsoccupiedbythepayloadonthehigh-resolutionCCDandcomputedframe-to-framemovementvectorstopredictthepositionoftheHABpayloadinthenextframe,andrepositionthetelescopeaccordingly.Eachsubsequentpredictionisbasedontheactualmovementrecordedandthenewlycomputedmovementvector.Correctionformiss-predictionsisalsoincorporated.


Characterization of a solid state air corona charging deviceMichaelYoung,BaominXu,SteveBuhler,PaloAltoResearchCenter,Inc.(UnitedStates);KarlLittau,StanfordUniv.(UnitedStates)

TwonewsolidstatedeviceswhichproducedanatmosphericaircoronadischargeforgeneratinganddepositingalayerofstaticchargeforXerographicimaginghavebeenfabricatedandcharacterized.Onetypewasaparallelplatecapacitivestructureandtheotherwasaninterdigitatedcapacitivestructure.Itwasdeterminedthattheinterdigitatedcapacitivestructureperformedbetterthantheparallelplatecapacitivestructureintermsofpowerconsumption,chargingcurrentlevelstabilityanddevicereliability.Severalmetalelectrodematerialalternativeswereinvestigatedandgoldelectrodesperformedthebest.Theaircorona’slightemissionpeaksweremeasuredtobeinthe350nmto500nmrange.Ozonegasgenerationto~13ppmwasdetectedforanactivesurfaceareaof5cm^2.Chargedepositionontoanimagingdrumwithasignificantchargingcurrentdensityof1.6E-4A/cm^2hasbeensuccessfullydemonstrated.


Creation of North-East Indian face database for human face identificationKankanSaha,PriyaSaha,MrinalK.Bhowmik,TripuraUniv.(India);DebotoshBhattacharjee,MitaNasipuri,JadavpurUniv.(India)

ThispaperdescribesthedesignanddevelopmentoftheNorth-EastIndian(NEI)FaceDatabase.Italsogivesadetaileddescriptionoftheimagecapturingequipments’(i.e.lights,camerasetc.)setupandtheirdifferentanglesanddistancesfromthesubject.Thedatabaseimageshavebeencapturedwithfourilluminationtypes,eightexpressions(includingneutral),andfaceswithglasses,inacontrolledenvironment.TheNEIFaceDatabasecontainsfaceimagesof292individualswith95imagesperindividual.Alltheparticipantsinthisdatabasearestudents,facultymembersandotherstaffsofvariouseducationalinstitutes.ParticipantsincludebothIndiantribesandnon-tribesofMongolianorigincollectedfromthethreenorth-easternstatesofIndia,agingfrom

18to63.SomebaselinealgorithmslikePrincipalComponentAnalysis(PCA)andIndependentComponentAnalysis(ICA)havealsobeentestedusingtheSupportVectorMachines(SVM)classifieronourowndatabase,whichmaybeusedasthecontrolalgorithmperformancescorebyotherresearchers.Acomparativestudyofthevarious2Dfaceimagedatabasesbasedonthepropertiesoftheimages,conditionsusedforcapturingthefaces,andbasedontheperformancescoreisalsoprovided.


Optical characterization parameters by study and comparison of subwavelength patterns for color filtering and multispectral purposeJacquesMantanga,YvonLacroute,PierreGouton,EricBourillot,Le2i(France)

Acomparativestudyofrealizationprocessesofcolorfiltersforsolidstateimagerhasbeendeveloped.Theunprecedentedabilityofnanometallicstructurestoconcentratelightintodeepsubwavelengthvolumeshaspropelledtheiruseinavastarrayofnanophotonicstechnologiesandresearchendeavors.Plasmoniclightconcentratorscanelegantlyinterfacediffraction-limiteddielectricopticalcomponentswithnanophotonicstructures.SolargeinterestwasfocusedonSurfacePlasmonResonant(SPR)basedsensorsandapplications.Recently,ResonantWaveguideMetallicGrating(RWMG)andNanoholesFilteringArray(NFA)havebeendevelopedwithhightransmissionfactor.Differentkindof2Dperiodicstructuresisobservedandquantifiedinthispaper,withvariousmetalstochecktheconsistencyofopticalparametersinordertoachieveaseriesofon-chippaper,suitablespectralfilterwithmultispectralpurpose.Simulation,andexperimentallytestsusingtheRigorousCoupledWaveAnalysis(RCWA)givessignificantresultsin2Dnanostructureshapesandthinfilms,spectrumofeachstructurebaseRCWAparameters.Wecomparedamorphousandcrystallizedstructuresandanimprovementoccurredbycrystallinestate.MostofSPRbasedsensorsconsistinastackofsuperposedlayer.Theseresults,thereof,willpotentiallybeusedintherealizationoftheupperlayerofaccurateresonantspectralfilter.

8659-13, Session 4

Empirical formula for rates of hot pixel defects based on pixel size, sensor area, and ISOGlennH.Chapman,RohitThomas,SimonFraserUniv.(Canada);ZahavaKoren,IsraelKoren,Univ.ofMassachusetts,Amherst(UnitedStates)

Imagesensorsmeasurementsshowacontinuousdevelopmentofin-fieldpermanenthotpixeldefectsovertime.Experimentaltestshavebeenaccumulatedoncamerasrangingfromlargearea(>300sqmm)DSLR’s,mediumsized(~40sqmm)pointandshoot,andsmall(20sqmm)cellphonecameras.Theseresultsshowthatthedefectratedependsonthetechnology(APSorCCD)andondesignparameterslikeimagerarea,pixelsize(from1.5-7um),andgain(fromISO100to1600).Comparingdifferentsensorsizeswithsimilarpixelsizesshowedthatdefectratesscalelinearlywithsensorarea,suggestingthemetricofdefects/year/sqmm.AsearchwasmadetomodelthatdefectrateusingtheparametersofbothpixelsizeandISO.ThebestempiricalcurvefitwasforCCDimagerswithapowerlawwithdefectratesproportionaltothepixelsizetothepowerof-2.24times,theISOraisedto0.68,timesaconstant.ForAPS(CMOS)sensorsthepowerlawhadpixelsizesto-3.6times,theISOraisedto0.56.ExtendingourempiricalformulatoincludeISOallowsustopredicttheexpecteddefectdevelopmentrateforawidesetofsensorparameters.

Conference 8659


8659-14, Session 4

A statistical evaluation of low frequency noise of in-pixel source follower-equivalent transistors with various channel types and body biasRihitoKuroda,AkihiroYonezawa,AkinobuTeramoto,Tsung-LingLi,YasuhisaTochigi,ShigetoshiSugawa,TohokuUniv.(Japan)

Forafurtherperformanceimprovementoflow-light-levelimagingCMOSimagesensors,areductionoflowfrequencynoiseofin-pixelsourcefollowersisindispensable.Forthispurpose,weevaluatedstatisticallowfrequencynoisecharacteristicsofin-pixelsourcefollower-equivalenttransistorswithvariouschanneltypesandbodybiasconditions.Theevaluatedtransistortypesareburiedchannel(BC)andsurfacechannel(SC)transistorswithorwithoutanisolatedwell.TThegatewidth/lengthoftheevaluatedtransistorswas0.32/0.32?m.Thegateoxidethicknesswas7.6nm.

TheBCtransistorswithoutisolatedwellexhibitsanoisedistributionhavingamuchlowernoiselevelinthetailpartscomparedtotheSCtransistorsaspreviouslyreported.DifferentimpactsoftheisolatedwellwereobservedforBCandSCtransistors.ForSCtransistors,boththresholdvoltagevariationandnoisedistributiondidnotchangemuchamongtransistorswithorwithoutisolatedwell.FortheBCtransistorswithisolatedwell,thresholdvoltagevariationwasgraterthatthatofoneswithoutisolatedwell.Inaddition,noisedistributionbecamealargerlevelforBCtransistorswithisolatedwellthanwithoutit.

8659-15, Session 4

New analog readout architecture for low noise CMOS image sensors using column-parallel forward noise-canceling circuitryTsung-LingLi,YasuyukiGoda,ShunichiWakashima,RihitoKuroda,ShigetoshiSugawa,TohokuUniv.(Japan)

Inthispaper,anewtypeofanalogpixelreadoutarchitectureforlow-noiseCMOSimagesensorshasbeenpresented.Comparingwithconventionalreadoutarchitecture,aproposedforwardnoise-cancellerisappliedinourreadoutarchitecturetoprovideasharpernoisefiltering.Themaincolumncircuitsconsistofx16low-noiseamplifierswithcorrelated-double-samplingoperation,proposedforwardnoise-cancellers,andanalogmemoriesforstorageofreadoutresetandsignalvoltage.Throughthehigh-gaincolumnamplifiertogetherwiththecolumnforwardnoise-canceller,thisreadoutarchitecturesignificantlyimprovesrandomnoiseofin-pixelsourcefollowerandcolumnamplifieraswellastemporallinenoisefromsuppliesandpulselines.A400(H)x250(V)CMOSimagesensorusingthenewreadoutarchitecturehasbeenimplementedin0.18um1P3MCMOStechnologywithpinned-photodiodes.Boththepixelpitchandthecolumncircuitpitchare4.5um.Theareaoftheproposedforwardnoise-cancellerisonly4.5umx136umwhichisarea-efficient.Thepixelconversiongainis67uV/e-.Withtheproposedarchitecture,therandomnoiseofthein-pixelamplifierandcolumnamplifierisreducedby32%and40%,respectively.Thepowersupplyrejectionratioimprovesnearly90%.Themeasuredrandomnoiseisreducedbyabout35%comparedtotheconventionalarchitecture.

8659-16, Session 4

A novel pixel design with hybrid type isolation scheme for low dark current in CMOS image sensorSungHoChoi,SungkyunkwanUniv.(Korea,Republicof)andSamsungElectronicsCo.,Ltd.(Korea,Republicof);YiTaeKim,MinSeokOh,SamsungElectronicsCo.,Ltd.(Korea,Republicof);YoungHwanPark,SamsungAdvancedInstituteofTechnology(Korea,Republicof);JeongJinCho,YoungHeubJang,Hyung

JunHan,JongWonChoi,HoWooPark,SangIlJung,HoonSangOh,JungChakAhn,HiroshigeGoto,ChiYoungChoi,SamsungElectronicsCo.,Ltd.(Korea,Republicof);YonghanRoh,SungkyunkwanUniv.(Korea,Republicof)

NewisolationschemeforCMOSimagesensorpixelisproposedandtheimproveddarkcurrentperformanceisreported.ItiswellknownSTI(ShallowTrenchIsolation),whichiscommonlyusedforelectricalisolationofsemiconductordevices,isoneofthemajorsourcesofdarkcurrentinimagerpixelduetotheexistenceofinterfacialdefectsatSTI/Siinterface.

TosuppressthedarkcurrentgeneratedattheSTI/Sisurface,theimplantationofbornatomsneartheSTI/Sisurfaceisgenerallyusedtoprovidethesufficientamountofholesandtoannihilatedarkelectrons.

Inthiswork,wedidexperimenttoremovetheSTIitselffromtheperipheralofphotodiodeinordertoremovefundamentalsourceofdarkcurrent,whileremainingSTIaroundin-pixeltransistors.Experimentalresultsarereviewedinthispaperincludingisolationabilityofin-pixeltransistors.

8659-17, Session 5

Continuous fabrication technology for improving resolution in RGB-stacked organic image sensorToshikatsuSakai,HokutoSeo,SatoshiAihara,MisaoKubota,NHKScience&TechnologyResearchLabs.(Japan);MamoruFuruta,KochiUniv.ofTechnology(Japan)

Todevelopcompact,high-resolutioncolorcameras,wehavebeenresearchinganovelimagesensorwiththreestackedorganicphotoconductivefilmssensitivetoonlyoneoftheprimarycolorcomponents,eachofwhichhasasignalreadoutcircuit.Wehavepreviouslyfabricatedthreeseparateelementsbyformingred,greenandblueorganiclayerswithaZnOthin-filmtransistor(TFT)arrayasareadoutcircuitonthreeglasssubstratesandstackedtheseelementstoproduceanimagesensor(128?96pixels).Intheprototypesensor,however,whentheopticalimageisfocusedonthemiddleofthethreelayers,theimagesformedonotherlayersareblurredbecause0.7-mm-thickglasssubstratesexistbetweenorganiclayers.Thisreducestheresolutioninoutputimages.

Wedevelopedacontinuousfabricationtechnologythatentailsplacingthreeorganiclayersclosetoeachother,separatedby1-2-?m-thickinterlayerinsulators,onasingleglasssubstrate.Topreventthermaldamagetotheorganiclayers,wefabricatedaTFTandaninterlayerinsulatoratatemperaturelowerthan150°C.Wealsoexplainedatestproductionofacontinuously-stackedorganicimagesensorusingtheselow-temperaturefabricationprocesses.

8659-18, Session 5

Biological tissue identification using a multispectral imaging systemCélineDelporte,Univ.PierreetMarieCurie(France);SylvieSautrot,Univ.DenisDiderot(France)andUniv.PierreetMarieCurie(France);MohamedBenChouikha,Univ.PierreetMarieCurie(France);FrançoiseViénot,Muséumnationald’Histoirenaturelle(France);GeorgesAlquié,Univ.PierreetMarieCurie(France)

Thispaperpresentsamultispectralimagingsystem(MIS)enablingbiologicaltissueidentifyinganddifferentiation.Thesystemwouldbeinstalledinsurgicalroomandshouldprovidethesurgeonanintraoperativediagnosisaid.Inapreviousworkweintroducedcalibrationofthemultispectralimagingsystemandwevalidateitsmeasurementoftissuespectralluminance,B(L).Eventhoughthesystemwascalibrated,wefoundwidevariabilityof?B(L)valuesinthecaseofbiologicaltissues.Thedispersionwasstudiedandevaluatedforseveraltissues.Asthesemediaarecomplexandnonhomogenous,B(L)?variabilityreaches20%.Thismaybeabiglimitationfortissueidentifyinganddifferentiation.Thiswokfocusesonthestudyofthe

Conference 8659


MISabilitytoidentifybiologicaltissuesandtodifferentiatethem.Severalidentificationmethodsareproposedandevaluatedinordertoassesstheirrelevance.Eachmethodusesspecificcriteriaappliedtothemultispectralimages.TheMISproducesaB(L)?spectralcubeofbiologicaltissuesinthespectralrangeof500nmto980nm.Thetissueimageswereobtainedaccordingtothe0°/45°CIEstandardgeometry.AnimageprocessingalgorithmextractateachpixeloftheimagethecorrespondingB(L)???A?databaseofB(L)forvarioustissuescanbeestablished.WecarriedoutmeasurementofB(L)spectralcube,forfourtissuetypespresentinthesamescene:beefmuscle,porkmuscle,turkeymuscleandbeef.Theexperimentwasrepeatedwiththesametypesoftissuefromdifferentanimals.Theidentificationmethodswereappliedtothemultispectralimagesofthescenes.Themethodrelevanceisevaluatedbycalculatingidentificationrateandover-detectionrateforeachtissueateachscene.ThefirstmethodwestudiedisbasedonthecorrelationofspectralluminancefactorsB(L).Detectionratesrangingfrom64%to89%,dependingontissuetype,wereobtained.Thiscorrelationcriterionmaintainslowratesofover-detection,ie0.5%to5.5%.Asecondmethodwasdefinedbycalculatingthederivativeofeachofthespectralluminancefactors.Thesingularcharacteristicsoftissueswerethushighlighted.ThecorrelationfactoriscalculatedontheB(L)derivatives.Detectionrateshaveimprovedandvaryfrom79%to89%.Averylowover-detectionraterangingfrom0.1%to0.2%isobtained.Toimprovedetectionrates,athirdidentificationmethodisproposedandimplemented.Somerelevantwavelengthshavebeenselectedtodisplay

8659-19, Session 5

A CMOS image sensor using floating capacitor load readout operationShunichiWakashima,YasuyukiGoda,Tsung-LingLi,RihitoKuroda,ShigetoshiSugawa,TohokuUniv.(Japan)

Inthispaper,aCMOSimagesensorusingfloatingcapacitorloadreadoutoperationhasbeendiscussed.Thefloatingcapacitorloadreadoutoperationisusedduringpixelsignalsreadout.Andthisoperationhastwofeatures;one,in-pixelsourcefollowersaredrivenbyloadcapacitorwithoutcurrentsources,two,parasiticcapacitorofverticalsignallineisusedasstoragecapacitor.Thisoperationproducesthreeadvantages.Smallerchipsize,loweroutputnoiseandlowerpowerconsumptionthanconversionalCMOSimagesensorsareachieved.TheprototypeCMOSimagesensorwasfabricatedusinga0.18um1P3MCMOStechnologywithpinnedphotodiodes.Thediesizeis2.5mm(H)x2.5mm(V),thepixelsizeis4.5um(H)x4.5um(V),andthenumberofpixelsis400(H)x300(V).Theconversiongainis67uV/e-.Thisimagesensorconsistsofonlypixelarray,scanningcircuits,columnsourcefollowersofwhichheightisaslowasthatofthepixelandtwooutputbuffers.Thesizeofperipheralcircuitisreducedby81%ofconventionalCMOSimagesensors.Thedarkrandomnoiseofpixelsexceptresetnoise(afterCDS)is160uVrms.Thenoiseofconventionalimagesensoris340uVrms;therefore,53%reductionisachieved.Thepowerconsumptionforpixeloutputreadoutreduced97%ofconventionalCMOSimagesensor.

8659-20, Session 5

A UV Si-photodiode with almost 100% internal Q.E. and high transmittance on-chip multilayer dielectric stackYasumasaKoda,RihitoKuroda,TaikiNakazawa,YukihisaNakao,ShigetoshiSugawa,TohokuUniv.(Japan)

Ultravioletlight(UV-light)sensing,especiallyinabout200-300nmwavelength,isrequired.Sensitivity(externalquantumefficiency)isexpressedbyinternalquantumefficiencytimestransmittance.Recently,atechnologytofabricatephotodiodewithalmost100%internalquantumefficiencyandhighstabilitytoUV-lighthasbeenreported[1][2].Weintroduceahightransmittanceon-chipmultilayerdielectricstackabovethistypeofphotodiodeanddemonstratetheimprovedsensitivityandhighstabilitytoUV-light.

WefocusedonSiO2andSi3N4whichhavebeenusedasinterlayerdielectricinLSI.WeemployedamicrowaveexcitedhighdensityplasmaCVDequipmenttoformhighqualitySi3N4.[3]BydecreasinggasflowratesofSiH4andH2,weobtainedalowextinctioncoefficientof0.013at215nmwavelength.

ByusingthedevelopedSi3N4film,wefabricatedphotodiodewithon-chipmultilayerdielectricstack;Si/SiO2(8nm)/Si3N4(13nm)/SiO2(44nm)/Si3N4(25nm)withbandpassfilter-typehightransmittancetoUV-light.Weobtainedthepeakof86.9%externalquantumefficiencyat238nmwavelengthandsensitivity-degradationinUV-lightregiondidnotappear.

Inconclusion,byoptimizingthestructureandthicknessofthestackusingSiO2andlowextinctioncoefficientSi3N4withthehighUV-lightsensitivityphotodiodetechnology,highexternalquantumefficiencyandhighstabilitytoUV-lightwaresuccessfullyobtained.

[1]R.Kurodaetal.,“HighUltravioletLightSensitiveandHighlyReliablePhotodiodewithAtomicallyFlatSiSurface”InternationalImageSensorWorkshop,pp.38-41,2011

[2]T.Nakazawaetal.,“PhotodiodedopantstructurewithatomicallyflatSisurfaceforhighsensitivitytoUV-light”TheInternationalSocietyforOpticalEngineering,2012

[3]Y.Nakaoetal.,“HighQualityandLowThermalBudgetSiliconNitrideDepsitionUsingPECVDforGateSpacer,SilicideBlockandContactEtchStopper”SolidStateDevicesandMaterials,2011

8659-21, Session 6

High sensitivity analysis of speckle patterns: a technological challenge for biomedical optics (Invited Paper)Jean-MichelTualle,KiniaBarjean,EricTinet,DominiqueEttori,Univ.Paris13(France)

Diffuselightintissuecanbeaveryinterestingtoolformedicaldiagnosis,especiallyifoneconsidersthefluctuationsofthespecklepattern.Ofcourse,speckleanalysissuffersfromthelowspatialcoherenceofspecklepatterns,andmulti-pixeldetectionisrequiredinordertoincreasethesignaltonoiseratio.Thereisthereforeaneedofasetupwithhighsensitivity,capableofoutputtingasignalfromnoisethroughaveragingonahighnumberofpixels,asthesignalcanbelowerthanthephotonlevelforoneimageandoneframe.Furthermore,suchaprocessinghastobedoneataveryhighacquisitionrate.“Smart-pixels”arrayscanrepresentamajorbreakthroughinthisfield.

8659-22, Session 6

Gesture recognition on smart camerasAzizDziri,MehdiDarouich,StéphaneChevobbe,CEA(France)

NoAbstractAvailable

8659-23, Session 6

3DS-colorimeter based on a mobile phone camera for industrial applicationsJariMiettinen,BirgittaMartinkauppi,VTTElectronics(Finland);PekkaSuopajaervi,VTTTechnicalResearchCenterofFinland(Finland)

Wehavedevelopedanovelmobilephonebasedmeasurementaccessorydevicewhichiscalledas3DScolorimeter.Thisisanew,low-costconsumer-orindustrial-levelproduct.Itissuitablealmostanymobilephonewithacamera.Whenthe3DScolorimeterisattachedonthemobilephone,thenthissystemiscapabletodocolorimetricmeasurementandevaluation.Asthe3Dpartofthenameindicates,thesystemcandoalso3Dsurfaceevaluation.Thispaperstartswithanintroductionofthedeviceoperationinwhichwedemonstrateitsuitabilityespeciallyforindustrialuse.Thedeviceisalsosuitable

Conference 8659


foraconsumerbecausethepriceofthedeviceisquitelowanditsuseisveryeasy.Thegoalofthispaperis,however,tostudytherepeatabilityandreliabilityofthe3Dandcolorimetricmeasurementsforindustrialuse.WealsoevaluatetheaccuracyofcolorimetricresultswithaMacbethchart.Thesetwofactors,repeatabilityandaccuracyareimportantforanydeviceintendedforreliablemeasurements.Itisalsopossibletosave,compareandsenddataandthusdocumentthemeasurements.

8659-24, Session 6

A single lens with no moving parts for rapid high-resolution 3D image capture DanielC.Gray,HongquiangChen,JosephCzechowski,KangZhang,JilinTu,FrederickWheeler,MasakoYamada,JuanPabloCilia,RussellDeMuth,EsmaeilHeidari,GilAbramovich,KevinG.Harding,GEGlobalResearch(UnitedStates)

Presentlyavailablemethodstocapture3Ddata(stereocameras,mechanicalfocusshifting,andstructuredillumination),arelimitedinspeed,complexity,andtransverseresolution.Wepresentanovellenssystemwithhigh-resolution,widefield-of-view,andrapidmulti-focuscapturefor3Dimaging.Theimaingesystemwasdesignedandmanufacturedwithover1000pixel-per-inchresolutionacrossa300mmfield-of-viewata700mmworkingdistance.Thelensincorporatesasinglediffractionlimitedimagingobjectivecoupledwithanelectro-opticsunittoproviderapidfocalshiftsacrossa30mmvolume.Usinga12megapixel,highspeedCMOScamera,high-resolution,volumetricimagesarecapturedinlessthan0.5seconds.Anoveldepth-from-defocusalgorithmwasdevelopedtogenerate3Dpointcloudsfromthefocalstack.Theperformanceofthelenssystemischaracterizedusingstandardopticalmetrologymethodsandcomparedagainstthetheoreticaldesign.Threedimensionaltesttargetsareimagedtodemonstratetheutilityandperformanceoftheassembledsystem.ThismaterialwaspreparedwiththesupportoftheU.S.ArmyunderAwardNo.W911NF-11-C-0256.However,anyopinions,findings,conclusionsorotherrecommendationsexpressedhereinarethoseoftheauthor(s)anddonotnecessarilyreflecttheviewsoftheU.S.Army.

8659-25, Session 6

Measurement and description method for image stabilization performance of digital camerasNorihiroAoki,NikonCorp.(Japan);HiroyaKusaka,PanasonicCorp.(Japan);HiroyukiOtsuka,CanonInc.(Japan)

Imagestabilizationfunctionalityiswidelyacknowledgedasanautomatedfunctionofdigitalcameras.However,becauseunifiedmethodsofmeasurementofstabilizationperformancehadnotbeendeveloped,theCamera&ImagingProductsAssociation(CIPA)standardizedthemeasurementanddescriptionmethodsforimagestabilizationperformance.

Forstandardization,CIPAformulatedtwovibrationwaveformsthatalmostfaithfullysimulatethecharacteristicsofcamerashakethatoccurwhenusersactuallyshootwiththedigitalcamerainhand.Usingthesevibrationwaveforms,vibrationisappliedtothedigitalcamerawhichismountedonavibratoryapparatus.Thespecifiedchartisphotographedinthisstate,andfromthemotionbluramountoftheimage,imagestabilizationperformanceismeasured.Imagestabilizationperformanceiscalculatedin“stops”convertedintoshutterspeeds.

Byusingvibrationwaveforms,measurementman-hoursaregreatlyreduced,andmeasurementisalsomadepossiblefordigitalcamerasinwhichtheimagestabilizationfunctioncannotbedeactivated.Moreover,byintroducingamethodthatremovestheeffectsofbokehcausedbytheopticalperformanceofthelensfrommotionblur,ahighaccuracyofmeasurementisrealized.Theimagestabilizationperformanceattainedwhenusersshootisconfirmedtobefavorablymeasuredinverificationexperimentsofcommercialdigitalcameras.

8659-26, Session 6

Interconnected network of camerasMahdadHosseiniKamal,HosseinAfshari,YusufLeblebici,AlexSchmid,PierreVandegheynst,EcolePolytechniqueFédéraledeLausanne(Switzerland)

Thereal-timedevelopmentofmulti-camerasystemsisagreatchallenge.Synchronizationandlargedataratesofthecamerasaddstothecomplexityofthesesystemsaswell.Thecomplexityofsuchsystemalsoincreasesasthenumberoftheirincorporatingcamerasincreases.Thecustomaryapproachtoimplementationofsuchsystemisacentraltype,wheretheentirerawstreamfromthecameraarefirststoredthenprocessedfortheirtargetapplication.Analternativeapproachistoembedsmartcamerastothesesystemsinsteadofordinarycameraswithlimitedornoprocessingcapability.Smartcameraswithintraandintercameraprocessingcapabilityandprogrammabilityatthesoftwareandhardwarelevelwilloffertherightplatformfordistributedandparallelprocessingformulti-camerasystemsreal-timeapplicationdevelopment.Intercameraprocessingrequirestheinterconnectionofsmartcamerasinanetworkarrangement.Anovelhardware-emulatingplatformisintroducedfordemonstratingtheconceptoftheinterconnectednetworkofcameras.Amethodologyisdemonstratedfortheinterconnectionnetworkofcameraconstructionandanalysis.Asampleapplicationisdevelopedanddemonstrated.

Conference 8659


Conference 8660: Digital Photography IXMonday-Wednesday4–6February2013

Part of Proceedings of SPIE Vol. 8660 Digital Photography IX

8660-1, Session 1

Multichannel camera calibrationWeiLi,JulieKlein,RWTHAachen(Germany)

Forthelatestcomputervisionapplications,itbecomesmoreandmorepopulartotakeadvantageofmultichannelcameras(RGBcameras,etc.)toobtainnotonlygrayvaluesbutalsocolorinformationofpixels.Thecurrentlymostcommonapproachformultichannelcameracalibrationisthestraightforwardapplicationofmethodsdevelopedforcalibrationofsinglechannelcameras.Theseconventionalcalibrationmethodsmaygivequitepoorperformancesincludingcolorfringesanddisplacementoffeatures,especiallyforhigh-resolutionmultichannelcameras.Inthispaper,tosuppresstheundesiredeffects,anovelmultichannelcameracalibrationapproach,consideringeachsinglechannelindividuallyandinvolvingdifferenttransversalchromaticaberrationmodels,isintroducedandevaluated.Incomparisontothestandardapproach,theproposedapproachprovidesmoreaccuratecalibrationresultsinmostcasesandshouldleadsubsequentlytomorereliableestimationresultsforcomputervisionissues.Moreover,besidestheexistingtransversalchromaticaberration(TCA)model,furtherTCAmodelandcorrectionmethodsareintroducedwhicharesuperiortotheexistingones.Sincetheproposedapproachisbasedonthemostpopularcalibrationroutine,onlyminimalmodificationshavetobemadetotheexistingapproachestoobtaintheimprovedcalibrationquality.

8660-2, Session 1

Direct spatio-spectral datacube reconstruction from raw data using a spatially adaptive spatio-spectral basisYusukeMonno,MasayukiTanaka,MasatoshiOkutomi,TokyoInstituteofTechnology(Japan)

Spectralreflectanceisaninherentpropertyofobjectsthatisusefulformanycomputervisiontasks.Thespectralreflectanceofascenecanbedescribedasaspatio-spectral(SS)datacube,inwhicheachvaluerepresentsthereflectanceataspatiallocationandawavelength.Inthispaper,weproposeanovelmethodthatreconstructstheSSdatacubedirectlyfromrawdata,inwhichspectralmeasurementsarespatiallysampled.WefirstdescribetheSSdatacubeasalinearcombinationofSSbasisvectors.Then,basedontherawdata,weestimatethelinearcoefficientsoftheSSbasisvectors.IncontrasttothepreviousmethodsinwhichspatiallyinvariantSSbasisvectorsareusedforreconstructingtheSSdatacube,ourmethodadaptivelygeneratestheSSbasisvectorsforeachspatiallocation.ExperimentalresultsdemonstratethatourproposedmethodcanaccuratelyreconstructtheSSdatacubecomparedtothemethodsusingthespatiallyinvariantSSbasisvectors.

8660-3, Session 1

Joint focus stacking and high dynamic range imagingQinchunQian,BahadirGunturk,LouisianaStateUniv.(UnitedStates);AzizUmitBatur,TexasInstrumentsInc.(UnitedStates)

Focusstackingandhighdynamicrange(HDR)imagingaretwowell-knowncomputationalphotographyparadigms;theformeraimstocreateanextendeddepthoffieldimagefrommultipleimageswithlimitedfocusdepth,whereasthelatteraimstocreateahighdynamicrangeimagefrommultipleimageswithlowdynamicrange.Inthis

paper,wediscusstheideaofcombiningfocusstackingandHDRcreationsothattheresultingimagehasbothhigherdynamicrangeandextendeddepthoffield.WedevelopedanAndroidsmartphoneappthatallowsselectingmultipleregionsofinteresttodeterminetheoptimalfocusandexposuresettings.ImagescapturedwithdifferentfocusregionsandexposuretimesarethenmergedtoformanextendeddepthoffieldandHDRimage.Wehaveconsideredbothlocalsharpnessandexposureinformationduringthemergeprocess.Wehavealsodevelopedanalgorithmtoestimatebothphotometricmappingfunctionanddenseopticalflow.ExperimentalresultsobtainedonanAndroidsmartphoneshowthattheproposedmethodisreliableandefficienttobeimplementedasamobileimagingapplication.

8660-4, Session 1

FUJIFILM X10 white orbs and DeOrbItHenryG.Dietz,Univ.ofKentucky(UnitedStates)

TheFUJIFILMX10isahigh-endenthusiastcompactdigitalcamerausinganunusualsensordesign.Unfortunately,uponitsFall2011release,thecameraquicklybecameinfamousfortheuniquelydisturbing“whiteorbs”thatoftenappearedinareaswherethesensorwassaturated.FUJIFILM’sfirstattemptatafixwasfirmwarereleasedonFebrurary25,2012--ithadlittleeffect.InApril2012,asensorreplacementreducedtheproblemtoanacceptablelevel.

Thispaperexploresthe“whiteorb”phenomenonindetail.AfterFUJIFILM’sattemptatafirmwarefixfailed,theauthordecidedtocreateapost-processingtoolthatautomaticallycouldrepairexistingimages.DeOrbItwasreleasedasafreetoolonMarch7,2012.Tobetterunderstandtheproblemandhowtofixit,theWWWformversionofthetoollogsimages,processingparameters,andevaluationsbyusers.Thecurrentpaperdescribesthetechnicalproblem,thenovelcomputationalphotographymethodsusedbyDeOrbittorepairaffectedimages,andthepublicperceptionsrevealedbythisexperiment.

8660-5, Session 1

Automated calibration and optical testing of the AWARE-2 gigapixel multiscale cameraDavidS.Kittle,DanielL.Marks,DavidJ.Brady,DukeUniv.(UnitedStates)

Gigapixel-classcameraspresentnewchallengesincalibration,mechanicaltesting,andopticalperformanceevaluation.TheAWARE-2gigapixelcamerahasnearlyone-hundredmicro-camerasspanninga120degreewideby40degreetallfieldofview,withonepixelspanningan8arcsecfieldangle.The98individualmicro-cameraimagesarecapturedtowithinone-tenthofasecondforallthecamerasinthearraythenstitchedintoacompositepanoramicimage.ThismakestheAWARE-2systemidealforcapturingsnapshot,high-resolutionimageryofdynamicevents.

Stitchingtheimagesrequirespixel-levelaccuratemappingovertheentireFOV,thereforeatestbedhasbeendevelopedtoautomaticallycalibrateandtesteachmicro-camerainthearray.MTFmeasurements,pointingvectorsofthemicro-camerasforcompositing,focuscalibration,distortionmapping,andilluminationprofilemeasurementsaredetailedforthe1-gigapixelAWARE-2prototypecamerasystem.Thisdatathenbuildsalookuptablefornearreal-timecompositingandviewing,significantlyimprovingthesystemmodelandoverallreconstructionquality.


8660-6, Session 2

Across-resolution adaptive dictionary learning for single-image super-resolutionMasayukiTanaka,AyumuSakurai,MasatoshiOkutomi,TokyoInstituteofTechnology(Japan)

Thispaperproposesanoveladaptivedictionarylearningapproachforasingle-imagesuper-resolutionbasedonasparserepresentation.Theadaptivedictionarylearningapproachofthesparserepresentationisverypowerful,forimagerestorationsuchasimagedenoising.Theexistingadaptivedictionarylearningrequirestrainingimagepatcheswhichhavethesameresolutionastheoutputimage.Becauseofthisrequirement,theadaptivedictionarylearningforthesingle-imagesuper-resolutionisnottrivial,sincetheresolutionoftheinputlow-resolutionimagewhichcanbeusedfortheadaptivedictionarylearningisessentiallydifferentfromthatoftheoutputhigh-resolutionimage.Itisknownthatnaturalimageshavehighacross-resolutionpatchredundancywhichmeansthatwecanfindsimilarpatcheswithindifferentresolutionimages.Ourexperimentalcomparisonsdemonstratethattheproposedacross-resolutionadaptivedictionarylearningapproachoutperformsstate-of-the-artsingle-imagesuper-resolutions.

8660-7, Session 2

Aesthetic scoring of digital portraits for consumer applicationsSebastianoBattiato,MarcoMoltisanti,FabrizioRavì,Univ.degliStudidiCatania(Italy);ArcangeloRBruna,FilippoNaccari,STMicroelectronics(Italy)

ComputationalAestheticsappliedondigitalphotographyisbecominganinterestingissueindifferentframeworks(e.g.,photoalbumsummarization,imagingacquisitiondevices).Althoughitiswidelybelievedandcanoftenbeexperimentallydemonstratedthataestheticsismainlysubjective,weaimtofindsomeformalormathematicalexplanationsofaestheticsinphotographs.Weproposeascoringfunctiontogiveanaestheticevaluationofdigitalportraitsandgrouppictures,takingintoaccountfacesaspectratio,theirperceptualgoodnessintermsoflightingoftheskinandtheirposition.Alsowellknowncompositionrules(e.g.,ruleofthirds)areconsideredespeciallyforsingleportrait.Bothsubjectiveandquantitativelyexperimentshaveconfirmedtheeffectivenessoftheproposedmethodology.

8660-8, Session 2

Correcting saturated pixels in images based on human visual characteristicsJunFu,HuiPeng,Xi’anJiaotongUniv.(China);XiChen,Xi’anJiaotongUniversity(China);XuanqinMou,Xi’anJiaotongUniv.(China)

ThispaperpresentsacorrectingmethodforsaturatedimageswhichisoperatedintheYCbCrcolorspace.Thealgorithmisbasedontwohumanvisualcharacteristics,oneisthevisualsensitivitiestocolordifferencesandtheotheristheHunteffect.Duringtheprocessofcorrectingcolors,MacAdamellipsemodelmappedtotheYCbCrcolorspaceisusedtosearchthenearestcolor.Sothatcorrectedpixels’chromaticaberrationislesstothatoftheirneighborhood.AndduringtheprocessofthequantificationoftheYCbCrcomponentsfordigitalimplementation,theregionswithhighluminancearesettohavelesssaturationbasedontheHunteffect.Experimentalresultsshowthattheproposedmethodiseffectiveincorrectingfacesaturatedandantagonisticcolorimages,andthecolorandtheluminanceofunsaturatedpixelsinoriginalimagearemaintainedasmuchaspossible,especiallyfortheoptimizationoftheregionwithlessluminanceandmorecolorfulness.

8660-9, Session 3

Spectrally tunable pixel sensorsGiacomoLangfelder,CesareBuffa,AntonioF.Longoni,FedericoZaraga,PolitecnicodiMilano(Italy)

Afterthedemonstrationofthebasicworkingprinciple,ofpotentialapplicationsandofasuitableactivepixeltopology,theyareherereportedthedevelopmentandexperimentalresultsoffullyoperatingmatricesofspectrallytunablepixelsbasedontheTransverseFieldDetector(TFD).

Unlikeseveraldigitalimagingsensorsbasedoncolorfilterarraysorlayeredjunctions,theTFDhasthepeculiarfeatureofhavingelectricallytunablespectralsensitivities.Inthiswaythesensorcolorspaceisnotfixedaprioributcanbereal-timeadjusted,e.g.forabetteradaptationtothescenecontentorformultispectralcapture.

Theseadvantagescomeatthecostofanincreasedcomplexitybothforthephotosensitiveelementsandforthereadoutelectronics.ThechallengesintherealizationofamatrixofTFDpixelswillbeanalyzedinthiswork.Firstexperimentalresultsonan8x8(x3colors)andona64x64(x3colors)matrixwillbepresentedandanalyzedintermsofcolorimetricandnoiseperformance,andcomparedtosimulationpredictions.

8660-10, Session 3

Design, simulation, and evaluation of imaging oximetersSteveDai,YeTian,JoyceE.Farrell,StanfordUniv.(UnitedStates)

Computersimulationshaveplayedanimportantroleinthedesignandevaluationofimagingsensorswithapplicationsinremotesensing[1]andconsumerphotography[2-7].Inthispaper,weprovideanexampleofcomputersimulationsusedtoguidethedesignofimagingsensorsforabiomedicalapplication:Weconsiderhowsensordesign,illumination,measurementgeometry,andskintypeinfluencetheabilitytodetectbloodoxygensaturationfromnon-invasivemeasurementsofskinreflectance.Themethodologywedescribeinthispapercanbeusedtodesign,simulateandevaluatethedesignofotherbiomedicalimagingsystems.

8660-11, Session 3

A collection of hyperspectral images for imaging systems researchTorbjornSkauli,NorwegianDefenseResearchEstablishment(Norway);JoyceE.Farrell,StanfordUniv.(UnitedStates)

Asetofhyperspectralimagedataaremadeavailable,intendedforuseinmodellingofimagingsystems.Thesetcontainsimagesoffaces,landscapes,buildingsandpaintings.Thedatacoverwavelengthsfrom0.4to2.5micrometers,spanninginthevisible,NIRandSWIRelectromagneticspectralranges.TheimageshavebeenrecordedwithtwoHySpexline-scanimagingspectrometerscoveringthespectralranges0.4to1micrometersand1to2.5micrometers.Therespectiveacross-trackpixelcountofthetwosensorsis1600and320.Thehyperspectraldatasetincludesmeasuredilluminantsandsoftwareforconvertingtheradiancedatatoestimatedreflectance.Fullsensorcalibrationdataareavailable,enablingestimationofsignal-dependentnoisefromthephysicalsensormodel.TheimagesarebeingmadeavailablefromtheStanfordCenterforImageSystemsEngineering(SCIEN).

8660-12, Session 3

Evaluation of multispectral plenoptic cameraLingfeiMeng,TingSun,RichKosoglow,KathrinBerkner,RicohInnovations,Inc.(UnitedStates)

Conference 8660


Plenopticcamerasenablecaptureofa4Dlightfield,allowingdigitalrefocusinganddepthestimationfromdatacapturedwithacompactportablecamera.Whereasmostoftheworkonplenopticcameradesignhasbeenbasedasimplisticgeometric-optics-basedcharacterizationoftheopticalpathonly,littleworkhasbeendoneofoptimizingend-to-endsystemperformanceforaspecificapplication.Suchdesignoptimizationrequiresdesigntoolsthatneedtoincludecarefulparameterizationofmainlenselements,aswellasmicrolensarrayandsensorcharacteristics.Inthispaperweareinterestedinevaluatingtheperformanceofamultispectralplenopticcamera,i.e.acamerawithspectralfiltersinsertedintotheapertureplaneofthemainlens.Suchacameraenablessingle-snapshotspectraldataacquisition.Wefirstdescribeindetailanend-to-endimagingsystemmodelforaspectrallycodedplenopticcamera.Differentperformancemetricsaredefinedtoevaluatethespectralreconstructionquality.WethenpresentaprototypethatisdevelopedbasedonamodifiedDSLRcameracontainingalensletarrayonthesensorandafilterarrayinthemainlens.Finallyweevaluatethespectralreconstructionperformanceofamultispectralplenopticcamera.

8660-13, Session 4

Median filtering in multispectral filter array demosaickingXingboWang,Univ.deBourgogne(France)andGjøvikUniv.College(Norway);Jean-BaptisteThomas,Univ.deBourgogne(France);JonY.Hardeberg,GjøvikUniv.College(Norway);PierreGouton,Univ.deBourgogne(France)

NoAbstractAvailable

8660-14, Session 4

A novel blind image quality assessment metric and its feature selection strategyYingChu,Xi’anJiaotongUniv.(China)andShenzhenUniv.(China);XuanqinMou,WeiHong,Xi’anJiaotongUniv.(China);ZhenJi,ShenzhenUniv.(China)

Werecentlyproposedanaturalscenestatisticsbasedimagequalityassessment(IQA)metricnamedSTAIND,whichextractsnearlyindependentcomponentsfromnaturalimage,i.e.,thedivisivenormalizationtransform(DNT)coefficients,andevaluatesperceptualqualityofdistortionimagebymeasuringthedegreeofdependencybetweenneighboringDNTcoefficients.ToimprovetheperformanceofSTAIND,itsfeatureselectionstrategyisthoroughlyanalyzedinthispaper.

ThebasicneighborrelationshipsinSTAINDincludescale,orientationandspace.ByanalyzingthejointhistogramsofdifferentneighborshipsandcomparingtheIQAmodalperformancesofdiversefeaturecombinationschemesinthepubliclyavailabledatabasessuchasLIVE,CSIQandTID2008,wedrawthefollowingconclusions:1)Spatialneighborrelationshipcontributesmosttothemodaldesign,scaleneighborshiptakessecondplace,andorientationneighborsmightintroducenegativeeffects;2)Inspacedomain,secondorderspatialneighborsarebeneficialsupplementstofirstorderspatialneighbors;3)CombiningscaleandspaceneighborshipsandintroducingspatialparentneighborscouldbehelpfulforblindIQAmetricdesign.

8660-15, Session 4

Edge patterns extracted from natural images and their statistics for reduced-reference image quality assessmentWentingShao,XuanqinMou,Xi’anJiaotongUniv.(China)

Naturalimagesareofplentyofedgeswhichresultsinthescalinglaw.Thisfacthintsthatthespatialdistributionofedgepointsofnaturalimagesareofsomespecificpatterns.Basedonthisthought,we

proposedtolearnanumberofgroupsofthenamededgepatternstorepresentthedistributionofzero-crossings(ZC)mapsofboththenaturalimagesandtheirdistortioncounterpartsandthensuccessfullyproposedaRR(ReducedReference)IQA(ImageQualityAssessment)metric.Inthispaper,wefocusontheissuesthatwhetherthereexistsomespecificedgepatternsbeinggoodatrepresenttheZCmapsofnaturalimages,whattheruleisbywhichtheedgepatternsareselected,andiftheaboveistrue,howwecouldusethoseedgepatternstoperformanIQAmodel.Inthisinvestigation,weraiseaprocessfortheedgepatternsselectionbyproposingthesmoothnessconstraintandtheerrortoleranceinthestatisticofedgepatternsrepresentation,andcollectRRfeatures.ExperimentsshowthattheselectededgepatternscanwellrepresenttheZCmapsofnaturalimages,aswellasthedistortionimageswitherrortolerance.TheIQAmodelevaluationshowsthattheproposedmetricworksfairlygoodcomparedtoitscompetitor.

8660-16, Session 5

Mobile-phone-based clinical microscopy (Invited Paper)DanielA.Fletcher,Univ.ofCalifornia,Berkeley(UnitedStates)

Microscopyisacriticaltoolfordiseaseresearch,screening,anddiagnosis.Presently,useofmicroscopyforhealthcareisoftenlimitedtowell-equippedmedicallaboratorysettingsstaffedbyqualifiedpersonnel.Inthedevelopingworldandotherunderservedregions,thelackofequipmentandexpertiserequiredfordiagnosticmicroscopycontributestopoorhealth,spreadoftreatablediseases,andemergenceofdrug-resistantdiseasestrains.Whilemedicalresourcesarescarceinmanydevelopingcountriesandruralcommunities,thewidespreadavailabilityofwirelesscommunicationandcamera-enabledmobilephoneshasthepotentialtofundamentallychangethewaymedicaldiagnosesareperformed.Acompactandportablemicroscopysystembasedonamobilephoneandcapableofimagecapture,imageprocessing,andcommunicationwithmedicalexpertscoulddramaticallyincreaseaccesstobasichealthcarebydeliveringservicesclosertowherepatientsliveandwork.Thistalkwilldescriberecentprogressdevelopingandimplementingsuchadevice,whichwecallCellScope,toimprovediagnosisofinfectiousdiseases.

8660-17, Session 5

Computational microscopy, sensing, and diagnostics on a cellphone (Invited Paper)AydoganOzcan,Univ.ofCalifornia,LosAngeles(UnitedStates)

Todaytherearecloseto6billioncell-phoneusersintheworld,andthemajorityofthesecellphonesarebeingusedinthedevelopingpartsoftheworld.Thismassivevolumeofwirelessphonecommunicationbringsanenormouscost-reductiontocellphonesdespitetheirsophisticatedhardwareandsoftwarecapabilities.Utilizingthisadvancedstateoftheartofthecellphonetechnologytowardpoint-of-carediagnostics,sensingandmicroscopicimagingapplicationscanoffernumerousopportunitiestoimprovehealthcareespeciallyinthedevelopingworldwheremedicalfacilitiesandinfrastructureareextremelylimitedorevendonotexist.Forthisend,herewereviewourrecentprogressontheuseofcellphonesascomputationalmicroscopes,bio-chemicalsensorsaswellasmedicaldiagnostictools.

8660-18, Session 5

Optics in pathology: from superresolution microscopy to point-of-care devices (Invited Paper)SebastianWachsmann-Hogiu,NSFCtr.forBiophotonicsScienceandTechnology(UnitedStates)

Associetystrugglestocontinuetomakestridesinbasicbiologicalresearchandtocontinuetomeetglobaldemandformedicalcareat

Conference 8660


decreasingcost,opticaltechnologiesoffercompellingsolutionsbyprovidingdetailedchemicalandstructuralinformationaboutbiologicalsamples,oftenwithouttheneedforcostlychemicalconsumablesorlaborandexpertise-intensivebiopsyprocedures.InthistalkIwillpresentourgroup’sworktodevelopandapplynovelopticaltechnologiesforpathology.

Oneexampleisthedevelopmentoftwoattachmentstoacommercialcellphonethattransformthephone’sintegratedlensandimagesensorintoa350xmicroscopeandvisible-lightspectrometer.Themicroscopeiscapableoftransmissionandpolarizedmicroscopymodesandisshowntohave1.5micronresolutionandausablefield-of-viewof~150x150micronswithnoimageprocessing,andapproximately350x350micronswhenpost-processingisapplied.Thespectrometerhasa300nmbandwidthwithalimitingspectralresolutionofcloseto5nm.Weshowapplicationsofthedevicestomedicallyrelevantproblems.Inthecaseofthemicroscope,weimagebothstainedandunstainedblood-smearsshowingtheabilitytoacquireimagesofsimilarqualitytocommercialmicroscopeplatforms,thusallowingdiagnosisofclinicalpathologies.Withthespectrometerwedemonstrateacquisitionofawhite-lighttransmissionspectrumthroughdiffusetissueaswellastheacquisitionofafluorescencespectrum.

8660-19, Session 6

SPCA: a no-reference image quality assessment based on the statistic property of the PCA on nature imagesYunZhang,ChaoWang,XuanqinMou,Xi’anJiaotongUniv.(China)

Despitetheacceptableperformanceofcurrentfull-referenceimagequalityassessment(IQA)algorithms,theneedforareferencesignallimitstheirapplication,andcallsforreliableno-referencealgorithms.Mostno-referenceIQAapproachesaredistortionspecific,aimingtomeasureimageblur,JPEGblockingorJPEG2000ringingartifactsrespectively.Inthispaper,weproposedano-referenceIQAalgorithmbasedonthepropertyofprincipalcomponentanalysisonnatureimage,whichdoesnotassumeanyspecifictypeofdistortionoftheimage.Themethodgetsstatisticsofdiscretecosinetransformcoefficientsfromthedistortimage’sprincipalcomponents.Thosefeaturesaretrainedby?-supportvectorregressionmethodandfinallytestonLIVEdatabase.Theexperimentalresultsshowahighcorrelationwithhumanperceptionofquality(averagelyover90%byscoresofSROCC),whichisfairlycompetitivewiththeexistingno-referenceIQAmetrics.

8660-20, Session 6

Local binary pattern statistics feature for reduced reference image quality assessmentMinZhang,GifuUniv.SchoolofMedicine(Japan);XuanqinMou,Xi’anJiaotongUniv.(China);HiroshiFujita,GifuUniv.SchoolofMedicine(Japan);LeiZhang,TheHongKongPolytechnicUniv.(China);XiangrongZhou,GifuUniv.SchoolofMedicine(Japan);WufengXue,Xi’anJiaotongUniv.(China)

Measurementofvisualqualityisoffundamentalimportancefornumerousimageandvideoprocessingapplications.Thispaperpresentedanovelandconcisereducedreference(RR)imagequalityassessmentprototype.Statisticsoflocalbinarypattern(LBP)isintroducedasasimilaritymeasuretoformanovelRRimagequalityassessment(IQA)methodforthefirsttime.First,thetestimageisdecomposedwithamulti-scaletransform.Second,LBPencodingmapsareextractedforeachsubbandimages.Third,thehistogramsareextractedfromtheLBPencodingmaptoformtheRRfeatures.Inthisway,imagestructureprimitiveinformationforRRfeaturesextractioncanbereducedgreatly.Hence,newRRIQAmethodisformedwithonlyatmost56RRfeatures.TheexperimentalresultsontwolargescaleIQAdatabasesshowthatthestatisticofLBPsisfairlyrobustandreliabletoRRIQAtask.Theproposedmethodshowsstrongcorrelationswithsubjectivequalityevaluations.

8660-21, Session 7

Evaluation of a high dynamic range video camera with non-regular sensorMichaelSchöberl,Friedrich-AlexanderUniv.Erlangen-Nürnberg(Germany)andFraunhofer-InstitutfürIntegrierteSchaltungen(Germany);JoachimKeinert,MatthiasZiegler,Fraunhofer-InstitutfürIntegrierteSchaltungen(Germany);JürgenSeiler,Friedrich-AlexanderUniv.Erlangen-Nürnberg(Germany);MarcoNiehaus,GeraldSchuller,TechnischeUniv.Ilmenau(Germany);AndréKaup,Friedrich-AlexanderUniv.Erlangen-Nürnberg(Germany);SiegfriedFößel,Fraunhofer-InstitutfürIntegrierteSchaltungen(Germany)

Althoughthereissteadyprogressinsensortechnology,imagingwithahighdynamicrange(HDR)isstilldifficultformotionimagingwithhighimagequality.Thispaperpresentsournewapproachforvideoacquisitionwithhighdynamicrange.Theprincipleisbasedonopticalattenuationofsomeofthepixelsofanexistingimagesensor.Thiswellknownmethodtraditionallytradesspatialresolutionforanincreaseindynamicrange.Incontrasttoexistingwork,weuseanon-regularpatternofopticalNDfiltersforattenuation.Thisallowsforanimagereconstructionthatrecovershighresolutionimages.Thereconstructionisbasedontheassumptionthatnaturalimagescanberepresentednearlysparseintransformdomains,whichallowsforrecoveryofsceneswithhighdetail.Theproposedcombinationofnon-regularsamplingandimagereconstructionleadstoasystemwithanincreaseindynamicrangewithoutsacrificingspatialresolution.Inthispaper,afurtherevaluationispresentedontheachievableimagequality.Inourprototypewefoundthatcrosstalkispresentandsignificant.Thediscussionthusshowsthelimitsoftheproposedimagingsystem.


Spectral characterization of COTS RGB cameras using a linear variable edge filterDanielL.Bongiorno,MitchBryson,DonaldG.Dansereau,StefanB.Williams,TheUniv.ofSydney(Australia)

Thespectralresponseofthecolorchannelsofacommercial-off-the-shelf(COTS)Red/Green/Blue(RGB)cameraisoftenunknown.Knowledgeofthisresponsecanbeusefulforawidevarietyofapplicationssuchassimulatinganimalvision,colorcorrectionandcolorspacetransformations.COTScamerasarewidelyusedduetotheirlowcostandeaseofimplementation.WeinvestigateamethodusingaLinearVariableEdgeFilter(LVEF)andalow-costspectrometertocharacterizeanRGBcamera;thismethodhastheadvantageoverpreviousmethodsinthesimplicityandhighspeedofthespectralcharacterization.Resultsarepresentedforthreecameras:aconsumer-leveldigitalSLR;theothertwoarepoint-and-shootconsumergradecameraswithoneofthembeinganunderwatercamera.


Focus finding using scale invariant patternsDavidP.Morgan-Mar,MatthewR.Arnison,CanonInformationSystemsResearchAustraliaPty.Ltd.(Australia)

Wepresentatest-chartbasedfocusfindingsystemforuseinimagequalitymeasurementwithdigitalSLRcameras.Weestablishanabsolutemeasurementoffocuspositionindependentofanymechanicalinstabilityinthefocusdrivemotorbycombiningopticalfocusbreathing,highaccuracyscaleinvariantalignmentpatterns,andblur-invariantphasecorrelation.Thiscombinationallowsaccuratemeasurementoffocuspositionunderconditionsofchartscaling,testrigmisalignment,anddefocus.Theresultingabsolutefocusscaleallowsanefficientclosed-loopsearchforthebestfocusposition,followedbyareturntothefoundpositionforsubsequentimagequalitymeasurement.Thismethodrapidlydrivesthelenstothebestfocuspositiontowithinonefocusmotorpulse,whichissignificantlymoreaccurateandreliablethancameraauto-focussystems,andanorderofmagnitudefasterthananexhaustivefocussweep.Weevaluated

Conference 8660


ourmethodwith8771focusfindingoperationsoverarangeoffocallengthsandfoundthatitallowsthepeakfocusMTFtobemeasuredconsistentlytowithin1.4%RMS.


Frequency division multiplexed imagingBahadirGunturk,MartinFeldman,LouisianaStateUniv.(UnitedStates)

Inthispaper,wedescribefrequencydivisionmultiplexedimaging(FDMI),wheremultipleimagesarecapturedsimultaneouslyinasingleshotasamultiplexedimageandlaterextractedfromthemultiplexedimagethroughfrequencydomainfiltering.ThisisachievedbyspatiallymodulatingtheimagessothattheyareplacedatdifferentlocationsintheFourierdomain.Thetechniqueassumesthattheimagesareband-limitedandplacedatnon-overlappingfrequencyregionsthroughaspatiallightmodulationprocess.TheFDMItechniquecouldbeusedtoinavarietyofapplications,includingstereoscopicimagecapturewithasinglesensorandsub-exposuremotionanalysis.Wepresentopticalsetupsandprovideexperimentalresultswithrealandsimulateddatatoillustratetheidea.


Bayesian estimation of device spectral sensitivities and its application for improvement of color accuracy using color balancing filterYusukeMurayama,JayArreO.Toque,PengchangZhang,AriIde-Ektessabi,KyotoUniv.(Japan)

WeproposedaBayesianmethodtorecoverthesystemspectralsensitivitiesofacolorimagingdevicefromanacquiredcolorchartimage,andaschemetopredictacolorbalancingfilterthatneedstobeattachedtothedevicetoimprovecoloraccuracybasedontherecoveredsensitivities.Therecoverymethodisderivedbyintroducingthenon-negativity,thesmoothnessandthezeroboundaryofthesensitivitycurvesaspriorinformation,andisranwithoutmanualtuningbyadoptingthemarginalizedlikelihoodcriterion.Thesensitivitiesaremodifiedbyavailablefilterscomputationallyandtheircoloraccuracyareestimated.Anexperimentwascarriedouttotesttheperformanceoftheproposedmethodforpredictingthecoloraccuracyimprovementusingascannerandtwokindsofcommercialcolorcharts,IT8targetasatrainingchart,andColorCheckerClassicasatestchart.Theaveragecolordifferencewasimprovedfrom4.27to2.57inthetestchartbyattachingthepredictedoptimalfilter.Theproposedmethodtorecoverforrecoveringthesystemsensitivitieswillbeabletobeappliedtochoosethecameraandilluminant,ortooptimizecalibrationmethod.


Achieving equal image quality at lower bit rates using evolved image reconstruction transformsBrendanJ.Babb,FrankW.Moore,Univ.ofAlaskaAnchorage(UnitedStates)

SeveralrecentNASAmissionshaveusedthestate-of-the-artwavelet-basedICERProgressiveImageCompressorforlossyimagecompression.Inthispaper,wedescribeamethodologyforusingevolutionarycomputationtooptimizewaveletandscalingnumbersdescribingreconstruction-onlymultiresolutionanalysis(MRA)transformsthatarecapableofacceptingasinputtestimagescompressedbyICERsoftwareatareducedbitrate(e.g.,0.99bitsperpixel[bpp]),andproducingasoutputimageswhoseaveragequality,intermsofmeansquarederror(MSE),equalsthatofimagesproducedbyICER’sreconstructiontransformwhenappliedtothesametestimagescompressedatahigherbitrate(e.g.,1.00bpp).Thisimprovementcan

beattainedwithoutmodificationtoICER’scompression,quantization,encoding,decoding,ordequantizationalgorithms,andwithverysmallmodificationstoexistingICERreconstructionfiltercode.Asaresult,futureNASAmissionswillbeabletotransmitgreateramountsofinformation(i.e.,agreaternumberofimages)overchannelswithequalbandwidth,thusachievingano-costimprovementinthesciencevalueoffutureNASAmissions.


Computer graphics solutions for pottery colors specificationFilippoD.Stanco,AnnaM.Gueli,Univ.degliStudidiCatania(Italy)

Amainissueinthearchaeologicalresearchistodealwithcolorsofsoilsandartefacts,especiallypottery.ToavoidrisksofatoosubjectiverecognitionMunsellsystemiscommonlyusedinarchaeology.Inthispaperasemi-automaticmethodofcolordetectiononselectedregionsofdigitalimagesofancientpotteryispresented.ThestartingpointoftheexperimentwastotakedigitalpicturesofspecimenstogetherwiththeGretag-MacbethColorCheckerChart.ThedigitalimageisprocessedwithcolorbalancingtechniquesaimedtorestoretheoriginalvalueofMacbethpatches,inordertoeliminatedistortionscomingfromacquisitionprocess.Afterthecolorcorrection,severalregionsofinterestareselectedfortheidentificationofsurfacecolor,thealgorithmconvertsRGBvaluesinMunselldata.ThereliabilityofourtoolisalsoverifiedcomparingthesechromaticvalueswiththecolorspecificationofpotterysherdsperformedwithaspectrocolometerusingtheCIELABspacetoevaluatethedifferences.


Investigating the lateral resolution in a plenoptic capturing system using the SPC modelMitraDamghanian,RogerOlsson,MårtenSjöström,MidSwedenUniv.(Sweden);HectorNavarro,ManuelMartinez-Corral,Univ.deValència(Spain)

Complexmultidimensionalcapturingsetupssuchasplenopticcameras(PC)introduceatrade-offbetweenvarioussystemproperties.Establishedcapturingproperties,likeimageresolution,need,therefore,tobedescribedthoroughlyforthesesystems.Hencemodelsandmetricsthatassistexploringandformulatingthistrade-offarehighlybeneficialforstudyingaswellasdesigningofcomplexcapturingsystems.Thisworkdemonstratesthecapabilityofourpreviouslyproposedsamplingpatterncube(SPC)modeltoextractthedepthdependentlateralresolution.TheSPCcarriesbothrayinformationaswellasfocalpropertiesofthecapturingsystemitmodels.TheproposedoperatorextractsthelateralresolutionfromtheSPCmodelthroughoutanarbitrarynumberofdepthplanesgivingadepth-resolutionprofile.Wehavevalidatedtheresolutionoperatorbycomparingtheresultswiththosefromothermodelsandfromempiricaldata.ThelateralresolutionpredictedbytheSPCmodelagreeswiththeresultsfromexperimentalsetupsandstrengthenstheconclusionthattheSPCfillsthegapbetweenray-basedmodelsandrealsystemmeasurements,byincludingthefocalinformationofthesystemasamodelparameter.TheSPCisprovenasimpleyetefficientmodelforextractingdepth-basedlateralresolutionasahigh-levelpropertyofcomplexplenopticcapturingsystem.

Conference 8660


Conference 8661: Image Processing: Machine Vision Applications VITuesday-Wednesday5–6February2013

Part of Proceedings of SPIE Vol. 8661 Image Processing: Machine Vision Applications VI

8661-1, Session 1

A polynomial phase-shift algorithm for high precision three-dimensional profilometryFuqinDeng,TheUniv.ofHongKong(HongKong,China);ChangLiu,Wui-FungSze,JiangwenDeng,ShunmingK.Fung,ASMPacificTechnologyLtd.(China);EdmundY.Lam,TheUniv.ofHongKong(China)

Theperspectiveeffectiscommoninrealopticalsystemsformachinevisionapplications.Inconventionalphase-shiftingalgorithms,thefrequenciesoftheprojectedsinusoidalpatternsareassumedtobeuniformatdifferentheightlevels,therefore,theerrorcausedbytheperspectiveeffectfromprojectionsystembecomespronouncedinphase-measuringprofilometry,especiallyforsomehighprecisionmetrologyapplicationssuchasmeasuringthesurfacesofthetinysemiconductorsatmicrometerlevelinthesemiconductorindustry.Inthisresearch,weinvestigatetheperspectiveeffectonphase-measuringprofilometrywhenreconstructingthesurfacesofmovingobjects.Togeneralizethephase-measuringprofilometrymodelandsolvethephasereconstructionproblemeffectively,weuseapolynomialtoapproximatethephasedistributionunderaperspectiveprojectionsystemandnameitpolynomialphase-measuringprofilometry(P-PMP)model.Underthismodel,wecharacterizehowthefrequencyoftheprojectedpatternchangesaccordingtoheightvariationsandhowthephaseoftheprojectedpatterndistributesinthemeasuringspace.Aftercalibratingthephasedistribution,weproposeapolynomialphase-shiftingalgorithm(P-PSA)tocorrecttheerrorduetoimperfectprojectionsystemduringphasereconstruction.Simulationexperimentsshowthattheproposedmethodcanimprovethereconstructionqualitybothvisuallyandnumerically.

8661-2, Session 1

High-temperature dual-band thermal imaging by means of high-speed CMOS camera systemWolfgangHauer,GeraldZauner,FHOÖForschungs&EntwicklungsGmbH(Austria)

Inthispaperwepresentanovelapproachtodeterminesurfaceradiationtemperatures(upto4000°K)ataveryhightimeresolution(~10µs)bymeansofaCMOShighspeedcamera.AsgovernedbyPlank’slawofradiation,theamountoflightemissioninthevisibleandnearinfraredrangeoftheelectromagneticspectrumattemperaturesabove1500°KissufficienttobedetectedwithaCMOSsensorevenatveryshortintegrationtimes(highspeedimaging).

Withtheaidofa3Dlensandtwodistinctspectralnarrowbandpassfilters,weapplythesocalleddual-bandmethod(ortwo-colorpyrometry),wheretheintensityratioofthermallightemissionintwodifferentspectralbandsisusedtodeducethesurfacetemperatureofanobject.Duetoslightlydifferentcameraviewingangelsofeachlens,astereocalibrationprocessisappliedinordertocorrectlyregistercorrespondingimagepixels.Additionally,weapplyimagedenoisingalgorithms,basedonanoisecharacterizationofthecamera,tofurtherimprovethemeasurementquality.

Withthepresentedsystemwesuccessfullyimagedandmeasuredthecontactsurfacetemperaturedecayoflow-voltagecircuitbreakers-averydemandingmetrologicaltask,astemperaturesof4000°Kcanoccurattimescalesoflessthan100µs.

8661-3, Session 1

A state observer for using a slow camera as a sensor for fast control applicationsReinhardGahleitner,Univ.ofAppliedSciencesUpperAustria(Austria);MartinSchagerl,UniversityofAppliedSciencesUpperAustria(Austria)

Thecontributiondealswithaproblemthatoftenarisesinvisionbasedcontrol,ifacameraisusedasasensorforfastcontrolapplications,ormoreprecise,whenthesamplerateofthecontrolloopishigherthantheframerateofthecamera.Incontrolapplicationsformechanicalaxes,e.g.inroboticsorautomatedproduction,acameraandsomeimageprocessingcanbeusedasasensortodetectpositionsorangles.Thesampletimeinthisapplicationstypicallyisbelow10msoreven1msandthisdemandstheuseofacamerawithaframerateabove100fpsor1000fps,respectively.

Thepresentedsolutionisaspecialstateobserverthatcanworkwithaslowerandthereforecheapercameratoestimatethestatevariablesatthehighersamplerateofthecontrolloop.TosimplifytheimageprocessingforthedeterminationofpositionsoranglesandmakeitmorerobustsomeLEDmarkersareappliedtotheplant.Simulationandexperimentalresultsshowthattheconceptcanbeusedeveniftheplantisunstableliketheinvertedpendulum.

8661-4, Session 2

Multiple-level patch-based object tracking using the MLBP-based integral histogramJiruiYuan,KarenO.Egiazarian,TampereUniv.ofTechnology(Finland)

Thispaperpresentsanovelmulti-levelpatch-basedapproachforobjecttrackingusingModifiedLocalBinaryPattern(MLBP)histograms.Theinitialtemplateisdividedintooverlappingrectangularpatches,andeachofthesepatchesistrackedindependentlybyfindingthemostsimilarmatchwithinasearchregion.Everypatchvotesonthepossiblelocationsoftheobjectinthecurrentframe,bycomparingitsMLBPhistogramwiththecorrespondenceinthetargetframe.Toreducetheindividualtrackingerrorofagivenpatchduetopartialocclusions,theideaofmulti-levelpatchpartitioningisfurtherdeveloped.Andthesimilaritybetweentemplateandtargetobjectiscomparedpatch-by-patch,level-by-level.Thecomparisonstartsfromthehighestlevelandprogressivelyfeedstothelowestlevelthroughamedianoperation.

Theproposedalgorithmprovidesadditionalrobustnessandeffectivenessinseveralways.First,thespatialrelationshipamongpatchesisimprovedbythisoverlappingpartitioningmanner.Second,byintroducingMLBPoperator,thetrackingaccuracyissignificantlyimproved.Third,themedianoperationutilizedinthemulti-levelvote-combiningprocessprovidesadditionalrobustnesswithrespecttooutliersresultingfromoccludedpatchesandposechanges.Theproposedmethodisevaluatedusingbothfaceandpedestriansequences,andcomparisonismadew.r.t.severalstate-of-arttrackingalgorithms,includingfrag-trackandmean-shiftmethods.Experimentalresultsshowthattheproposedmethodsignificantlyoutperformsincaseofocclusionsandposechanges.Besides,thetrackingincaseofscalechangesadditionallyprovestheeffectivenessandefficiencyoftheproposedmethod.


8661-5, Session 2

Periodicity estimation of nearly regular textures based on discrepancy normGernotStübl,PeterHaslinger,VolkmarWieser,SoftwareCompetenceCtr.Hagenberg(Austria);JosefScharinger,JohannesKeplerUniv.Linz(Austria);BernhardMoser,SoftwareCompetenceCtr.Hagenberg(Austria)

Slidingwindowbasedprocessingofimagesisacrucialstepinvariousimageprocessingapplicationsforexampleintemplatematchingbasedmethods.

Thechoiceofanoptimalwindowsizeisnotalwaysstraightforward.Particularly,inthecontextofnearlyregulartexturedimagesthisquestionturnsouttodeservespecialconsideration.Typicallysuchtexturesplayanimportantroleinqualityinspectionoftextilefabrics.

Thispaperproposesanovelapproachtodeterminethetextureperiodicity,thetextureelementsizeandfurthercharacteristicsliketheareaofthebasinofattractioninthecaseofcomputingthesimilarityofatestimagepatchwithareference.

Thepresentedmethodutilizesthepropertiesofanovelmetric,theso-calleddiscrepancynorm.IncontrasttoMinkowskinormsthisnormisbasedontheevaluationofpartialsumsbywhichthediscrepancynormbecomesorderingdependentyieldingahighlyasymmetricunitball.ThismetricdistinguishesbymonotonicityaswellasaLipschitzcontinuitypropertythatallowrobustcomputationatthepresenceofnoiseandvariationsinappearance.

Thegeneralformoftheproposedapproachreliesonthegenerationofdiscrepancynorminducedsimilaritymapsatrandompositionsintheimage.ThroughapplyingstandardimageprocessingoperationslikeWatershedandblobanalysisonthesimilaritymapsarobustestimationofthecharacteristicperiodicitycanbecomputed.Asabyproductofthisimageanalysisonegetsasegmentationwhichspecifiesconvergencepropertiesfortemplatematching.

Eventhoughthediscrepancynormusesonlyadd/maxoperationsandcanbeimplementedwithO(n)withnasnumberofpixels,computingafullsimilaritymapisstillcomputationallyexpensive.Thereforeafasterversionofthegeneralapproachtailoredtoorthogonalalignedtexturesispresented.

DuetotheLipschitzandthemonotonicitypropertythediscrepancynormdistinguishesitselffromothermetricsbywell-formedandstableconvergenceregions.Boththeperiodicityandtheconvergenceregionsarecloselyrelatedandhaveanimmediateimpactontheperformanceofasubsequenttemplatematchingandevaluationstep.

Inanexperimentalsetuptheestimationperformanceistestedonsamplesofstandardizedimagedatabasesandiscomparedwithstate-of-the-artmethods.Resultsshowthattheproposedmethodisapplicabletoawiderangeofnearlyregulartexturesandshowsrobustnesstonoisedisturbedimages.

Summingupitcanbesaidthatthroughtheuseofanovelmetricthepresentedapproachneedsonlybasicimageprocessingtechniquestoestimatethecharacteristicperiodicityfornearregulartexturesandfurthermoregivesimportantconfigurationinformationforsubsequentoptimizationortemplatematchingsteps.Thisisdemonstratedonrealworldexamplesandanimplementationconceptforindustrialusageisgiven.

8661-6, Session 2

Gradient feature matching for in-plane rotation invariant face sketch recognitionAnnThejaAlex,VijayanK.Asari,AlexMathew,Univ.ofDayton(UnitedStates)

Inthisworkweproposeanewmethodforfacesketchrecognitionthatdoesnotrequireanysynthesisstepandallowsrecognitionacrossmodalities.Theproposedmethodusestheedgefeaturesofafacesketchandfacephotoimagetocreateafeaturestringcalled‘edge-string’.Themethodusesedgedetectionalgorithmstoextracttheedgefeatures.Thethresholdsusedforedgedetectionaredetermined

automaticallyusingtheOtsu’salgorithm.Thenextstepistorepresentedgefeaturesinpolarcoordinaterepresentation.Torepresentfeaturesaspolarcoordinates,weneedtodeterminethecentroidandareferenceline.Thereferencelineisdefinedasthelinepassingthrougheyecenters.ThemethodusesCircularHoughtransformtoeffectivelylocalizetheirislocationandtheeyecenterisdeterminedasthecenteroftheiris.Thecenterofthisreferencelineisdefinedasthecentroid.Theedge-stringsofthefacephotoandfacesketcharecomparedusingtheSmith-Watermanalgorithmforlocalstringalignments.Thefacephotothatgavethehighestsimilarityscoreisthephotothatmatchesthetestfacesketchinput.PreliminaryresultsontheCUHKdatabasegivepromisingresultsandprovetheeffectivenessoftheproposedapproach.

8661-7, Session 2

An iris segmentation algorithm based on edge orientation for off-angle iris recognitionMahmutKarakaya,DelBarstow,HectorJ.Santos-Villalobos,ChristopherBoehnen,OakRidgeNationalLab.(UnitedStates)

Irisrecognitionisknownasoneofthemostaccurateandreliablebiometricmeasure.However,theaccuracyofirisrecognitionsystemreliesonawell-controlledsetupandisaffectedbyseveralfactorssuchasoff-angle,occluded,anddilatedimages.Therefore,non-idealirisrecognitionisanewemergingresearchfieldandgotattentionsfrommanyresearchgroups.Inthispaper,wepresentasegmentationalgorithmforoff-angleirisimagesbyusingedgedetection,edgeelimination,edgeclassification,andellipsefittingtechniques.Inourapproach,wefirstdetecttheallpossibleedgesintheirisimagebyusingthecannyedgedetectorwhichmaycontainedgesfromnotonlyirisandpupilbutalsoeyelash,eyelids,iristextureetc.Therefore,weeliminatetheedgesthatcannotbepartoftheirisorpupilbyusingtheedgeorientation.Third,weclassifytheremainingedgepointsintotwosetsaspupiledgesandirisedges.Finally,werandomlygeneratesubsetsofineachset,fitellipsesforsubsetofeachirisandpupiledgepointsandselectellipseswithsimilarparametersandaverageresultantellipses.Basedontheresultsfromrealexperiments,theproposedmethodshowseffectivenessinsegmentationforoff-angleirisimages.

8661-8, Session 3

Dense sampling of shape interiors for improved representationVittalPremachandran,RamakrishnaKakarala,NanyangTechnologicalUniv.(Singapore)

Inthispaper,weidentifysomeofthelimitationsofcurrent-dayshapematchingtechniques.Weprovideexamplesofhowcontour-basedshapematchingtechniquescannotprovideagoodmatchforcertainvisuallysimilarshapes.Toovercomethislimitation,weproposeaperceptuallymotivatedvariantofthewell-knownshapecontextdescriptor.Weidentifythattheinteriorpropertiesoftheshapeplayanimportantroleinobjectrecognitionanddevelopadescriptorthatcapturestheseinteriorproperties.Ourdescriptorcapturesthepropertiesoftheshapeinitsentirety.Weshowthatourmethodcaneasilybeaugmentedwithanyothershapematchingalgorithm.Wealsoshowfromourexperimentsthattheuseofourdescriptorcansignificantlyimprovetheretrievalrates.

8661-9, Session 3

Efficient defect detection with sign information of Walsh Hadamard transformQiangZhang,ArizonaStateUniv.(UnitedStates);PeterJ.L.vanBeek,ChangYuan,CindyXu,Hae-jongSeo,SharpLabs.ofAmerica,Inc.(UnitedStates);BaoxinLi,ArizonaStateUniv.(UnitedStates)

Conference 8661


WeproposeamethodfordefectdetectionbasedontakingthesigninformationofWalshHadamardTransform(WHT)coefficients.Thecorepartoftheproposedalgorithmonlyinvolvesthreesteps,applyingforwardWHT,computingsignoftransformcoefficientsandtakinganinverseWHTonthesigninformation,whichcanallbeimplementedveryefficiently.Ourimplementationtakesonly7millisecondsfora512x512imageonaPCplatform.Asaresult,theproposedmethodismoreefficientthanPHaseOnlyTransform(PHOT)andothermethodsinliterature.Inaddition,theproposedapproachiscapableofdetectingdefectsofvaryingshapes,bycombining2-DimensionalWHTand1-DimensionalWHT;andcandetectdefectsinimageswithstrongobjectsboundariesbyutilizingareferenceimage.Theproposedalgorithmisrobustoverdifferentimagepatternsandvaryingilluminationconditions.Goodresultsoftheproposedmethodsaredemonstratedondefectdetectionapplications.

8661-10, Session 3

Improving the performance of interest point detectors with contrast stretching functionsPrabakarGunashekhar,BahadirGunturk,LouisianaStateUniv.(UnitedStates)

Interestpointdetectionthatisrobustagainstvariationsinillumination,blur,compression,andgeometrictransformationsisanimportantresearchproblemincomputervision.Inarecentpaper,weproposedamethod,calledilluminationrobustfeatureextractiontransform(IRFET),toimprovetheilluminationrobustnessofHarriscornerdetector.Inthispaper,wedemonstratethatIRFETimprovesnotonlytheilluminationrobustnessbutalsotherobustnessagainstblur,compression,andgeometrictransformations.TheIRFETmethodisnotlimitedtoHarriscornerdetector;andtheperformanceboost-upisdemonstratedwithmulti-scaleHarrisandmulti-scaleHessianinterestpointdetectorsinstandarddatasets.

8661-11, Session 3

Object detection using feature-based template matchingSimoneBianco,MarcoBuzzelli,RaimondoSchettini,Univ.degliStudidiMilano-Bicocca(Italy)

Patternmatching,alsoknownastemplatematching,isacomputationallyintensiveproblemaimedtolocalizetheinstancesofagiventemplatewithinaqueryimage.

Inthisworkwepresentafastexhaustivetechniquefortemplatematching,abletousehistogram-basedsimilaritymeasures.InparticularwewillfocusonColorHistograms(CH),HistogramsofOrientedGradients(HOG),andBagofvisualWordshistograms(BOW).ToobtainaBOWdescriptionofanimage,weextractSIFTlocalimagedescriptors,andtheresultingdescriptorsarevectorquantizedusingaK-entrycodebookofvisualwordprototypes.

Inordertointroducespatialinformationamongthesub-partsofthetemplate,wehavealsointegratedthehistogram-basedrepresentationwiththespatialpyramidrepresentation,andotherspatialrepresentationwithinourtemplatematchingapproach.

Theproposedsolutioniscomparedwithstateoftheartapproachesintermsofbothaccuracyandexecutiontimeondifferentstandarddatasets.Adetailedanalysisoftheresultswillbealsoprovidedtoputinevidencethestrengthsoftheproposedsolutionswithrespecttotheobjectscharacteristicsandscenecomplexityandvariability.

8661-12, Session 4

Touch sensing analysis using multi-modal acquisition systemJeffreyS.King,ZachiI.Baharav,CorningIncorporated(UnitedStates)

Touchsensingisquicklybecomingaubiquitousfeatureinconsumerproducts,andinmanypracticalimplementationstheuser’sfingerphysicallyinteractswithaglasslayerthatprotectsthedisplayanddevice.Thislayerhasasignificantimpactonthetouchsensingmechanismandperformance,beitprojectedcapacitivesensingorothermethods.Inthiswork,wepresentasystemforthemeasurementandevaluationofvariousfactorsinthefinger-glassinteraction.Thissystemenablessynchronouscaptureofdifferentmodalitiesoftheinteraction,andpost–processingallowscorrelationamongthem.Wecomparewithavailablemodelsofthefinger-glassinteractions,anddescribebothqualitativeandquantitativevaluableresults.Forexample,high-resolutionvisiblelightimagesoftheinterfaceallowstudyoftheeffectoffingerwettinganditsrelatedtimeconstantontheperformanceofatouchscreen.Wedescribethesystemandthedesignconsiderationsinvolved,analyzetheresults,anddiscussinsightsgainedandimplicationstopracticalsystemdesign.

8661-13, Session 4

OrphanpPoint relocation and 3D super-resolutionSeungkyuLee,SamsungAdvancedInstituteofTechnology(Korea,Republicof)

Recently,consumerdepthcamerashavewidelyappliedtorobotics,computervisionandgraphicsapplications.Furthermoresensorfusionwithcolorcamerahasprovidedmultimodal3-dimensionaldataformanyresearchproblems.3Dpointclouddataprocessingbecomesmoreimportanttocorrectlydealwiththedepthimage,howevermanyconventional2Dimageprocessingmethodhaveappliedondepthimages.Eventhoughitissimpleandeffectiveinsomelimitedconditiontousesuch2Dimageprocessingmethod,mostof3Dapplicationssuchas3Dreconstruction,interactionshowsmanyartifactsafter2Dimageprocessingapplication.

Inthispaper,weproposedtwomain3DimageprocessingmethodsforlowresolutionandlowaccuracyToFdepthimages.Firstwerelocatetheorphanpointsbacktoeitherforegroundorbackgroundratherthaneliminatingthemasmostofpriorworkdo.Basedontheproximityscorecalculatedfromcalibratedcolorcamerasandlocaldepthpointdistance,wedecidethe3Drelocationdirection.Secondly,wedecidesub-pixellevelforegroundandbackgroundboundariesbeforeweperformoursuperresolution.Extensiveonvariouschallenging3Dreconstructionexperimentalresultsateachstepshowthattheproposedmethodshowssignificantimprovementsindealingwith3Dpointclouddata.

8661-14, Session 4

Structural deformation measurement via efficient tensor polynomial calibrated electro-active glass targetsChristophGugg,MatthewJ.Harker,PaulL.O’Leary,MontanUniv.Leoben(Austria)

Thispaperdescribesthephysicalsetupandmathematicalmodellingofadeviceforthemeasurementofstructuraldeformationsoverlargescales,e.g.,aminingshaft.Imageprocessingtechniquesareusedtodeterminethedeformationbymeasuringthepositionofatargetrelativetoareferencelaserbeam.Itisnecessarytocompensateforopticaldistortionifhighaccuracyistobeachievedinacompacthardwaredesignwherelenseswithshortfocallengthsareused.Thispaperpresentsanewcoordinatemappingprocedurebasedonatensorproductofdiscreteorthogonalpolynomials.Bothlensdistortionandtheprojectionarecompensatedbyasinglelineartransformation.Wehavedevelopedanewhierarchicalapproachbasedonaquad-treesubdivisiontoincreasethemeasurement’sperformance.Cross-validationtestsverifythevalidity,demonstratingthattheproposedmethodaccuratelymodelsboththeopticaldistortionaswellastheprojection.Theachievableaccuracyise<+/-0.01[mm]inafieldofviewof150[mm]x150[mm]atadistanceofthelasersourceof120[m].

Conference 8661


8661-15, Session 4

Machine vision system for the control of tunnel boring machinesMichaelHabacher,MatthewJ.Harker,PaulL.O’Leary,MontanUniv.Leoben(Austria)

ThispaperpresentsamachinevisionsystemofthecontrolofDual-ShieldTunnelBoringMachines(TBM).Arobustsystemhasbeenimplementedbothfromamechanicalasalsofromacomputingpointofview.Thisisnecessarytoguaranteereliablefunctionalityintheextremelyharshenvironmentinvolvedintunneling.Aredundantconfigurationwithfourretro-reflectortargetshasbeenchosen.Afullmeasurementcanstillbepreformedwiththelossofonetarget,andareducedmeasurement(aEuclideantransformation)canbeachievedevenwiththelossoftwotargets.Acontouringalgorithmdetectstheedgesofthetargets,eachcontouristhensegmentedintoNoverlappingsub-segments,towhichNcirclesarefit.Thecirclesfitsareclassifiedwiththeaimofidentifyingpointsonthecontourwhichbelongtotheexpectedcircle.Thisdeliversarobustcircleidentificationalgorithm.AnewSVDbasedimplementationofacovarianceweightedcirclefittingalgorithmispresented,thenewmethodisefficientandexhibitslowerfittingbiasthanpreviousmethodswhenfittingcirculararcs.Duringtheservicephasetheexactpositionofthemachineismeasuredbeforeandaftermovingthegrippershield.Inthismannertwofullsetsofdataareavailableforthepositionsofthetargetsandtheseen-positionsofthetargets.Thisenablesacomplete3Dcameracalibration.Thesystemwastestedunderlaboratoryconditionsusinganautomaticpositioningtable.Aresolutionof0.3mmwasachievedforextensionsintherangeof2500[mm]to4200[mm].

8661-16, Session 5

Eliminating illumination effects by discrete cosine transform (DCT) coefficients’ attenuation and accentuationShanDu,MohamedShehata,WaelBadawy,ChoudhuryARahman,IntelliviewTechnologiesInc.(Canada)

Illuminationvariationisoneofthemostsignificantfactorsaffectingtheperformanceoffacerecognitionandhasreceivedmuchattentioninrecentyears.Inthispaper,afaceimagepre-processingapproachwasproposedthatdealswiththeilluminationproblemtomakefacerecognitionrobusttoilluminationvariations.Thismethodeliminatestheilluminationeffectsandextractsilluminationinsensitivefeaturesatthesametimebyattenuatinglow-frequencyDCTcoefficientsandaccentuatinghigh-frequencyDCTcoefficientsinlogarithmdomain.Thisisbecauseilluminationvariationsmainlyconcentrateonthelow-frequencybandandunderpoorilluminations,thehigh-frequencyfeaturesbecomemoreimportantinrecognition.Toevaluatetheeffectivenessoftheproposedmethod,faceimageswithlargeilluminationvariationsareused.YaleB,ExtendedYaleBandCMU-PIEdatabasesareselectedforevaluation.Thismethoddoesnotrequiremodelingandmodelfittingsteps.Itcanbedirectlyappliedtosinglefaceimage,withoutanypriorinformationof3Dshapeorlightsources.Byexperiments,wecanseethattheproposedmethodoutperformsotherexistingmethods.

8661-17, Session 5

Non-rigid ultrasound image registration using generalized relaxation labeling processJong-HaLee,YeongKyeongSeong,MoonHoPark,Kyoung-GuWoo,SamsungAdvancedInstituteofTechnology(Korea,Republicof);JeonghunKu,Hee-JunPark,KeimyungUniversity(Korea,Republicof)

Thisresearchproposesanovelnon-rigidregistrationmethodforultrasoundimages.Themostpredominantanatomicalfeaturesin

medicalimagesaretissueboundaries,whichappearasedges.Inultrasoundimages,however,otherfeaturescanbeidentifiedaswellduetothespecularreflectionsthatappearasbrightlinessuperimposedontheidealedgelocation.Inthiswork,animage’slocalphaseinformation(viathefrequencydomain)isusedtofindtheidealedgelocation.Thegeneralizedrelaxationlabelingprocessisthenformulatedtoalignthefeaturepointsextractedfromtheidealedgelocation.Inthiswork,theoriginalrelaxationlabelingmethodwasgeneralizedbytaking“n”compatibilitycoefficientvaluestoimprovenon-rigidregistrationperformance.Thiscontextualinformationcombinedwitharelaxationlabelingprocessisusedtosearchforacorrespondence.Thenthetransformationiscalculatedbythethinplatespline(TPS)model.Thesetwoprocessesareiterateduntiltheoptimalcorrespondenceandtransformationarefound.Wehavetestedourproposedmethodandthestate-of-the-artalgorithmswithsyntheticdataandbladderultrasoundimagesofinvivohumansubjects.Experimentsshowthattheproposedmethodimprovesregistrationperformancesignificantly,ascomparedtootherstate-of-the-artnon-rigidregistrationalgorithms.

8661-18, Session 5

Mammogram CAD, hybrid registration, and iconic analysisArnaudBoucher,FlorenceCloppet,NicoleVincent,Univ.ParisDescartes(France)

Thispaperaimstodevelopacomputer-aideddiagnosis(CAD)comprisingtwosteps:amethodologytoregisterandanalyzepairsoftemporalmammograms.Theconceptof“file”,includingallthepastinformationonapatient,enablesjointanalysisofdifferentacquisitionstakenatdifferenttimes,andthedetectionofsignificantmodifications.Theregistrationdevelopedmethodisdesignedtosuperimposeatbestthedifferentbreastanatomicalstructures.Theregistrationisadaptedinordertomakedisappeardeformationundergonebytheacquisitionprocess.Inordertoreachthisgoal,areferentimageiscomputedfromcontrolpointsbasedonanatomicalfeaturesthatareextractedautomatically.Thenthesecondimageofthecoupleisrealignedonthereferentimage,usingacoarse-to-fineapproachbasedontwo1Dlineartransforms,thatallowsthedefinitionofacomplexdeformationfield,whichsimulatesmammographiclikedeformations.Thejointanalysisdetectstheevolutionbetweentwoimagesrepresentingthesamescene.Inordertoachievethis,itisimportanttotakeintoaccounttheregistrationerrorlimitstoadapttheobservationscale.Theapproachusedinthispaperisbasedonanimagesparserepresentation.Decomposedinregularpatterns,theimagesareanalyzedunderanewangle.TheCADisevaluatedusingrecallandprecisionofdifferencesinmammograms.

8661-19, Session 6

Neutron imaging for geothermal energy systemsPhilipR.Bingham,LarryAnovitz,YaromPolsky,OakRidgeNationalLab.(UnitedStates);LucDessieux,TheUniv.ofTennessee(UnitedStates);HassinaBilheux,OakRidgeNationalLab.(UnitedStates)

Geothermalenergysystemstransferheatfromtheearththroughwaterorotherfluidsandconverttheheattoenergythroughageneratorsystem.Thesesystemsaresuccessfullybeingusedforenergyproductioninregionswithnaturalhydrothermalcirculation.Inanefforttoincreasepotentialenergyproductionforhydrothermal,aneffortisunderwaytodevelopenhancedgeothermalenergysystems(EGS)thatwouldallowenergyproductionfromhotdryrockareas.DevelopmentofEGScapabilitiesrequirescharacterizationoffluidflowinfracturedrockatthehighpressuresandtemperaturesexperiencedinhotdryrockregions.DuetothehighcrosssectionofHydrogenforneutrons,neutronimagingprovideshighcontrastbetweenwaterandrock.Theauthorshaveperformedinitialexperimentstoquantifyfluidflowthroughrockcoresandhavedevelopedanenvironmentalchamberforneutronimagingofcoreswithfluidflowatpressuresexperience

Conference 8661


innature.Thispaperwillpresenttheimagingsystem,environmentalchamber,neutronimagingdata,andquantiativeresultscalculatedfromneutronradiographs.

8661-20, Session 6

Wave front distortion based fluid flow imagingEmishawD.Iffa,WolfgangHeidrich,TheUniv.ofBritishColumbia(Canada)

Inthispaper,atransparentflowsurfacereconstructionbasedonwavefrontdistortionisinvestigated.Acameralensisusedtofocustheimageformedbythemicro-lensarraytothecameraimagingplane.Theirradianceofthecapturedimageistransformedtofrequencyspectrumandthenthexandyspatialcomponentsareseparated.Arigidspatialtranslationfollowedbylowpassfilteringyieldsasinglefrequencycomponentoftheimageintensity.IndexofrefractionisestimatedfromtheinverseFouriertransformofthespatialfrequencyspectrumoftheirradiance.Theproposedmethodisevaluatedwithsyntheticdataofarandomlygeneratedindexofrefractionvalueandusedtovisualizeafuelinjectionvolumetricdata.

8661-21, Session 6

Autonomous ship classification using synthetic and real color imagesDenizKumlu,B.KeithJenkins,TheUniv.ofSouthernCalifornia(UnitedStates)

Autonomousshiprecognitionisanactiveareaformilitaryandcommercialapplications.Thisworkclassifiescolorimagesofshipsattainedusingcamerasmountedonshipsandinharbors.Ourdata-setscontain9differenttypesofshipwith18differentperspectivesforourtrainingset,developmentsetandtestingset.Thetrainingdata-setcontainsmodeledsyntheticimages;developmentandtestingdata-setscontainrealimages.Thedatabaseofrealimageswasgatheredfromtheinternet,and3DmodelsforsyntheticimageswereimportedfromGoogle3DWarehouse.Akeygoalinthisworkistousesyntheticimagestoincreaseoverallclassificationaccuracy.Wepresentanovelapproachforautonomoussegmentationandfeatureextractionforthisproblem.Supportvectormachineisusedformulticlassclassification.Thisworkreportsthreeexperimentalresultsformulticlassshipclassificationproblem.Firstexperimenttrainsonasyntheticimagedata-setandtestonarealimagedata-set,andobtainedaccuracyis87.8%.Secondexperimenttrainsonarealimagedata-setandtestsonaseparaterealimagedata-set,andobtainedaccuracyis87.8%.Lastexperimenttrainsonreal+syntheticimagedata-sets(combineddata-set)andtestsonaseparaterealimagedata-set,andobtainedaccuracyis93.3%.

8661-22, Session 6

Fast and flexible 3D object recognition solutions for machine vision applicationsIraEffenberger,JensU.Kühnle,Fraunhofer-InstitutfürProduktionstechnikundAutomatisierung(Germany);AlexanderVerl,Fraunhofer-IPA(Germany)

Inautomationandhandlingengineering,thefeedingofworkpiecesbetweenthedifferentstagesoftheproductionprocesschainisofspecialinterest.Oftenthepartsarestoredunorderedinbinsorlatticeboxesandhencehavetobeseparatedandorderedforfeedingpurposes.Thiscanbedonewithmechanicsspecificallyadaptedtothegeometricpropertiesoftheparts.Aspace-savingalternativeisusingarobottogriptheworkpiecesoutofabinorfromabelt.Forthiskindofapplicationsreliableandpreciseobjectrecognitionandlocalizationsystemsareneeded.Inthispaperwepresentfastandflexible3Dobjectrecognitionalgorithmsforhandlingapplications.Basedon3Dbest-fitalgorithmsforgeometricprimitivesandpre-

processingmethodsforsegmentationofthe3Dmeasuredataafastobjectlocalizationhasbeendeveloppedwhichcanbeeasilyadaptedtomodifiedorevennewparts.Theobjectrecognitionhasbeentestedondifferentrealindustrialpartsandtheresultswillbeshownandsummarized.

8661-23, Session 7

Low complexity smile detection technique for mobile devicesValeriaTomaselli,MirkoGuarnera,ClaudioMarchisio,SimoneMoro,STMicroelectronics(Italy)

Alowcomplexitysmiledetectiontechnique,abletodetectsmilesinavarietyoflightconditions,facepositionsandimageresolutionsisproposed.

Theapproachfirstlyrunsfacedetectionand,consequently,mouthdetection.Themouthdetectionisasimpleapplicationofgeometricalproportionsbetweenfaceandmouth,whichhavebeenstatisticallyestimated.Themouthsarethendescribedbyacombinationoftwodescriptors.ThefirstoneisthePyramidHistogramofOrientationGradients(PHOG),calculatedonalimitednumberoforientationbins.Theedgeorientationsarecomputedateachpixelofthemouthregionandnotonedgesonlytobeinvarianttoilluminationchanges.TheseconddescriptoristhePyramidLocalBinaryPattern(PLBP).PLBPistheconcatenationofLocalBinaryPatternhistogramsatdifferentpyramidlevels,anditisapowerfulmeanoftexturedescription.

Forclassificationweuseaminimumdistanceclassifierfromthecentroidsofthesmilingandnon-smilingclasses.Inparticular,Mahalanobisdistancehasbeenchosen,becauseittakesintoaccountthecorrelationsbetweenfeatures.Thistechniqueproducesthe94%ofareaundertheROC(AUROC)curve.Theproposedapproachobtainsbetterresultsthanothermorecomplextechniques,intermsofhitrateandAUROC.

8661-24, Session 7

Density-induced oversampling for highly imbalanced datasetsDanielFecker,VolkerMärgner,TimFingscheidt,TechnischeUniv.Braunschweig(Germany)

Theproblemofhighlyimbalanceddatasetswithonlysparsedataoftheminorityclassinthecontextoftwo-classclassificationisinvestigated.Theexaminedapplicationinthispaperisopticalon-lineprocessmonitoringoflaserbrazingwithonlyraresporadicoccurringdefects.

Anovelsyntheticdataoversamplingtechniqueisproposedwhichutilizesestimationsoftheprobabilitydensitydistributioninthefeaturespace.First,aGaussianmixturemodel(GMM)fromthedataofthewell-sampledmajorityclassisgeneratedandwithitshelpanewGMMisapproximatedwiththesparseminorityclassdatabyBayesianadaptation.RandomsyntheticdataisgeneratedfromtheadaptedGMMandanadditionalassignmentruleassignsthisdatatoeithertheminorityclassorelsediscardsit.Theobtainedsyntheticdataisemployedincombinationwiththeavailableoriginaldatatotrainasupportvectormachineclassifier.

Experimentswithdifferentamountsofminorityclassdatasamplesandcomparisonstoothermethodsshowthatthisapproachperformsverywellforhighlyimbalanceddatasets.

8661-25, Session 7

Coherent image layout using an adaptive visual vocabularyScottE.Dillard,Google(UnitedStates);MichaelJ.Henry,ShawnJoelBohn,LukeJ.Gosink,PacificNorthwestNationalLab.(UnitedStates)

Conference 8661


Whenqueryingahugeimagedatabasecontainingmillionsofimages,theresultofthequerymaystillcontainmanythousandsofimagesthatneedtobepresentedtotheuser.Weconsidertheproblemofarrangingsuchalargesetofimagesintoavisuallycoherentlayout,onethatplacessimilarimagesnexttoeachother.Imagesimilarityisdeterminedusingabag-of-featuresmodel,andthelayoutisconstructedfromahierarchicalclusteringoftheimagesetbymappinganin-ordertraversalofthehierarchytreeintoaspace-fillingcurve.Thislayoutmethodprovidesstronglocalityguaranteessoweareabletoquantitativelyevaluateperformanceusingstandardimageretrievalbenchmarks.Performanceofthebag-of-featuresmethodisbestwhenthevocabularyislearnedontheimagesetbeingclustered.Becauselearningalarge,discriminativevocabularyisacomputationallydemandingtask,wepresentanovelmethodforefficientlyadaptingagenericvisualvocabularytoaparticulardataset.Weevaluateourclusteringandvocabularyadaptationmethodsonavarietyofimagedatasetsandshowthatadaptingagenericvocabularytoaparticularsetofimagesimprovesperformanceonbothhierarchicalclusteringandimageretrievaltasks.

8661-26, Session 7

Shape recognition for capacitive touch displayIvanaGuarneri,AlessandroCapra,STMicroelectronics(Italy);GiovanniM.Farinella,SebastianoBattiato,Univ.degliStudidiCatania(Italy)

Withthegrowingdiffusionoftouchscreenbasedconsumerdevicesthedevelopmentofalgorithmsabletodiscriminateamongthedifferentshapesobtainedbytouchingthedevicedisplaybecomesveryimportant.Forinstancethedetectionandrecognitionofthefingersrepresentsfundamentalinformationinmanytouchbaseduserapplications.Thesealgorithmsarealsoextremelyusefultorecognizeaccidentaltouchesinordertoavoidinvoluntarytouchfunctionalities.

Takingintoaccounttheaforementionedmotivations,inthispaperwepresentatechniqueabletoclassifythefivemostcommonclassesofshapesacquiredwithacapacitivetouchdisplay:finger,ear,cheek,halfearhalfcheekandhandhold.Themainstepsoftheproposedalgorithmaresummarizedinthefollowing.Firstlyanoiseremovalfilterisappliedtoeachacquiredcapacitanceimagetoeliminatenoisyspikes.Secondly,thefeaturesusedforshapeclassificationareextracted.Finally,throughabaseddecisionaltree(C4.5)thefinalclassificationisperformed.

Thetechniquehasbeenproperlytestedonadatasetof1500capacitivemapsforeachofthefiveclasses.Thedatasethasbeenpopulatedbyinvolvingbothfemaleandmaleindividuals,presentingdifferentanatomiesandtherecognitionaccuracyachievedbytheproposedtechniqueis91.12%.


An elliptic phase-shift algorithm for high speed three-dimensional profilometryFuqinDeng,Univ.ofHongKong(HongKong,China);ZhaoLi,IstitutoItalianodiTecnologia(Italy);JiaChen,HarbinInstituteofTechnology(China);JiangwenDeng,ShunmingK.Fung,ASMPacificTechnologyLtd.(China);EdmundY.Lam,TheUniv.ofHongKong(HongKong,China)

Ahighthroughputisoftenrequiredinmanymachinevisionsystemsespeciallyontheassemblylineinthesemiconductorindustry.Todevelopanon-contactthree-dimensionaldensesurfacereconstructionsystemforreal-timesurfaceinspectionandmetrologyapplications,inthiswork,weprojectsinusoidalpatternsontotheinspectedobjectsandproposeahighspeedphase-shiftingalgorithmfordensesurfaceprofilometry.First,weuseanillumination-reflectivity-focus(IRF)modeltocharacterizethefactorsonformingtheintensitiesinphase-measuringprofilometry.Second,byvisualizingandanalyzingthecharacteristicintensitylocusprojectedontotheintensityspace,webuildatwo-dimensionalphasemaptostorethephaseinformationfor

eachpointintheintensityspace.Third,wedevelopanefficientellipticphase-shiftingalgorithm(E-PSA)forhigh-speedsurfaceprofilometry.Inthisnovelmethod,insteadofcalculatingthetime-consuminginversetrigonometricfunctions,weonlyneedtonormalizethemeasuredintensitydataandindexthebuilttwo-dimensionalphasemapbeforehandduringreal-timephasereconstruction.Finally,experimentalresultsshowthatitismorethantwotimesfasterthanconventionalphase-shiftingalgorithm.


An incompressible fluid flow model with mutual information for MR image registrationLeoTsai,Herng-HuaChang,NationalTaiwanUniv.(Taiwan)

Imageregistrationisoneofthefundamentalandessentialtaskswithinimageprocessing.Itisaprocessofdeterminingthecorrespondencebetweenstructuresintwoimages,whicharecalledthetemplateandreferenceimages.Thechallengeofregistrationistofindanoptimalgeometrictransformationbetweencorrespondingimagedata.ThispaperdevelopsanewMRimageregistrationalgorithmthatusesaclosedincompressibleviscousfluidmodelassociatedwithmutualinformation.Inourapproach,wetreattheimagepixelsasthefluidelementsofaviscousfluidflowgovernedbythenonlinearNavier-Stokespartialdifferentialequation(PDE).Wereplacethepressuretermwiththebodyforcemainlyusedtoguidethetransformationwithaweightingcoefficient,whichisexpressedbythemutualinformationbetweenthetemplateandreferenceimages.TosolvethismodifiedNavier-StokesPDE,weadoptedthefastnumericaltechniquesproposedbySeibold[MIT,2008].Theregistrationprocessofupdatingthebodyforce,thevelocityanddeformationfieldsisrepeateduntilthemutualinformationreachesaprescribedthreshold.WeappliedourapproachtoBrainWebandrealMRimages.Asconsistentwiththetheoryoftheproposedfluidmodel,wefoundthatourmethodaccuratelytransformedthetemplateimagesintothereferenceimagesbasedontheintensityflow.Experimentalresultsindicatethatourmethodisofpotentialinawidevarietyofmedicalimageregistrationapplications.


Improved skin detection method by iteratively eliminating pseudo-skin colors through combined skin filterOhyeolKwon,KyungpookNationalUniv.(Korea,Republicof);Kyung-AhKim,KyungpookNationalUniversity(Korea,Republicof);Sung-IlChien,KyungpookNationalUniv.(Korea,Republicof)

Inordertodetectmorereliablytheskincolorfrompseudo-skincolorswhicharesimilartotheskincolor,weproposeanimprovedskindetectionmethod,whichcandetectbyiterativelyeliminatingpseudo-skincolorsthroughacombinationoftheskincolordetectionfilters.First,weclassifyimagesintothenormalandtheabnormalskinimagesdependingonwhetherthepseudo-skincriterionexceedsathresholdvalueornot.Incaseofnormalskinimages,wedetermineafinalskincolorregionbymergingtworegionsdetectedbythetwoskinfilters.Asfortheabnormalskinimages,wedetecttheskincolorbyusingtwocombinedfilters.Then,thedetectedskincolorpixelsfromeachcombinedfilteraretranslatedintoCheddad’sfeaturerespectivelyandwecancreateahistogramofeachdetectedskinregion.Theskincolorisdetectedbyeliminatingthepseudo-skincolorsthroughOtsu’shistogrambinarization.Finally,wedetermineafinalskincolorregionbymergingtwoskinregionsdetectedbyapplyingthehistogrambinarization.Experimentalresultsshowthatourproposedmethodcandetecttheskincolorfromthenormalandtheabnormalskinimagesmoreaccuratelythanthatoftheconventionalmethod.

Conference 8661



A modified hierarchical graph cut based video segmentation approach for high frame rate video XuezhangHua,NanjingUniversityofPostandTelecommunications(China);SumitChakravarty,NewYorkInstituteofTechnology(UnitedStates);QiShe,BoyuWang,NanjingUniv.ofPostsandTelecommunications(China)

Videoobjectsegmentationentailsselectingandextractingobjectsofinterestfromavideosequence.VideoSegmentationofObjects(VSO)isacriticaltaskwhichhasmanyapplications,suchasvideoedit,videodecompositionandobjectrecognition.ThecoreofVSOsystemconsistsoftwomajorproblemsofcomputervision,namelyobjectsegmentationandobjecttracking.Thesetwodifficultiesneedtobesolvedintandeminanefficientmannertohandlevariationsinshapedeformation,appearancealterationandbackgroundclutter.Alongwithsegmentationefficiencycomputationalexpenseisalsoacriticalparameterforalgorithmdevelopment.Mostexistingmethodsutilizeadvancedtrackingalgorithmssuchasmeanshiftandparticlefilter,appliedtogetherwithobjectsegmentationschemeslikeLevelsetsorgraphmethods.Asvideoisaspatiotemporaldata,itgivesanextensiveopportunitytofocusontheregionsofhighspatiotemporalvariation.Weproposeanewalgorithmtoconcentrateonthehighvariationsofthevideodataandusemodifiedhierarchicalprocessingtocapturethespatiotemporalvariation.


Power and performance tradeoffs of GPGPU vs. multicore: a case study for stereo matching SaralaArunagiri,JaimeJaloma,RicardoPortillo,PatriciaJ.Teller,TheUniv.ofTexasatElPaso(UnitedStates)

GPGPUsandmulticoreprocessorshavebecomecommonplacewiththeirwideusageintraditionalhigh-performancecomputingsystemsaswellasmobile-computingdevices.Asignificantspeedupcanbeachievedforavarietyofgeneral-purposeapplicationsbyemployingtheparallelismmadeavailablebythesetechnologies.Unfortunately,thisspeedupisoftenaccompaniedbyhighpowerand/orenergyconsumption.Thus,energyconservationisincreasinglybecomingamajorconcernindesigningthesecomputingdevices.Themaindriverforenergyefficiencyincomputersystemsisthecostofpowerandcooling,aswellastheenvironmentalimpactofubiquitousPCsandmassivedatacenters.Ontheotherhand,formobiledevicesenergyconservationisdrivenbytheneedtoextendbatterylifeandpowercappingismandatedbytherestrictivepowerbudgetofmobileplatformssuchasunmannedaerialvehicles(UAVs).OurfocusistounderstandtheperformancetradeoffsinvolvedinexecutingArmyapplicationsonmobilecomputingplatforms.InthispaperweexplorethetradeoffsinvolvedinGPGPUvs.multicorecomputingintermsofexecutiontime,outputquality,andpowerandenergyconsumptionforstereomatching,whichisaheavily-investigatedcompute-intensivecandidateArmyapplicationformobilecomputingplatforms.


An efficient algorithm for food quality control based on multispectral signaturesJuanCarlosValdiviezo-Navarro,CarinaToxqui-Quitl,JoséFranciscoSolís-Villareal,AlfonsoPadilla-Vivanco,Univ.PolitécnicadeTulancingo(Mexico)

Multispectralimaginghasmotivatednewapplicationsrelatedtofoodqualitymonitoringduetoitscapabilityofanalysisbasedonspectralsignatures.Thismanuscriptintroducesafastandefficientalgorithmthatisusedincombinationwithamultispectralsystemfortheunsuperviseddeterminationoffoodquality.Ourspectralclassification

algorithmisbasedonthemin-Wandthemax-Mlatticeassociativememoriestodeterminethecentersofcorrespondingclusters.Giventwoclassesoffood,previouslyidentifiedashighandlowqualityproducts,wefirstregisteramultispectralimagefromthemandperformadimensionalityreductionbytakingintoaccountthosespectralbandsthatinvolvetheirmostsignificantabsorptionandreflectionspectra.Fromthereducedsetofimages,themin-Wandthemax-Mlatticeassociativememoriesarecomputedandasubsetoftheircolumnvectorsareusedasthemostrepresentativespectralsignaturesforeachclass,whichwillbeusedforlaterclassification.Theclassificationprocesswillbeaccomplishedbymeansofthespectralanglecomputedbetweeneachrepresentativespectraandanyotherpixelintheimage,andwhoseresultsallowustoclassifyregionsintheimagewithsimilarspectralresponses.Thetechniquehasbeenappliedfortheunsupervisedclassificationoflowandhighqualitycoffeegrains.Theachievedresultsstatethatourprocedureisefficientforfoodqualitycontrolandinspection.


Bottle inspector based on machine visionCarinaToxqui-Quitl,JuanA.Cardenas-Franco,AlfonsoPadilla-Vivanco,JuanCarlosValdiviezo-Navarro,Univ.PolitécnicadeTulancingo(Mexico)

APETbottleinspectorforqualitycontrolbasedonmachinevisionispresented.Theopticalsetupforimagingrequiresfivecameras,anilluminationsystemandasoftwareinterface.Toachieveacompleteinspectionofthebottle,itiscarriedoutinthreemodules.Withtheabove,itinspectsthefinish,wallandbaseofthePETbottleforthedetectionofdefectssuchastears,cracks,orstains.Inordertocapturetheimageofthefinish,thesystemutilizesdirectlightilluminationwithpolarizerstoavoidreflexionsandtwocamerasfromdifferentanglesforawholescanning.Atransmissionilluminationisusedforimagingthebaseandwallofthebottle.Theinspectionofthewallandfinishofthebottlerequiresaspecificalgorithmtosearchcracksandtears.Whiletheinspectionofthebasefinishdetectsdirtparticlesandevaluatethesizeofthebase.Therefore,differentalgorithmsarerequired.Thealgorithmforthefinishandwallinspectiondetectschecksandcracks.Themethodisbasedonthediscretewavelettransform(DWT).Amultiresolutionanalysisallowsdetectedgesandfinedetails.Andthesmalldefectsconnectedwithedgesusuallyneedhigherresolutiontodisplay.Also,theDWTfilteringusingalowpassfilterandahighpassfilter,thisallowsremovebackgroundinformation.Ontheotherhand,thealgorithmforthebaseinspectionthatdetectsdirtparticlesorstickyobjectsisbasedonapixelorregionvariation.Thesizeofthebottlebaseiscalculatedusinggeometricmoments.

Fromthis,threeproposeddefectmapsareusingtotrainaneuronalnetworkforclassificationofbottles.ThismethodclassifiesPETbottlesintotwoclasses,withandwithoutfault.

Experimentalresultsshowthatthebottleinspectorachievesacorrectclassificationrateof78%,90%and98%whenthesysteminspectsfinish,wallandbaserespectively.Theproposedmethodincreasesconsiderablythespeedofdetection.


Static sign language recognition using neural networksJoséFranciscoSolís-Villareal,OscarMorales-Álvarez,CarinaToxqui-Quitl,JuanCarlosValdiviezo-Navarro,Univ.PolitécnicadeTulancingo(Mexico)

Insignlanguagerecognitionprocess,segmentationisoneofthemostchallengingissues,manyworksbasedoncomputervisionsystem,usespecialclothes,tintedglovesoruniformbackgroundforleadingtothismatter.Forthisworkaninfraredsensorwasusedfordiscardthebackgroundjustasinpreviousreport[Solís,etal.,2012],whereitwasusebinaryimagestoextractone-dimensionaldescriptors,however,torecognizetheentirealphabetisnecessarytoextractinformationofthevisiblespectrum,soatwocamerasconfigurationwaschosenforthiswork,oneforsegmentingandtheotherforsigndataextraction.A

Conference 8661


databasewascreatedintheLaboratoryofOpticsandVisionSystemsoftheUniversity,23staticsignswerepicked-upfromAmericanSignLanguage,othersignswerediscardedbecausetheyareexpressedwithmovement,differentversionsofsamepersonsignswerestored.ThepurposeofthisworkistoachievestaticAmericanSignLanguageRecognitionwithacomputervisionbasedsystem,withouttheuseoftintedgloves,specialclothingoraparticularbackgroundcolor;themaindisadvantageistoavoidpassingthehandinfrontoftheface.Theimagepreprocessingofthedatabasebeginswithscaleandtranslationtransformstogetthehandintensitydatainvisiblespectrumsegmentedbyaninfraredsensor,thisallowsthesystemtocomputefiltersinspatialandfrequencydomains,atthispoint,wavelettransformwasselectedtoimproveaccuratedatalengthreductioninordertoprocesslessinformationinanartificialneuralnetwork,themultilayerperceptronwithsigmoidalneuronsachieveagoodclassificationperformancewithk-foldcross-validation.


Defect inspection technology for a gloss-coated surface using patterned illuminationTsuyoshiNagato,TakashiFuse,TetsuoKoezuka,FujitsuLabs.,Ltd.(Japan)

Inthispaper,wediscussthedevelopmentofaninspectionsystemforagloss-coatedsurfaceusingpatternedillumination.SinceanirregulardefectonaUV-curablecoatingsurfaceistransparent,itisdifficulttoobserveitintypicalillumination.Thus,byconsideringthenatureofthespecularreflectiononthegloss-coatedsurface,wedevelopedtheinspectionsystemwhichischaracterizedbypatternedilluminationandphase-shiftingmethod.Thissystemtakesmultiplespecularreflectionimagesofthestripepatterngeneratedbypatternedillumination,andcalculatesthephaseandmodulationofbrightnessperpixelaccordingtophase-shiftingmethod.Thephaseandmodulationimagescanberegardedasthedistributionofspecularreflectionangleandintensity,respectively.Oursystemcanobtainthedistributionofspecularreflectionangleandintensity.Hence,itcanobserveshapesofthetransparentirregulardefectandspotssuchasfingerprintsonthegloss-coatingsurface.Thissystemcanallowtheinspectionofthegloss-coatedsurfacewhichwasdifficultinaconventionalmethod.Additionally,itcanbealsoappliedtoamirrorsurfaceandatransparentobject.


HOG and CS-LBP based detector for surveillance of birdsMihreteabA.Kidane,ThangQ.Nguyen,NagaokaUniv.ofTechnology(Japan)

Thegatheringoflargepopulationofdetrimentalbirdssuchascrowscausesnoiseandenvironmentalpollutiontothevicinityareas.Hence,arobustimageprocessingtechniquecapableofdetectingandlocalizingbirdsinimagesplaysanimportantroleinmanycomputervisionapplications.Inthispaper,afeaturesetdetectorbasedonHistogramofOrientedGradients(HOG)andCenter-SymmetricLocalBinaryPattern(CS-LBP)forcrowbirdsisproposed.HOGisrobustincapturinglocalshapeinformationoredgedirectionsandCS-LBPdescriptorcapturesgradientbasedinformationandtextureinformation,thus,thecombinationofthesetwofeaturesiscapableofcapturingbetterbirdfeatures.BycombiningHOGandCS-LBPasafeatureset,detectionofcrowsunderdifferentlightingconditionscouldbecarriedout.AdatasetofcrowbirdswithawiderangeofposesandbackgroundswaspreparedandlearnedusingSupportVectorMachine(SVM).ExperimentalresultsonvarioustestimagesunderdifferentlightingconditionsshowthatHOGandCS-LBPbaseddetectorachievedadetectionresultof91%at0.02FPPIandonaverageadetectionresultof87%at0.01to0.04FPPI.ThedetectionperformanceachievedusingourHOGandCS-LBPbaseddetectoroutperformssomeofthestateoftheartalgorithms.


A semi-automatic annotation tool for cooking videoGianluigiCiocca,RaimondoSchettini,SimoneBianco,PaoloNapoletano,Univ.degliStudidiMilano-Bicocca(Italy);RobertoMargherita,GianlucaMarini,GiorgioGianforme,GiuseppePantaleo,AlmavivAS.p.A.(Italy)

Inordertocreateacookingassistantapplicationtoguidetheusersinthepreparationofthedishesrelevanttotheirprofiledietsandfoodpreferences,itisnecessarytoaccuratelyannotatethevideorecipes,identifyingandtrackingthefoodsandtheactionsofthecook.Thesevideospresentparticularannotationchallengessuchasfrequentocclusions,foodappearancechanges,andhighvariabilityoffine-grainedactions.

Manuallyannotatethevideosisatime-consuming,tediousanderror-pronetask.Fullyautomatictoolsthatintegratecomputervisionalgorithmstoextractandidentifytheelementsofinterestarenoterrorfree,andfalsepositiveandfalsenegativedetectionsneedtobecorrectedinapost-processingstage.

Wepresentaninteractive,semi-automatictoolfortheannotationofcookingvideosthatintegratescomputervisiontechniquesunderthesupervisionoftheuser.Theannotationaccuracyisincreasedwithrespecttocompletelyautomatictoolsandthehumaneffortisreducedwithrespecttocompletelymanualones.

Theperformanceandusabilityoftheproposedtoolarecomparedwithotherannotationtoolsonthebasisofthetimeandeffortrequiredtoannotatethesamevideosequences.Theaccuracyofthecomputervisionmodulesarealsoreportedanddiscussed.


Intensity and color descriptors for texture classificationClaudioCusano,Univ.degliStudidiMilanoBicocca(Italy);PaoloNapoletano,RaimondoSchettini,Univ.degliStudidiMilano-Bicocca(Italy)

Inthispaperwepresentanoveltexturedescriptorforcolortextureanalysisspeciallydesignedtodealwithchangesinillumination.Thedescriptor,thatwecalledIntensityColorContrastDescriptor(ICCD),isbasedonacombinationoftheLBPapproachwithameasureofcolorcontrastdefinedastheanglebetweentwocolorvectorsinanorthonormalcolorspace.TheICCDdescriptorisinvarianttoseveraltransformationsofthecolorspace.Itsrobustnesswithrespecttoglobalchangesinlightingconditionshasbeenexperimentallydemonstratedbycomparingitonstandarddatasetsagainstseveralothercolortexturedescriptorsinthestateoftheart.

Conference 8661


Conference 8662: Intelligent Robots and Computer Vision XXX: Algorithms and TechniquesMonday-Wednesday4–6February2013

Part of Proceedings of SPIE Vol. 8662 Intelligent Robots and Computer Vision XXX: Algorithms and Techniques

8662-1, Session 1

Control issues and recent solutions for voltage controlled piezoelectric elements utilizing artificial neural networks (Invited Paper)MarkoKauppinen,JuhaRöning,Univ.ofOulu(Finland)

Performingactuationinnanomanipulationatthenecessaryaccuracyislargelypossiblethankstothemanynewpiezoelectricactuationsystems.However,therearestillroomforaccuracyandspeedimprovementswiththeaidofbetterpiezoelectricelementmodelsandbettersensors.

Inthiswork,wewillcoversomeofthecontrolissuesrelatedespeciallytopiezoelectricactuationinnanomanipulationtasks.Wewillalsotakealookatsomeoftherecentimprovementsmadepossiblebymethodsutilizingartificialneuralnetworksforimprovingthegeneralizationcapabilityandtheaccuracyofpiezoelectrichysteresismodelsusedininversemodellingandcontrolofthesolid-statevoltagecontrolledpiezoelectricactuators.

Wewillalsobrieflydiscusstheproblemareasthatthepiezoelectriccontrolmethodresearchshouldbeespeciallyfocusedonandthebiggestproblemareasformanyoftheexistingmethods.Inaddition,someofthecommonissuesrelatedtotestingandresultrepresentationsarediscussed.

8662-2, Session 1

The 20th annual intelligent ground vehicle competition: building a generation of robotists (Invited Paper)BernardL.Theisen,AndrewD.Kosinski,U.S.ArmyTankAutomotiveResearch,DevelopmentandEngineeringCtr.(UnitedStates)

TheIntelligentGroundVehicleCompetition(IGVC)isoneoffour,unmannedsystems,studentcompetitionsthatwerefoundedbytheAssociationforUnmannedVehicleSystemsInternational(AUVSI).TheIGVCisamultidisciplinaryexerciseinproductrealizationthatchallengescollegeengineeringstudentteamstointegrateadvancedcontroltheory,machinevision,vehicularelectronicsandmobileplatformfundamentalstodesignandbuildanunmannedsystem.Teamsfromaroundtheworldfocusondevelopingasuiteofdual-usetechnologiestoequipgroundvehiclesofthefuturewithintelligentdrivingcapabilities.Overthepast20years,thecompetitionhaschallengedundergraduate,graduateandPh.D.studentswithrealworldapplicationsinintelligenttransportationsystems,themilitaryandmanufacturingautomation.Todate,teamsfromover80universitiesandcollegeshaveparticipated.Thispaperdescribessomeoftheapplicationsofthetechnologiesrequiredbythiscompetitionanddiscussestheeducationalbenefits.TheprimarygoaloftheIGVCistoadvanceengineeringeducationinintelligentvehiclesandrelatedtechnologies.Theemploymentandprofessionalnetworkingopportunitiescreatedforstudentsandindustrialsponsorsthroughaseriesoftechnicaleventsoverthefour-daycompetitionarehighlighted.Finally,anassessmentofthecompetitionbasedonparticipationispresented.

8662-3, Session 2

Visual homing with a pan-tilt based stereo cameraParameshNirmal,DamianLyons,FordhamUniv.(UnitedStates)

Visualhomingisanavigationmethodbasedoncomparingastoredimageofthegoallocationandthecurrentimage(currentview).Itistheorizedthatinsects,suchasantsandbees,employvisualhomingmethodstoreturntotheirnest[1].Visualhominghasbeenappliedtoautonomousrobotplatformsusingtwomainapproaches:holisticandfeature-based.Bothmethodsaimatdeterminingdistanceanddirectiontothegoallocation.NavigationalalgorithmsusingScaleInvariantFeatureTransforms(SIFT)havegainedgreatpopularityintherecentyearsduetotherobustnessofthefeatureoperator.ChurchillandVardy[2]havedevelopedavisualhomingmethodusingscalechangeinformation(HominginScaleSpace,HiSS)fromSIFT.

HiSSusesscalechangeinformationfromSIFTfeaturestodeterminedistancebetweentherobotandthegoallocation.Sincethescalecomponentisdiscretewithasmallnumberofelements[3],theresultisaroughmeasurementwithlimitedaccuracy.Wehavedevelopedamethodthatusesstereodata,resultinginbetterhomingperformance.Ourapproachutilizesapan-tiltbasedstereocamera,whichisusedtobuildwide-fieldimages.Weusethewide-fieldimagescombinedwithstereo-dataobtainedfromthestereocameratoextendthekeypointvectordescribedin[3]toincludeanewparameter,depth(z).Usingthisinfo,ouralgorithmdeterminesthedistanceandorientationfromtherobottothegoallocation.

WecompareourmethodwithHiSSinasetofindoortrialsusingaPioneer3-ATrobotequippedwithaBumbleBee2stereocamera.Weevaluatetheperformanceofbothmethodsusingasetofperformancemeasuresdescribedinthispaper.

8662-4, Session 2

Motion control for intelligent ground vehicles based on the selection of paths using fuzzy inferenceShiweiWang,TaskinPadir,WorcesterPolytechnicInstitute(UnitedStates)

Thispaperdescribesamotionplanningtechniqueforintelligentgroundvehiclesusingapathselectionalgorithmbasedonfuzzyinference.Theapproachextendsonthetentaclemotionplanningalgorithm.Tentacleswhicharespeed-dependentsetsofvirtualdrivablepathsrepresentasetofpre-calculatedtrajectoriesinthecoordinateframeofthevehiclelocatedatitscenterofgravity.Inourapproach,weutilizeafuzzyinferencebasedselectiontodecideonthepathtobefollowedbytheautonomousvehicle.Weformulatethemappingbetweentheinputsandoutputsusingfuzzylogicwithoutamathematicalmodel.ThepaperillustratesresultsfrombothasimulationcarriedoutusingMATLAB’sFuzzyLogicToolboxaswellasfieldtestsperformedonaClearpathHuskyA100groundvehiclewhichisprogrammedusingtheRobotOperatingSystem(ROS).Thesimulationtakesintoaccounttheroadmodelincludingobstaclesandlanes,thecalculationofspeed-dependentsetsoftentaclesandimplementsthefuzzyinferencebasedselectionmechanismandgeneratesaplannedpathinanobstaclecourseforthevehicletofollow.ThepracticalimplementationofthefuzzyinferenceruleonHuskyA100,a6-wheeledskid-steeredgroundvehicleoperatedinanoff-roadGPS-deniedenvironmentcoveredwithtrees,reliesona2Dmappingoftherobot’senvironmentusing


ascanninglaserrangefinder.Thepaperprovideadiscussionofthemeasurementmethodsandproceduresusedinthisresearch.

8662-5, Session 2

Panoramic stereo sphere visionWeijiaFeng,TianjinUniv.(China);BaofengZhang,TianjinUniversityofTechnology(China);JuhaRöning,Univ.ofOulu(Finland);XiaoningZong,YITian,TianjinUniversityofTechnology(China)

Conventionalstereovisionsystemshaveasmallfieldofview(FOV)whichlimitstheirusefulnessforcertainapplications.Whilepanoramavisionisableto“see”inalldirectionsoftheobservationspace,scenedepthinformationismissedbecauseofthemappingfrom3Dreferencecoordinatesto2Dpanoramicimage.Inthispaper,wepresentaninnovativevisionsystemwhichbuildsbyaspecialcombinedfish-eyelensesmodule,andiscapableofproducing3Dcoordinateinformationfromthewholeglobalobservationspaceandacquiringnoblindarea360??360?panoramicimagesimultaneouslyjustusingsinglevisionequipmentwithonetimestaticshooting.ItiscalledPanoramicStereoSphereVision(PSSV).Weproposedthegeometricmodel,mathematicmodelandparameterscalibrationmethod.Specifically,videosurveillance,roboticautonomousnavigation,virtualreality,drivingassistance,multiplemaneuveringtargettracking,automaticmappingofenvironmentsandattitudeestimationaresomeoftheapplicationswhichwillbenefitfromPSSV.

8662-6, Session 2

Stabilization and control of quad-rotor helicopter using a smartphone deviceAlokDesai,DahJyeLee,JasonMoore,Yung-PingChang,BrighamYoungUniv.(UnitedStates)

Inrecentyears,autonomousmicrounmannedaerialvehicles(micro-UAVs),ormorespecificallyhoveringmicro-UAVshaveprovedtobesuitableformanypromisingapplicationssuchasexploringofunknownenvironmentsandsearchandrescueoperations.TheyhavebecomeanimportanttoolinthefieldofindooraswellasoutdoorUAVresearch.TheearlyversionsofUAVshadnoon-boardcontrolcapabilities,andweredifficultformanualcontrolfromagroundstation.ManyUAVsnowareequippedwithon-boardcontrolsystemsthatreducetheamountofcontrolrequiredfromtheground-stationoperator.However,thelimitationsonpayloadandpowerconsumptionandcontrolwithouthumaninterferenceremainthebiggestchallenges.

Thispaperproposestouseasmartphoneasthesolecomputationaldevicetostabilizeandcontrolaquad–rotor.ThegoalistousethereadilyavailablesensorsinasmartphonesuchasGPS,accelerometer,rate-gyros,andcameratoassistinsomevision-relatedtaskssuchasflightstabilization,estimationofheightaboveground,targettracking,obstacledetection,andsurveillance.Weuseaquad-rotorplatformthathasbeenbuiltintheRoboticVisionLabatBrighamYoungUniversityforourdevelopmentandexperiments.AnAndroidsmartphoneisconnectedthroughtheUSBporttoanexternalhardwarethathasamicroprocessorandcircuitriestogeneratepulsewidthmodulationsignalstocontrolthebrushlessservomotorsonthequad-rotor.Thehigh-resolutioncameraonthesmartphoneisusedtodetectandtrackfeaturestomaintainadesiredaltitudelevel.Thevisionalgorithmsimplementedincludetemplatematching,Harrisfeaturedetector,RANSACsimilarity-constrainedhomography,colorsegmentation.Othersensorsareusedtocontrolyaw,pitch,androllofthequad-rotor.Thissmartphone-basedsystemisabletostabilizeandcontrolandisidealformicro-UAVsthathavesize,weight,andpowerlimitations.

8662-7, Session 2

Loop closure detection using local Zernike moment patternsEvangelosSariyanidi,IstanbulTeknikÜniv.(Turkey);OnurSencan,IstanbulTeknikÜniversitesi(Turkey);HakanTemelta,IstanbulTeknik

Üniv.(Turkey)

Loopclosure,asveryimportantprobleminmobilerobotics,canbedefinedasthecorrectidentificationofapreviouslyvisitedlocation.Thispaperpresentsanewloopclosuredetectiontechnique,whichreliesonvisuallandmarkdetection.Thelandmarksaredetectedbycombiningverysmallvisualfeatures,calledLocalZernikeMoment(LZM)patterns.TheLZMtransformation,whichisusedtoextracttheLZMpatterns,hasproventobeausefulrepresentationforcertainchallengingrecognitiontaskslikefacerecognition.Inthispaper,theLZMtransformationisusedtoextractcertainlocalfeaturescalledLZMpatterns,whichareextractedbylearningtheappearanceprobabilityoftheLZMfeaturesonanofflinedataset.Oncetheappearanceprobabilityofthepatternsislearnt,thenextstepistofindararecombinationoftheextractedLZMpatternsanduseitasavisuallandmark.Inordertofindauniquefeaturecombination,alandmarkextractionalgorithmthathasbeenintroducedinapreviousstudyisutilized.Theloopclosuredetectionframeworkthatisbeingproposediscomputationallyefficient,tothepointthatitcanoperatenearreal-time.Theoverallapproachhasbeenevaluatedontwodatasets,andtheachievedresultsarequitepromising.

8662-8, Session 3

Optimizing feature selection strategy for adaptive object identification in noisy environmentSagarPandya,TheUniv.ofSouthernCalifornia(UnitedStates);ThomasT.Lu,Tien-HsinChao,JetPropulsionLab.(UnitedStates)

Wepresentthedevelopmentofamulti-stageautomatictargetrecognition(MS-ATR)systemforcomputervisioninrobotics.Thispaperdiscussesourworkinoptimizingthefeatureselection,andfeatureextractionstrategiesoftheMS-ATRsystem.Wediscussamodifiedsaliencyalgorithmthatacceptstheguidanceoftargetfeaturesinlocatingregionsofinterest(ROI).Byintroducingtargetrelatedinputparameters,saliencybecomesmorefocusedandtaskoriented.ItisusedasaninitialstageforthefastROIdetectionmethod.Wefurtherpresentanovelmethodforoptimizingfeatureselectionbycombiningtheprincipalcomponentanalysis(PCA),theIndependentComponentAnalysis(ICA),thesaliencyandgeometricalgorithmstoextractthefeaturevectors.Geneticalgorithmandseveraloptimizingmethodsareusedtoselectaminimumsetoffeaturesthatbestdistinguishesthetargetfromthebackgroundandfalsetargets.Thefeaturevectorsarepassedtothelaststagefortargetidentificationandverification.TheresultsoftheMS-ATRsystemhaveshownincreaseddetectionrateandreducedfalsealarmrateinnoisysonarandvideoimagery.

8662-9, Session 3

GPU-based real-time trinocular stereo visionYuanbinYao,RJLinton,TaskinPadir,WorcesterPolytechnicInstitute(UnitedStates)

Stereovisionhasbeenextensivelyusedinautonomousgroundvehiclenavigation.Traditionally,moststereovisionapplicationsarebinocularwhichusesinformationfromahorizontal2-cameraarraytoperformstereomatchingandcomputethedepthimage.Trinocularstereo-visionwitha3-cameraarrayhasbeenprovedtoprovidehigheraccuracyinstereomatchingwhichcouldbenefitapplicationslikedistancefinding,objectrecognitionanddetection.However,thepresenceofthethirdimageincreasesthecomputationalburdenandhencenotpracticalinmanytimecriticalapplicationslikeautonomousrobotnavigation.DuetothenatureofitshighlyparallelizedSIMD(SingleInstructionMultipleData)architecture,GPGPU(GeneralPurposeGPU)computingcaneffectivelybeusedtoparallelizethelargedataprocessingandgreatlyacceleratethecomputationofalgorithmsusedintrinocularstereovision.So,theimplementationoftrinocularstereovisiononaGPGPUisaneffectivemethodforthedevelopmentofstereovisionapplication.

Thisworkfocusesondesigningandimplementingareal-timetrinocularstereovisionalgorithmwithGPU.Theimplementationmakesuseof

Conference 8662


OpenSourceComputerVisionLibrary(OpenCV)inC++andNVidiaCUDAGPGPUsolution.Asetofimageprocessingalgorithmsweredevelopedwithinthisframeworkandawinner-take-allmethodisappliedtoperformfusionofdisparitiesindifferentdirections.Theresultsarecomparedinaccuracyandcomputationspeedtoverifytheimprovement.

8662-12, Session 3

Remotely controlling of mobile robots using gesture captured by the Kinect and recognized by machine learning methodRoyChaomingHsu,Jhih-WeiJian,Chih-ChuanLin,NationalChiayiUniv.(Taiwan);Chien-HungLai,NationalChiayiUniversity(Taiwan);Cheng-TingLiu,NationalChiayiUniv.(Taiwan)

ItisabigbreakthroughtothegameindustryandinremotelyplayingofthegamebyusingKinectsensorofbodysensationresearch,whileremotelycontrollingofmobilerobotsisthemostpopularsubjectinthefieldofrobotics.ThemainpurposeofthispaperistouseKinectanditsbodysensationtechnologytodesignsimple,convenient,andforhumanityoperationremoterobotcontrolsystem.Inthisstudy,weuseKinectsensortocapturethehumanbodyskeletonwithdepthinformation,todesigngesturetrainingandidentificationtechnology,andfinallytocommandamobilerobotforcertainactions.ExperimentalresultsshowthatthedesignedremotelycontrollingofmobilerobotsusinggesturecapturedbytheKinectsensorcanachieve,onanaverage,morethan96%ofaccurateidentificationof7typesofgesturesandcancontrolasimulatede-puckrobotforfindthewayoutofasimulatedmaze.

8662-13, Session 4

Relating vanishing points to catadioptric camera calibrationWentingDuan,Univ.ofLincoln(UnitedKingdom);HuiZhang,UnitedInternationalCollege(China);NigelAllinson,Univ.ofLincoln(UnitedKingdom)

Thispaperpresentstheanalysisandderivationofthegeometricrelationbetweenvanishingpointsandcameraparametersofcentralcatadioptricsystems.Thesevanishingpointscorrespondtothethreemutuallyorthogonaldirectionsof3Drealworldcoordinatesystem(i.e.X,YandZaxes).Comparedtovanishingpoints(VPs)intheperspectiveprojection,theadvantagesofVPsundercentralcatadioptricprojectionarethattherearenormallytwovanishingpointsforeachsetofparallellines,sincelinesareprojectedtoconicsinthecatadioptricimageplane.Also,theirvanishingpointsareusuallylocatedinsidetheimageframe.WeshowthatknowledgeoftheVPscorrespondingtoXYZaxesfromasingleimagecanleadtosimplederivationofbothintrinsicandextrinsicparametersofthecentralcatadioptricsystem.Thisderivednoveltheoryisdemonstratedandtestedonbothsyntheticandrealdatawithrespecttonoisesensitivity.

8662-14, Session 4

Natural image understanding using algorithm selection and high-level feedbackMartinLukac,MichitakaKameyama,TohokuUniv.(Japan);KosukeHiura,TohokuUniversity(Japan)

NaturalImageprocessingandunderstandingencompasseshundredsoreventhousandsofdifferentalgorithms.Eachalgorithmhasacertainpeakperformanceforaparticularsetofinputfeaturesandconfigurationsoftheobjects/regionsoftheinputimage(environment).Toobtainthebestpossibleresultofprocessing,weproposeanalgorithmselectionapproachthatpermitstoalwaysusethemostappropriatealgorithmforthegiveninputimage.Thisisobtainedbyatfirstselectinganalgorithmbasedonlowlevelfeaturessuchas

colorintensity,histograms,spectralcoefficients.Theresultinghighlevelimagedescriptionisthenanalyzedforlogicalinconsistencies(contradictions)thatarethenusedtorefinetheselectionoftheprocessingelements.Theselectionstopswhenthehighlevelinconsistenciesareallresolvedornomoredifferentalgorithmscanbeselected.

8662-15, Session 4

Improving shape context using geodesic information and reflection invarianceVittalPremachandran,RamakrishnaKakarala,NanyangTechnologicalUniv.(Singapore)

Objectrecognitionisusuallyperformedbyextractingfeaturesandthenmatchingthefeaturestotestimages.Color,textureandshapeareimportantfeaturesofanobject.Shapeisarichinformationthathasnotbeenwellexploited.Inthispaper,anewwayforextractingtheshapeinformationfromagivenobject,isproposed.Todescribetheobject’sshapeinameaningfulway,theobject’scontoursarefirstextractedandsampledintoasetofuniformlyspacedpoints.Shapecontextsaregeneratedateachoftheseuniformlyspacedpoints.Ashapecontexta2Dhistogramofdistancesandangles.Theproposedvariantoftheshapecontextmakesuseofthegeodesicdistancebetweentwosampledpoints.TheuseofgeodesicdistancebettercapturestheshapeinformationthanthetraditionalEuclideandistance.Experimentalresultsonastandarddatabaseshowsthattheuseofgeodesicinformationhelpsinobtainingbetterobjectretrievalrates.

8662-16, Session 4

A proposed super-fast scheme for instant-detect-instant-kill of a ground-to-air missileChialunJ.Hu,SouthernIllinoisUniv.atCarbondale(UnitedStates)

Wecanapplyaslightlymodifiedlocalpolaredgedetection(LPED)methodtheauthorusedinthelast3yearstoabinaryIRpicturecapturedbyanIRcameraoperatedatroomtemperaturetogetthecentralofmasspoint(CMP)ofthehightemperature,enemysurfacetoairmissile(oraSAM)againstthelowtemperatureenvironmentINSTANTLYwhentheSAMisfired.ThenitwillautomaticallytracktheCMPofthefastflyingSAMbypredictingaheadthelocationoftheCMPinthenextIRpictureframe.Whenthisisautomaticallyconfirmedinthenextpictureframe,theoperatorscreenwillflash“TARGETLOCKED”sign.Theoperatorcanthenpressaredbuttontofireeitheralasergunoran“induction-ignitedair-to-air(orATA)missile”atthepredictedrendezvouspointoftheenemySAMandkilltheSAMinstantly.Forlasergunoperation,therendezvouspointisthepredictedCMPofthenextSAMposition.Fortheinduction-ignitedATAmissile,therendezvouspointisthecenterofainfluencespheresuchthatwhentheATAfliesintotheinfluencesphereoftheSAM,notnecessarilyhittingtheSAM,theATAwillexplodetoshoottheenemySAMdown.

Toenhancefurthertheaccuracy,weshouldusetwohelicopterstocross-firecooperativelywithhelicopterAtargetingattheSAMaimingathelicopterBandhelicopterBtargetingattheSAMaimingathelicopterA.Thiscross-firingstrategyisusedtoimprovethetargetcrosssectionbyaimingatthetargetfrom“sideview”insteadoffrom“head-onview”.

8662-17, Session 4

Video surveillance with alertsRobertJ.Wahlstedt,WhitworthUniv.(UnitedStates)

VSWAisaprogramthatintegratescomputervisiontogathernon-verbalcluesofsuspiciousbehavior.Thispaperdiscussessocialaspectsthatcharacterizethatareinconsistentthatposeathreatsoanalertcangotoanauthority.Thesesituationscanrangefromabankholduptodrugtrafficking.Thisprogramusesbodypostureconfidence,

Conference 8662


walkingpace,scanningforweaponsinimages,andfacialexpression,aswellasscanningforthreateninggestures.Thisprogramdoesnotalertbasedononevariablealonebutratherithasanindexoffactorsthatsuggestsuspicionorguilt.Thispaperdiscussesthealgorithmsandmethodologythatisintheprogram.Thisprogramhasthegoalofmakingtransit,libraries,schools,andbankssaferandlessinvasive.

8662-18, Session 4

Finger tracking for hand-held device interface using profile-matching stereo visionYung-PingChang,DahJyeLee,JasonMoore,AlokDesai,BeauJ.Tippetts,BrighamYoungUniv.(UnitedStates)

Hand-heldcommunicationorcomputingdeviceshavebecomeanimportantpartofhumanlife.Alargepercentageofpopulationsusethemregularly.Althoughconvenientfortheusertodialaphonenumberofselectmusictoplay,theycausesafetyproblems,especiallywhenusingthesedeviceswhiledriving.Astheintendeddesign,theuserhastousefingerstotouchthescreentooperatethem.Thesetouch-screenoperationsdistractdriversandputthemindangerormakethemdangeroustootherdriversorpedestrians.Anon-contactcontrolinterfacewillallowtheusertocontrolthedevicewithouttouchingthescreenandtofocusmoreondriving.Thechallengesofbuildingsuchinterfacearehowtolocalizeandtrackfingersinrealtime.

WerecentlydevelopedanewstereovisionalgorithmcalledIntensityProfileShape-Matchingfor3-Dhumangestureanalysisandobstacledetection.Thisalgorithmisabletoextract3-Dinformationfromapairofintensityprofilesfromstereoimagesinrealtime.Itmatchesintensityprofileshaperowbyrow,andproducesadisparitymapforanalysis.Bychoosingacertainintervalofdisparity,objectinacertaindistancerangecanbesegmented.Inotherwords,wedetectobjectnotbyitscolor,intensity,ortexturebutbydistance.Theadvantageofthisprofileshape-matchingalgorithmisthatdetectionofcorrespondencesreliesontheshapeofprofilenotonintensityvalueswhicharesubjectedtolightingvariations.Basedontheresulting3-Dinformation,themovementoffingersinspacefromaspecificdistancecanbedetermined.Fingerlocationandmovementcanthenbeanalyzedfornon-contactcontrol.

Thisalgorithm,likeothers,facesthetradeoffbetweenaccuracyandprocessingspeed.Higheraccuracyrequiresmoreresourcesandtime.Fortheproposedfingerlocalizationandtrackingapplication,weareabletofindabalancebetweenaccuracyandspeedandfindthatitissuitablefortheproposedapplication.

8662-19, Session 5

Training industrial robots with gesture recognition techniquesJenniferR.Piane,DePaulUniv.(UnitedStates)andYaskawaAmerica,Inc.(UnitedStates);DanielaRaicu,JacobFurst,DePaulUniv.(UnitedStates)

Thispaperdescribesthatsystemthatappliesgesturerecognitiontechniquesandmachinelearningmethodstobuildasystemthatgeneratesarobotmotionprogramfromapairofstereovideos.Thevideosusedtoevaluatethesystemare‘highlycluttered’andinsomecasesblurrytosupporta‘real’usageenvironment.Thispaperappliesacolor-thresholdingmodel,naïveBayesandSVMtodetectthehumanhand.ItperformsStereoMatchingontheregionwherethehandwasdetectedtofindrelative3Dcoordinates.Thesystemfinallyanalyzesthelistofcoordinatestodetermineapathfortherobottomove.Withlargeamountsofclutterandsomeblur,thesystemwas77%effectiveindetectingthehand.Theanalysisofrelative3Dcoordinateswasabletocompensateforerrorsandproduceeffectiverobotmotion.

8662-20, Session 5

A restrained-torque-based motion instructor: forearm flexion/extension-driving exoskeletonTakuyaNishimura,YoshihikoNomura,RyotaSakamoto,MieUniv.(Japan)

Theauthorsintroducedacost-effectiveidea:itwouldbeenoughtoinstructthetimingofstart-stopandtoaccelerateordeceleratejointanglerotationbecausesuchinstructions,inessentials,donotneedtoexertpowerandenergybutjustneedtotransmitmotioninformation.Thisconceptisveryimportantpointthatdifferentiatesourstudyfromtheordinary“highly-poweredstrongarm”typeofpoweredsuits.

Thispaperpresentsaresultofafundamentalexperimentforconstructingaguidelineofanexternaltorque-representingmechanicalspecification:takinganexamplewiththestart-upofaforearmbendingmotion,quantitiesofstimulusthresholdforhavingtheforearmmovepassivelybytheexternallyexertedtorquewereexaminedthroughpsychophysicalexperimentsusingaprototypeofamechanicalinterface:atorque-basedexoskeletoninstructorforinducingforearmflexionmotion.

8662-21, Session 5

3D recovery of human gaze in natural environmentsLucasPaletta,KatrinSantner,GeraldFritz,JOANNEUMRESEARCHForschungsgesellschaftmbH(Austria);HeinzMayer,JOANNEUMRESEARCHForschungsgesellschaftmbH(Germany)

Inmarketingandusabilityengineering,thetrackingofhumaneyemovements,i.e.eye-tracking,hasbeenthecentraltechnologyforcapturingvisualattention.Withinthelastcoupleofyearsminiaturizedmobileeye-trackingsystemshavebecomeavailableandbeensuccessfullyappliedindifferentareasofresearchwiththemajoradvantagetoevaluateattentioninthefieldwherethetaskofinterestisperformed.Amajordisadvantageofexistingeye-trackingtechnologyistheneedtomanuallyanalyzethehugeamountofcollectedvideodata.Inourworkweseektoattainalignmentofthehumangazedatawithina3Dreconstructionoftheenvironmentbyrecoveringthefull6DegreesofFreedom(DOF)poseandthesensorshouldprovidethree-dimensionalinformationoftheenvironment.Thismeansthatthehumangazeisfullyreconstructedandmappedtowardsthe3Dmodeloftheenvironmentandoncetheenvironmenthasbeenannotatedbefore,thesemanticsofthegazetrajectorycanbereconstructedthereafteraswell.ForthispurposeweapplytheSimultaneousLocalizationandMapping(SLAM)framework.Onastandardizedtestdataset,weoutperformEKFbasedsolutionsintermsoflocalizationaccuracybyatleastafactoroftwo.

8662-22, Session 6

CANINE: a robotic mine dogBrianStancil,JeffHyams,JordanShelley,KartikBabu,NeyaSystemsLLC(UnitedStates);HernanBadino,AayushBansal,DanielF.Huber,CarnegieMellonUniv.(UnitedStates);ParagBatavia,NeyaSystemsLLC(UnitedStates)

Thepurposeofthiseffortistodevelopthesensingandprocessingcapabilitiestoenableasmallmobilerobot(CANINE)toactasaminedog.Minedogswereusedtolookfordangerousobjectssuchasmines,unexplodedordnance,andtripwires.Forthiseffort,theconceptofoperationsistoshowCANINEanobjectofinterest,haveitlearnthesalientfeaturesofthatobject,andthensearchforandrecovertheobjectafterithasbeenthrowninafieldcontainingdistractorobjectsandotherobstacles.

Conference 8662


8662-23, Session 6

Development of dog-like retrieving capability in a ground robotDouglasC.MacKenzie,MobileIntelligenceCorp.(UnitedStates);RahulAshok,JamesM.Rehg,GeorgiaInstituteofTechnology(UnitedStates);GaryWitus,WayneStateUniv.(UnitedStates)

TheMobileIntelligenceTeam’sapproachtoaddressingtheCANINEoutdoorgroundrobotcompetitionwillbepresented.Thecompetitionrequireddevelopingarobotthatprovidedretrievingcapabilitiessimilartoadog,whileoperatingfullyautonomouslyinunstructuredenvironments.ThevisionteamconsistedofMobileIntelligence,theGeorgiaInstituteofTechnology,andWayneStateUniversity.Importantcomputervisionaspectsoftheprojectweretheabilitytoquicklylearnthedistinguishingcharacteristicsofnovelobjects,searchingimagesfortheobjectastherobotdroveasearchpattern,identifyingpeopleneartherobotforsafeoperations,correctlyidentifyingtheobjectamongdistractors,andlocalizingtheobjectforretrieval.

8662-24, Session 6

Multidisciplinary unmanned technology teammate (MUTT)NenadUzunovic,AnneSchneider,AlbertoLacaze,KarlMurphy,RoboticResearchLLC(UnitedStates);MarkDelGiorno,DelServices(UnitedStates)

TheuniqueapproachcombinedwiththelatesttechnicaldevelopmentsinautonomousmobilityandoperatorinteractionguidedtheRoboticResearch,LLC-ledteamtoafirstplacewininTARDEC’sCANINEcompetition.ThegoalofthecompetitionwastodeveloparoboticsystemthatwoulddisplayusefulcapabilitiessimilartothosefoundinanArmydog–suchasfetchingobjectsthrownbyanoperator.Usingonlygesturesorvoicecommands,therobotswouldlearnanewobjectatthestartofeachphase,findtheobjectafteritwasthrownintoafield,andreturntheobjecttotheoperator.Eachofthesixphasesbecamemoredifficult,includingclutterofthesamecolororshapeastheobject,movingandstationaryobstacles,andfindingtheoperatorwhomovedtoanewlocation.TheRoboticResearchTeamintegratedtechniquesincomputervision,speechrecognition,objectmanipulation,andautonomousnavigation.Amulti-filtercomputervisionsolutionreliablydetectedtheobjectswhilerejectingobjectsofsimilarcolororshape,evenwhiletherobotwasinmotion.Aspeech-basedinterfacewithshortcommandsprovidedclosetonaturalcommunicationofcomplicatedcommandsfromtheoperatortotherobot.Aninnovativegripperdesignallowedforefficientobjectpickup.Arobustautonomousmobilityandnavigationsolutionforgroundroboticplatformsprovidedfastandreliableobstacleavoidanceandcoursenavigation.Theresearchapproachfocusedonwinningthecompetitionwhileremainingcognizantandrelevanttorealworldapplications.

8662-25, Session 6

R-MASTIF: robotic mobile autonomous system for threat interrogation and object fetchAveekK.Das,SRIInternationalSarnoff(UnitedStates);DineshThakur,JamesFKeller,UniversityofPennsylvania(UnitedStates);SujitKuthirummal,ZsoltKira,SRIInternationalSarnoff(UnitedStates);MihailPivtoraiko,UniversityofPennsylvania(UnitedStates)

Autonomousrobotic“fetch”operation,wherearobotisshownanovelobjectandthenaskedtolocateitinthefield,successfullyretrieveitandbringitbacktothehumanoperator,isachallengingproblemthatisofinteresttothemilitary.TheCANINEcompetitionpresentedaforumforseveralresearchteamstotacklethischallengeusingstateoftheartinroboticstechnology.TheSRI-UPennteamfieldedamodifiedSegway

RMP200robotwithmultiplecamerasandlidars.Weimplementedauniquecomputervisionbasedapproachforcoloredobjecttraininganddetectiontorobustlylocatedpreviouslyunseenobjectsoutto15metersonmoderatelyflatterrain.WeintegratedSRI’sstateoftheartVisualOdometryforGPS-deniedlocalizationonourrobotplatform.Wealsodesignedauniquescoopingmechanismwhichallowedretrievalofuptobasketballsizedobjectswithareciprocatingfour-barlinkagemechanism.FurtherthesoftwareframeworkwasdevelopedusingROS(RobotOperatingSystem)whichisopensourceandwelladoptedbytheroboticscommunity.Wepresentadescriptionofthesystem,ourkeytechnicalcontributionsandexperimentalresults.

8662-26, Session 6

LABRADOR: a learning autonomous behavior-based robot for adaptive detection and object retrievalBrianM.Yamauchi,MarkMoseley,iRobotCorp.(UnitedStates);JonathanBrookshire,MIT(UnitedStates)

AspartoftheTARDEC-fundedCANINE(CooperativeAutonomousNavigationinaNetworkedEnvironment)Program,iRobotdevelopedLABRADOR(LearningAutonomousBehavior-basedRobotforAdaptiveDetectionandObjectRetrieval).LABRADORwasbasedontherugged,man-portable,iRobotPackBotunmannedgroundvehicle(UGV)equippedwithanexplosivesordnancedisposal(EOD)manipulatorarmandacustomgripper.ForLABRADOR,wedevelopedavision-basedobjectlearningandrecognitionsystemthatcombinedaTLD(track-learn-detect)filterbasedonobjectshapefeatureswithacolor-histogram-basedobjectdetector.Ourvisionsystemwasabletolearninreal-timetorecognizeobjectspresentedtotherobot.WealsoimplementedawaypointnavigationsystembasedonfusedGPS,IMU(inertialmeasurementunit),andodometrydata.Weusedthisnavigationcapabilitytoimplementautonomousbehaviorscapableofsearchingaspecifiedareausingavarietyofrobustcoveragestrategies–includingoutwardspiral,randombounce,randomwaypoint,andperimeterfollowingbehaviors.WhilethefullsystemwasnotintegratedintimetocompeteintheCANINEcompetitionevent,wedevelopedusefulperception,navigation,andbehaviorcapabilitiesthatmaybeappliedtofutureautonomousrobotsystems.


Method and application of active visual tracking based on illumination invariantsJieSu,HarbinUniv.ofScienceandTechnology(China);Gui-shengYin,HarbinEngineeringUniv.(China);LeiWang,Yong-qianSun,ZhengLiu,HarbinUniv.ofScienceandTechnology(China)

Toimproveidentificationrateandtrackrateforquicklymovingtarget,expandtrackingscopeandlowersensitivitytoilluminationvarying,anactivevisualtrackingsystembasedonilluminationinvariantsisproposed.Cameramotionpre-controlmethodbasedonparticlefilterpre-locationisusedtoimproveactivityandaccuracyoftrackforquicklymovingtargetbyforecastingtargetpositionandcontrolcamerajointsofTiltandPan.Pre-locationmethodusingparticlesamplefilteraccordingtoilluminationinvariantsoftargetisusedtoreducetheaffectofvaryingilluminationduringtrackingmovingtargetandtoimprovealgorithmrobust.Thecorrectnessandaccuracyweretestedthoughexperiments.Activity,realtimeandaccuracyareimprovedwhilesensitivitytoilluminationvariationisreduced.


Supervised linear dimensionality reduction with robust margins for object recognitionFadiDornaika,Univ.delPaísVasco(Spain);AmmarAssoum,LebaneseUniv.(Lebanon)

Conference 8662


LinearDimensionalityReduction(LDR)techniqueshavebeenincreasinglyimportantincomputervisionandpatternrecognitionsincetheypermitarelativelysimplemappingofdataontoalower

dimensionalsubspace,leadingtoefficientclassificationstrategies.Recently,alinearmethodcalledAverageNeighborsMarginMaximization(ANMM)wasproposedandshowntohavepowerfuldiscriminationproperties.Althoughgoodresultswereobtainedwiththismethod,itsuffersfromtwoshortcomings:i)itrequiresthesettingoftwoparameters-theneighborssizesforhomogeneousandheterogeneoussamples,andii)itcanbeverysensitivetothelabeloutlierssinceamarginaverageisused.

Inthispaper,weproposearobustlinearembeddingmethodwhosemarginsarecomputedovermediandistances.Experimentalresultsperformedonthreefacedatabases(ExtebdedYale,PIE,andPF01)showthattheproposedapproachcangivebettergeneralizationperformancethantheANMMmethod.OurproposedmethodismuchlesssensitivetooutliersthantheANMMmethodandtheNearestmiss/Nearesthitmethod.Theproposedapproachcouldalsobeappliedtoothercategoryofobjectscharacterizedbylargevariationsintheirappearance.


Using a multi-port architecture of neural-net associative memory based on the equivalency paradigm for parallel cluster image analysis and self-learningVladimirG.Krasilenko,VinnitsaSocialEconomyInstitute(Ukraine);AlexanderA.Lazarev,VinnitsaNationalTechnicalUniv.(Ukraine);SvetaK.Grabovlyak,DianaV.Nikitovich,VinnitsaSocialEconomyInstitute(Ukraine)

First,weconsiderequivalencymodels,includingmatrix-matrixandmatrix-tensorandwiththedualadaptive-weightedcorrelation,multi-portneural-netauto-associativeandhetero-associativememory(MPNNAAMandHAP),whichareequivalencyparadigmandthetheoreticalbasisofourwork.WemakeabriefoverviewofthepossibleimplementationsoftheMPNNAAMandoftheirarchitecturesproposedandinvestigatedearlierbyus.Themainbaseunitofsucharchitecturesisamatrix-matrixormatrix-tensorequivalentor.

WeshowthattheMPNNAAMbasedontheequivalencyparadigmandoptoelectronicarchitectureswithspace-timeintegrationandparallel-serial2Dimagesprocessinghaveadvantagessuchasincreasedmemorycapacity(morethananorderofthenumberofneurons!),highperformanceindifferentmodes(10^10-10^12connectionspersecond!)andtheabilitytoprocess,storeandassociativelyrecognizehighlycorrelatedimages.

Next,weshowthatwithminormodifications,suchMPNNAAMcanbesuccessfullyusedforhigh-performanceparallelclusteringprocessofimages.Weshowsimulationresultsofusingthesemodifiedforclusteringandlearningmodelsandalgorithmsforclusteranalysisofspecificimagesanddividethemintocategoriesofthearray.

Showexampleofaclusterdivisionof32images(40x32pixels)lettersandgraphicsfor12clusterswithsimultaneousformationoftheoutput-weightedspaceallocatedimagesforeachcluster.

Wewilldiscussalgorithmsforlearningandself-learninginsuchstructuresandtheircomparativeevaluationsbasedonMathCadsimulationsaremade.

Itwillbeshownthat,unlikethetraditionalKohonenself-organizingmaps,timeoflearningintheproposedstructuresofmulti-portneuronetclassifier/clusterizer(MPNNC)onthebasisofequivalencyparadigm,duetotheirmulti-portdecreasesbyordersandcannotexceed,insomecases,justafewepochs.

Estimatesshowthatinthetestclustering,selectedforexample,321280-elementimagesinto12groups,theformationofneuralconnectionsofthematrixdimensionof128*120elementsoccurstotensofiterativesteps(someepochs),andforasetoflearningpatternsconsistingof32suchimages,andattimeofprocessingof1-10microseconds,thetotallearningtimedoesnotexceedafewmilliseconds.

WewilloffercriteriaforevaluatethequalityofpatternsclusteringwithsuchMPNNAAM.

Conference 8662


Conference 8663: Video Surveillance and Transportation Imaging ApplicationsMonday-Wednesday4–6February2013

Part of Proceedings of SPIE Vol. 8663 Video Surveillance and Transportation Imaging Applications

8663-1, Session 1

Group localisation and unsupervised detection and classification of basic crowd behaviour events for surveillance applicationsNadejdaS.Roubtsova,PeterH.N.deWith,TechnischeUniv.Eindhoven(Netherlands)

Recently,inresponsetotheneedforautomaticmonitoringofdenselypopulatedhigh-riskpublicplaces,interesttocrowdbehaviouranalysishasbeengrowing.Insteadoftacklingcomplexsurveillancescenariosdirectlythroughmachinelearning,wepresentanoptical-flow-basedunsuperviseddetectionandclassificationalgorithmofthreeelementarycrowdbehaviourevents:splits,mergesandlateralslide-bymotions.Oursystemismotion-based,ratherthanobject-detection-based,andinherentlylocalisesmovinggroupsatframeintervals.Thisgroup-based,ratherthanindividual-based,approachallowsforscalabilityandadegreeofrobustnessagainstartifactsasinfalliblehumandetectionisnolongeressential.Thesystem’sperformanceischaracterisedbyhighdetectionandclassificationrecallscores.

Thepresentedframeworkcanbebuiltuponanditsapplicationsextendbeyondtheoriginallyintendedscope.Firstly,basicbehaviourevents,localisedbothspatiallyandtemporally,togetherwiththecurrentgrouppositioninformationbythemselvesindirectlyhelpsketchthemonitoredenvironmentthroughthecrowd’sreactiontoit(e.g.anobstaclecausingacrowdsplit).Secondly,thepresentedframework,alonewithminorextensionsand/orwhenintegratedwithstaticcontentanalysisalgorithms(e.g.trafficsigndetection),canserveforanalysisofcomplexsurveillancescenariossuchasmassevacuation,unregulatedroadcrossingorunauthorisedovercrowding.

8663-2, Session 1

Gaussian mixtures for anomaly detection in crowded scenes HabibUllah,Univ.degliStudidiTrento(Italy);LorenzaTenuti,UniversityofTrento(Italy);NicolaConci,Univ.degliStudidiTrento(Italy)

NoAbstractAvailable

8663-3, Session 1

Space-time correlation filters for human action detectionJosephA.Fernandez,B.V.K.VijayaKumar,CarnegieMellonUniv.(UnitedStates)

Toautomatevideosurveillancesystems,algorithmsmustbedevelopedtoautomaticallydetectandclassifybasichumanactions.Manytraditionalapproachesfocusontheclassificationofactions,whichusuallyassumespriordetection,tracking,andsegmentationofthehumanfigurefromtheoriginalvideo.Ontheotherhand,actiondetectionisamoredesirableparadigm,asitiscapableofsimultaneouslocalizationandclassificationoftheaction.Thismeansthatnopriorsegmentationortrackingisrequired,andmultipleactioninstancesmaybedetectedinthesamevideo.

Correlationfiltershavebeentraditionallyappliedforobjectdetectioninimages.Inthispaper,wereporttheresultsofourinvestigationusing

correlationfiltersforhumanactiondetectioninvideos.Correlationfiltershavepreviouslybeenexploredforactionclassification,butthisisthefirsttimetheyareevaluatedforthemoredifficulttaskofactiondetection.Inaddition,weinvestigateseveralpracticalimplementationissues,includingparameterselection,reducingcomputationaltime,andexploringtheeffectsofpreprocessingandtemporalocclusion(i.e.,lossofvideoframes)onperformance.

8663-4, Session 1

Recognition of two-person interaction in multi-view surveillance video via proxemics cues and spatio-temporal interest pointsBoZhang,PaoloRota,NicolaConci,Univ.degliStudidiTrento(Italy)

NoAbstractAvailable

8663-5, Session 1

Weighted symbolic analysis of human behavior for event detectionAndreaRosani,GiuliaBoato,FrancescoG.B.DeNatale,Univ.degliStudidiTrento(Italy)

Videounderstandinghasbecomeahighinterestresearchtopicinrecentyears,withvariousapplications,suchasvideobrowsing,content-basedvideoindexingandvisualsurveillance.Theautomationofthisprocessisstillachallengingtask.Symbolicapproachesintroduceagreatsimplificationinthedata,transformingthemintochainsofmeaningfulpatterns.Thisallowstoovercomeacommondrawbackresultingfromtheclutterproducedbylow-levelprocessingoperations,embeddingsignificantcontextualinformationintothedata,aswellasusingsimplesyntacticapproachestoperformthematchingbetweenactualsequencesandmodels.InthispaperweproposeanovelframeworkthatemploysasymbolicrepresentationofcomplexactivitiesthroughsequencesofatomicactionsbasedonaweightedContext-FreeGrammar(SVM-CFG).Thevideounderstandingisperformedatdifferentabstractionlayers,retrievinganomalousactivitiesandlimitingtheinfluenceofnoise.

8663-6, Session 2

A resource allocation framework for adaptive selection of point matching strategiesQuentinDeNeyer,LiSun,ChristopheDeVleeschouwer,Univ.CatholiquedeLouvain(Belgium)

OurworkinvestigatesthePTZautotrackingproblem,whichbuildsontheclose-viewvideoscapturedbyamovingcamera.Thiscaseisindeedfundamentallydifferentfromconventionaltrackingscenarios,generallydealingwithwidefieldofviews,inwhichtheobjectofinterestcanbetrackedbydiscriminatingitfromitssurroundingbackground.Inclearwords,theconventionaltrackingalgorithms,whichrelyonthediscriminationbetweenthetargetanditsbackground,arenotrelevantinourclose-viewcontext.Hence,weproposetoadoptapoint-basedmodelofthetarget,soastotracktheobjectbasedonthematchingofpointsbetweenconsecutiveframesofthevideo.

Conference 8663


Thepaperintroducesanoriginalcost-benefitframeworktoadaptthenumberofpointsandtheirindividualmatchingmetricstomaximizetheglobalframeregistrationqualityunderacomplexityconstraint.HandlingthecomplexityisofparticularimportanceintheautotrackingscenariosincecomplexitydirectlyaffectsthedelayaffectingthePTZcontrolloop.Ourpoint-basedrepresentationmodelappearstobebotheffectiveandefficientinregisteringpairsofframe.

8663-7, Session 2

Human behavior analysis using trajectory data in a PTZ surveillance systemSanjeevKumar,BalasubramanianRaman,IndianInstituteofTechnologyRoorkee(India)

ThispaperpresentsamodernvideosurveillancesystemcomposedbyaPTZcameraforhumanmotionanalysis.Thetrajectoryisobtainedfromthevideostreamofhumanmotionandclassifiedasnormaloranomalous.Backgroundsubtractionapproachhasbeenusedtoobtainthetrajectoryofhumanmotion.ThetrajectoryisrepresentedbyanonuniformrationalB-spline(NURBS)curveintermsofapolynomial.EachcoordinateofNURBSpolynomialisrepresentedbyafunctionoftime.Inthiscontext,thepositionofhumancanbeobtainedjustbythetimeinformationwithanaccuratelocalization.ThecontrolpointsandweightsoftheNURBScurvesareusedtogeneratethefeaturevectorforclassification.Abinarysupportvectormachineisusedastheclassifierforthetrajectoryclassification.Experimentalresultsarepresentedfromthevideostreamsofalobbywhereanumberofpeoplearemovingindifferentdirections.Besidesshowingtheanalysisoftheobtainedresults,wecompareourresultswithsomeexistingtrajectoryclusteringapproaches.

8663-8, Session 2

Collaborative real-time scheduling of multiple PTZ cameras for multiple objects tracking in video surveillanceYu-CheLiu,Chung-LinHuang,NationalTsingHuaUniv.(Taiwan)

ThevideosurveillancesystemswithmanyCCTVcamerasperformpoorlyontasksthatrequirehigherresolutionfacialimagesforbiometricidentification.OnepromisingsolutionisusingthePan-Tilt-Zoom(PTZ)camerastocoverapotentiallymuchlargerareaandobtainahigherresolutionimagethroughzoomcapability.So,somesurveillancesystemscombinetraditionalpassiveCCTVcameraswiththeactivePTZcamerasinmaster/slaveconfigurations.

VideosurveillancesystemwithfixedCCTVcamerascannotprovidehigherresolutionimagesofthetargetsuchasclose-upfacialimages.Oncetheobjectapproachessomeblindspots,itcannotbeobserved.Therefore,manysurveillancesystemscombinethepassiveCCTVcameraswiththePTZcamerasinmaster/slaveconfiguration.TheactivePTZcamerasshouldtrackandobservethedesignatedtargetobject,andthenswitchtheirtargetobjects(orhand-off)oncetheobservationgoalchanges.

Thecameranetworkcanresolvetheconflictsoftaskingmultiplecamerastoobserveseveralpedestriansimultaneously.ThemainchallengeariseswhenthenumberofsubjectsandactivitiesexceedsthenumberofPTZcamerassothattheschedulingandcontrollingofthePTZcamerasbecomenontrivial.Themulti-PTZcamerassystemdecidesthebestallocationofthecameratoobservedifferenthumansubjectsinthemeanwhilesatisfiestheobservationcriterionthatrequiresseamlessclose-upvideooftheobjectsofinterest.Here,weformulaPTZcamerasassignmentandhandoffasaplanningproblemwhosesolutionachievesoptimalcamerautilizationforthepredefineobservationgoal.

Manyliteraturesinvideosurveillancesystemsfocusonmulti-peoplemulti-camera(MPMC)objecttrackinginalargeheterogeneouscameranetwork.UsingfixedCCTVcamerasperformspoorlyontasksthatrequirehigherresolutionfacialimagesforbiometricidentification.TheMPMCtrackingsystemneedstoconsiderthefollowingconditions:(1)thenumberofpedestrianandactivitiesappearsinthescenemay

exceedsthenumberofavailablePTZcameras,(2)thedualcamerapairsrequiredtocoveralltheareatoavoidtheoccurrenceofblindspots,(3)Thecomputationcomplexityofobjecttrackingandlocationincreases.

ThispaperformulatesPTZcameraassignmentandhand-offasaplanningproblem.Ourgoalistooptimizethecamerautilizationtorecordseamless,highqualityvideoofroamingindividualbyusingonlymultiplePTZcameras.

8663-9, Session 2

Tracking small targets in wide area motion imagery dataAlexMathew,VijayanK.Asari,Univ.ofDayton(UnitedStates)

Inthispaper,weproposeamethodtotrackverysmalltargetssuchaspedestrians,inWideAreaMotionImagery(WAMI)data.OurmethodisevaluatedonAFRLColumbusLargeImageFormat(CLIF)WAMIdata.Theapproachinvolvesfoursteps–imageregistration,featureextraction,locationestimationandtemplateupdate.Speeded-UpRobustFeautres(SURF)isusedforimageregistration.Forfeatureextraction,weuseanovelmethodinwhichtheimageisrepresentedasamatrixofhistograms.AKalmanfilter,incombinationwiththeproposedfeatureextractiontechnique,isusedfortracking.Aslidingwindowisusedtosearchforpossiblelocationofthetargetwithinthesearchregiongivenbythefilter.Insteadoftreatingtheproblemasanobjectdetectionproblem,wetreatthisasanimageretrievalproblemanduseEarthMover’sDistance(EMD)tocomputea‘dissimilarityvalue’.Thetargetlocationistakenastheonethatproducestheleastvalue.Theproblemoftemplatedriftissolvedbylocalizingthetargetwithablobdetectionalgorithm.WeshowtherobustnessofthealgorithmbygivingacomparisonwithothermethodssuchasSURF,SIFT,HoGandmean-shifttracking.

8663-10, Session 2

Feature descriptors for object matching in real-time tracking applicationsGernotLoibner,OliverSidla,SLREngineeringOG(Austria)

Thisworkpresentsareal-timetrackingframeworkwhichisdesignedtoachievetrackingwithatleast8-10framespersecondonalowpowersmartcamera.Applicationsoftheproposedtrackingframeworkispedestrianandvehicletrackinginindoorandoutdoorenvironments.Ourproposedtrackingframeworkiskeptsimpleinordertoavoidunnecessarycomputation,butithasproventobeeffectiveinlowtomediumcrowdedsituations.Thisworkconcentratesonappearancebasedsimilaritymeasuresusedwhichareusedtoverifymatchesfromframetoframefori)thedecisionwhetheradetectionactuallymatcheswiththeextrapolatedpositionfromthepreviousframe,ii)notonlybetweenframesbutalsobetweendifferentcameras,iii)todecidewhetherobjectsmightbe‚goodobjecttotrack‘whichsignificantlydifferfromtheaverage.

TheobjectdescriptorswhichhavebeentestedareallfastenoughsothattheycaneusedinourembeddedsystemparalleltoLKopticalflowandHOGdetectormodules.Specificallywehavetestedthefollowingfeaturedescriptors:

-BRIEF

-ORB

-BRISK

-FREAK

-HOG

-DCT

-colorhistograms

Basedonasetofseveralhundredmanuallyextractedobjectsamples(pedestriansonly)wetesteveryfeaturedescriptoronitsabilityto

i)matchidenticalobjectsfromdifferentframes

ii)todescribeeachobjectandseparateitfromotherobjectsinfeature-space

Conference 8663

Conference 8663


Eachtestwillreportalsoontherelativecomputationcostontheembeddedhardwareunderconsideration.Thetracking-frameworkwillhasbeentestedonreal-worldvideoswithannotatedgroundtruthtrajectoriesinordertoevaluateitsperformance.

8663-11, Session 3

Identification of overloaded ships based on video surveillance in inland riverLeiXie,WuhanUniv.ofTechnology(China)

Toreducewatertrafficaccident,identificationofoverloadedshipshasbeenofmajorimportanceforenforcinginlandrivermanagement,especiallyinsomedevelopingcountries.Drivenbyrisingoilpricesorothereconomicbenefits,theshipoverloadphenomenoncontinuedtooccurinChina.Therefore,overloadedshipdetectionhasbeenofakeyfactorofshippingsafety.Thispaperpresentedarobustmethodfordetectingoverloadedshipandtheproposedalgorithmincludedthreestages:shipdetection,shiptrackingandoverloadedshipidentification.Shipdetectionwasakeystepandtheconceptofshiptrackingisbuiltupontheship-segmentationmethod,inwhichthealgorithmaboutbackgroundestimation,backgroundupdating,backgroundsubtractionandshipdetectionhasbeendescribed.Accordingtothesegmentedshipshape,apredictmethodbasedonKalmanfilterhasbeenproposedtotrackeachship.Thedescribedidentificationsystemincludedavideocameraandahighdefinitioncamera,whichledtoanecessarycoordinatetransformationinsystemmodelofKalmanfilter.Thedataofshiplengthandshipspeedcouldbeusedtoidentifyoverloadedship.Theproposedmethodhasbeentestedonanumberofmonocularshipimagesequencesandtheexperimentalresultsshowedthatthealgorithmwasrobustandreal-time.

8663-12, Session 3

Ship detection in port surveillance based on context and motion saliency analysisXinfengBao,SvitlanaZinger,TechnischeUniv.Eindhoven(Netherlands);RobWijnhoven,ViNotionB.V.(Netherlands);PeterH.N.deWith,TechnischeUniv.Eindhoven(Netherlands)

Automaticshipdetectioniscrucialforportsurveillanceinshiptrafficmonitoringandmanagement.Itisachallengingandcomplextaskduetothedynamicwaterbackgroundandlargevariationsofshipappearances.Inthispaper,wepresentanovelshipdetectionapproachbasedontheanalysisofcontextualinformationandmotionsaliencyinthevideoframes.Foreachframe,thewaterregionisfirstlabeledusingacombinationofgraph-basedsegmentationwithregion-levelrecognition.Thedetectedwaterregionisthenusedascontextualinformationtolocatethecandidateships,whichareconnectedcomponents(blobs)labeledasnon-water.Weassumethatmovingshipsaretheblobshavingsalientmotion,wherethesaliencymeasureisdefinedbycomparingthemotionofcandidateshipsandthesurroundingbackground.Themainadvantagesofourapproacharethatitrequiresnopriorknowledgeofshipappearancesandcandelineatethecontoursofeachdetectedship.Theexperimentalresultsshowthatourapproachcandetectmovingshipswithlargevariationsincharacteristic(shape,size,speed),withhighrecallandprecisionrates.

8663-13, Session 3

A CubeSat-size surveillance satellite conceptJeremyStraub,DebabrataGhosh,NaimaKaabouch,RonaldA.Fevig,TheUniv.ofNorthDakota(UnitedStates)

Anarchitectureforahigh-resolutionremotesensingsatellitedesignedtobelaunchedintolowEarthorbitispresented.Atanaltitudeof300kilometers,theproposedsatelliteiscapableofproducingimageswithaspatialresolutionofbelowonemeterthroughthecombinationof

limitedonboardopticsandcomputerimageenhancement.

Thesatelliteincorporatespassivestabilizationandlimitedattitudecontrolcapabilities.Itispoweredbysolarpanelsonfivefacesandoperatesinanextremelylimitedmodewhennotoveritsdesignatedtargetarea,allowingitsbatteriestobechargedoverthetwo-thirdsofitsorbitwhereitisnotineclipseanddischargedtoprovidetherequiredpowerwhenoverthetargetand/orgroundstation.Thesatelliteincorporatessignificantonboardcomputationalcapabilitiesforimageprocessing.Anonboardmosaickingsystemisusedtoreducedatatransferrequirements(vianottransmittingoverlappingareas)andprovideaready-for-userwide-areaimage.

Surveillanceapplications(e.g.,military,intelligence,defenseassistancetocivilauthorities,emergencymanagement,andEarthscience)canbenefitdramaticallyfromalow-cost,rapidly-deployable,visiblelightremotesensingsatellite.TheCubeSatformfactorisidealfortheseapplications.Rapidlaunchmechanismsarepoisedtoallowin-the-fieldCubeSatdeployment,intheimminentfuture.

8663-14, Session 3

An edge directed super resolution technique for multimedia applicationsOsborndeLima,SreenathRaoVantaram,SankaranarayananPiramanayagam,EliSaber,RochesterInstituteofTechnology(UnitedStates);KurtR.Bengtson,Hewlett-PackardCo.(UnitedStates)

Surveillanceapplicationsintoday’sworld,requiresimageswithsuperiorvisualqualityandcleardistinguishablefeaturesinordertomakeinformeddecisionswithrespecttosecurityandlawenforcement.Thedemandforhighresolutionimagesindifferentapplicationsalongwithastipulationofreducedcosthaspavedwaytosuperresolutionenhancementtechniques.Superresolutioncanbeviewedasasoftwaresolutionthatimpactshardwarelimitations.Theproposedalgorithmisamultiplepassiterativealgorithmaimedatproducingbetterdefinedimageswithsharperedges.Thebasicpremisebehindthisalgorithmisinterpolatingalongtheedgedirectionandthusreducingblurthatcomesfromtraditionalinterpolationtechniqueswhichoperateacrosstheedgeinsomeinstances.Theprocessutilizesmultipleimagesshiftedbyasubpixelamountandyieldsasinglehigherresolutionimage.Apolynomialwarpingregistrationapproachdeterminestherelativeshiftsbetweentheframeswhichinturnleadsthealgorithmtoproduceahighresolutionimagewithimprovedquality.Thealgorithmdevelopedwasappliedtograyscaleandcolorimagesandshowedfavorableresultsonawidevarietyofdatasetsobtainedfromsurveillanceandprintingapplications.

8663-15, Session 4

A smart camera based traffic enforcement system: experiences from the fieldOliverSidla,GernotLoibner,SLREngineeringOG(Austria)

Railwaycrossings,evenwhensecuredbyautomatedbarriers,poseathreattodriversdayandnight.Especiallydriverswhichtrytocrossrailwaytracksinthelastmomentsbeforeabarriercloseshavearelativelyhighprobabilitytocausesevereaccidents.Inordertomonitorunsecuredrailwaycrossingsandtodetectandfinered-lightjumpers,wehavedevelopedavisionbasedsystemwhichiscompletelyintegratedintoasmartcamera.Triggeredbyanelectricalsignalfromtherailway,thecamerabeginstoobservethecrossing.Assoonasavehiclemovesoverthestoppinglineandthevehicleiswelloverthisline,imagesequencesarerecordedandstoredonthesystem.SystemarchitectureThisworkconcentratesonthesystemarchitectureandsetupintermsofHWandSWmodules.Thesmartcamerasneedtoprovidereadablelicenseplateimagesofthosecarswhichviolatethetrafficrulesattheinstallationsite.Tothisendwehadtotakeseveralmeasurestobeableachievethisgoalduetothedifficultcamerageometryinthefield:

-optical,geometricsetup.IRilluminationandproperfiltersareusedtominimizetheeffectofenvironmentallight.

Conference 8663


-aSWcontrolledimagingcontrolonthecameratooffseteffectsofstrongdirectsunlightandveryhighcontrastscenariosduringthedarkhours.

Algorithmformovingvehicledetection

ThedetectionofmovingvehiclesisbasedonsparseopticalflowWewillreportonresultsoftheLKtrackerwithFASTfeaturesandtheAGHASTcornerpointdetectorascomparison.

Themovingvehicledetectionworksasfollows:

Foreveryvideoframe:

i)LKopticalflowupdate

ii)Removetrackedpointswhichdonotmove

iii)Removetrackedpointswhichdonotmovealongdirectionofthestreetlane

iv)Clusterpointsaccordingtosimilarvelocity,direction,anddistances

v)Validclustersareaddedtoaringbuffer

vi)Ifvalidclustersexistforadurationofmorethan300ms,weflagthedetectionofavehicle

Experimentalresultsanddetectorevaluation

Thedetectioncapabilityandfalsealarmrateofthesystemsisevaluatedusingseveraldatasets:

-videosequencesformthesiteforevaluationoftheabsolutedetectionrate

-actualdetectionsfromtheinstallationsiteinordertoevaluatethefalsealarmrate

-evaluationofonsitevideosequencestoassesthedetectioncapabilityduringnight-time

8663-16, Session 4

Algorithm design for automated transportation photo enforcement camera image and video quality diagnostic check modulesAjayRaghavan,BhaskarSaha,PaloAltoResearchCenter,Inc.(UnitedStates)

Photoenforcementdevicesfortrafficrulessuchasredlights,stops,andspeedareincreasinglybeingdeployedincitiesaroundtheworldtoensuresmoothtrafficflowandpublicsafety.Thesearetypicallyunattendedfieldedsystems,andsoitisimportanttoperiodicallycheckthemforpotentialimage/videoqualityproblemsthatmightinterferewiththeirintendedfunctionality.Thereisinterestinautomatingsuchcheckstoreducetheoperationaloverheadandhumanerrorinvolvedinmanuallycheckinglargecameradevicefleets.Examplesofproblemsaffectingsuchcameradevicesincludeexposureissues,focusdrifts,obstructions,misalignment,downloaderrors,andmotionblur.Furthermore,insomecases,inadditiontothesub-algorithmsforindividualproblems,onealsohastocarefullydesigntheoverallalgorithmandlogictocheckforandaccuratelyclassifyingtheseindividualproblems.Examplesincludecameramisalignmentthatcancausesomesceneelementstogooutoffocusforwide-areascenesordownloaderrorsthatcanbemisinterpretedasanobstruction.Therefore,thesequenceinwhichthesub-algorithmsareutilizedisalsoimportant.Thispaperpresentsanoverviewoftheseproblemsalongwithimageandvideoqualitysolutionstodetectandclassifysuchfaults.

8663-17, Session 4

Full-body occlusion handling in vision-based traffic density countingEvangelosPalinginis,KeitaroKamiya,ManWooPark,IoannisBrilakis,JorgeA.LavalJr.,RandallGuensler,GeorgiaInstituteofTechnology(UnitedStates)

Vision-basedtrafficsurveillancesystemsareamidstmostreliable,inexpensiveandhighlyapplicablemethodologiesinsurveyingtrafficconditions.However,theimplementationofthesestrategiesis

limitedunderoccludingandilluminatingconditions.Giventhattheaforementionedconditionsleadtoeitherover-countedorunder-countedtrafficdata,theproposedmotion-basedframeworkaimsatovercomingtheselimitationsbyemployingamethodologynotassociatedwiththetrajectoryofcars’movement.Withinthemethodology,thebackgroundsubtractionisfollowedbytheHistogramofOrientedGradients(HOG)whichistrainedbylinearSupportVectorMachine(SVM);andHaar-likefeaturestrainedbyAdaboost.Bothareimplementedsoastodetecteachvehicleclosethegroundtruth.Accordingly,theirlocationisestimatedtheautomaticallydetecteddataareconvertedintotherealwordcoordinatesystem.Theproposedmethodwillbetestedontovarious15-minuteslongvideofromdifferentlocationalongtheGeorgiahighwaycorridoratthetimeofadaywhereseveretrafficjamisobserved.Preliminaryresultsindicatethereliabilityandeffectivenessoftheproposedmethodologyinprovidingtrafficdensitycounts.Futureworkmayrelyontheextensionoftheproposedmethodologyinordertotransformthedataoftheremovedvehicleintothedataofthenewvehicleautomatically.

8663-20, Session 5

Vehicle identification in two non-overlapping viewsTzung-YuHsieh,NationalTsingHuaUniv.(Taiwan);Chung-LinHuang,AsiaUniv.(Taiwan)

Vehicleidentificationintwonon-overlappedviewsisanon-trivialproblembecausetheappearanceofthesamevehicleintwocamerasmaynotbesimilar.Directmatchingcanbedifficultduetodramaticillumination/appearanceandaspectchanges.Insteadofdirectlyfindingtheprobabilityofmatchbetweentwoobservationsofthesamevehicleappearingintwonon-overlappingviews,weapplytheso-callednon-metricdistanceembeddingsofvehicleobservationswithinonecameraandtheircorrespondingembeddingsinanothercameratocharacterizethesimilaritiesanddifferencebetweenthevehiclesacrosscameras.Theembeddingarewithrespecttoexamplespre-selectedforeachcamera,andtheoutputistheprobabilityofthesamevehicle.Theembeddingprocessisbasedonconceptthattheobservationsofthevehiclesappearinginthesamecamerawillbesimilarlyaffectedbythesameilluminationandaspectchanges.Therefore,robustalignmentandmatchingwithinthesamecameracanaccuratelycapturethesimilarityanddifferenceamongstthevehicles.Thevehicleinanycameracanberepresentedasavectorofdistancetotherespectiveexampleswithinthatcamera.Thenwecantwosuchvectorrepresentationstoverifywhetherthetwoobservationsrepresentthesamevehicleornot.

Tofindthesimilaritydistance,wefindtheedgemapofthetargetvehicleandcomparewithexamplevehicletofindtheedgedistance.Besidestheedges,wealsocomparethecolorsimilarityofthetargetvehicleandexamples.Afterembedding,wetraintheSupportVectorMachine(SVM)byusingasetofsimilarvectorpairsandnon-similarvectorpairs.Finally,inthetestingprocess,afterembedding,wehaveavectorpairwhichcanbeverifybyapplyingtheSVMtoidentifywhetheritisasimilarvectorpairornot.

Oursystemconsistsof(1)Foregroundvehiclesegmentation,(2)Feature(edgesandcolor)extraction,(3)embeddingvectorgenerationbasedontheexamplepairs,and(4)SVMclassificationbasedontheembeddingvectorpair.Intheexperiments,wedemonstratetheacceptablerecognitionrateforthevehiclepairsappearingintwosceneswithtotallydifferentviewsandilluminations.Ourmethoddemonstratesabetterperformancethantheconventionalthresholdingmethod.

8663-21, Session 5

Vehicle classification for road tunnel surveillanceAndrésFrias-Velázquez,PeterVanHese,AleksandraPi?urica,WilfriedPhilips,Univ.Gent(Belgium)

Vehicleclassificationfortunnelsurveillanceaimstonotonlyretrievevehicleclassstatistics,butalsopreventaccidentsbyrecognizing

Conference 8663


vehiclescarryingdangerousgoods.Inthispaper,wedescribeamethodtoclassifyvehicleimagesthatexperiencedifferentgeometricalvariationsandchallengingphotometricalconditionssuchasthosefoundinroadtunnels.Unlikepreviousapproaches,weproposeaclassificationmethodthatdoesnotrelyonlengthandheightestimationofthevehicles.Alternatively,weproposeanoveldescriptorbasedontracetransformsignaturestoextractsalientandnon-correlatedinformationofthevehicleimages.Also,weproposeametricthatmeasurestheshapecomplexityofthevehiclesbasedoncornerpointdetection.Asaresult,thesefeaturesdescribethevehicle’sappearanceandshapecomplexityindependentlyofthescale,pose,andilluminationconditions.Experimentswithvehiclescapturedfromthreedifferentcamerasconfirmthesaliencyandrobustnessofthefeaturesproposed,achievinganoverallaccuracyof97.5%fortheclassificationoffourdifferentvehicleclasses.Forvehiclestransportingdangerousgoods,ourclassificationschemeachievesanaveragerecallof97.6%ataprecisionof98.6%forthecombinationoflorriesandtankers,whichisaverygoodresultconsideringthesceneconditions.

8663-36, Session 5

Using visual analytics model for pattern matching in surveillance dataMohammadSHabibi,TennesseeStateUniv.(UnitedStates)

Inapersistentsurveillancesystemhugeamountofdataiscollectedcontinuouslyandsignificantdetailsarelabeledforfuturereferences.Inthispaper,weexplainamethodtosummarizevideodataasaresultofidentifyingeventsbasedonsuchtaggedinformation,leadingtoaconcisedescriptionofbehaviorwithinasectionofextendedrecordings.Anefficientretrievalofvariouseventsthusbecomesthefoundationfordeterminingapatterninsurveillancesystemobservations,bothinitsextendedandfragmentedversions.Thepatternsconsistingofspatiotemporalsemanticcontentsareextractedandclassifiedbyapplicationofvideodataminingongeneratedontology,andcanbematchedbasedonanalystsinterestandrulessetforthfordecisionmaking.Theproposedextractionandclassificationmethodusedinthispaperusesquerybyexampleforretrievingsimilareventscontainingrelevantfeatures,andiscarriedoutbydataaggregation.SincestructureddataformsmajorityofsurveillanceinformationourVisualAnalyticsmodelemploysKD-Treeapproachtogrouppatternsinvariantspaceandtime,thusmakingitconvenienttoidentifyandmatchanyabnormalburstofpatterndetectedinasurveillancevideo.Severalexperimentalvideowerepresentedtoviewerstoanalyzeindependentlyandwerecomparedwiththeresultsobtainedinthispapertodemonstratetheefficiencyandeffectivenessoftheproposedtechnique.

8663-37, Session 5

Situation exploration in a persistent surveillance system with multidimensional dataMohammadSHabibi,TennesseeStateUniv.(UnitedStates)

Thereisanemergingneedforfusinghardandsoftsensordatainanefficientsurveillancesystemtoprovideaccurateestimationofsituationawareness.Thesemostlyabstract,multi-dimensionalandmulti-sensordataposeagreatchallengetotheuserinperforminganalysisofmulti-threadedeventsefficientlyandcohesively.ToaddressthisconcernaninteractiveVisualAnalytics(VA)applicationisdevelopedforrapidassessmentandevaluationofdifferenthypothesesbasedoncontext-sensitiveontologyspawnfromtaxonomiesdescribinghuman/humanandhuman/vehicle/objectinteractions.AmethodologyisdescribedhereforgeneratingrelevantontologiesinaPersistentSurveillanceSystem(PSS)anddemonstrateshowtheycanbeutilizedinthecontextofPSStotrackandidentifygroupactivitiespertainingtopotentialthreats.TheproposedVAsystemallowsforvisualanalysisofrawdataaswellasmetadatathathavespatiotemporalrepresentationandcontent-basedimplications.Additionallyinthispaper,atechniqueforrapidsearchoftaggedinformationcontingenttorankingandconfidenceisexplainedforanalysisofmulti-dimensionaldata.Lastly

theissueofuncertaintyassociatedwithprocessingandinterpretationofheterogeneousdataisalsoaddressed.

8663-22, Session 6

Vehicle presence analysis for law enforcement applications and parking lot managementYuriyLipetski,OliverSidla,SLREngineeringOG(Austria)

Theefficientandrobustdetectionofthepresenceofvehiclesinrestrictedparkingareasisimportantforapplicationsinlawenforcementaswellasfortheenforcementofparkingrulesonprivateproperty.Wepresentourworktowardsthisgoalaimedattheapplicationofvehicledetectioninurbanenvironments.Themethodistobesuitedforsmartcameraswhichshavetooperateautonomouslyoverextendedperiodsoftime.Oursystemisdevelopedaspartofabiggerresearcheffortwhichcombinesonsitevehiclepresencedetectionandanassociatedwebmanagementsystemwhichisintendedtomonitor,steerandreroutedeliveryvehicles.Ourdetectionapproachisbasedonappearancebasedmodelingwithareabasedfeaturecomputationandasubsequentclassificationalgorithm.TheworkpresentedherewillshowevaluationresultsontositesinthecityofViennafromtheoperationoftwosmartcamerasoveranextendedperiodoftime.

8663-23, Session 6

Video-based parking occupancy detection MichaelDeruytter,KevinAnckaert,TraficonN.V.(Belgium)

ITStechnologiesareincreasinglyusedfornear-roadapplications,suchasparkingareamanagement.Atleasttwousecasesshowthatthereisaneedforasmarterapproachtowardsparkingmanagement.Firstofall,becauseofashortageoftruckparksalonghighwaystruckdriversareforcedtoparkonthehardshoulder,whichincreasestheriskofaccidents.Secondly,citiesarecopingwithlargeamountsofvehicletraffic,generatedbymotoriststhataredrivingaroundinsearchforaparkingsport.Inbothcases,parkingguidanceandmanagementsystemscanalleviatetheproblem.Inthecaseoftrucks,suchsystemsallowtruckdriverstodynamicallyplantheirrouteandpreventthemfromhavingtoparkonthehardshoulder.Inthecaseofcitypassengertraffic,motoristscansignificantlyreducetheirtraveldistanceinsearchforaparkingspotbyrelyingoninformationgeneratedbyaparkingguidancesystem.Acriticalfactorofparkingsystemsisoccupancydetection.Varioustechnologies,suchasvideodetection,microwaveradardetectionandelectromagneticloops,canbeusedtodetectvehiclepresence.Inthispaper,wepresentthebenefitsofavideo-basedapproachfortruckandcarparkoccupancydetection.

8663-24, Session 6

Motorcycle detection and counting using stereo camera, IR camera, and microphone arrayBoLing,MigmaSystems,Inc.(UnitedStates);DavidR.P.Gibson,FederalHighwayAdministration(UnitedStates);DanMiddleton,TexasA&MTransportationInstitute(UnitedStates)

Thedetection,classification,andcharacterizationarethekeytoenhancingmotorcyclesafety,motorcycleoperationsandmotorcycletravelestimation.AveragemotorcyclefatalitiesperVehicleMileTraveled(VMT)arecurrentlyestimatedat30timesthoseofautofatalities.Althoughithasbeenanactiveresearchareaformanyyears,motorcycledetectionandclassificationstillremainachallengingtask.WorkingwithFHWA,wehavedevelopedahybridmotorcycledetectionandcountingsystemusingasuiteofsensorsincludingstereocamera,thermalIRcameraandunidirectionalmicrophonearray.TheIRthermalcameracancapturetheuniquethermalsignaturesassociatedwithmotorcycle’sexhaustpipesthatoftenshowbrightelongatedblobsinIRimages.Thestereocamerainthesystemisusedtodetectthe

Conference 8663


motorcyclistwhocanbeeasilywindowedoutinthestereodisparitymap.Ifmotorcyclistisdetectedthroughhisorher3Dbodyrecognition,motorcycleisdetected.Microphonesareusedtodetectmotorcyclesthatoftenproducelowfrequencyacousticsignals.Allthreemicrophonesinthemicrophonearrayareplacedinstrategiclocationsonthesensorplatformtominimizetheinterferencesofbackgroundnoisesfromsourcessuchasrainandwind.Testresultsshowthatthissystemhasanexcellentperformance.

8663-25, Session 6

Vehicle-driven video compression/decompression for fast and efficient search in large video databasesOrhanBulan,XeroxResearchCtr.(UnitedStates);EdgarBernal,RobertLoce,XeroxResearchCenter(UnitedStates);WenchengWu,XeroxResearchCtr.(UnitedStates)andXeroxResearchCtr.(UnitedStates)

Videocamerashavebeencommonlydeployedathighways,tollbooths,trafficlights,stopsigns,orlocalroadsbyauthorizedentitiesformonitoringtrafficorlawenforcement.Thevideoscapturedbythesecamerasaretypicallycompressedandstoredinlargedatabases.Performingsearchesforaspecificvehicle(e.g.,vehicleinvolvedinacrime)withintheselargedatabasesofcompressedvideosisoftenrequired.Inthispaper,weproposeavideocompressionanddecompressionalgorithmforfastandefficientvehiclesearchinginlargevideodatabases.Theproposedalgorithmselectsthereferenceframes(i.e.,I-frames)basedonavehiclepositioninthescenewhilecompressingavideosequence.Aspecificvehicleinthecompressedvideoisthensearchedonlyacrossthereferenceframes,whichdonotrequiretodecompressthewholevideosequence.Ourexperimentalresultsacrossthevideoscapturedinalocalroadshowsthattheproposedalgorithmsignificantlyreducesthespaceforsearchingavehicleinacompressedvideocapturedespeciallyinlighttraffic.

8663-26, Session 7

Vehicle occupancy detection camera position optimization using design of experiments and standard image referencesPeterPaul,MartinHoover,MojganRabbani,XeroxCorp.(UnitedStates)

Onechallengeinvehicleoccupancydetectionisachivingcameralineofsightintoavehiclecabinordertodetectoccupants.Seats,pillars,rooftops,vehiclebodyframes,andotheroccupantsaresomeoftheoccludingitems.Inaddition,differingvehicleshapesandsizes,aswellasvariationsinoccupantsalsoleadtodifficulties.Anapproachtocamerasetupforvehicleoccupancydetectionwhichisrobusttotheseeffectsisproposedhere.Themethodusesrigrousstatisticaltechniquesfromthefieldofdesignofexperiments(DOE).Inaddition,thetechniqueofusingastandardimagereference(SIR)isusedtodetermineaquantifiedoutputtobeoptimized.Theapproachisperformedonavehiclerearseatoccupancydetectiontask.Experimentalresultsshowsignificantimprovementoverthebaseline.

8663-27, Session 7

HOV/HOT occupancy detection systemWayneDaley,JackW.Wood,JohnM.Stewart,ColinUsher,OmarArif,JohnTurgeson,ErinHanson,GeorgiaTechResearchInstitute(UnitedStates)

OnetechniquetobetterutilizeexistingroadwayinfrastructureistheuseofHOVandHOTlanes.Technologytomonitortheuseoftheselaneswouldassistmanagersandplannersinefficientroadwayoperation.Therearenoavailableoccupancydetectionsystemsthatperformatacceptablelevelsofaccuracyinpermanentfieldinstallations.Themaingoalofthisresearcheffortistoassessthe

possibilityofdeterminingpassengerusewithimagingtechnology.Thisisespeciallychallengingbecauseofrecentchangesintheglasstypesusedbycarmanufacturerstoreducethesolarheatloadonthevehicles.Wedescribeinthisresearchasystemtousemulti-planeimagingwithappropriatewavelengthselectionforsensingpassengersinthefrontandrearseatsofvehiclestravellinginHOV/HOTlanes.Theprocessofdeterminingthegeometricrelationshipsneeded,thechoiceofilluminationwavelengths,andtheappropriatesensorsaredescribed,takingintoaccountdriversafetyconsiderations.Thepaperwillalsocoverthedesignandimplementationofthesoftwareforperformingthewindowdetectionandpeoplecountingutilizingbothimageprocessingandmachinelearningtechniques.Theintegrationofthefinalsystemprototypewillbedescribedalongwiththeperformanceofthesystemoperatingatarepresentativelocation.

8663-28, Session 7

Joint histogram between color and local extrema patterns for object trackingSubrahmanyamMurala,Q.M.J.Wu,Univ.ofWindsor(Canada);BalasubramanianRaman,R.P.Maheshwari,IndianInstituteofTechnologyRoorkee(India)

Inthispaper,anewalgorithmmeantforobjecttrackingapplicationisproposedusinglocalextremapatterns(LEP)andcolorfeatures.Thestandardlocalbinarypattern(LBP)encodestherelationshipbetweenreferencepixelanditssurroundingneighborsbycomparinggraylevelvalues.TheproposedmethoddiffersfromtheexistingLBPinamannerthatitextractstheedgeinformationbasedonlocalextremabetweencenterpixelanditsneighborsinanimage.Further,thejointhistogrambetweenRGBcolorchannelsandLEPpatternshasbeenbuildwhichisusedasafeaturevectorinobjecttracking.TheperformanceoftheproposedmethodiscomparedwithNingetal.onthreebenchmarkvideosequences.TheresultsafterbeinginvestigatedproposedmethodshowasignificantimprovementinobjecttrackingapplicationascomparedtoNingetal.

8663-29, Session 8

Adaptive real-time road detection using VRay and A-MSRG in complex environmentsSunHeeWeon,SungIlJoo,HyungIlChoi,SoongsilUniv.(Korea,Republicof)

Thispaperproposesanadaptivedetectionmethodfordetectingroadregionsthathaveambiguousboundarieswithinnaturalimages.Theproposedmethodachievesreliablepartitioningoftheroadregionwithinanaturalenvironmentwherenoiseispresentthroughthefollowingtwostages.Inthefirststage,weseparateoutcandidateregionsoftheroadbydetectingtheroad’sboundarythroughtheRadialregionsplitmethodusingVRay(Vanishingpoint-constrainedray).Inthesecondstage,weapplysocalledAdaptive-MultipleSeedRegionGrowing(A-MSRG)approachintotheseparatedcandidateregioninordertoidentifytheroadregioninrealtime.TheA-MSRGisanenhancedversionoftheSeedRegionGrowing(SRG).Forperformanceevaluation,thisstudyassessedefficiencybasedontheresultsofregiondetectionachievedthroughtheproposedcombinationoftheRadialregionsplitmethodandA-MSRG.WealsoconductedcomparisonsagainsttheexistingSRGandMSRGmethodstoconfirmthevalidityoftheproposedmethod.

Themaintaskofthefirststageistodetecttheroadboundaryandthecandidateregioninrealtimewithinacomplexnaturalenvironment.ThistaskisaccomplishedthroughtheRadialregionsplitmethod,whichincludesthreesteps.Theyarethestepofadaptiveedgedetection,thestepofvanishingpointdetectionusingHoughTransformandthestepofdetectingtheroadboundaryusingVRay.

ThesecondstageisthestageofroadregiondetectionusingtheA-MSRGmethod.Inthisstage,theroadcandidateregionformedbytheextractedoptimalboundariesisadaptivelypartitionedintoblocksusingtheviewpoint’saltitudetoextractthemultipleseedblocks.Thenweapplythelocalweightedvaluebasedonthemeasurementoftheblock’snon-uniformityandweusetheA-MSRGmethodformerging

Conference 8663


andexpandingtheblockregiontoperformtheprocessofadaptivelynavigatingtheroadregion.Toensureitsstrongperformanceinresponsetofeaturedistortionsthatarecausedbychangesinperspectivedependingonthepointofviewandinresponsetochangesintheformofthesidewalk.

8663-30, Session 8

Intensity estimation method of LED array for visible light communicationTakanoriIto,TomohiroYendo,NagaokaUniv.ofTechnology(Japan);AraiShintaro,KagawaNationalCollegeofTechnology(Japan);TakayaYamazato,HirakuOkada,ToshiakiFujii,NagoyaUniv.(Japan)

Thispaperfocusesonaroad-to-vehiclevisiblelightcommunication(VLC)systemusingLEDtrafficlightasthetransmitterandcameraasthereceiver.ThetrafficlightiscomposedofahundredofLEDsontwodimensionalplain.Inthissystem,dataissentastwodimensionalbrightnesspatternsbycontrollingeachLEDofthetrafficlightindividually,andtheyarereceivedasimagesbythecamera.Here,thereareproblemsthatneighboringLEDsonthereceivedimagearemergedduetolessnumberofpixelsincasethatthereceiverisdistantfromthetransmitter,and/orduetoblurringbydefocusofthecamera.Becauseofthat,biterrorrate(BER)increasesduetorecognitionerrorofintensityofLEDs

Tosolvetheproblem,weproposeamethodthatestimatestheintensityofLEDsbysolvingtheinverseproblemofcommunicationchannelcharacteristicfromthetransmittertothereceiver.TheproposedmethodisevaluatedbyBERcharacteristicswhichareobtainedbycomputersimulationandexperiments.Intheresult,theproposedmethodcanestimatewithbetteraccuracythantheconventionalmethods,especiallyincasethatthereceivedimageisblurredalot,andthenumberofpixelsissmall.

8663-31, Session 8

An improved background segmentation method for ghost removalsWaqasHassan,PhilipBirch,RupertYoung,ChrisChatwin,Univ.ofSussex(UnitedKingdom)

Videosurveillancehasbecomecommonforthemaintenanceofsecurityinawidevarietyofapplications.However,theincreasinglylargeamountsofdataproducedfrommultiplevideocamerafeedsismakingitincreasinglydifficultforhumanoperatorstomonitortheimageryforactivitieslikelytogiverisetothreats.Thishasledtothedevelopmentofdifferentautomatedsurveillancesystemsthatcandetect,trackandanalyzevideosequencesbothonlineandofflineandreportpotentialsecurityrisks.Segmentationofobjectsisanimportantpartofsuchsystemsandnumerousbackgroundsegmentationtechniqueshavebeenusedinliterature.Onecommonchallengefacedbythesetechniquesisadaptionindifferentlightingenvironments.Anewimprovedbackgroundsegmentationtechniquehasbeenpresentedinthiswherethemainfocusistoaccuratelysegmentpotentiallyimportantobjectsbyreducingtheoverallfalsedetectionrate.Historicedgemapsandtrackingresultsareanalyzedforthispurpose.Theideaistoobtainanuptodateedgemapofthesegmentedregionhighlightedasforegroundareasandcomparethemwiththestoredresults.Theedgemapsareobtainedusinganoveladaptiveedgeorientationbasedtechniquewhereorientationoftheedgeisused.Experimentalresultshaveshownthatthediscussedtechniquegivesmorethana95%detectionsuccessratewhentestedindifferentindoorandoutdoorenvironments.

8663-32, Session 9

Retail video analytics: an overview and survey (Invited Paper)QuanfuFan,PrasadGabbur,NormanHaas,SharathchandraPankanti,HoangTrinh,IBMThomasJ.WatsonResearchCtr.(UnitedStates)

Todayretailvideoanalyticshasgonebeyondthetradi-tionaldomainofsecurityandlosspreventionbyprovidingretailersinsightfulbusinessintelligencesuchasstoretrafficstatisticsandqueuedata.Suchinformationallowsforen-hancedcustomerexperience,optimizedstoreperformance,reducedoperationalcosts,andultimatelyhigherprofitabil-ity.Thispapergivesanoverviewofvariouscamera-basedapplicationsinretailaswellasthestate-of-the-artcomputervisiontechniquesbehindthem.Italsopresentssomeofthepromisingtechnicaldirectionsforexplorationinretailvideoanalytics.

8663-33, Session 9

Video-CRM: understanding customer behaviors in storesIsmailHaritaoglu,Anvato,Inc.(UnitedStates);MyronD.Flickner,DavidBeymer,IBMAlmadenResearchCtr.(UnitedStates)

Thispaperdescribestworeal-timecomputervisionsystemscreated10yearsagothatdetectandtrackpeopleinstorestoobtaininsightsofcustomerbehaviorwhileshopping.Thefirstsystemusesasinglecolorcameratoidentifyshoppinggroupsinthecheckoutline.Shoppinggroupsareidentifiedbyanalyzingtheinter-bodydistancescoupledwiththecashier’sactivitiestodetectcheckouttransactionsstartandendtimes.Thesecondsystemusesmultipleoverheadnarrow-baselinestereocamerastodetectandtrackpeople,theirbodypostureandpartstounderstandcustomerinteractionswithproductssuchas”customerpickingaproductfromashelf”.Inpilotstudiesbothsystemsdemonstratedreal-timeperformanceandsufficientaccuracytoenablemoredetailedunderstandingofcustomerbehaviorandextractactionablereal-timeretailanalytics.

8663-35, Session 9

Human object articulation for CCTV video forensicsIffatZafar,MuhammadFraz,EranEdirisinghe,LoughboroughUniv.(UnitedKingdom)

Videoforensicsapplicationsoftenrequirethearticulationofthekeyobjectofinterest,namelyahumanobject.Forexampleitmayberequiredtoidentifypersonwearingaredcolouredtopatthehighestlevelofarticulation.Howeveritmayalsobeusefultoperformafurtherdetailedarticulationsuchaspersonwearingashortsleeved,whiteandbluetopthatincludestext(oralogo)andablacktrouser.

ArticulatinghumanobjectsinvideoforensicswhenperformedinCCTVfootageischallengingduetothepresenceofnoise,limitationsofresolution,changinglevelsandsourcesofsceneillumination,variationsofdirectionofviewetc.Weproposetheuseofimagepre-processingpipelinethatincludesamultitudeofenhancementoperationsthatdealswiththeabovechallengespriortoperformingobjectarticulation.Theobjectsofinterestsareidentifiedbymovingobjectdetectionfollowedbythebinaryclassificationofsuchobjectsintohumansandnon-humans.Theidentifiedhumanobjectsarefurtherarticulatedbydividingthehumanfigureintohead,upperbodyandatorsoarea.Theheadareaischeckedfordetectingthepresenceofskintoneandfacialfeaturestoidentifywhetherthepersoniswalkingtowardsthecamerafromthecameraoralternativelyidentifythecolourofhair.Thetoppartofbodyisclusteredintodominantcoloursthatareusedtodeterminethepresenceofskintonetoidentifyindividualswearingshortsleevedshirtsandalsotodeterminemultiplecoloursandpresenceoftextinthetopbeingworn.Asimilaranalysisiscarriedoutforthetorsoareaenablingdetailedarticulation.

Weprovidedetailedexperimentalresultsandanin-depthanalysisprovidingsubjectiveandobjectivemetricstoprovelevelofperformanceachievedbytheproposedimageprocessingapplicationonrealCCTVfootage.

Conference 8663


Conference 8664: Imaging and Printing in a Web 2.0 World IVMonday-Tuesday4–5February2013

Part of Proceedings of SPIE Vol. 8664 Imaging and Printing in a Web 2.0 World IV

8664-1, Session 1

Videos in tangible productsReinerFageth,RalfWieting,CeWeColorAG&Co.OHG(Germany)

Highqualityvideoscanbetakenwithnearlyeverycamera,digitalpointandshootcameras,DSLRsaswellassmartphones.HighqualitymeansvideowithHDresolutionsaswellashighqualitywithrespecttostory-telling.Thesevideoscanbeviewedonnearlyeveryscreenandeasilysharedviasocialnetworks.Alloftheseusecasesdonotallowtomonetizethelargenumberofvideos(snippets)taken.Thesevideosarenotlinkedtootherpersonaldatasuchaspictures,maps,textdescriptionsoranykindoftangibleproducts.

Particularlythesevideoscannotbecombinedwiththecurrentstate-of-the-artstorytelling,printedphotobooks.Intheproposedpaperanapproachofimplementinguser-definedpicturesoutofthevideoimplementedinthephotobooktoreachanevenmorecompellingstoryaswellasimplementingthewholevideoviaQRcodesprintedinthephotobookisdescribed.TheQRtechnologyisoftenusedtolinkdirectlytoawebsite,butnottopersonalizedcontent.TheimplementationisdoneviaadesktopbasedsoftwaretogenerateCEWEPHOTOBOOKS.Additionallytheproposedtechnologyalsoenablesprintedproductstobeeasilypersonalizedwithpersonalmessages,e.g.greetingcardsorbusinesscards.

8664-2, Session 1

HP Smart Marketing Suite: the future of dynamic content publishing GaloGimenez,EduardoArgollo,Hewlett-PackardCo.(Spain);JitendraKumar,G.K.Tejaswi,Hewlett-PackardLabs.India(India)

Inordertoproduceanddeliverenterprisedocuments,theenterprisepublishingspacehandlesawiderangeandagreatloadofstructured,semi-structuredandnon-structuredcontent.Becauseoftheirsingletouch-pointfocus,siloprocessorientationordatacentralizedorganization,existentContentManagementSystemsfallbehindonbeingacompletepublishingsolution,especiallywhenfocusingonmediumandlargeenterprises.InthispaperweintroduceHPSmartMarketingSuite,acontentcentricplatformthatdoescontentorganizationandpublishingforlargeenterprisestherightway.HP-SMSfirstclientisHPitselfwhereitprovidesreal-timeaccesstomorethan7millionassets(includingdigitalmediaandstructuredproductioninformation)storedinmultipledistributedrepositories.HPSMSservices30thousandinternaland240thousandsexternalusers.ItisestimatedthatHPwillsaveover$100millionduring3yearswiththedeploymentofHPSMS.

8664-3, Session 1

TopicSelect: a platform for automated publications of Web contentIldusAhmadullin,Hewlett-PackardLabs.(UnitedStates);YuZhang,Xiao-WeiWu,Hewlett-PackardChinaCo.,Ltd.(China);JerryLiu,Hewlett-PackardLabs.(UnitedStates)

NoAbstractAvailable

8664-4, Session 1

The CloudBoard Research Platform: an interactive whiteboard for corporate usersJohnBarrus,EdwardL.Schwartz,RicohInnovations,Inc.(UnitedStates)

NoAbstractAvailable

8664-5, Session 2

Automatic page composition with nested sub-layoutsAndrewA.Hunter,Hewlett-PackardLabs.(UnitedKingdom)

Thispaperextendsourpreviousworkonthelayoutofcroppedimages.Thenewworkdemonstratesasolutiontothemoregeneralchallengeoflayingoutcontentwithnoinherentsizeorshape.Thesolutionisparticularlyeffectivefordeeplynestedcontentsuchasmagazinepageswherelayoutsoftencontainotherlayoutsincomplexcontenthierarchies.

Aswedemonstratedatlastyear’sconference,someimages,especiallyconsumerphotographs,arebesttreatedasmutableelementswithaspectratiosandpotentiallayoutsthataremutuallydependent.Whenelementsbeinglaidoutaremutablesub-layoutsofsubsidiarycontent,theissuesaresimilar.Ateachlevelofadocumenthierarchy,thespaceallocatedfortheelementsdependsbothonthelayoutrequirementsfortheotherelementsatthesamelevelandalsoontheinternallayoutoptionsoftheelementsatthenextleveldownthehierarchy.

Wewilldescribeasolutioninwhichelementsprovidedatatothenextlevelupthehierarchythatenablesaninitialallocationofspacetobalancethespacedistributionfollowedbyasecondaryallocationtooptimizethelayoutaesthetics.

8664-6, Session 2

Graph-based layout analysis for PDF documentsCanhuiXu,ZhiTang,XinTao,YunLi,CaoShi,PekingUniv.(China)

Toincreasetheflexibilityandenrichthereadingexperienceofe-bookonsmallportablescreens,inthispaper,agraphbasedmethodisproposedtosegmentthePDFpageswhicharedifficulttocopewithwhenusingtraditionalXYcutandwhitespacemethods.Unliketheimagebaseddocumentanalysis,digitalborndocumenthasitsinherentadvantageslikerepresentingcharactersandimagesinexplicitform,whichcanbestraightforwardlyexploitedforlayoutanalysis.Tointegratetheapplicationofimage-baseddocumentlayoutanalysismethodsandtheinherentmeta-datastructureinformationprovidedbydigital-borndocumentslikePDF,alltheelementsineachpageareprocessed,includingtext,image,graphicelementsordecorations.Graph-basedmethoddevelopedcancapturecertainperceptuallyimportantnon-localimagecharacteristicsforsegmentationpurposes.InsteadofusingpixelpointsorCCstate,eachpageelementorprimitivecorrespondingtoavertexareconstructedinthegraph.Allthetextelementscanbeconnectedbyestablishinganeighborhoodsystem.Delaunaytessellationisappliedinthisregard.ImplementationofthismethodbyDelaunaytessellationandfeature-basedclassificationispresented.Graphbasedmethodservingasabottom-upmethodisusedtogroupthepageelementsaccordingtoedgeweightsliketheproximityandcompactness.Thefonttypeandfontsizeareusedtoinatop-downmannertoobtainthedesiredblock


segmentation.Hierarchically,textlinedetectionisthenaccomplishedbystatisticallyfilteringtheedgeswithorientation.TheexperimentalresultsoncomplexlayoutpagesofPDFbookshaveshownsatisfactoryperformance.

8664-7, Session 2

Learning probabilistic document template models via interactionIldusAhmadullin,NiranjanDamera-Venkata,Hewlett-PackardLabs.(UnitedStates)

NoAbstractAvailable

8664-8, Session 2

Interaction paradigms for photo book layout on mobile devicesPhilippSandhaus,CeWeColorAG&Co.OHG(Germany)

Gettingfromacollectionofphotostothefinalprintedphotobookmeansalotofeffortfortheenduser:Photoshavetobeselected,beplacedontheindividualpagesandhavetobelaidoutinaniceandpleasingway.Withtheadventofmobiledevicesthisprocessisalsogettingmobile.Herewearefacedwithacoupleofchallenges,e.g.limitedscreensizeandlessfine-grainedmanipulationcapabilitiesofscreenobjectsbecauseoftouch-basedinteraction.Ontheotherhand,touch-basedinteractionisgenerallyconsideredasmuchmoreintuitiveastheobjectsonthescreenaredirectlymanipulatedincontrasttothemouseasaninputdevice.Thechallengeistocompensatetheseweaknesses.

Withoursystemtheuserisabletoveryefficiently,roughlyplacetextandphotosonthesinglephotobookpagesandanautomaticlayoutsystemautomaticallyadjuststheseroughlayoutswiththehelpofanautomaticlayoutenginebasedonasetofcarefullyselectedlayoutrules.Thegoalofthissystemistoprovidetheuserwithaneasywaytoveryfastdesignaphotobookonamobiledeviceinanintuitiveway.

8664-9, Session 3

Non-Manhattan layout extraction algorithmAzizaSatkhozhina,PurdueUniv.(UnitedStates);IldusAhmadullin,Hewlett-PackardLabs.(UnitedStates);JanP.Allebach,PurdueUniv.(UnitedStates)

NoAbstractAvailable

8664-10, Session 3

Automatic design of colors for magazine covers AliJahanian,JanP.Allebach,PurdueUniv.(UnitedStates);JerryLiu,QianLin,DanielTretter,EamonnO’Brien-Strain,SeungyonC.Lee,NicLyons,Hewlett-PackardLabs.(UnitedStates)

Weproposearecommendationsystemfortheautomaticdesignofcolorsfortheautomaticprocessofmagazinecoverdesign.Inthedesignofamagazinecover,makingasetofdecisionsregardingthecolordistributionofthecoverimageandthecolorsofothergraphicalandtexturalelementsisconsideredtobethecolordesignconcept.Thisconceptaddressesanumberofsubjectivechallenges:whatsetofcolorsisaestheticallypleasingyetalsocontributestothepurposeofthedesign,thelegibilityoftextualelements,andtheconsistencyofstylewiththeclassofmagazine.Oursolutionstoautomaticcolordesigncomprisethequantificationofthesechallengesbydeployinganumberofwell-knowncolortheories.Furthermore,weusethesecolortheoriesasscalestoanalyzethecolordesignofsomewell-knownmagazinecoverstodelineaterulesforautomaticcolordesign.Asa

result,oursystemallowstheusertouploadanumberofimages,andittellshim/herwhichimageisbetter,color-wise,foradesign.Thisrecommendationisdonebyasemanticextractionofa3-colorpalettefromtheimage.Basedonthiscolorpalette,thesystemalsomakesanumberofdecisionsforthecolorofthetext.

8664-11, Session 3

Content-aware automatic cropping for consumer photosHenryTang,DanielTretter,QianLin,Hewlett-PackardLabs.(UnitedStates)

Consumerphotosaretypicallyauthoredonce,butneedtoberetargetedforreuseinvarioussituations.Theseincludeprintingaphotoondifferentsizepaper,changingthesizeandaspectratioofanembeddedphototoaccommodatethedynamiccontentlayoutofwebpagesordocuments,adaptingalargephotoforbrowsingonsmalldisplayssuchasmobilephonescreens,andimprovingtheaestheticqualityofaphotothatwasbadlycomposedatthecapturetime.Inthispaper,weproposeanovel,effective,andcomprehensivecontent-awareautomaticcropping(hereafterreferredtoas``autocrop’’)methodforconsumerphotostoachievetheabovepurposes.Ourautocropmethodcombinesthestate-of-the-artcontext-awaresaliencydetectionalgorithm,whichaimstoinferthelikelyintentofthephotographer,andthe``branch-and-bound’’efficientsubwindowsearchoptimizationtechnique,whichseekstolocatethegloballyoptimalcroppingrectangleinafastmanner.Unlikemostcurrentautocropmethods,whichcanonlycropaphotointoanarbitraryrectangle,ourautocropmethodcanautomaticallycropaphotointoeitherarectangleofarbitrarydimensionsorarectangleofthedesiredaspectratiospecifiedbytheuser.Theaggressivenessofthecroppingoperationmaybeeitherautomaticallydeterminedbythemethodormanuallyindicatedbytheuserwithease.Inaddition,ourautocropmethodisextendedtosupportthecroppingofaphotointonon-rectangularshapessuchaspolygonsofanynumberofsides.Itmayalsobepotentiallyextendedtoreturnmultiplecroppingsuggestions,whichwillenablethecreationofnewphotostoenrichtheoriginalphotocollections.Ourexperimentalresultsshowthattheproposedautocropmethodinthispapercangeneratehigh-qualitycropsforconsumerphotosofvarioustypes.

8664-12, Session 3

Automatic photobook: focusing on image selection and image layout based on content and compositionShao-FuXue,PurdueUniv.(UnitedStates);HenryTang,DanielTretter,QianLin,Hewlett-PackardLabs.(UnitedStates);JanP.Allebach,PurdueUniv.(UnitedStates)

Automaticphotobookgenerationshouldconsiderthediversity,coverage,aestheticsofthephotosbeingselectedandplacementoftheselectedphotosonthephotobook.Itisimportantthatthealgorithmidentifiesthephotosthatareimportanttotheuserandthealgorithmgeneratesthelayoutswhichareaestheticallypleasing.

Inthisproject,weproposedacompleteworkflowforgeneratingautophotobooks.Basedontheproposedsimilaritymeasure,whichtakesaccountofthetimeofexposure,face,andthemeinformation,thealgorithmautomaticallyfindsrepresentativephotosthatcovertheconceptsoftheoriginalinputcollectionofphotos.Eachoftheselectedbasephotogoesintoonepageofthealbum.Thenafewsimilarphotosareselectedwithrespecttoeachofthebasephotoasneighboringphotos.

Forallthepotentialimagestoputonthesamepage,dominantsemanticlinesandsalientregionsareidentifiedinthephotos.Thesevisualcluesareusedtoidentifythemajorphototoputonthedominantpositionofthepage,aswellastodeterminewhethertoputotherpotentialphotosontothepage.Thedecisionismadefromthevisualpath,visualdistanceandotheraestheticsmeasures.

Conference 8664


8664-13, Session 4

Visualization and exploration for recommender systems in enterprise organizationZachiKarni,Hewlett-PackardLabs.IsraelLtd.(Israel);LiorShapira,MicrosoftResearch,(UnitedStates)

Wepresentamethodforvisualization,navigationandexplorationofinformationassetsinenterpriseorganizations.Therehasbeenaproliferationofinformationassetssuchas,emails,documents,presentationsandmoreinenterprises.Toolsarerequiredwhichassistusersinlocatingrelevantdatafortheirwork,inthecontextoftheirorganization.Severalmethodssuchassearchenginesandrecommendersystemshavebeendevelopedtohandlebig-dataandefficientlylocaterelevantassets.Inthesetheresultsareusuallypresentedwithnocontextandtheuserhasnoabilitytoinfluence,refineortofurtherexplore.Inourvisualizationmethodausercanseetheassetslocatedorrecommendedforhim,butinadditionalsogainsanunderstandingwhy.Focusingonauser,werenderaplanarvisualizationlistingasetofrecommendedassets.Theassetsareorganizedsuchthatsimilarassetsresidenearbyonthescreen,centeredaroundreal-timegeneratedcategories.Weuseacombinationoficonography,textandtagclouds,withmaximaluseofscreenrealestate,andkeepitemsfromoverlappingtoproduceourresults.Oursystemenablesvisualexplorationoftheresultsandletstheuserrefinethequiresaccordingtohisneeds.

8664-14, Session 4

Augmented distinctive features with color and scale invarianceYanLiu,XiaoqingLu,YeyangQin,ZhiTang,PekingUniv.(China);JianboXu,PekingUniv.FounderGroupCorp.(China)

Forobjectswiththesametexturebutdifferentcolors,itisdifficulttodiscriminatethemwiththetraditionalscaleinvariantfeaturetransformdescriptor(SIFT),becauseitworksforgrayscaleimagesonly.Thusitisimportanttokeepahighprobabilitytomakesurethattheusedkeypointsarecouplesofcorrectpairs.Inaddition,meandistributedkeypointsaremuchmoreexpectedthanoverdenseandclusteredkeypointsforimagematchandotherapplications.Inthispaper,weanalyzethesetwoproblems.First,weproposeascale-invariantmethodtoextractamoremeandistributedkeypointsrelyingonilluminationintensityinvariancebutobjectreflectancesensitivityvariancevariable.Second,wemodifythekeypoint’scanonicaldirectionaccumulatederrorbydispersingeachpixel’sgradientdirectiononarelativedirectionaroundthecurrentkeypoint.Atlast,webuildthedescriptorsonaGaussianpyramidandmatchthekeypointswithourenhancedtwo-waymatchingregulations.ExperimentsareperformedontheAmsterdamLibraryofObjectImagesdatasetandsomesyntheticimagesmanually.TheresultsshowthattheextractedkeypointshavebetterdistributioncharacterandlargernumberthanSIFT.Thefeaturedescriptorscanwelldiscriminateimageswithdifferentcolorbutwiththesamecontentandtexture.

8664-15, Session 4

Video text tracking and recognitionBaokangWang,ChangsongLiu,XiaoqingDing,TsinghuaUniv.(China)

NoAbstractAvailable

8664-16, Session 4

Image vectorization using blue-noise samplingJiaojiaoZhao,JieFeng,BingfengZhou,PekingUniv.(China)

Abstract:

Currentimagevectorizationtechniquesmainlydealwithimageswithsimpleandplaincolors.Forfull-colorphotographs,manydifficultiesstillexistinobjectsegmentation,featurelineextraction,andcolordistributionreconstruction,etc.

Inthispaper,weproposeahigh-efficiencyimagevectorizationmethodbasedonblue-noisesampling.Themethodfirstgeneratesasetofsamplingpointsontheimageplanebyanimprovedadaptiveerror-diffusionsamplingalgorithm.Thepointsethasidealblue-noisepropertyandcanwellpreservethefeaturesintheimage.Thenafteratriangulationonthispointset,theinformationsuchascolorsandgradientintheimagearerecordedintheresultingtrianglemeshtoformavectorimage.Aftercertainimageediting,e.g.scalingortransforming,thewholeimagecanbereconstructedbycolorinterpolatinginsideeachtriangle.

Experimentsshowthatthemethodhashighperformingefficiencyandabilitiesinfeature-preservingandimagereconstruction.Thisimagevectorizationmethodwillbringbenefitstomanyapplications,e.g.imagecompressing,editing,transmittingandresolutionenhancement,especiallyintheenvironmentswithlimitedcomputationalresourcessuchasmobiledevices.

8664-17, Session 4

Non-iterative normalized feature extraction in large viewpoint variances based on PCA of gradientJianZhang,SongCao,DiWen,TsinghuaUniv.(China)

Effectivelocalfeatureextractionisfundamentalforcontent-basedimageanalysisandretrievalinmultimediaapplication.However,itisdifficulttoachievedistinguishablelocalfeaturesinlargeviewpointvariances.Inthispaper,weproposeanovelnon-iterativeapproachofnormalizedfeatureextractioninlargeviewpointvariances,whichadaptslocalregionstorotation,scalevarianceandrigiddistortionfromaffinetransformation.Ourapproachisbasedontwokeyideas:1)Localizationandscaleselectioncanbedirectlyachievedwiththecentroidandcovariancematrixofpixelsdistributioninalocalregion.2)PrincipalComponentAnalysis(PCA)ongradientgivesinformationontexture,thusitcanbeusedtogetaresampledregionwhichisisotropicintermsofvarianceofgradient.Experimentsdemonstratethatournormalizedapproachhassignificantimprovementonfeatureextractioninlargeviewpointvariances.

8664-18, Session 5

Geometric enumerated chrominance watermark embed for spot colorsAlastairM.Reed,RobertLyons,JohnStach,DigimarcCorp.(UnitedStates)

Themajorityofpackagingisprintedusingspotcolorsmainlyforreasonsofcostandcolorconsistency,andtoachieveawidecolorgamutonthepackage.Mostwatermarkingtechniquesaredesignedtoembedawatermarkineithercyan,magenta,yelloworblackforprintedimagesorred,greenandbluefordigitalimagesthatarebeingdisplayed.

Ourmethodisdesignedtoaddresstheproblemofwatermarkingspotcolorimages.Animagecontaining2ormorespotcolorsisembeddedwithawatermarkin2ofthespotcolors,withthemaximumsignalstrengthwithinauserselectablevisibilityconstraint.

Theusercaninsertthemaximumwatermarksignal,whilemeetingtherequiredvisibilityconstraint.ThemethodhasbeenappliedtothecaseoftwospotcolorsandimageshavebeenproducedwhicharemorethantwiceasrobusttoGaussiannoiseasasinglecolorimagewhichisembeddedusingaluminanceonlywatermarktothesamevisibility.

Conference 8664


8664-19, Session 5

Dual resolution 2D color barcodeZhigangFan,XeroxCorp.(UnitedStates);YonghuiZhao,AppleInc.(UnitedStates);ShengeWang,QalcommInc.(UnitedStates);HengzhouDing,XeroxCorp(UnitedStates)

QuickResponse(QR)codesandother2-dimensional(2-D)barcodeshavebecomeawidelyacceptedmethodforautomaticallyidentifyingandtrackingobjects.Theabilityofmobiledevicestoreadtheencodedinformationhasmadethesecodesincreasinglypopularinagrowinglistofapplications,suchashealthcare,electronicsandproductinformationonpackages.

Currently,thetypicalQRcodeisaround2cmx2cminsize.However,manysmallpackagesrequirethat2-Dcodesbe0.75cmx0.75cm.Unfortunately,detectionofthesesmallcodesischallengingformanymobiledevices.Blurring,mainlycausedbyopticallimitationsofthecamerasystemsandhandmotion,magnifiesitsimpactonsmallimagescontainminutedetails.

Inthispaper,abarcodesystemispresentedwhichincludesanencoderthatconfiguredtogeneratea2-Dcolorbarcodewithadualresolutionstructure.Itcontainsahighresolutionlayerthatiscodedinluminanceandisinconsistencywiththeconventional2-Dbarcode,andalowresolutionlayerprovidingadditionalerrorcheckinginformation,thatiscodedinchrominanceandisrobusttoblurring.Theproposedbarcodeiscompatibletoitsunderlyingconventionalblackandwhite2-Dbarcodeasitcanbereadbytheirdecoders.Itsadvantageisadditionalreliabilitywhenacolordecoderisused.Inparticular,itenhancesthedecodingaccuracyfordevicessuchasmobiledevicesforbarcodesprintedinsmallsizes.

8664-20, Session 5

HP EOS cross platform mobile solutionJunhuaFu,HuaZhang,Hewlett-PackardChinaCo.,Ltd.(China);JerryLiu,Hewlett-PackardLabs.(UnitedStates)

NoAbstractAvailable

8664-21, Session K1

Tangible imaging systems (Keynote Presentation)JamesA.Ferwerda,RochesterInstituteofTechnology(UnitedStates)

NoAbstractAvailable

8664-22, Session 7

Event detection from the microblogJingxuanLi,FloridaInternationalUniv.(UnitedStates);WeiPeng,TongSun,XeroxCorp.(UnitedStates)

Eventdetectionhasbeenattractingresearchers’attentionforalongtime.Todetecteventsfromthemicroblogisbecomingahottopicrecentlybecauseitreflectswhathappenedintheworldinatimelyfashion,thuscanhelppeoplemonitorrealtimeevents.Forexample,amobileserviceprovidermaywanttomonitorusers’feedbackthroughthemicroblog.However,giventhecollectionofmicroblogdocuments,e.g.,TweetsfromTwitter,performingeventdetectionisnon-trivialforthenaturalofmicroblogdocuments–short.Meanwhile,onlinemicroblogdocumentsusuallymeanstextstreamsavailableonthewebforeveryminute.Howtoquicklydetectcurrenteventsfrommicroblogdocumentsisachallenge.Inthiswork,weproposeandimplementa“New”and“Anomalous”eventdetectionframeworkwhichcanbeefficientlyappliedovermicroblogstreams.

8664-23, Session 7

Person-based video summarization and retrieval by tracking and clustering temporal face sequencesTongZhang,Hewlett-PackardLabs.(UnitedStates);DiWen,TsinghuaUniv.(China);XiaoqingDing,TsinghuaUniversity(China)

Peopleareoftenthemostimportantsubjectsinvideos.Itishighlydesiredtoautomaticallysummarizetheoccurrenceofdifferentpeopleinalargecollectionofvideoandquicklyfindthevideoclipscontainingaparticularpersonamongthem.Inthispaper,wepresentaperson-basedvideosummarizationandretrievalsystem,VideoWho,whichextractstemporalfacesequencesinvideosandgroupsthemintoclusters,witheachclustercontainingvideoclipsofthesameperson.Thisisaccomplishedbasedonadvancedfacedetectionandtrackingalgorithms,togetherwithasemi-supervisedfaceclusteringapproach.Thesystemachievedgoodclusteringaccuracywhentestedonahybridvideosetincludinghomevideo,TVplaysandmovies.Ontopofthistechnology,anumberofapplicationscanbebuilt,suchasautomaticsummarizationofmajorcharactersinvideos,person-relatedvideosearchontheInternetandpersonalizedUIsystemsetc.Especially,wearemakingthisasacloudservicefordevelopersandconsumerstomanagevideocontent.

8664-24, Session 7

Building an internet-scale web clipping systemEamonnO’Brien-Strain,Hewlett-PackardLabs.(UnitedStates)

NoAbstractAvailable

8664-25, Session K2

That thing in your pocket is really a computer!: the future of mobile imaging (Keynote Presentation)EdwardJ.DelpIII,PurdueUniv.(UnitedStates)

NoAbstractAvailable

Conference 8664


Conference 8665: Media Watermarking, Security, and Forensics 2013Tuesday-Thursday5–7February2013

Part of Proceedings of SPIE Vol. 8665 Media Watermarking, Security, and Forensics 2013

8665-1, Session K1Converting watermark research into a technology startup (Keynote Presentation)JaapHaitsma,Civolution(UnitedStates)

InthistalkIwillpresentmypersonalexperienceofhowthewatermarkandlateronfingerprintresearchresultsofjustasmallteamofscientistsinPhilipsResearch(ofwhichIwasone)inthelate90’swasconvertedintoacompanycalledCivolutionthatiscurrentlyemployingoverahundredpeoplethatworkonprovidingwatermarkingandfingerprintingsolutionstocustomersworldwide.Furthermoreitwilltouchuponthedifferentapplicationsanddeploymentsofwatermarkingandfingerprintingtechnologyandprovidesomeinsightsinthechallengesahead.Ihopethistalkcangiveinspirationandsomeadvicetootherscientiststoconverttheirresearchresultsintoatechnologystartup.Havingyourownalgorithmssolveproblemsorgiveenhancedexperiencestorealcustomersisaverygratifyingexperience.

8665-2, Session 1

Insertion deletion robust audio watermarking: a set theoretic, dynamic programming approachAndrewNadeau,GauravSharma,Univ.ofRochester(UnitedStates)

Todate,desynchronizationattacksareakeyproblemforaudiowatermarks,whichhavelimitedtheirsuccessinapplicationssuchascontentauthenticationanddigitalrightsmanagement.Thispaperproposesanaudiowatermarkingschemerobusttobothdesynchronizationfromrandominsertionsanddeletions,andnon-desynchronizingperturbationsfromMP3compression.Theproposedschemeembedsmultiplespreadspectrum(SS)watermarksinthepoly-phasequadraturemodulatedfilterbank(PQMF)subbandsofanaudiosignalusingasettheoreticframeworkforinformedembedding.ThesettheoreticframeworkallowsconstraintsonMP3robustness,perceptualfidelity,anddetectabilitytobeenforcedconcurrentlyforthemultiplewatermarks.Blindcorrelationdetectionofshortwatermarkblocksfollowedbydynamictimewarping(DTW)efficientlysearchesforaoptimalsequenceofblockdetectionsinspiteofinsertionsanddeletions.Unlikepriorart,DTWisintegratedintoSSdetectionanddoesnotrequiretheoriginalaudiosignalforalignment.

Novelcontributionsoftheproposedaudiowatermarkingschemeinclude:

1)robustnesstonon-desynchronizingperturbationsusingasettheoreticframework,2)insertionanddeletionrobustnessusingDTWaspartofblinddetection,and3)reductionoffalsepositivestypicalofhighdimensionalsearchesbycrossvalidationofmultiplewatermarks.

8665-3, Session 1

Impeding forgers at photo inceptionMatthiasKirchner,InternationalComputerScienceInstitute(UnitedStates);PeterWinkler,HanyFarid,DartmouthCollege(UnitedStates)

Wedescribeanewconceptformakingphototamperingmoredifficultandtimeconsuming,andthereforemorepronetoerror.Werecordthecamerapreviewandcameramotioninthemomentsjustpriortoimagecapture.Thisinformationispackagedalongwiththefullresolutionimage.Anysubsequentmanipulationofthefinalimagewillhavetobepropagatedtobeconsistentwiththisdata–adecidedlymoredifficultundertaking.

8665-4, Session 1

Watermark embedding in optimal color directionRobertLyons,AlastairM.Reed,JohnStach,DigimarcCorp.(UnitedStates)

Towatermarkspotcolorpackagingimagesonechangesavailablecolorinkstocreateawatermarksignal.Bychangingtheinkdensitiesatapointonecanembedthewatermarksignalindifferentcolordirections.Wedescribetheoptimalcolordirectionthatembedsthemaximumsignalwhilekeepingthevisibilitybelowsomeacceptablevalue.Theoptimalcolordirectiondependsonthedefinitionofthewatermarksignal,thestartingcolorfortheimageregionandtheinkdensityconstraints.

Afteradescriptionofthegeneralproblemweshalldescribetwoinkembeddingandtrytofindtheoptimaldirectionthatwillmaximizerobustnessatagivenvisibility.Ifwedefinethewatermarksignalas(R+G+B)/3,thentheoptimaltwocolorembedderwillinsertcolorwithloworzeroluminancechanges.Wecomparetheoptimalcolorembeddertoasinglecolorembedder.

8665-5, Session 1

Video game watermarkingWaldemarBerchtold,MarcelSchäfer,Fraunhofer-InstitutfürSichereInformations-Technologie(Germany);HuajianLiu,FraunhoferInstituteSIT(Germany);FábioTouceiraTakahashi,UniversityofPorto(Portugal);AndréSchmitz,waza!servicesUG(Germany);SaschaZmudzinski,MartinSteinebach,Fraunhofer-InstitutfürSichereInformations-Technologie(Germany);JonasWieneke,TechnicalUniversityofDarmstadt(Germany)

Thepublishersofvideogamessufferfromillegalpiracyandinformationleakagecausedbyend-consumers,“releasegroups”orinsidersshortlyafterorevenbeforetheoffcialreleaseofanewvideogame.MechanismstopreventoratleastpostponethisillegalredistributionareDRMorcopyprotectionmechanisms.However,thesemechanismsareveryunpopular,becausetheyrestrictthecustomersinplayingthegameanddemandahighadministrationeffortfromthedevelopersand/ordistributors.Evenworse,mostcopyprotectionmechanismshaveproventobeinsecureas“patches”forcircumventionusuallyareavailablequicklyandeasytoget.Tosatisfythechallengesofsecurityandusability,thisworkintroducestheideaofusingdigitalwatermarkingtoprotectallavailableandsuitablemediatypesandsoftwarebinariescontainedinavideogame.Athree-layeredwatermarkingdeploymentapproachalongtheproductionchainisproposedtodetectleakageinthereleasephaseaswellasduringthedevelopmentprocessofaboxedvideogame.Theproposedapproachfeaturesbothcopyrightwatermarkingandcollusionsecurefingerprintsembeddedastransactionwatermarkmessagesincomponentsofvideogames.Wediscussthecorrespondingnewchallengesandopportunities.Inaddition,aprototypewatermarkingalgorithmispresentedtodemonstratetheadaptionnecessityofclassicalimagewatermarkingwhenappliedtovideogamestosatisfiestherequirementsfortransparency,securityaswellasperformance.Thewatermarkcapacityissignificantlyincreasedwhileinter-mediaandinter-fileembeddingisenabledandtheassociatedsynchronizationchallengeissolvedbyrobusthashes.


8665-6, Session 2

Banknote authentication with mobile devicesVolkerLohweg,JanLeifHoffmann,HeleneDörksen,RolandHildebrand,EugenGillich,Ostwestfalen-LippeUniv.ofAppliedSciences(Germany);JürgHofmann,JohannesSchaede,KBA-NotaSysS.A.(Switzerland)

Maintainingconfidenceinsecuritydocuments,especiallybanknotes,isandremainsamajorconcernforthecentralbanksinordertomaintainthestabilityoftheeconomyaroundtheworld.InthispaperwedescribeanimageprocessingandpatternrecognitionapproachwhichisbasedontheSound-of-Intaglioconceptfortheusageinsmartdevicessuchassmartphones.Today,inmanyworldregionssmartphonesareinuse.Thesedevicesbecomemoreandmorecomputingunits,equippedwithresource-limited,buteffectiveCPUs,cameraswithillumination,andflexibleoperatingsystems.Hence,itappearstobeobvious,toapplythosesmartphonesforbanknoteauthentication,especiallyforvisuallyimpairedpersons.Ourapproachshowsthatthosedevicesarecapableofprocessingthedataundertheconstraintsofimagequalityandprocessingpower.Even,pointingoutstrictlythatamobiledeviceassuchisnotanindustrialproductforharshenvironments,itispossibletousemobiledevicesforbanknoteauthentication.Theconceptisbasedonanewstrategyforconstructingadaptivewaveletsfortheanalysisofdifferentprintpatternsonabanknote.Furthermore,abanknotespecificfeaturevectorisgeneratedwhichdescribesanauthenticbanknoteeffectively.Amulti-stageLinear-discriminant-analysisclassifiergeneratesstableandreliableoutput.

8665-7, Session 2

Visibility enhancement and validation of segmented latent fingerprints in crime scene forensicsAndreyMakrushin,Otto-von-Guericke-Univ.Magdeburg(Germany);TobiasKiertscher,FachhochschuleBrandenburg(Germany);MarioHildebrandt,JanaDittmann,Otto-von-Guericke-Univ.Magdeburg(Germany);ClausVielhauer,FachhochschuleBrandenburg(Germany)

Forensicinvestigatorsarepermanentlylookingfornoveltechnologiesforfastandeffectiverecoveringoflatentfingerprintsatacrimescene.Traditionally,thisworkisdonemanuallyandthereforeconsideredverytimeconsuming.Highlyskilledexpertsapplychemicalreagentstoimprovevisibilityoftracesandusedigitalcamerasoradhesivetapetoliftprints.Throughanautomationofthesurfaceexamination,largerareascanbeinvestigatedfaster.Thisworkamplifiestheexperimentalstudyoncapabilitiesofachromaticwhite-lightsensor(CWL)regardingthecontact-lessliftingoflatentfingerprintsfromdifferentlychallengingsubstrates.ThecrucialadvantageofaCWLsensorcomparedtotakingdigitalphotographsisthesimultaneousacquisitionofluminanceandtopographyofthesurface,extendingthestandardtwodimensionalimageprocessingtotheanalysisofthree-dimensionaldata.Thepaperfocusesontheautomaticvalidationoflocalizedfingerprintregions.Incontrasttostatisticalfeaturesfromluminancedata,previouslyusedforlocalization,weproposethestreakinessofapatternasthebasicfeatureindicatingthefingerprintpresence.Regionsareanalyzedforstreakinessusingbothluminanceandtopographydata.Asaresult,thehumanexpertssignificantlysavetimebydealingwithalimitednumberofapprovedfingerprints.Theexperimentsshowthatthevalidationperformanceintermsofequalerrorratedoesnotexceed6%evenonverychallengingsubstratesregardinghigh-qualityfingerprints.

8665-8, Session 2

Printed fingerprints at crime scenes: a faster detection of malicious traces using scans of confocal microscopesMarioHildebrandt,StefanKiltz,JanaDittmann,Otto-von-Guericke-Univ.Magdeburg(Germany)

Fingerprinttracesareanimportantpartofforensicinvestigationstoidentifypotentialperpetrators.Withtheadventofthepossibilityofprintingtracesforqualityassurancepurposesitisalsopossibletoplacemalicioustracesoncrimescenes.Inforensicsexaminersareawareofmultipleidenticaltraces,e.g.producedbystampingfingerprints.Thetechniqueofprintingfingerprintsusingartificialsweatallowsforcreatingdifferentversionsofthesamefingerprint,similartotheresiduefromafingerwhichisalmostnever100percentidenticaltoanotherlatentfingerprint.Hence,Kiltzetal.(2011)introduceafirstframeworkforthedetectionofsuchmalicioustracesinsubjectiveevaluationsbasedondotpatternsofaminoacid.Hildebrandtetal.(2012)introduceafirstautomatedapproachforthedetectionofprintedfingerprintsusinghighresolutionscansfromaChromaticWhiteLightsensor.However,thereportedrecognitionaccuracyisinsufficientforforensicinvestigations.

Inthispaperweproposeanimprovedfeatureextractionforscansusingaconfocalmicroscopetoreducetheoverallanalysistimeandtoincreasetherecognitionaccuracy.Ourfirstpreliminaryevaluationon55printedand55realfingerprintsonanoverheadfoilindicatesasignificantreductionofthenecessaryanalysistime.Therecognitionaccuracyinthisfirsttestsetisincreasedto100percent.

8665-9, Session K2

YouTube content ID: background, challenges, and the roadmap ahead (Keynote Presentation)ThabetAlfishawi,Google(UnitedStates);DavidErb,GoogleZürich(Switzerland)

YouTubelaunchedContentIDin2007asatooltoenablecontentownerstoidentifycontenttheyownautomaticallyandapplyingtheirdesiredpolicyonit:monetize,block,takedownortrack.Overtheyears,ContentIDevolvedtobecomeanintegralpartofYouTube’secosystem.ContentIDisamongthemostcomprehensiveaudiovisualdatabasesintheworldasitscansover100yearsofuser-uploadedvideoeverydayagainstmorethan100yearsofreferencefiles.

InthistalkwewillprovidebackgroundonContentID,origins,evolutionandinnerworkings.Thenwe’lltakeadeeperdivetofocusonthecurrentchallengesfacingContentIDbreakingthemdowninthreecategories:technical,businessandbehavioral.Finallywe’llclosewithanoverviewoftheroadmapaheadforContentID.

8665-10, Session 3

Sensor fingerprint digests for fast camera identification from geometrically distorted imagesMiroslavGoljan,JessicaFridrich,BinghamtonUniv.(UnitedStates)

Incameraidentificationusingsensorfingerprint,itisabsolutelyessentialthatthefingerprintandthenoiseresidualfromagiventestimagebesynchronized.Ifthesignalsaredesynchronizedduetoageometricaltransformation,fingerprintdetectionbecomessignificantlymorecomplicated.Besidesconstructingthedetectorinaninvarianttransformdomain(whichlimitsthetypeofthegeometricaltransformation)amoregeneralapproachistoapplyageneralizedlikelihoodratiodetector,whichinvolvesasearchfortheunknownnuisanceparametersofthetransform.Thissearch,however,isingeneralveryexpensiveasitrequiresnumerousresamplingsoftheentireimage(orfingerprint).Inthisabstract,weproposeameasurethatsignificantlyreducesthesearchcomplexitybyreducingtheneedtoresampletheentireimagetoamuchsmallersubsetofthesignalcalledthefingerprintdigest.Thetechniquecanbeappliedtoanarbitrarygeometricaldistortion,includingnon-linearlens-distortioncorrection.Thefullversionofthispaperwillincludeextensiveexperimentsandamathematicalframeworkforselectingthedigestthatminimizesthedetectionerrorforselectedcommongeometricaltransforms.

Conference 8665


8665-11, Session 3

Case studies and further improvements on source camera identificationKenjiKurosawa,KenroKuroki,Ken’ichiTsuchiya,NaoakiIgarashi,NorimitsuAkiba,NationalResearchInstituteofPoliceScience(Japan)

Theauthorshaveconductedvariousforensicexaminationsofimagesinthepolicelaboratory.Inthispaper,actualcaseexamplesandfurtherimprovementsonsourcecameraidentificationareshown.Thespecifictopicsarefollowing(a)to(d).

(a)Inordertoimproveperformanceofsourcecameraidentification,thehybrididentificationmethodusingdarkcurrentandPRNUnoiseisproposed.Thepreliminaryexperimentalresultshowedthatidentificationperformancewasimprovedwiththeproposedmethod.

(b)Thesourcecameraidentificationhasbeenperformedintheactualfivecriminalcases,suchasthemurdercase,andsoon.Theprocedurewasasequentialexaminationofhotpixelcoordinatesvalidationfollowedbythesimilarityevaluationofsensornoisepattern.Theauthorscouldclearlyprovethatthequestionedcriminalsceneshadbeenrecordedbythequestionedcamerasinfourcasesofthefive.

(c)TheexperimentresultswithtwentyCCDmodulesofthesamemodelshowedthatindividualcameraidentificationfordarkimageswaspossiblebyusingnon-uniformnatureofCCDchargetransfer.

(d)Itwasalsoshownthatthesourcecameraidentificationmethodcanbeappliedfordetectingforgeryonthetimeaxisofvideosequences,suchasinsertionofasceneobtainedbytheothercameras.

8665-12, Session 3

Forensic analysis of interdependencies between vignetting and radial lens distortionAndréFischer,ThomasGloe,TechnischeUniv.Dresden(Germany)

Thisabstractstartsadiscussionontheforensicanalysisofinterdependenciesbetweenlensaberrationsontheexampleofvignettingandradiallensdistortion.Startingwithabriefdiscussiononthebackground,weassumetheopticalcentreofvignettingandradiallensdistortionsimilarandexpectfurtherdependenciesintheappear-anceofboth.Estimatingaberrationparametersincombinationcanhelptoincreasetheaccuracyandmakesinvestigationsmorereliable.Afterashortdiscussionofourtestsetup,wegivearoughlistofexperimentalresults.Weproposeanovelmethodtomakethecharacteristicofvignettinginherentinanimagevisibleusingavectorplot.Thisallowsforensicinvestigatorstoinspectandcomparepatternsofvignettingvisually.Ourpreliminaryresultsdocumenttheinfluenceofcamerarotationandindicatepreviouslyunknownchallengesintheforensicanalysis.However,estimatingcamerarotationmighthelptoauthenticatemetadatatag‘orientation’.Unexpectedly,theestimatedopticalcentreforvignettingandradiallensdistortiondonotcoincide,whichmightbeaproblemoftheemployedmodelsoranimperfectilluminationofthecheckerboardimages.Spacelimitationsprohibitacompletedocumentationofourstudyandwewillprovidedetailedresultsinthefinalpaperincaseofacceptance.

8665-13, Session 3

A sneak peek into the camcorder pathCherifBenZid,SéverineBaudry,TechnicolorS.A.(France);BertrandChupeau,TechnicolorR&DFrance(France);GwenaëlDoërr,TechnicolorS.A.(France)

Thecamcorderpathreferstothecommunicationschannelwhereavideofileisrenderedonadisplaydeviceandre-capturedbyacamerarecorder.Severalsignalprocessingprimitivesareexpectedtoberobustagainstsuchdistortion.Forinstance,forensicvideowatermarksshouldberecoveredevenifthepiratecamcordstheTV/cinemascreenontowhichthemovieisdisplayed.Video-basedsecond-screenapplicationsimplicitlyexpecttheunderlyingwatermarkingor

fingerprintingtechnologytosurvivecamcording.Still,robustnesstothiscommunicationschannelishardlyevaluatednowadaysbecausebenchmarkingcampaignssimplyprovetobetoocumbersomeinpractice.

Camcordingdisplayedcontentmodifiesthevideoessenceinanumberofways:geometricdistortions,temporaltransforms,non-uniformandvaryingluminancetransformations,saturation,coloralteration,etc.Itisthereforechallengingtodesignasimulatorthatwouldmimicthecamcorderpathandthatcouldthusbeusedinasystematicmannerforbenchmarkingwhileavoidinglogisticshurdles.Asafirststepinthisdirection,wefocusinthisstudysolelyonluminancetransforms.Inparticular,weinvestigatethreedifferentphenomena,namely:(i)thesteadystateluminanceresponse,(ii)thespatialnonuniformity,and(iii)thetransientluminanceresponse.

8665-14, Session 4

Ballistic examinations based on 3D data: a comparative study of probabilistic Hough Transform and geometrical shape determination for circle-detection and segmentation of cartridge bottomsRobertFischer,FachhochschuleBrandenburg(Germany);ClausVielhauer,Otto-von-GuerickeUniv.Magdeburg(Germany);MarioHildebrandt,Otto-von-GuerickeUniversityMagdeburg(Germany);StefanKiltz,JanaDittmann,Otto-von-Guericke-Univ.Magdeburg(Germany)

Theapplicationofcontact-lessoptical3Dsensingtechniquesyieldingdigitaldatafortheacquisitionoftoolmarksonforensicballisticspecimensfoundatcrimescenes,aswellasthedevelopmentofcomputer-aided,semi-automatedfirearmidentificationsystemsthatareusing3Dinformation,arecurrentlyemergingfieldsofresearchwithrisingimportance.Traditionally,theexaminationofforensicballisticspecimenisdonemanuallybyhighlyskilledforensicexpertsusingcomparisonmicroscopes.Apartlyautomationofthecomparisontaskpromisesexaminationresultsthatarelessdependentonsubjectiveexpertiseandfurthermoreareductionofthemanualworkneeded.Onenecessaryrequirementfortheexaminationofforensicballisticspecimensisareliablecircle-detectionandsegmentationofcartridgebottoms.Thisinformationislaterusedforexampleforalignmentandregistrationtasks,determinationofregionsofinterest,andlocallyrestrictedapplicationofcomplexfeature-extractionalgorithms.InthisworkweareusingaKeyenceVK-X105laser-scanningconfocalmicroscopetoacquireaveryhighdetailtopographyimage,alaser-intensityimage,andacolorimageoftheassessedcartridgebottomssimultaneously.TheworkisfocusedonacomparisonofHough-Transformandgeometric-shapedeterminationforcircle-detectiononcartridgebottomsusing3Daswellas2Dinformation.Wecomparethepre-processingcomplexity,therequiredprocessingtime,andtheabilityforareliabledetectionofalldesiredcircles.Weassumethattheutilizationofgeometric-shapedetectioncanreducetherequiredprocessingtimeduetoalesscomplexprocessing.ForapplicationofshapedeterminationaswellasforHough-Transformweexpectamorereliablecircle-detectionwhenusingadditional3Dinformation.Ourfirstexperimentalevaluation,using100cartridgesshotfrom3differentfirearmsshowspositivetendencytoverifythesesuppositions.

8665-15, Session 4

Photocopier forensics based on arbitrary text charactersChangyouWang,XiangweiKong,ShizeShang,DalianUniv.ofTechnology(China);Xin’gangYou,BeijingInstituteofElectronicTechnologyandApplication(China)

Aphotocopieddocumentcanprovidecharacteristicsofthephotocopiersoastodistinguishwhichphotocopieritcomesfrom.Whilehowtoextracttheoptimalintrinsicfeaturesiscriticalinphotocopierforensics.Inthispaper,charactertexturefeaturesare

Conference 8665


extractedastheintrinsicfeaturesandaphotocopierforensicsmethodbasedonthetextureanalysisofarbitrarytextcharactersisproposed.Firstly,imagepreprocessingprocessispracticedtogetindividualcharacterimages.Nextthreesetsoftexturefeaturesareextractedfromeachindividualcharacter,includingthegraylevelfeatures,thegradientdifferentialmatrix(GDM)featuresandthegraylevel-gradientco-occurrencematrix(GLGCM)features.TheneachindividualcharacterinadocumentisclassifiedusingaFisherclassifierandamajorityvoteschemeisperformedonthecharacterclassificationresultstoidentifythesourcephotocopier.Experimentalresultsonsevenphotocopiersprovetheeffectivenessoftheproposedmethodandanaveragecharacterclassificationaccuracyof88.47%canbeachieved.

8665-16, Session 4

Accelerating video carving from unallocated spaceHariKalva,FloridaAtlanticUniv.(UnitedStates);AnishParikh,NirmaUniv.ofScience&Technology(India);AvinashSrinivasan,GeorgeMasonUniv.(UnitedStates)

Videocarving,theprocessofrecoveringvideofromunallocateddiskspace,hasbecomeanessentialtoolindigitalforensics[1,2,3].TherehasbeensomepriorworkonvideocarvingincludingopensourcetoolssuchasDefraser.Harddiskcapacityhascontinuedtoincreaseand1TBdisksarenowcommononconsumergradelaptopsandPCs.Asthediskcapacitygrows,thecomplexityofvideocarvinggrowsastheamountofunallocatedspacealsogrows.Inthispaperwepresentalgorithmstoacceleratevideocarving.


A histogram shifting based RDH scheme for H.264/AVC with controllable driftZafarShahid,LIRMMLabs(France);WilliamPuech,Univ.Montpellier2(France)

Thispaperpresentsanefficientmethodforreversibledata-hiding(RDH)inH.264/AVCintrabitstreaminanopen-loopfashion.Theproposedschemehasaminimalcontrollabledrift,whichisproportionaltothepayload.Thedataisreversiblyembeddedinquantizedtransformcoefficients(QTCs)usinghistogramshiftingoutsidethereconstructionloop.IngeneralhistogramshiftingbasedRDHmethodsforQTCs,thereisamismatchonencoderanddecodersidebecauseofmodificationofboundarypixelwhichareusedforspatialintraprediction.Itresultsinaseveredegradationofthevisualqualityofwatermarkedvideo.Intheproposedscheme,wehavesolvedasystemoflinearequationswhichmakesitpossibletoperformembeddingintheQTCswithoutchangingthevalueoftheboundarypixels.Itmakesitpossibletoembeddatainthetransformdomainwithoutintrapredictiondrift.Moreover,thedistortionisproposedtoonlythosebitsofpayloadwhichare‘0’.Experimentalresultshowsthattheproposedschemehasaminimalincreaseinbitrateandguaranteesthecompleterecoveryoftheoriginalvideocontentafterextractionoftheembeddedcontent.

8665-19, Session 5

On the role of side information in steganography in empirical coversJessicaFridrich,BinghamtonUniv.(UnitedStates)

Inanattempttoalleviatethenegativeimpactofunavailablecovermodel,somesteganographicschemesutilizetheknowledgeoftheso-called“precover”whenembeddingsecretdata.Theprecoveristypicallyahigher-resolution(unquantized)representationofthecover,suchastherawsensoroutputbeforeitisconvertedtoan8-bitperchannelcolorimage.TheprecoverobjectisonlyavailabletothesenderbutnottotheWarden,whichseemstogiveafundamentaladvantagetothesender.Inthispaper,weprovidetheoreticalinsightforwhyside-informedembeddingschemesforempiricalcoversmight

providehighlevelofsecurity.Byadoptingapiece-wisepolynomialmodelcorruptedbyAWGNforthecontent,weprovethatwhenthecoverissufficientlynon-stationary,embeddingbyminimizingdistortionw.r.t.theprecoverismoresecurethanbypreservingamodelestimatedfromthecover(theso-calledmodel-basedsteganography).Moreover,theside-informedembeddingenjoysfourtimeslowersteganographicFisherinformationthanLSBmatching.

8665-20, Session 5

Embedding operations and locations for steganography in H.264 videoAndreasNeufeld,AndrewD.Ker,Univ.ofOxford(UnitedKingdom)

ThisworkstudiesthefundamentalbuildingblocksforsteganographyinH.264compressedvideo:theembeddingoperationandthechoiceofembeddinglocations.Ouraimistoinformthedesignofbettervideosteganography,atopiconwhichtherehasbeenrelativelylittlepublicationsofar.Wedeterminethebestembeddingoption,fromasmallmenuofembeddingoperationsandlocations,asbenchmarkedbyanempiricalestimateofMaximumMeanDiscrepancy(MMD)forfirst-andsecond-orderfeaturesextractedfromavideocorpus.Ahighly-stableestimateofMMDcanbeformedbecauseofthelargesamplesize.Thebestembeddingoperation(so-calledF5)isidenticaltothatfoundbyarecentstudyofstillcompressedimagesteganography,butinvideotheoptionsforembeddinglocationarericherandtheoptimalchoiceinvolvesunequaluseoflumaandchromachannels.

8665-21, Session 5

Video steganography with multi-path motion estimationYunCao,XianfengZhao,FenghuaLi,InstituteofInformationEngineeringofChineseAcademyofSciences(China);NenghaiYu,Univ.ofScienceandTechnologyofChina(China)

Thispaperproposesanovelvideosteganographyduringmotionestimation.Comparedwithexistingschemes,thenewapproachhasbeenenhancedfromtwoaspectstoimprovesteganographicsecurity.First,toreducethesinglechangedistortion,multiplequicksearchfunctionsareemployedtogenerateoptimizedalternatesformotionvectorreplacement.Secondly,toimproveboththeembeddingandcomputationalefficiencies,aflexibleembeddingstructureisdesignedtoperformmatrixembedding.

8665-22, Session 6

Random projections of residuals as an alternative to co-occurrences in steganalysisVojtechHolub,JessicaFridrich,TomasDenemark,BinghamtonUniv.(UnitedStates)

Today,themostreliabledetectionofsteganographyinempiricalcoversources,suchasdigitalimages,hasbeenachievedusingmachine-learningbyrepresentingimageswithco-occurrencematricesofneighboringnoiseresidualsamplescomputedusinglocalpixelpredictors.Inthispaper,weproposeanalternativestatisticaldescriptionbybinningrandomprojectionsofresidualsfromalocalneighborhood.Thesizeandshapeoftheneighborhoodallowthesteganalysttofurtherdiversifythestatisticaldescriptionandthusimprovesteganalysisdetection.Otherkeyadvantagesofthisapproachincludethepossibilitytomodellonger-rangedependenciesamongpixelsandmakinguseofinformationthatwaspreviouslyunderutilizedinthemarginalsofco-occurrences.Moreover,theproposedapproachismuchmoreflexible,allowingthesteganalysttobetterbalanceperformanceandfeaturedimensionality(complexity).Wedemonstratetheusefulnessoftheproposedapproachonhighlyadaptiveembeddingschemes,suchasHUGOandWOW.

Conference 8665


8665-23, Session 6

The challenges of rich features in universal steganalysisTomasPevny,CzechTechnicalUniv.inPrague(CzechRepublic);AndrewD.Ker,Univ.ofOxford(UnitedKingdom)

NoAbstractAvailable

8665-24, Session 6

Exploring multitask learning for steganalysisJulieM.Makelberge,AndrewD.Ker,Univ.ofOxford(UnitedKingdom)

Thispaperintroducesanewtechniqueformulti-actorsteganalysis.Inconventionalsettings,itisunusualforoneactortogenerateenoughdatatobeabletotrainapersonalizedclassifier.Ontheotherhand,inanetworktherewillbemanyactors,betweenthemgeneratinglargeamountsofdata.Priorworkhaspooledthetrainingdata,andthentriedtodealwithitsheterogeneity.Inthiswork,weusemultitasklearningtoaccountfordifferencesbetweenactors’imagesources,whilestillsharingdomain(globally-applicable)information.Wetackletheproblembylearningseparatefeatureweightsforeachactor,andsharinginformationbetweentheactorsthroughtheregularization.Thisway,thedomaininformationthatisobtainedbyconsideringallactorsatthesametimeisnotdisregarded,buttheweightsareneverthelesspersonalized.Thispaperexploreswhethermultitasklearningimprovesaccuracyofdetection,bybenchmarkingnewmultitasklearnersagainstpreviouswork.

8665-25, Session 7

A cost-effective decision tree based approach to steganalysisLiyunLi,PolytechnicInstituteofNewYorkUniv.(UnitedStates);HusrevT.Sencar,TOBBUniv.ofEconomicsandTechnology(Turkey);NasirMemon,PolytechnicInstituteofNewYorkUniv.(UnitedStates)

Animportantissueconcerningreal-worlddeploymentofsteganalysissystemsisthecomputationalcostofac-quiringfeaturesusedinbuildingsteganalyzers.Conventionalapproachtosteganalyzerdesigncruciallyassumesthatallfeaturesrequiredforsteganalysishavetobecomputedinadvance.However,asthenumberoffeaturesusedbytypicalsteganalyzersgrowintothousandsandtimingconstraintsareimposedonhowfastadecisionhastobemade,thisapproachbecomesimpractical.Toaddressthisproblem,wefocusonmachinelearningaspectofsteganalyzerdesignandintroduceadecisiontreebasedapproachtosteganalysis.Theproposedsteganalyzersystemcanminimizetheaveragecomputationalcostformakingasteganalysisdecisionwhilestillmaintainingthedetectionaccuracy.Todemonstratethepotentialofthisapproach,aseriesofexperimentsareperformedonwellknownsteganographyandsteganalysistechniques.

8665-26, Session 7

Quantitative steganalysis using rich modelsJanKodovsky,JessicaFridrich,BinghamtonUniv.(UnitedStates)

Inthispaper,weproposearegressionframeworkforsteganalysisofdigitalimagesthatutilizestherecentlyproposedrichmodels--high-dimensionalstatisticalimagedescriptorsthathavebeenshowntosubstantiallyimproveclassical(binary)steganalysis.Ourproposedsystemisbasedongradientboostingandutilizesasteganalysis-specificvariantofregressiontreesasbaselearners.Theconductedexperimentsconfirmthattheproposedsystemoutperformspriorquantitativesteganalysis(bothstructuralandfeature-based)acrossawiderangeofsteganographicschemes:HUGO,LSBreplacement,nsF5,BCHopt,andMME3.

8665-27, Session 8

Stegatone performance characterizationYung-YaoChen,PurdueUniv.(UnitedStates);RobertUlichney,Hewlett-PackardLabs.(UnitedStates);JanP.Allebach,PurdueUniv.(UnitedStates);MatthewGaubatz,Hewlett-PackardLabs.(UnitedStates);StephenPollard,Hewlett-PackardLabs.(UnitedKingdom)

Embeddingdatainhardcopyisinwidespreaduseforapplicationsthatincludepointingthereadertoon-linecontentbymeansofaURL,tracingthesourceofadocument,labelingandpackaging.Mostsolutionsinvolveplacingovertmarksonthepage.Themostcommonare1D,2D,and3D(color)barcodes.However,whilebarcodesareapopularmeansforencodinginformationforprintedmatter,theyaddunsightlyovertcontent.

Inordertoavoidsuchovertcontent,Stegatonesareclustered-dothalftonesthatencodeadatapayloadbysingle-pixelshiftsofselecteddot-clusters.InStegatone,wecanembedinformationinimagesorgraphics–notintheimagefileasisdoneintraditionalwatermarking,butinthehalftoneontheprintedpage.However,therecoveryperformanceofstegatonesisnotwellunderstoodacrossawidevarietyofprintingtechnologies,models,andresolutions,alongwithvariationsofscanningresolution.Itwouldthusbeveryusefultohaveatooltoquantifystegatoneperformanceunderthesevariables.Theresultswouldthenbeusedtobettercalibratetheencodingsystem.WeconductthreetestprocedurestocharacterizeStegatoneperformance.TheexperimentalresultscharacterizeStegatoneperformanceforanumberofprinters,scanners,andresolutions.

8665-28, Session 8

Image tampering localization via estimating the non-aligned double JPEG compressionLanyingWu,XiangweiKong,BoWang,ShizeShang,DalianUniv.ofTechnology(China)

Inthispaper,wepresentanefficientmethodtolocalizetheforgedpartsinatamperedJPEGimage.TheforgedregionusuallyundergoesadifferentJPEGcompressionwiththebackgroundregioninJPEGimageforgeries.WhenaJPEGimageiscroppedtoanotherhostJPEGimageandresavedinJPEGformat,theJPEGblockgridofthetamperedregionoftenmismatchestheJPEGblockgridofthehostimagewithacertainshift.Thisphenomenoniscallednon-aligneddoubleJPEGcompression(NA-DJPEG).Inthispaper,weidentifydifferentJPEGcompressionformsbyestimatingtheshiftofNA-DJPEGcompression.OurshiftestimatingapproachisbasedonthepercentageofnonzerosofJPEGcoefficientsindifferentsituations.Comparedtopreviouswork,ourtamperinglocalizationmethod(i)performancesbetterfordealingwithsmallimagesize,(ii)isrobusttocommontamperingprocessingsuchasresizing,rotating,featheringandsoon,(iii)doesn’tneedaimagedatabasetotrainamachinelearningbasedclassifierortogetaproperthreshold.

Conference 8665


Conference 8666: Visual Information Processing and Communication IVWednesday-Thursday6–7February2013

Part of Proceedings of SPIE Vol. 8666 Visual Information Processing and Communication IV

8666-1, Session 1

Fairness issues in resource allocation schemes for wireless visual sensor networksKaterinaPandremmenou,LisimachosP.Kondi,KonstantinosE.Parsopoulos,Univ.ofIoannina(Greece)

Thisworkaddressestheproblemoffairnessandefficiencyevaluationofvariousresourceallocationschemesforwirelessvisualsensornetworks(VSNs).Theseschemesareusedtooptimallyallocatethesourcecodingrates,channelcodingrates,andpowerlevelsamongthenodesofawirelessdirectsequencecodedivisionmultipleaccess(DS-CDMA)VSN.Alloftheconsideredschemesoptimizeafunctionofthevideoqualitiesofeachnode.However,thereisnosingleschemethatmaximizesthevideoqualityofeachnodesimultaneously.Infact,allpresentedschemesareabletoprovideaPareto-optimalsolution,meaningthatthereisnoothersolutionthatissimultaneouslypreferredbyallnodes.Thus,itisnotclearwhichschemeresultsinthebestresourceallocationforthewholenetwork.Tohandletheresultingtradeoffs,inthisstudy,ametricthatconsidersbothfairnessandperformanceissuesisusedforthequalitativeevaluationoftheresultsofeachscheme.Ideally,adesirableschemecouldachievehightotalutilitywhilebeingequallyfairtoallnodesandrequiringlowamountsofpower.Forthisreason,wealsoevaluatedtheschemesintermsofthetotalconsumedpowerrelativewiththetotalachievedutility.

8666-2, Session 1

Discussion on information theoretic and simulation analysis of linear shift-invariant edge detection operatorsBoJiang,NationalInstituteofAerospace(UnitedStates)

Generally,thedesignsofdigitalimageprocessingalgorithmsandimagegatheringdevicesremainseparate.However,experimentsshowthattheimagegatheringprocessprofoundlyimpactstheperformanceofdigitalimageprocessingandthequalityoftheresultingimages.Weproposedanend-to-endinformationtheorybasedsystemtoassesslinearshift-invariantedgedetectionalgorithms,wherethedifferentparts,suchasscene,imagegathering,andprocessing,areassessedinanintegratedmannerusingShannon’sinformationtheory.Weevaluatedtheperformanceofthedifferentalgorithmsasafunctionofthecharacteristicsofthesceneandtheparameters,suchassampling,additivenoiseetc.,thatdefinetheimagegatheringsystem.Theedgedetectionalgorithmisregardedashavinghighperformanceonlyiftheinformationratefromthescenetotheedgeimageapproachesitsmaximumpossible.Thisgoalcanbeachievedonlybyjointlyoptimizingallprocesses.Tovalidateourinformationtheoreticalconclusions,aseriesofexperimentssimulatedthewholeimageacquisitionprocesswereconducted.Aftercomparisonanddiscussionbetweentheoreticanalysisandsimulationanalysis,wecandrawaconclusionthattheproposedinformation-theoreticassessmentprovidesanewtoolwhichallowsustocomparedifferentlinearshift-invariantedgedetectorsinacommonenvironment.

8666-3, Session 1

Analysis of uncoded image communication over noisy channels with unequal protection

RuiqinXiong,PekingUniv.(China);XiaopengFan,HarbinInstituteofTechnology(China);FengWu,MicrosoftResearchAsia(China);WenGao,PekingUniversity(China)

Forimagecommunicationovernoisychannels,conventionalcodingschemesgenerallysufferfromthresholdeffectwhenchannelqualityfallsunderacertainthreshold.Anuncodedanalogtransmission(UAT)schemenamedSoftCast,ontheotherhand,hasrecentlyshowntoprovidegracefulqualitytransitionandachievesuperiorperformancetoexistingstate-of-the-artdigitalmethods,forchannelswithwideinstantaneoussignal-to-noiseratiorange.However,themechanismofUATandthefactorsthatinfluencethedistortion-powerefficiencyofUATarenotinvestigatedthoroughlyinliteratures.Inthispaper,wepresentatheoreticalanalysisforuncodedanalogtransmission.Thecontributionsarethree-fold.Firstly,aformulationfortheend-to-endperformanceofUATwithunequalnoiseprotectionisdeveloped.Ametriccalled“dataactivity”isintroducedtomeasurethecomplexityofasourceinthecontextofUAT.Secondly,weshowthat,forsignalswithstrongcorrelation,significantperformancegaincanbeprovidedbypromotingenergydiversityindatausingadecorrelationtransform.Thirdly,weshowthatthetransformgaincanberealizedonlyiftheenergydiversityindataisknownbyboththeencoderandthedecoderandproperlyexploitedforunequalnoiseprotection.Specifically,thegranularityfordescribingthedataenergydiversityknowledgesharedbytheencoderandthedecodercontrolstheactualachievedperformancegain.Theanalysespresentedinthepaperareverifiedbyexperimentalresults.


Block-layer bit allocation for quality constrained video encoding based on constant perceptual qualityChaoWang,XuanqinMou,WeiHong,Xi’anJiaotongUniv.(China);LeiZhang,TheHongKongPolytechnicUniv.(HongKong,China)

Inlossyimage/videoencoding,thereisacompromisebetweenthenumberofbits(rate)andtheextentofdistortion.Bitsneedtobeproperlyallocatedtodifferentsources,suchasframesormacroblocks(MBs).Sincethehumaneyesaremoresensitivetothedifferencethantheabsolutevalueofsignals,theMINMAXcriterionsuggeststominimizingthemaximumdistortionofthesourcestolimitthequalityfluctuation.Therearemanyworksaimedtosuchconstantqualityencoding,however,almostallofthemfocusontheframelayerbitallocation,andusePSNRasthequalityindex.WesuggestthatthebitallocationforMBsshouldalsoaimattheconstantquality,andfurthermore,perceptualqualityindicesshouldbeusedinsteadofPSNR.Basedonthisidea,weproposeamulti-passblock-layerbitallocationschemeforqualityconstrainedencoding.Theexperimentalresultsshowthattheproposedmethodcanachievemuchbetterencodingperformance.


A bilateral hole filling algorithm for time-of-flight depth cameraSeung-WonJung,SamsungAdvancedInsituteofTechnology(Korea,Republicof)

Time-of-flight(ToF)depthcameraprovidesabsolutedepthvaluesbymeasuringthephasedelaybetweentheemittedandreceivedlightsignals.Sincethephasedelayiscalculatedbycomparingelectric


chargesaccumulatedatthephotodiode,thesaturationoccurswhentheamountofelectricchargesexceedsthecapacityofthephotodiode.SuchasaturationphenomenonisparticularlyproblematicintheToFdepthcamerasincealongexposuretimeisrequiredforaccuratedepthacquisition.Inthispaper,wepresentaneffectivealgorithmtocompensateforasaturatedregion,i.e.aholeregion,indepthimagesacquiredbytheToFcamera.Theproposedalgorithmrecoverstheholeregionbytakingintoaccountcontourpixelssurroundingtheholeregion.Inparticular,eightcontourpixelsareselectedandthengroupedintofourpairsaccordingtothefourrepresentativedirections,i.e.horizontal,vertical,andtwodiagonaldirections.Thefourpairsofcontourpixelsarethencombinedviaamodifiedbilateralfilteringframeworkinwhichthefiltercoefficientsareobtainedbyconsideringthedepthvaluedifferencebetweentwodepthpixelsineachpairandthegeometricdistancebetweentheholepixelandthepixelinthecontour.Experimentalresultsdemonstratetheeffectivenessoftheproposedalgorithm.


Utilization of reverse super-resolution techniques to reduce image size for transmissionJeremyStraub,TheUniv.ofNorthDakota(UnitedStates)

Thereductionoffilesizefortransmissionofimageryoveraconstrainedcommunicationspathprovidessignificantbenefitinamultitudeofbandwidth-constrainedapplicationsincludingtransmissionfrominterplanetaryspacecrafttoEarth,transmissionfromorbittoEarth,andbandwidthreductionforvariouscommercialandentertainmentapplications.Commonmethodsutilizevarioustechniquesrangingfromtheidentificationandsymbolicreplacementofcommonpatternsandvariousloss-causingtechniques,whichreplaceregionswithabstractions.Thispaperpresentsanapproachwhichutilizesashareddatabaseandsuper-resolutiontechniquestodownsizeandre-enhancedatatofacilitatelow-date-ratetransmissionofimagedata.

8666-5, Session 2

Improved motion prediction for scalable extensions of HEVC (Invited Paper)XiaoyuXiu,InterDigital,Ltée(Canada);YongHe,YuwenHe,YanYe,InterDigital,Inc.(UnitedStates)

Duetotheemergenceandmaturityofnewadvancedvideocompressiontechnologies,anewvideocodingstandard,HighEfficiencyVideoCoding(HEVC),isunderdevelopmentbytheJointCollaborativeTeamonVideoCoding(JCT-VC)oftheITU-TVCEGandtheISO/IECMPEG,whichhasalreadydemonstratedsignificantimprovementinvideocompressionefficiencycomparedtotheexistingvideocodingstandardssuchasH.264/AVC.Asthefirstversionofsingle-layerHEVCstandardcomesclosetocompletion,therearegreatintereststoextendthestandardwithscalablecapabilities.Inthispaper,severaladvancedcodingtechniquesarepresentedtoimprovethemotionpredictionprocessforscalableextensions.Experimentalresultsarepresentedtovalidatetheirefficiency.

8666-6, Session 2

An HEVC extension for spatial and quality scalable video coding (Invited Paper)TobiasHinz,PhilippHelle,HaricharanLakshman,MischaSiekmann,JanStegemann,HeikoSchwarz,DetlevMarpe,ThomasWiegand,Fraunhofer-InstitutfürNachrichtentechnikHeinrich-Hertz-Institut(Germany)

NoAbstractAvailable

8666-7, Session 2

Towards a next generation open-source video codec (Invited Paper)JimBankoski,RonaldBultje,AdrianGrange,JohnKoleszar,DebarghaMukherjee,PaulWilkins,YaowuXu,Google(UnitedStates)

Googlehasrecentlystarteddevelopinganextgenerationopen-sourcevideocodec-calledVPNext,aspartoftheexperimentalbranchoflibvpxincludedintheWebMproject(http://www.webmproject.org/).StartingfromtheVP8videocodecreleasedbyGooglein2010asthebaseline,anumberofenhancementsandnewtoolshavebeenaddedtoimprovethecodingefficiency.Sofar,atleast25%bit-ratereductionhasbeenachievedoverVP8onawiderangeoftestsets.ThispaperwillpresentatechnicaloverviewofthecurrentstatusofthisprojectalongwithcomparisonsmadetoVP8andothermainstreamstate-of-the-artvideocodecs.

Thenewtoolsthathavebeenaddedsofarinclude:moremodesforINTRAprediction,variousformsofcompoundINTERprediction,?-pelmotionvectorsand8-tapswitchablesubpelinterpolationfilters,filteredprediction,improvedmotionreferencegenerationandmotionvectorcoding,improvedentropycodingandframe-leveladaptiveentropycodingforvarioussymbols,improvedloopfiltering,incorporationoflarger32x32super-blocksandcodingmodes,framelevelsegmentationtogroupsimilarareastogether,andinclusionoflarger8x8and16x16DCTs,aswellas4x4and8x8AsymmetricDSTs.Othertoolsarebeingactivelyworkedonaswell.

8666-8, Session 3

Scalable extensions of HEVC for next generation services (Invited Paper)LouisJ.Kerofsky,JaneZhao,KiranMisra,AndrewSegall,SharpLabs.ofAmerica,Inc.(UnitedStates)

NoAbstractAvailable

8666-9, Session 3

An improved hypothetical reference decoder for HEVC (Invited Paper)SachinG.Deshpande,SharpLabs.ofAmerica,Inc.(UnitedStates);MiskaHannuksela,NokiaResearchCtr.(Finland);KimihikoKazui,FujitsuLabs.,Ltd.(Japan);ThomasSchierl,Fraunhofer-InstitutfürNachrichtentechnikHeinrich-Hertz-Institut(Germany);GaryJ.Sullivan,MicrosoftCorp.(UnitedStates);Ye-KuiWang,QualcommInc.(UnitedStates)

HypotheticalReferenceDecoder(HRD)describesahypotheticaldecodermodelthatspecifiesconstraintsonaconformingbitstreamthatanencodingprocessmayproduce.HEVCbuildsuponandimprovesthedesignofthegeneralizedhypotheticalreferencedecoderofH.264/AVC.ThispaperdescribessomeofthemainimprovementsofhypotheticalreferencedecoderforHEVC.

8666-10, Session 3

On lossless coding for HEVC (Invited Paper)WenGao,MinqiangJiang,HaopingYu,HuaweiTechnologiesCo.,Ltd.(UnitedStates)

NoAbstractAvailable

Conference 8666


8666-11, Session 3

Edge adaptive intra field de-interlacing of video imagesVladimirLachine,GregorySmith,LouieLee,QualcommInc.(Canada)

Expandingimagebyanarbitraryscalefactorandtherebycreatinganenlargedimageisacrucialimageprocessingoperation.Deinterlacingisanexampleofsuchoperationwhereavideofieldisenlargedinverticaldirectionwith1:2scalefactor.Themostadvanceddeinterlacingalgorithmsuseafewconsequentinputfieldstogenerateoneoutputframe.Inordertosavehardwareresourcesinvideoprocessors,missinglinesineachfieldmaybegeneratedwithoutreferencetotheotherfields.Linedoubling,knownas“bobbing”,isthesimplestintrafielddeinterlacingmethod.However,itgeneratesalotofvisualartifacts.Eveninterpolationofaninsertedlinefromafewneighboringlinesbyverticalfiltermayproducesuchvisualartifactsas“jaggies”.

Inthisworkwepresentedgeadaptiveimageup-scalingalgorithm,whichcanproduce“jaggies”freevideooutputframes.Asafirststep,anedgeanditsparametersineachinterpolatedpixelaredetectedfromgradientsquaredtensorbasedonlocalsignalvariances.Then,accordingtotheedgeparametersincludingorientation,anisotropyandvariancestrength,thealgorithmdeterminesfootprintandfrequencyresponseoftwo-dimensionalinterpolationfilterfortheoutputpixel.Filter’scoefficientsaredefinedbyedgeparameters,sothatqualityoftheoutputframeiscontrolledbylocalcontent.Theproposedalgorithmoftheedgeadaptiveimageup-scalinghasbeenhardwareimplementedinvideodisplayprocessorforintrafielddeinterlacingofvideoimages.

8666-12, Session 3

On the efficiency of image completion methods for intra prediction in video coding with large block structuresDimitarDoshkov,OscarJottrand,ThomasWiegand,PatrickNdjiki-Nya,Fraunhofer-InstitutfürNachrichtentechnikHeinrich-Hertz-Institut(Germany)

Intrapredictionisafundamentaltoolinvideocodingwithhybridblock-basedarchitecture.Recentinvestigationshaveshownthatoneofthemostbeneficialelementsforahighercompressionperformanceinhigh-resolutionvideosistheincorporationoflargerblockstructures.Inthiswork,weinvestigatetheperformanceofnovelintrapredictionmodesbasedondifferentimagecompletiontechniquesinanewvideocodingschemewithlargeblockstructures.ImagecompletionmethodsexploitthefactthathighfrequencyimageregionsyieldhighcodingcostswhenusingclassicalH.264/AVCpredictionmodes.ThisproblemistackledbyinvestigatingtheincorporationofseveralintrapredictorsusingtheconceptofLaplacepartialdifferentialequation(PDE),LeastSquare(LS)basedlinearpredictionandtheAutoRegressivemodel.Amajoraspectofthisarticleistheevaluationofthecodingperformanceinaqualitative(i.e.codingefficiency)manner.Experimentalresultsshowsignificantimprovementsincompression(upto7.41%)byintegratingtheLS-basedlinearintraprediction.

8666-13, Session 4

Depth-layer-based multiview image synthesis and coding for interactive z- and x-dimension view switchingYuMao,TheGraduateUniv.forAdvancedStudies(Japan);GeneCheung,YushengJi,NationalInstituteofInformatics(Japan)

NoAbstractAvailable

8666-14, Session 4

Wyner-Ziv coding of depth maps exploiting color motion informationMatteoSalmistraro,MarcoZamarin,SørenO.Forchhammer,TechnicalUniv.ofDenmark(Denmark)

Distributedcodingofmulti-viewdataanddepthmapsisaninterestingandchallengingresearchfield,whoseinterestisgrowingthankstotherecentadvancesindepthestimationandthedevelopmentofaffordabledevicesabletoacquiredepthinformation.Inapplicationslikevideosurveillanceandobjecttracking,theavailabilityofdepthdatacanbebeneficialandallowformoreaccurateprocessing.Inthesescenarios,theencodingcomplexityistypicallylimitedandthereforedistributedcodingapproachesaredesirable.

Inthispaperanovelalgorithmfordistributedcompressionofdepthmapsexploitingcorrespondingcolorinformationisproposed.Thankstothehighcorrelationofthemotionincoloranddepthvideos,motioninformationfromthedecodedcolorsignalcaneffectivelybeexploitedtogenerateaccuratesideinformationforthedepthsignal,allowingforhigherrate-distortionperformancewithoutincreasingthedelayatthedecoderside.Theproposedschemehasbeenevaluatedagainststate-of-the-artdistributedvideocodingtechniquesappliedondepthdata.ExperimentalresultsshowthattheproposedalgorithmcanprovidePSNRimprovementbetween2.18dBand3.40dBondepthdatacomparedtothereferencedecoder.

8666-15, Session 4

Multimodal image registration by iteratively searching keypoint correspondencesYongLi,RobertL.Stevenson,Univ.ofNotreDame(UnitedStates)

Thispaperproposesamultimodalimageregistrationalgorithmthroughsearchingthebestmatchedkeypointsbyemployingtheglobalinformation.Keypointsaredetectedfromimagesfromboththereferenceandtestimages.Foreachtestkeypoint,acertainnumberofreferencekeypointsarechosenasmappingcandidates.Atripletofkeypointmappingsdetermineanaffinetransformation,andthenitisevaluatedwiththesimilaritymetricbetweenthereferenceimageandthetransformedtestimagebythedeterminedtransformation.Aniterativeprocessisconductedontripletsofkeypointmappings,andforeverytestkeypointupdatesandstoresitsbestmatchedreferencekeypoint.Thesimilaritymetricisdefinedtobethenumberofoverlappededgepixelsoverentireimages,allowingforglobalinformationbeingincorporatedinevaluatingtripletsofmappings.ExperimentalresultsshowthattheproposedalgorithmcanprovidemoreaccurateregistrationthanexistingmethodsonEO-IRimages.

8666-16, Session 4

A spatially varying PSF model for Seidel aberrations and defocusJonathanD.Simpkins,RobertL.Stevenson,Univ.ofNotreDame(UnitedStates)

Contrarytocommonassumptionsintheliterature,theblurkernelcorrespondingtolens-effectblurhasbeendemonstratedtobespatially-varyingacrosstheimageplane.Existingmodelsforthecorrespondingpointspreadfunction(PSF)areeitherparameterizedandspatially-invariant,orspatially-varyingbutad-hocanddiscretely-defined.

Inthispaper,wedevelopandpresentanovel,spatially-varying,parameterizedPSFmodelthataccountsforSeidelaberrationsanddefocusinanimagingsystem.Wealsodemonstratethattheparametersofthismodelcaneasilybedeterminedfromasetofdiscretely-definedPSFobservations,andthatthemodelaccuratelydescribesthespatialvariationofthePSFfromatestcamera.

Conference 8666


8666-17, Session 5

Efficient determination of intra predictability in H.264/AVC and similar codecsSeyfullahH.Oguz,QualcommInc.(UnitedStates)

IntrapredictionasintroducedbyH.264/MPEG-4AVCisaverypotentimageandvideocompressiontooltoexploitspatialcorrelationwithinapicturepriortotransformcoding.InanextendedformithasbeenalsoadoptedtotheHEVCstandard.Thesignificantlyincreasedintracodingefficiencyprovidedbythistoolcomparedtoearliercodingstandardscomesattheexpenseofconsiderablyincreasedcomputationalrequirements.Withincreasingspatio-temporalresolutionsforvideosignalsandtheconstraintsimposedbyreal-timeencoding,thesignificanceofcomputationalrequirementsbecomesmorepronounced.

Inthispaper,anovel,computationallyefficientalgorithmtoassesstheintrapredictabilityofanimageblockisintroduced.Intheproposedalgorithm,theinputimageblocksizeischosentobelargerthanthesmallestblocksizeusedforintrapredictionforexample8x8and4x4respectively.Thelargersizeoftheimageblockinputandanalyzedenablescapturingtheprominentdirectionalstructurepresentintheimageblockwithinasupportregionlargerthanthe(smallest)blocksizeusedforintrapredictioni.e.atalargerscale,andhencecontributestotheoverallefficiencyofthealgorithm.Theproposedalgorithmsubsamplesthepixelsoftheinputimageblockontwoormore2Dgrids,theaxesofwhicharerotatedintheplane.Forexample,forH.264/MPEG-4AVC,two4x4subsamplinggridsareused,theaxesofwhicharerotatedbypi/4radiansrelativetoeachother.Thepixelsoftheinputimageblocke.g.8x8sized,aresubsampledonthesetwodifferent4x4grids,and4x4Hadamardtransforms–alreadyinH.264/MPEG-4AVCtoolset–areappliedtobothsubsampledpixelsets.Analysisofthetransformcoefficientsresultingfrombothgrids’transformsprovidesaccurate,conciseinformationaboutthepresenceandnatureofanyprominentsmoothordirectionalstructuree.g.horizontal,vertical,oblique(thetwodifferentdiagonals),presentwithintheinputimageblock.Hence,inadditiontoinferringtheintrapredictabilityoftheinputimageblockatalargescale,theproposedalgorithmalsopointstothepotentialintrapredictiondirection(s)ineffect.FeaturesofthealgorithmsuchastheuseofHadamardtransformsandsimplesumsofcoefficientmagnitudesbasedcomparisonsfacilitateitsfastcomputation.

RDefficiencycomparisonswithanotherfastintrapredictionalgorithmandfutureworkconsiderationsareprovided.

8666-18, Session 5

Cubic-panorama image dataset analysis for storage and transmissionSaeedSalehi,EricDubois,Univ.ofOttawa(Canada)

Inthisworkweaddresstheproblemofcubic-panoramaimagedatasetanalysisforstorageandtransmissionwithemphasisondisparityestimationanddisparityvectortranscoding.Basedonourearlierworksoncubic-panoramaimagedatasetsandbyusingpropertiesoftheEpipolargeometry,anoveldisparityestimationanddisparityvectortranscodingmethodsuitableforcubic-panoramaimagedatasetsispresented.Ouraimistoachieveincreasedrate-distortionperformancewithequivalentcomputationalcomplexityforstorageascomparedtotheclassicsolutions.Comparingthecomputationalcomplexity,searchrange,andsearchdirection,itisshownthatourmethodismoreintelligent.PerformanceimprovementisshownbothsubjectivelyandobjectivelyindisparityestimationerrorusingPSNR(PeakSignaltoNoiseRatio)measure.Furthermore,obtainedresultsareusedinthetransmissionstageoftheproject.Ourdisparityvectortranscodingmethodcanbeusedtoconvertcubic-panoramasofIPIPformattorectangularviewsofsimilarpredictionstructureveryefficiently.SimilarapproachcanbeusedincaseofIBIBstructure.Subsequently,inapplicationswhereatthetransmissionstageIPPPsequenceispreferred,ourmethodcanbefollowedbyavideotranscodingschemetoconvertthesequencefromrectangularIBIBformattorectangularIPPPformat.

8666-19, Session 5

Efficient streaming of stereoscopic depth-based 3D videosDogancanTemel,MohammedAabed,GeorgiaInstituteofTechnology(UnitedStates);MashhourSolh,TexasInstrumentsInc.(UnitedStates);GhaassanAlregib,GeorgiaInstituteofTechnology(UnitedStates)

Inthispaper,wesummarizeourcurrentworkonreconstructingthedepthmapfrommonocularcuesanddepthmapstatistics.Wefirstanalyzeagroundtruthdepthmaptoextractasetofdepthcuesorstatistics.Then,basedonthesedepthcues,weprocessthecoloredreferencevideoandgenerateanestimateofthedepthmappermonocularcue.Themonocularcuesweconsiderinthisworkaremotion,texture,andintensity.TheprocessingofeachchannelintheYCRCBcolorspaceisconductedseparately.Wetestedourapproachondifferentvideosequenceswithdifferentmonocularproperties.Theresultsshowthattheextracteddepthmapsgeneratea3Dvideowithqualityclosetothevideorenderedusingthegroundtruthdepthmap.Wereportobjectiveresultsusing3VQMandPSNR.Furthermore,weanalyzethesavingsinbitrateasaconsequenceofeliminatingtheneedfortwovideocodecs,oneforthereferencecolorvideoandoneforthedepthmap.Inthiscase,onlythedepthcuesaresentasasideinformationwiththecolorreferencevideoandthiscanleadtobit-ratesavingsupto40%.

Conference 8666


Conference 8667A: Mobile Imaging System Design and Image QualityTuesday5–5February2013

Part of Proceedings of SPIE Vol. 8667 Multimedia Content and Mobile Devices

8667-43, Session 7

Mobile-based text detection and recognition for visually impaired personsMustafaI.Jaber,JeremiSudol,BingSong,IpplexHoldingsCorp.(UnitedStates)

Inthispaper,weproposeasystemfordetectingandlocalizingtextregionsinimagesandvideoscapturingprintedpage,books,magazine,mailenvelope,andreceiptinreal-timeusingsmartphonecamera.Thesystemincludesstagesfori)identifyingtextregionsfromlow-resolutionvideoframes,ii)generatingaudiofeedbacktoguidethevisuallyimpairedpersonaltocapturetheentiretextregioninthescene,iii)triggeringthecameratocaptureahigh-resolutionstill-imageofthesamescene,iv)recognizingthetextregionsusingoff-the-shelfopticalcharacterrecognitiontoolthatrunsonthemobiledeviceorinthecloud,andv)pronouncingtherecognizedtextusingtext-to-speechmodule.Theoriginalityoftheproposedalgorithmstandsinitsreal-timeaudioguidedfeedbacktocaptureanacceptableimagefortheOCRengine.Methodsforcornerdetection,connectedcomponentanalysis,andparagraphstructuretestareusedintheproposedtextdetectionmodule.ThealgorithmhasbeentestedoniPhonedevisewhereenhancedperformancewasachieved.Theusagesimplicityandavailabilityoftheapplicationonsmartphonesgaveitsadvantageovertraditionalscanner-basedOCRsystems.

8667-45, Session 7

Determination of sensor oversize for stereo-pair mismatch compensation and image stabilizationPrajitKulkarni,AptinaImagingLLC(UnitedStates)

Stereoscopiccamerasconsistoftwosensormodulesthatintheoryaremountedparalleltoeachotheratafixeddistancealongasingleplane.Practicaltolerancesinthemanufacturingandassemblyprocesscan,however,causemismatchesintherelativeorientationofthemodules.Onesolutiontothisproblemistodesignsensorsthatimagealargerfield-of-viewthanisnecessarytomeetsystemspecifications.Thisrequiresthecomputationofthesensoroversizeneededtocompensateforthevarioustypesofmismatch.Thisworkintroducesamathematicalframeworktocomputetheseoversizevaluesformismatchalongeachofthesixdegreesoffreedom.Thebasisofthemathematicaltreatmentisthattheoriginalfield-of-viewforeachsensorwillneedtobemaintaineddespiteanyrelativedisplacementbetweenthesensormodules.Onemoduleisconsideredasthereferenceandtheextremeraysofthefield-of-viewofthesecondsensoraretracedinordertoderiveequationsfortherequiredhorizontalandverticaloversize.Asafurtherapplication,bymodelinguserhand-shakeasthedisplacementofthesensorfromitsintendedposition,thesedeterministicequationscouldbeusedtoestimatethesensoroversizerequiredtostabilizeimagesthatarecapturedusingcellphones.

8667-46, Session 8

Image quality evaluation using moving targetsUweArtmann,ImageEngineeringGmbH&Co.KG(Germany)

Theimagequalityishighlyinfluencedbythesignalprocessingfornoiseandresolutionandtheprocessingisthemainreasonforthelossoflowcontrast,finedetails,thesocalled“textureblur”.Wepresentan

approachtodescribetheimageprocessinginmoredetail.

Allstandardizedtestmethodsuseadefinedchartandrequire,thatthechartandthecameraarenotmovedinanywayduringtest.Inthispaper,wepresentourresultsinvestigatingtheinfluenceofchartmovementduringthetest.

Differentcharts,optimizedfordifferentaspectsofimagequalityevaluation,aremovedwithadefinedspeedduringthecapturingprocess.Thechartmovementwillchangetheinputforthesignalprocessingdependingonthespeedofthetargetduringthetest.

Thebasictheoreticalchangesintheimagewillbetheintroductionofmotionblur.Withtheknownspeedandthemeasuredexposuretime,wecancalculatethetheoreticalmotionblur.Wecomparethetheoreticalinfluenceofthemotionblurwiththemeasuredresults.Weusedifferentmethodstoevaluateimagequalityparametervs.motionspeedofthechart.ForthistestseriesweuseaD-SLRcameraasareferenceanddifferentmobilephonecameras.

8667-47, Session 8

Multiple-field approach for aberration correction in miniature imaging systems based on wafer-level productionEricLogean,ToralfScharf,NicolasBongard,HansPeterHerzig,EcolePolytechniqueFederaledeLausanne(Switzerland);MarkusRossi,Heptagon(Switzerland)

Mobileimagingsystemsareoftenlimitedbytheobjectivelens.Herewepresentanintermediateapproachbetweenthecostlytraditionalobjectivesandthelow-resolutionobjectivesinspiredbytheinsecteyes.Ourmulti-fieldapproachusesasmallnumberofopticalchannelseachimagingaportionofthedesiredfieldofview.Thefull-fieldimageisobtainedbydigitalreconstruction.Eachopticalchannelisofadequatedimensiontoobtainimageswithanangularresolutionclosetotheresolutionofthehumaneye,andsimultaneously,itiskeptsimpleforeasymassproductionusingwafer-leveltechnology.

Wepresentthedesignandfabricationoftwoprototypes,eachusing9plano-convexlenses(9channels)toimageafieldof+/-30degreeswithfieldcurvaturecorrection.ThefirstprototypeusesglasslensesfromEdmundOpticsgluedonawafertoformanmulti-fieldimagingsystemwithanf-numberof3.Thesecondprototypeuseslensesfabricatedbythereflowtechnique.Ithasanf-numberof5.Experimentalresultsincludingsimpleimagereconstructionsarepresented.

Inconclusion,wehavepresentedobjectivesfabricatedwithmicro-opticstechnologyandusingthemulti-fieldconceptwhicharethin,simpletomount,robust,andeasilyreplicated.

8667-48, Session 8

Color correction using multi-cycle feedback system for adaptive color image quality in mobile phone cameraSetaekOh,JaeminJoo,JinWookKwon,ChangChunGoh,SangJoKim,SAMSUNGElectronicsCo.,Ltd.(Korea,Republicof)

Recently,Cameramodulesysteminmobilephoneissteadilydevelopedbytheminiatureofchipsizeandoptimizedlensdesignforhighendlevel.Howeverithasdefiniteadvantagessuchasusefulsize,optimizedweightandefficientpowercontrolinCMOSimagesensor,itisdifficulttomakeitimplementhighqualityimageonaccountofskillfullimitationofimageprocessing.Colorcorrectionisimportantinvisionapplicationsandprocesstoimprovecolorconstancyof


image.ColorsfromCMOSorCCDimagesensordonotmatchcolorsofthephotographedobjectduetooutputcharacteristic’sdifferencecomparedtostandardcolor.Therefore,weneedcolorcorrectiontocomplementthatweaknesssimilartoidealcameraimage.Intheconventionalmethod,colorcorrectionisimplementedbyempiricalmethodintermsoflotsofiterativeexperiments.Inthispaper,weproposefaithfultotheoriginalcolorsadaptivecolorcorrectionusingmulticyclefeedbacksystemforhighqualityimage.Multicyclefeedbacksystemistheadaptivemethodthatcolormatrixatcolorspace(R-G-B-Y-M-C)isseparatedandeachcolor’svaluereproducesrepetitively.Also,wecanevaluateaproposedcolorcorrectionmethodquantitativelyusingcolormatchingmethod.Theacquiredexperimentresultsshowtheproposedcolorcorrectionisclosetothequantitativevalueofidealortargetcamera.Inaddition,wecanconfirmthedramaticimprovementoftheimagequalityforrealtakenpictureapplicability.

8667-49, Session 8

Auto-focus algorithm based on statistical blur estimationPrajitKulkarni,AptinaImagingLLC(UnitedStates)

Conventionalauto-focustechniquesinmovable-lenscamerasystemsuseameasureofsharpnesstodeterminethelenspositionthatbringsthesceneintofocus.Thispaperpresentsanovelwavelet-domainapproachtodeterminethepositionofbestfocus.Incontrasttocurrenttechniques,theproposedalgorithmestimatesthelevelofblurinthecapturedimageateachlensposition.ThedegreeofblurinanimageisquantifiedusingameasureofGaussianityinthehigh-passfilteredversionoftheimage.Thealgorithmovercomesseveralshortcomingsofthesharpness-basedapproach.Sharpnessestimationrequirestheapplicationofband-passfiltersandissensitivetoimagenoise.Thisleadstolargerfiltersandtheuseofdata-dependentthresholdswhichnecessitatestuningandcalibrationfordifferentimagingconditions.Theproposedblur-basedmethodworksonthestatisticscomputedfromafilteredversionoftheinputimage.Sincenoisehasnoimpactonperformance,thealgorithmworkswithashortfilterandisdevoidofthresholdsandparametertuning.Theseadvantages,alongwiththeoptimizationpresentedinthepaper,maketheproposedveryattractiveforhardwareimplementationoncellphones.Experimentsprovethatthealgorithmperformsverywellonnoisyimagesandunderresolutionanddatascaling.

8667-77, Session 8

Nokia PureView oversampling technology (Invited Paper)TeroVuori,JuhaAlakarhu,EeroSalmelin,AriPartinen,NokiaCorp.(Finland)

NoAbstractAvailable

8667-50, Session 9

Low light performance of digital still camerasDietmarWüller,ImageEngineeringGmbH&Co.KG(Germany)

Lookingatthelatestdevelopmentsincamerasthepixelraceisstillgoingonandpixelpitchescloseto1micronhavebecomereality.Especiallyinmobilecamerassensorsizeandtherelatedmodulesizematters.Withshrinkingpixelsizesthesensitivityofsensorsdecreases.Moderntechnologiestrytocompensateforthis.Ontheonehandbacksideilluminationincreasesthequantumefficiencyofthesignalitself.Ontheotherhandimageprocessingisusedtoincreasethequalityoftheamplifiedsignalafterwards.

Onemajoraspectofimageprocessingisnoisereductionthatdecreasestheamountofnoiseintheimages.Unfortunatelytheprocessingalgorithmsfacetheproblemtodifferentiatebetween

finedetailintheimagesandnoise.Thereforethedenoisingalsoaffectslowcontrastfinedetailandreducesresolutionintheimages.Noisereductionistypicallyalsohigherincolorchannelsthanontheluminancepartofthesignalresultingadecreaseofcolorfidelity.

Whileloweringtheilluminationlevelsomeexperimentslookatthedifferentimagequalityaspectsforahugevarietyofcameras.Thepaperwilldiscussandsummarizetheresultsoftheseexperimentsandwillbeaninputforthediscussionofthestandardscommitteeonhowtoproceedforwardtocreatealowlightperformancemeasurementstandard.

8667-51, Session 10

Noise evaluation standard of image sensor using visual spatio-temporal frequency characteristicsTakeyukiFujii,ShoichiSuzuki,ShinichiroSaito,SonyCorp.(Japan)

Regardingthenoiseevaluationofimagesensor,itisimportanttoestablishthevisualnoisestandard,whichisanoiseevaluationmetricusingvisualcharacteristics.

Thevisualnoiselevelcanvarydependingontheviewingdistance,spatialfrequency,colorandviewingconditionsandframerate.AmethodofmeasuringthevisualnoiselevelisprovidedinISO15739.Visualcharacteristicsdependoncontrastandframerate;however,themethoddoesn’tconsiderthat.Weproposesolutionstosolvetwoproblemsusingvisualspatio-temporalfrequencycharacteristics.

Firstly,weinvestigatedvisualspatialfrequencycharacteristicsthatdependoncontrastandproposeanewevaluationmethod.Itshowsthattheimagesensorwithlargepixelscountiseffectiveinnoisereduction.

Secondly,weinvestigatedvisualtemporalfrequencycharacteristicsandproposeanewevaluationmethodforthemovingimage.Itshowsthattheimagesensorwithhighframerateiseffectiveinnoisereduction.

Finally,bycombiningtwoproposedmethods,weshowthemethodinwhichnoiseevaluationispossibleinbothastillimageandmovingimagesequences.Weappliedtheproposalmethodtomovingimagesequencesacquiredbytheimagesensorandinvestigatedthevalidity.Theproposedmethodshowedgoodperformanceintheevaluationexperiment.

Conference 8667A


Conference 8667B: Mobile Computational PhotographyMonday-Tuesday4–5February2013


8667-52, Session 11

Image deblurring in smartphone devices using built-in inertial measurement sensorsOndrejSindelar,CharlesUniv.inPrague(CzechRepublic);FilipSroubek,InstituteofInformationTheoryandAutomation(CzechRepublic)

Longexposurehand-heldphotographyisdegradedwithblur,whichisdifficulttoremovewithoutpriorinformationaboutthecameramotion.Inthiswork,weutilizeinertialsensors(accelerometersandgyroscopes)inmodernsmartphonestodetectexactmotiontrajectoryofthesmartphonecameraduringexposureandremoveblurfromtheresultingphotographybasedontherecordedmotiondata.ThewholesystemisimplementedonAndroidplatformandembeddedinthesmartphonedeviceresultinginaclose-to-real-timedeblurringalgorithm.Toourknowledgethisisthefirstattemptinthisdirectionandrendersthemethodparticularlyappealingforendusers.Theperformanceoftheproposedsystemisdemonstratedinreal-lifescenarios.Wehavejustifiedthespace-invariantsimplificationforcertaincameramotions,butsimultaneouslywehaveuncoveredintrinsicsourcesofspace-variantblur,suchasrollingshutter.Thespace-variantimplementationofthedeblurringalgorithm,whichwouldsolvesomeofthecurrentissues,isintheorypossible,butthecomputationalcostonthesmartphonemaybetoohigh.Itwillbeatopicofourfutureresearchtofindoutwhetherthisisviable.

8667-53, Session 11

Compensating specular highlights for non-Lambertian projection surfacesChen-TaiKao,Tai-HsiangHuang,HomerH.Chen,HuaLee,NationalTaiwanUniv.(Taiwan)

Wehavepresentedamethodforradiometriccompensationofspecularhighlightofanon-Lambertianprojectionsurface.Anattractivefeatureofthemethodisthatitrelaxesthelimitationontheviewer’spositionwithrespecttothecamera.Ourmethodisabletoautomaticallyreconstructspecularhighlightresponseoftheprojectionsurfaceanddynamicallycompensatetheimageforvieweratanarbitraryviewingangle.Weimplementedthisalgorithmontheradiometriccompensationframeworkproposedin[1].Asapreprocessingmoduleofthewholeprocamsystem,theproposedalgorithmissimpleandefficient.Onlyonecalibrationimageisneeded.Experimentalresultsshowthatouralgorithmcaneffectivelypredictthespecularlightandeliminatethephotometricdistortionseenatanyarbitraryviewingangle.Theprojectionqualityisgreatlyimproved.Sincethealgorithmisdesignedforaprocamsystem,itiswellapplicableformobiledeviceswithprojectorandcameraboundtogetherandusinganearbywallastheadhocscreenforprojection.

8667-54, Session 12

On plenoptic camera resolutionAndrewLumsdaine,IndianaUniv.(UnitedStates);TodorG.Georgiev,QualcommInc.(UnitedStates);LilieLin,IndianaUniv.(UnitedStates)

Plenopticcamerascapturethe4Dradianceofascene--twodimensionsofspatialinformationandtwodimensionsofangularinformation.Finalimagesarerenderedfromthe4Ddatausingintegralprojection,ofteninconjunctionwithashearingtransformationusedtoeffect“refocusing”ofthefinalimage.Thevarioustransformationsappliedtotheradiancecomplicatethequestionofwhatisthe

achievableresolutionofthefinalrenderedimage.Inthispaperweanalyzethegeometryofdiscreteplenopticfunctioncapturetodeterminetheoreticallimitsofplenopticcameraresolutionandprovideexperimentalresultstoverifyouranalyses.

8667-55, Session 12

Wave analysis of a plenoptic system and its applicationsSapnaA.Shroff,KathrinBerkner,RicohInnovations,Inc.(UnitedStates)

Traditionalimagingsystemsdirectlyimagea2Dobjectplaneontothesensor.Plenopticimagingsystemscontainalensletarrayattheconventionalimageplaneandasensoratthebackfocalplaneofthelensletarray.Inthisconfigurationeachlensleteffectivelyimagestheapertureofthemainimaginglensatthesensor.Thereforethedatacapturedatthesensorretainsangularlight-fieldinformation.Ifafilterarrayisplacedatthepupilapertureofthemainimaginglens,eachlensletimagesthefilterarrayontothesensor.Thisenablestheseparationofmultiplefiltermodalitiesgivingsinglesnapshot,multi-modalimages.Plenopticsystemsareincreasinglybeingusedinrecenttimesforimaginginvariousapplications.Astheapplicationofthesesystemsmovestowardsmicroscopesandothercomplexsystemsandaspixelsizesbecomesmaller,considerationofdiffractioneffectsinthesesystemsbecomesincreasinglyimportant.Wediscussaplenopticsystemanditswavepropagationanalysisforboththecoherentandincoherentcases.Wesimulateasystemresponseusingouranalysisanddiscussvariousapplicationsofthesystemresponsepertainingtoplenopticsystemdesign,implementationandcalibration.

8667-56, Session 12

Fourier analysis of the focused plenoptic cameraAndrewLumsdaine,LilieLin,IndianaUniv.(UnitedStates);TodorG.Georgiev,QualcommInc.(UnitedStates)

Thefocusedplenopticcameraisarecentlydevelopedapproachtolightfieldcapturethatusesthemicrolensarrayasanimagingsystemfocusedonthefocalplaneofthemaincameralens.Sincelightfieldscanbecapturedwithsignificantlyhigherspatialresolutionthanwiththetraditionalapproach,imagescanberenderedatresolutionsthatmeettheexpectationsofmodernphotographers.Thefocusedplenopticcameracaptureslightfieldswithadifferenttradeoffbetweenspatialandangularinformationthanwiththetraditionalapproach.Tomorerigorouslycharacterizethesetradeoffs,includingthelimitsofthisnewapproach,thispaperpresentsaFourieranalysisofthefocusedplenopticcamera.Basedonthisanalysis,wealsopresentanextendedFourier-slicerenderingalgorithmthatcanbeusedtorenderhigh-resolutionimagesfromlightfields.

8667-57, Session 13

Design of user interfaces for selective editing of digital photos on touchscreen devicesThomasBinder,MeikelSteiding,ManuelWille,NilsKokemohr,NikSoftwareGmbH(Germany)

Wheneditingimagesitisoftendesirabletoapplyafilterwithaspatiallyvaryingstrength.Withtheusualselectiontoolslikegradient,lasso,or


brushtools,creatingmaskscontainingsuchspatiallyvaryingstrengthvaluesistime-consumingandcumbersome.Wepresentaninteractivefilteringapproachwhichallowstoprocessphotosselectivelywithouttheintermediatestepofcreatingamaskcontainingstrengthvalues.Inusingthisapproach,theuseronlyneedstoplacereferencepoints(calledcontrolpoints)ontheimageandadjuststhespatialinfluenceandfilterstrengthforeachcontrolpoint.Thefilteristhenappliedselectivelytotheimage,withstrengthvaluesinterpolatedforeachpixelbetweencontrolpoints.Theinterpolationisbasedonamixtureofdistancesinspace,luminance,andcolor;itisthereforealow-leveloperation.Sincethemaingoaloftheapproachistomakeselectiveimageeditingintuitive,easyandplayful,emphasisisputontheuserinterface:Wedescribetheevolutionprocessofdevelopinganexistingmouse-drivenuserinterfaceintoatouchscreen-drivenone.Manyquestionneededtobeansweredanew,suchashowtopresentasliderwidgetonatouchscreen.Severalvariantsarediscussedandcompared.

8667-58, Session 13

Touch HDR: photograph enhancement by user controlled wide dynamic range adaptationSteveM.Verrall,QualcommInc.(UnitedStates);HasibSiddiqui,QualcommInc(UnitedStates);KalinAtanassov,SergioR.Goma,VikasRamachandra,QualcommInc.(UnitedStates)

HighDynamicRange(HDR)technologytypicallyconsistsof(1)acquiringawidedynamicrangeimageand(2)adaptingthewidedynamicrangeimagetofittothedisplayrangeofthedevice.Thefirstpartcanbeachievedinmultipleways,twocommonexamplesarebyusingaspecialsensororbycombiningtwoormoreimageswithsameordifferentexposures.Thesecondpartlocallyadaptsthetonemappingfunctionthuseffectivelyre-usingtheavailablerange.Inthispaperweaddressthesecondpartandwedescribeaphotoeditingtechnique,referredtoas“TouchHDR”thatenablestheusertocreateanewimagebyselectivelyenablingandspecifyingtheaggressivenessoftheadaptation.

8667-59, Session 13

Image and video denoising and enhancement through frame stacking and alignmentKalinAtanassov,JamesNash,SergioR.Goma,VikasRamachandra,QualcommInc.(UnitedStates);HasibSiddiqui,QualcommInc(UnitedStates)

Duetothecurrentcell-phonecameraspixelsize,thereisagapinthelow-lightandhighdynamicperformanceofthosecamerascomparedwithtraditionalcameras(DSC/DSLRs).Mostofthesystemsconnectedtothosecamerashavesignificantlymoreprocessingresourcesthantraditionalcameras.Mobilecomputationalphotographyisanemergingfieldthattriestoleveragethoseresourcesandcompensatethelackinperformance.Inthispaperweproposeanalgorithmthatreducesthenoiseandincreasesthedynamicrangeofafinalimageby“stacking”sequentiallycapturedframes.Theframestackingisdonebyframealignmentsubjecttoaprojectivetransformandtemporalanisotropicdiffusion.Ourapproacheffectivelyaccomplishesanaverageofimagesinwhichwereducethequantizationanddarkcurrentnoiseuptothelimitofthefixedpatternnoise.Furtherimageimprovementisaccomplishedbyfixedpatternnoisecharacterizationandremoval.Toenhancetheresultweapplyadaptivecontrastenhancement,andshow10-20timesdynamicrangeimprovementfortypicalscenes

8667-60, Session 13

Accelerating defocus blur magnificationFlorianKriener,ThomasBinder,ManuelWille,NikSoftwareGmbH(Germany)

Ashallowdepthoffieldisoftenusedasacreativeelement,forexampleinportraitphotography.This,however,comesatthecostofexpensive

andheavycameraequipment,suchaslargesensorDSLRbodiesandfastlenses.Incontrast,cheapsmall-sensorcameraswithfixedlensesusuallyexhibitalargerdepthoffieldthanissometimesdesirable.Inthiscaseacomputationalsolutionissuggesting,sinceashallowdepthoffieldcannotbeachievedbyopticalmeans.Onepossibilityistoalgorithmicallymagnifythedefocusbluralreadypresentintheimage.Yet,existingalgorithmicsolutionsinthisdirectionsufferfrompoorperformanceduetotheill-posednessoftheproblem:Theamountofdefocusblurcanbeestimatedatedgesonly;homogeneousareasdonotcontainsuchinformation.Inordertoenhancethedefocusblur,thesegapsneedtobeclosedwhichrequirestosolveaverylargeoptimizationproblem.

Weproposeafasterwaytopropagatetheamountofblurfromtheedgestotheentireimagebysolvingtheoptimizationproblemonacoarsescale,followedbyedge-awareupsamplingusingtheoriginalimageasguide.Theresultingapproximatedefocusmapcanbeusedtosynthesizeimageswithshallowdepthoffieldwithaqualitycomparabletotheoriginalapproach.Thisisdemonstratedbyexperimentalresults.

8667-62, Session 14

Adaptive DOF for plenoptic camerasAlexanderOberdörster,Fraunhofer-InstitutfürAngewandteOptikundFeinmechanik(Germany);HendrikP.A.Lensch,EberhardKarlsUniv.Tübingen(Germany)

Plenopticcameraspromisetoprovidearbitraryre-focusingthroughasceneafterthecapture.Inpractice,however,therefocusingrangeislimitedbythedepthoffield(DOF)oftheplenopticcamera.Forthefocusedplenopticcamera,thisrangeisgivenbytherangeofobjectdistancesforwhichthemicrolensimagesareinfocus.

Weproposeatechniqueofrecordinglightfieldswithanadaptivedepthoffocus.Betweenmultipleexposures--ormultiplerecordingofthelightfield--themicrolensarrayoftheplenopticcameraismoved.ThedepthandqualityoffocusisadjustedbychangingthenumberofexposuresandthespacingoftheMLAmovements.

Incontrasttotraditionalcameras,extendingtheDOFdoesnotforceanall-in-focusimage.Instead,therefocusrangeisextended.Thereisfullcreativecontrolaboutthefocusdepth;imageswithshalloworselectivefocuscanbegenerated.

8667-63, Session 14

Plenoptic depth map in the case of occlusionsZhanYu,JingyiYu,Univ.ofDelaware(UnitedStates);AndrewLumsdaine,IndianaUniv.(UnitedStates);TodorG.Georgiev,QualcommInc.(UnitedStates)

Recentrealizationofhand-heldplenopticcamerashasgivenrisetopreviouslyunexploredeffectsinphotography.DesigningamobilephoneplenopticcameraisbecomingfeasiblewiththesignificantincreaseofcomputingpowerofmobiledevicesandtheintroductionofSystemonChip.However,capturinghighnumberofviewsisstillimpracticalduetospecialrequirementssuchasultra-thincameraandlowcosts.Inthispaper,weanalyzeamobileplenopticcamerasolutionwithsmallnumberofviews.Suchcameracanproducerefocusablehighresolutionfinalimageifadepthmapisgeneratedforeverypixelinthesparsesetofviews.

Withthecapturedmulti-viewimages,theobstacletorecoverahigh-resolutiondepthisocclusions.Torobustlyresolvethese,wefirstanalyzethebehaviorofpixelsinsuchsituations.Weshowthatevenundersevereocclusion,onecanstilldistinguishdifferentdepthlayersbasedonstatistics.Weestimatethedepthofeachpixelonbydiscretizingthespaceinthesceneandconductingplanesweeping.Specifically,foreachgivendepth,wegatherallcorrespondingpixelsfromotherviewsandmodelthein-focuspixelsasaGaussiandistribution.Weshowhowitispossibletodistinguishocclusionpixels,andin-focuspixelsinordertofindthedepths.Finaldepthmapsarecomputedinrealscenescapturedbyamobileplenopticcamera.

Conference 8667B


8667-64, Session 14

Reduced depth of field using multi-image fusionBorisAjdin,Univ.ofUlm(Germany);TimoAhonen,NokiaResearchCtr.(UnitedStates)

Thispaperpresentsanovelmulti-imagefusionmethodforartificiallyreducingdepthoffieldinphonecameraphotographs.Inadditiontotwohigh-resolutionimages,onefocusedontheobjectofinterestselectedbytheuserusingasingletap,andanotheronewiththemaximumbackgroundblur(achievedbyfocusingatthenearfocusplane),thesystemcapturesaVGAresolutionfocalstackofimageswithvariablefocussettings.Thefocalstackisthenusedtoguideagraphcutbasedsegmentationoftheobjectofinterestfromthesharphigh-resolutionimagebycomparingperpixelthesharpnessprofileacrossthestackwiththesharpnessprofileofthepointofinterest.Afterwardsthesegmentedobjectisblendedintothebackgroundfromtheblurryhigh-resolutionimage,resultinginavisuallypleasingimagewithashallowdepthoffieldeffect.

8667-65, Session 14

Optimizing depth of field based on a range map and a wavelet transformMikeWellner,PatternRecognitionLab.(Germany);ThomasKäster,PatternRecognitionCo.GmbH(Germany);ThomasMartinetz,ErhardtBarth,Univ.zuLübeck(Germany)

Theimagingpropertiesofsmallcamerasinmobiledevicesexcluderestricteddepthoffieldanddistance-dependentblurwithasensationofdepth.Algorithmicsolutionstothisproblemusuallyfailbecausehigh-quality,denserangemapsarehardtoobtain,especiallywithamobiledevice.However,methodslikestereo,shapefromfocusstacks,andtheuseofflashlightsmayyieldcoarseandsparserangemaps.Astandardprocedureistoregularizesuchrangemapstomakethemdenseandmoreaccurate.Inmostcases,regularizationleadstoinsufficientlocalizationandsharpedgesindepthcannotbehandledwell.Inawaveletbasis,animageisdefinedbyitssignificantwaveletcoefficients,onlytheseneedtobeencoded.Ifwewishtoperformrange-dependentimageprocessing,weonlyneedtoknowtherangeforthesignificantwaveletcoefficients.Wethereforeproposeamethodthatdeterminesasparserangemaponlyforsignificantwaveletcoefficients,andthenweightsthewaveletcoefficientsdependingontheassociaterangeinformation.Theimagereconstructedfromtheresultingwaveletrepresentationexhibitsspace-variant,range-dependentblur.Wepresentdifferentresults,e.g.basedonimagesandrangemapsobtainedwiththestereocameraofthemobilephoneLGP920.

8667-66, Session 15

Resolution and sensitivity of wafer-level multi-aperture camerasAlexanderOberdörster,Fraunhofer-InstitutfürAngewandteOptikundFeinmechanik(Germany);HendrikP.A.Lensch,EberhardKarlsUniv.Tübingen(Germany)

Thescalinglimitsofmulti-aperturesystemshavebeenwidelydiscussedfromaninformation-theoreticalstandpoint.Whiletheseargumentsarevalidasanupperlimit,thereal-worldperformanceofmass-marketsystemsformobiledevicesisstillrestrictedbyopticalaberrations.Wearguethataberrationscanbemoreeasilycontrolledwithcertainarchitecturesofmulti-aperturesystems,especiallythosemanufacturedonwaferscale(WLO).Wecomplementouranalysiswithmeasurementsofonesingle-andonemulti-apertureWLOcamera,showingthatmulti-aperturesystemscanindeeddeliversuperiorperformance.Weexaminebothsharpnessandsensitivity,givingmeasurementsofMTFandSNR.

8667-67, Session 15

Refinement of depth maps by fusion of multiple estimatesBalajiKrishnamurthy,AnubhaRastogi,ADOBESystemsIndiaPrivateLtd.(India)

NoAbstractAvailable

8667-68, Session 15

On the application of the plenoptic camera to mobile phonesIciarMontillaGarcia,InstitutodeAstrofísicadeCanarias(Spain);MartaPuga,JoseG.Marichal-Hernandez,JonasPhilippLüke,JoséManuelRodríguez-Ramos,Univ.deLaLaguna(Spain)

TheplenopticcamerawasoriginallycreatedtoallowthecaptureoftheLightField,afour-variablevolumerepresentationofallraysandtheirdirections,thatallowsthecreationbysynthesisofa3Dimageoftheobservedobject.Thismethodhasseveraladvantageswithregardto3Dcapturesystemsbasedonstereocameras,sinceitdoesnoteneedframesynchronizationorgeometricandcolorcalibration.Andithasmanyapplications,from3DTVtomedicalimaging.Aplenopticcamerausesamicrolensarraytomeasuretheradianceanddirectionofallthelightraysinascene.Thearrayisplacedatthefocalplaneoftheobjectivelens,andthesensorisatthefocalplaneofthemicrolenses.Inthispaperwestudytheapplicationofoursuperresolutionalgorithmtomobilephonescameras.Withacommercialcamera,itisalreadypossibletoobtainimagesofgoodresolutionandenoughnumberofrefocusedplanes,justplacingamicrolensarrayinfrontofthedetector.

8667-69, Session 15

A novel 3D model for rotation/zoom motion deblurZhihuiLi,SilongPeng,XiYuanHu,InstituteofAutomation(China);XiaojingXu,InstituteofForensicScience(China)

Spatiallynon-uniformimageblurcausedbycamerashakeisacommonproblemfordomesticdigitalcameras.Rota-tion/Zoommotionblurisaspecialtypeofspatiallynon-uniformimageblur.However,mostoftheexistingap-proachesforsolvingspatiallyvariantblurusetoocomplexmodelsandsufferfromhighcomputationalcost.Andtherehaveseldompaperdiscussiontheestimationmethodsofzoom/rotationparameters.Onthecontrary,basedontherealimagecaptureprocess,weproposeanoveltimespatiallyinvariant3-Dconvolutionmodelforrotation-blurandzoom-blur.Underthismodel,rotation-blurandzoom-blurrestora-tionbecomequitesimpleandafastandefficientalgorithmcanbeusedtosolvesuchkindofspatiallyvariantblur.Andwecanestimatetherotation/zoomparametersbasethe3-Dmodel.Furthermore,wepresenttheblinddeblurringmethodofzoom/rotationblur.Theexperimentalresultsdemonstratetheeffectiveandaccuracyofourmethods.

8667-70, Session 16

Lytro camera technology: theory, algorithms, performance analysis (Keynote Presentation)SergioR.Goma,TodorG.Georgiev,QualcommInc.(UnitedStates);AndrewLumsdaine,IndianaUniv.(UnitedStates)

NoAbstractAvailable

Conference 8667B


8667-71, Session 17

A new fusion-based low light still-shot stabilizationYoung-SuMoon,SamsungAdvancedInstituteofTechnology(Korea,Republicof)andSAMSUNGElectronicsCo.,Ltd.(Korea,Republicof);Shi-HwaLee,SamsungAdvancedInstituteofTechnology(Korea,Republicof)andSAMSUNGElectronics,Ltd.(Korea,Republicof)

Digitalcamerasunderadarkilluminationrevealssomeartifactslikemotionblurinalong-exposedshotorsalientnoisecorruptionina(HighISO)short-exposedshot.Topreventsuchartifacts,multi-framefusionofeitherdifferentlyexposedimagesormultipleshort-exposedimageshasbeenstudiedactively,andasapracticalstill-shotstabilization,theapproachusingmultipleshort-exposureimageshasbeguntobeappliedtoconsumercameras.However,itrequireshighlycomplicatedandtime-consumingcomputationalprocedurestoachieveeffectivemotioncompensatednoisefilteringandbrightnessrestorationinadifficultsituationofstrongnoisecorruption,lowbrightnessanddistortedcolor.

Inthispaper,weproposeanewfusion-basedlow-lightstabilizationapproach,whichinputsbothshort-exposureimagesandoneproper(orlong)-exposure(blurry)image.First,acoarse-to-finemotioncompensatedtemporalnoisefilteringissuggestedforgettingacleanimagefromthemultipleshort-exposurenoisyimages.Morespecifically,globalmotionestimation,block-wiselocalmotionestimation,andmotioncompensatedtemporalnoisefilteringareiteratedinahierarchicalframeworktoacquirebetterdenoisinganditsspeed-up.Then,onlineimagerestorationtothedenoisedimageisfollowedtoobtainavisuallygood(properlybrightandcolored)result.Ourtestresultsshowtheeffectivenessoftheproposedalgorithm.

8667-72, Session 17

Real-time skeleton tracking for embedded systemsFotiColeca,Univ.zuLübeck(Germany)andgestigonGmbH(Germany);SaschaKlement,gestigonGmbH(Germany);ThomasMartinetz,ErhardtBarth,Univ.zuLübeck(Germany)

Touch-freegesturetechnologyisbeginningtobecomemorepopularwithconsumersandmayhaveasignificantfutureimpactoninterfacesfordigitalphotography.However,almosteverycommercialsoftwareframeworkforgestureandposedetectionisaimedateitherdesktopPCsorhigh-poweredGPUs,makingmobileimplementationsforgesturerecognitionanattractiveareaforresearchanddevelopment.InthispaperwepresentanalgorithmforhandskeletontrackingandgesturerecognitionthatrunsonanARM-basedplatform(PandaboardES,OMAP4460architecture).Thealgorithmusesself-organizingmapstofitagiventopology(skeleton)intoa3Dpointcloud.Thisisanovelwayofapproachingtheproblemofposerecognitionasitdoesnotemploycomplexoptimizationtechniquesordata-basedlearning.Afteraninitialbackgroundsegmentationstep,thealgorithmisraninparallelwithheuristicswhichdetectandcorrectartifactsarisingfrominsufficientorerroneousinputdata.WethenoptimizethealgorithmfortheARMplatformusingfixed-pointcomputationandtheNEONSIMDarchitecturetheOMAP4460provides.Wetestedthealgorithmwithtwodifferentdepth-sensingdevices(MicrosoftKinect,PMDCamboard).Forbothinputdeviceswewereabletoaccuratelytracktheskeletonatthenativeframerateofthecameras.


Reconstruction of the image on the Cartesian lattice from a finite number of projections in computed-tomographic imagingNanDu,TheUniv.ofTexasatSanAntonio(UnitedStates);YushengFeng,UnivofTexasatSanAntonio(UnitedStates);ArtyomM.Grigoryan,TheUniv.ofTexasatSanAntonio(UnitedStates)

Inthispaper,wediscussanewapproachofreconstructingtheimagefromafinitenumberofprojectionsusingthetensorrepresentationoftheimage,afterprocessingtheprojectionsbysolvingthespecialsystemsofequations.Inthediscretespace,allcomponentsofthetensortransformaredefinedastheray-sumsoftheimageintheCartesianlattice,whichcanexactlybecalculatedfromtheray-integrals.Todescribethistransformationofintegralsfromtheimageplanetothelattice,weintroducetheconceptofshiftedgeometricalrays,orG-raysthatallowforeachprojectiontocalculatethesplitting-signalbysolvingasystemofalgebraicequationsdescribedbythebinaryToeplitzmatrices.Then,thediscreteimageiscalculatedbytheinversetensortransformorthroughtheinverse2-DDFT.Theproposedapproachispresentedforthecontinuousmodel,whentheimagef(x,y)isontheunitsquare[0,1]x[0,1]andconsistsofN2cellsofconstantintensityeachontheCartesianlatticeNxN,whereNisapoweroftwooraprime.TheproposedmethodwasimplementedinMATLABandC++,andtheexperimentalresultsareillustrated,fortheimagesofsizeupto1024x1024.


Method of G-particles for image reconstruction from a finite number of projectionsArtyomM.Grigoryan,NanDu,TheUniv.ofTexasatSanAntonio(UnitedStates)

Novelmethodofsummationofprojectionsisproposed,whichusesthetensorrepresentationoftheimageasasetofsplitting-signalsdefinedbysumsofparallelrayspassingthroughtheCartesianlattice.Thesesignalsdefinethedirectioncomponentsoftheimageandcanbecalculatedfromtheray-integralsofsetsofshiftedparallelrays,whicharedefinedinauniquewayforeachprojection.Thissetofraysiscalledthegeometrical,orG-rays.Weintroducetheconceptofthepointmapofprojections,wheneachimageelementisconsideredasaparticledescribedbyafieldfunctionintensorrepresentation.ThemapofeachparticleontheCartesianlatticeisconsideredintheformofamatrixdescribingallG-rayspassingthroughtheparticle.TheconsiderationoffieldfunctionsforG-particlesleadstoafieldtransformationoftheimagewithfollowingreconstruction.Thesetofprojectionsisdefinedbythetensorrepresentation.However,othersetsofprojectionscanalsobeconsidered;thesetsdefinedbyasmallnumberofangles,symmetricandnon-symmetricsetsofangles,aswellasrandomsetsofangles.Experimentalresultsoftheproposedreconstructionaregivenandcomparedwiththefilteredbackprojection.

Conference 8667B


Conference 8667C: Multimedia on Mobile Devices 2013Monday-Wednesday4–6February2013


8667-17, Session 4

Enabling customer self service through image processing on mobile devicesJoernKreutel,FachhochschuleBrandenburg(Germany);SaschaHellmann,BrandenburgUniversityofAppliedSciences(Germany);JörnKreutel,BeuthUniversityofAppliedSciences(Germany)

Wedescribeaninnovativeuseofaugmentedrealitytechnologyinamobileapplicationthatisdistributedtoendcustomersofanenterprise.Usingtheapplication,customerswillbeabletosolveissuesrelatedtotheenterprise’sproductswithoutrequiringassistancebyahumancustomerserviceagent.

8667-18, Session 4

Cognitive styles and visual qualitySatuJumisko-Pyykkö,TampereUniv.ofTechnology(Finland)

Assessorsarethemainmeasurementinstrumentsinsubjectivequalityofexperienceevaluation.Althoughtheperceptualopportunitiesandconstrainsareinfluencedbymultipledemographicandpsychographicfactors,theyaretypicallydisregardedasapartofqualityassessment(overviewJumisko-Pyykkö,2011).Cognitivestylesrefertoindividual’sconsistentapproachestoorganizeandrepresentinformation(RidingandRayner,1998).Goalofthisstudyistoexploreinfluenceofcognitivestylesonvisualqualityrequirements.Thedata-collectionisconductedusingtheStyleofProcessing(SOP)questionnaire(Childersetal.,1985)inthreevisualvideoqualityexperimentswithatotalof80participants.Allparticipantswerecategorizedintofourgroupsaccordingtosensorialpreferencesininformationprocessing(visual,auditive,bimodal–highprocessing,andbimodal-lowprocessing).Theexperimentswereconductedinthecontrolledcircumstanceswhenvaryingdepthinvideoqualitywithseveralcontenttypesonthemobiledevice.Theresultsshowedvariationinqualityrequirementsbetweenthesegroups.Finally,theseresultsalsoindicatethatsensorialprocessingstylesareessentialtoexploreforsamplecharacterizationinqualityassessmentandforexploringmoreuser-awarequalityadjustmentsinfutureservicesandproducts.

8667-19, Session 4

Subjective evaluation of HEVC in mobile devicesRayGarcia,GeneralDynamicsItronixCorp.(UnitedStates);HariKalva,FloridaAtlanticUniv.(UnitedStates)

Mobilecomputeenvironmentsprovideauniquesetofuserneedsandexpectationsthatdesignersmustconsider.Withincreasemultimediauseinmobileenvironments,videoencodingmethodswithinthesmartphonemarketsegmentarekeyfactorsthatcontributetopositiveuserexperience.Currentlyavailabledisplayresolutionsandexpectedcellularbandwidtharemajorfactorsthedesignermustconsiderwhendeterminingwhichencodingmethodsshouldbesupported.Thedesiredgoalistomaximizetheuserexperience,reducecost,andreducetimetomarket

SubjectivecomparisonsaremadebetweenH.264/AVCandHEVCencodingstandardsinaccordancewithDouble-stimulusimpairmentscale(DSIS)asdefinedbyITU-RBT.500-13.TestenvironmentarebasedonsmartphoneLCDresolutionsandexpectedcellularbitrates,suchas200kbpsand400kbps.

Subjectivefeedbackshowsbothencodingmethodsareadequateat400kbpsconstantbitrate.However,anoticeableuserexperiencegapwasobservedfor200kbpsconstantbitrate.SignificantlylessH.264subjectivequalityisnoticedwithvideosequencesthathavemultipleobjectsmovingandnosinglepointofvisualattraction.VideosequenceswithsinglepointsofvisualattractionorfewmovingobjectstendedtohavehigherH.264subjectivequality.

8667-20, Session 5

Location-based tracking using long-range passive RFID and ultrawideband (UWB) communications (Invited Paper)FaranakNekoogar,LawrenceLivermoreNationalLab.(UnitedStates)

Theabilitytolocateandtrackfirstrespondersinsidebuildingsisanimportantproblemforwhichtherearecurrentlynogoodsolutions.Indoorlocalizationandnavigationposesmanychallengesforsearchandrescueteams(i.e.firefighters)suchasinabilitytodeterminetheirexactlocationandcommunicatewiththeincidentcommanderoutsidethebuilding.AlthoughRFnavigationandtrackingsystemshavemanyadvantagesoverothertechnologies,theharshindoorRFenvironmentdemandsnewwaysofdevelopingandusingRFsensorandcommunicationsystems.Inthispaper,weintroduceanintegratedsystemforreliableindoornavigationthatincludes(i)long-rangepassiveRFtagswithreaddistanceofover300feet,and(ii)ultra-wideband(UWB)communicationslinkforsendingreaderdatafrominsidethebuildingtooutsideinordertolocateandtrackfirstresponders.Themobileplatformarchitecturedescribedinthispapermakeoptimaluseoflong-rangepassivetags,andtakesadvantageofthefrequencydiversityofUWBcommunicationsystemsforareliable,robustandyetlow-costdesign.Further,theUWBcommunicationallowstransmissionofbothvoiceanddataoverthesameharshRFenvironment.

8667-21, Session 5

Real-time content-aware video retargeting for tunnel vision assistanceThomasA.Knack,AndreasE.Savakis,RochesterInstituteofTechnology(UnitedStates)

Imageandvideoretargetingtechnologieshavebecomeaneffectivemeansofresizingmediaforaspectratioconstrainedapplicationsandareusefulformobilesmartphonesystems.Inthispaper,anovelreal-timevideoretargetingmodelisproposedformobilephoneimplementationtoassistindividualswithtunnelvision.Spatialandtemporalimprovementstothebaselineseamcarvingalgorithmprovidethebasisforourmodel.Seamcarvingisacontent-awareretargetingoperatorwhichdefines8-connectedhorizontalorverticalpaths,orseamsofpixels.Theoptimalityoftheseseamsisbasedonaspecificenergyfunction.Seamremovalpermitschangesintheaspectratiowhilesimultaneouslypreservingimportantregions.Thisworkintroducesavideoretargetingmodelfortunnelvisionassistancewhichincorporatesspatialandtemporalconsiderationstopreservevisualintegrity.Facedetectionmasksandsaliencemapsareprovidedtoachievemorecomprehensiveresults.Additionally,formulationofanoveltemporalcoherencemeasureispresentedthatallowsforretargetingofstreamingvideo.IntegrationoftheproposedmodelwithanAndroidmobileplatformdemonstratesitsportabilityandpotentialforuseinmobilelowvisionassistancesystems.


8667-22, Session 5

Human movement activity classification approaches that use wearable sensors and mobile devicesSahakI.Kaghyan,YerevanStateUniv.(Armenia);HakobG.Sarukhanyan,InstituteforInformaticsandAutomationProblems(Armenia);DavidAkopian,TheUniv.ofTexasatSanAntonio(UnitedStates)

Thispaperreviewsdifferentmethodsandapproachesofsolvinghumanactivityclassificationproblem.Toclassifyactivitycurrentworkreviewsfollowingdatacollectionmethods:1)rawdataretrievingfrommobiledevices’sensors(suchasGPSsensor,accelerometerandgyroscope),and2)signalvalues’collectingprocessandtransferringtoserversimultaneouslyfromseveralbiaxialaccelerometers,attachedtodifferentpartsofhumanbody.Generally,signalswereuninterruptedlytransferredfrommentionedsensorsorfromsmartphoneviawirelessnetworktoserver,locatednearby,forrawdatafurtheranalysisandrecognition.Currentworkalsonotesdifferentalgorithmswhichwereusedinclassificationduringincomingdataanalyzingprocess.Fromcomparingtheseresultswitheachotheritwillbeclearwhichapproachofactivityclassificationwillbemoreefficienttousedependingonlimitationsthatmaybeandmeans,availabletosolveactivityclassificationproblem.

8667-23, Session 5

Concept for practical exercises for studying autonomous flying robots in a university environmentRicardoBand,Johann-SebastianPleban,StefanSchön,ReinerCreutzburg,ArnoFischer,FachhochschuleBrandenburg(Germany)

Theaimofthispaperistodemonstratetheusefulnessofaconceptofpracticalexercisesforstudyingautonomousflyingrobotsforcomputersciencestudentsinauniversityenvironment.

Itisshownhowthestudentscanassemble,program,fly,networkandapplyautonomousflyingrobots(drones,quadrocopters,hexacopters,octocopters,helicopters,helicams,bugbots,...)indifferentexercisesandimprovetheirskillsandtheoreticalandpracticalknowledgeindifferentaspects.

8667-24, Session 5

Applications of multimedia technology on autonomous flying robots for university technology transfer projectsRicardoBand,Johann-SebastianPleban,StefanSchön,ReinerCreutzburg,ArnoFischer,FachhochschuleBrandenburg(Germany)

Theaimofthispaperistogiveanoverviewofthewideareaofpossibleapplicationsofmultimediatechnologyonautonomousflyingrobotsforuniversitytechnologytransferprojects.

Inparticularwesummarizetheusefulnessofdifferentimagingandvideotechnologyapplicationsinordertostrengthenthelinksbetweenindustrialandadministrativepartnersandtheuniversity.

8667-25, Session 6

Mobile variable data pages: from apps to Cloud printingNathanMoroney,Kok-WeiKoh,Hewlett-PackardLabs.(UnitedStates)

Variabledatapagesorprintingisthedynamicgenerationofpartiallyorcompletelycustomizedcontentforprintproduction.Inacommercial

printproductioncontext,thesepagesareoftengeneratedusingcentralizedtools,databasesandprintengines.Onechallengeinacentralizedproductionenvironmentismanagingthevariabledata,whichcanincludepersonalinformationorvendordata.Mobilecomputingpresentsaninterestingalternativeforgenerationofmobilevariabledatapages.Inamobilecontext,individualappscancreatehighlycustomizedcontentmakinguseofdistributedprocessing,dataandprinters.Wepresentresultsforanexperimentalapplication,tobedescribedingreaterdetailelsewhere,whichmakesuseofthePDFgenerationtoolsincludedwiththedeveloperSDKtocreatepersonalizedletter-formattedpageswhicharethenprintedusingHP’sePrintcloudprintingtechnology.Inthisway,mobiledevicesgenerateuniquedatawhichisthenformattedbytheappasaprint-readydocumentwhichisthenprintedviaadistributednetworkofprinters,withouttheneedforanintermediatedriverordedicatedgeneralpurposeformattingengine.Thiscapabilityisageneralpurposeonethatscalesintwodirections.First,theappandmobiledeviceareusedtogenerateandmanagethevariabledatacontent.Ifaprocessingmodulecanbeimplementedgiventhelimitationsofthetargetdevicethenthegenerationofvariabledatapageswillscaledirectlywiththenumberofmobilecomputingdevices.Second,useofadistributedprintproductionprocessmeansthatprintingwillalsoscaledirectlywiththenumberofprintersmakinguseofthecloudprintinginfrastructure.ThispaperwilldescribethespecificsofthecreationofthevariabledatapagesusingaspecificdeveloperSDKandourexperiencesusingtheePrintcloudprintingtechnology.ThepaperwillalsoincludeaprovocativeexampleinwhichuseofthePDFgenerationtoolsinthemobileenvironmentallowedthecreationofahighlyspecializedprintedswatch-bookinmannerthatwascompetitivewithusinghighlyspecializeddesktoptools.

8667-26, Session 6

Digitized forensics: retaining a link between physical and digital crime scene traces using QR-codesMarioHildebrandt,StefanKiltz,JanaDittmann,Otto-von-Guericke-Univ.Magdeburg(Germany)

Thedigitizationofphysicaltracesfromcrimescenesinforensicinvestigationsentrainsthechallengeofcreatingalinkbetweenthetwoormorerepresentationsofthesametrace.Inordertobeforensicallysound,especiallythetwosecurityaspectsofintegrityandauthenticityneedtobemaintainedatalltimes.Especiallytheadherencetotheauthenticityusingtechnicalmeansprovestobeachallengeattheboundarybetweenthephysicalobjectanditsdigitalrepresentations.Inthisarticleweproposeanewmethodoflinkingphysicalobjectswithitsdigitalcounterpartsusingtwo-dimensionalbarcodesandadditionalmeta-dataaccompanyingtheacquireddataforintegrationintheconventionalbaggingandtaggingprocess.UsingtheexemplarychosenQR-codeasparticularimplementationofabarcodeandamodeloftheforensicprocess,wealsosupplyameanstointegrateoursuggestedapproachintoforensicallysoundproceedings.

Weusetheexampleofthedigitaldactyloscopyasaforensicdiscipline,wherecurrentlyprogressisbeingmadebydigitizingsomeoftheprocessingsteps.Weshowanexemplaryprototypicimplementationofthesuggestedapproachusingasmartphoneasamobiledevicefortheverificationofthephysicaltracetoextendthechain-of-custodyfromthephysicaltothedigitaldomain.

8667-27, Session 6

Smart apps for applied machine learning on mobile devices - the MOMO projectStefanEdlich,MathiasVogler,BeuthHochschulefürTechnikBerlin(Germany)

TheMOMOprojectisamultimillion€researchprojectsupportedbyEFRErunatBeuthUniversityofTechnologyBerlin(App.Sc.).Itconsistsoftwoparts:EcoMobilityandMobilecomputing.ThelattergoalistodevelopnewSmartphoneandTabletapplicationsformobilecomputing

Conference 8667C


(asNFC,indoornavigation,etc.).Asub-projectofMOMOissmartappsandadressingmachinelearning.Inthefirstpartwedevelopedasmartappforamusementparksasanexample.Theusersweretrackedandmachinelearningalgorithmshavebeenappliedthereaftertopredictvisitorbehaviourintheseparks(orbuildings,concerts,airports,etc.).Thisexperienceleadustomoreresearchintermsofa)machinelearning(ML)resultvisualization,b)MLimplementationandc)MLconfigurationandexecutiononmobiledevices.Inthispaperweshowexamplesofimplementationsandrealizationsforthesethreeareas.Furthermoreweshowhowsuchapplicationscanitselfbeusedbyusers/visitors-andnotonlyscientists/parkoperators-tocreatenew(visual)experiences.

8667-28, Session 6

Real-time volume rendering of digital medical images on an iOS deviceChristianNoon,DreamworksAnimationSKG,Inc.(UnitedStates);JosephHolub,IowaStateUniv(UnitedStates);EliotWiner,IowaStateUniv.(UnitedStates)

Performinghighquality3Dvisualizationsonmobiledevices,whiletantalizinglycloseinmanyareas,isstillaquitedifficulttask.Thisisespeciallytruefor3Dvolumerenderingofdigitalmedicalimages.Allowingthiswouldempowermedicalpersonnelapowerfultooltodiagnoseandtreatpatientsandtrainthenextgenerationofphysicians.ThisresearchfocusesonperformingrealtimevolumerenderingofdigitalmedicalimagesoniOSdevicesusingcustomdevelopedGPUshadersfororthogonaltextureslicing.Aninteractivevolumerendererwasdesignedanddevelopedwithseveralnewfeaturesincludingdynamicmodificationofrenderresolutions,anincrementalrenderloop,ashader-basedclippingalgorithmtosupportOpenGLES2.0,andaninternalbackfacecullingalgorithmforproperlysortingrenderedgeometrywithalphablending.Theapplicationwasdevelopedusingseveralapplicationprogramminginterfaces(APIs)suchasOpenSceneGraph(OSG)astheprimarygraphicsrenderercoupledwithiOSCocoaTouchforuserinteraction,andDCMTKforhandlingofDICOMfiles.Thedevelopedapplicationisabletorendervolumedatasetsover450slicesupto50-60framespersecond,dependingonthespecificdeviceandmodel.AllrenderingisdonelocallyonthedevicesonoInternetconnectionisrequired.

8667-29, Session 6

MessageSpace: a messaging system for health researchRodrigoD.Escobar,DavidAkopian,TheUniv.ofTexasatSanAntonio(UnitedStates);DeborahParra-Medina,LauraA.Esparza,TheUniv.ofTexasHealthScienceCtr.atSanAntonio(UnitedStates)

Mostdevelopedcountrieshavebeenincreasingtheirhealthcareexpendituresovertheyears.Increasesofaround30%and50%ofthepercapitaincomeofthosecountrieshavebeennoticedinthelasteightyears.Diseaseprevalenceandmedicaltreatmentsrepresentsaroundtwo-thirdsoftheseraisings.Thistrendrevealstheimportanceofdiseaseprevention.Basically,themorepeopleareawareofhowtopreventdiseases,themoreexpensescanbecut,thusmakinganimpacttothewholesociety.ShortMessageService(SMS)messageshavebeenusedtoempowerhealthcareandhealthpromotion,butexistingSMSbasedhealthpromotionsystemsdonotcompletelyaddresshealthpromotionresearchneeds.Inordertoempowerhealthcarepromotionandeducationresearch,wepresentasystemwhosemaingoalistofulfilltheneedsofhealthcareresearchers.WedosobyintegratingandextendingexistingtechnologiesincludingJ2EE,databases,mobilemessaging,andSmartphonesapplicationsamongothers.TheSMSisusedasthekeytechnologytosendmessagesandpollstoresearchprojects’participants.TheintegrationofthementionedtechnologiesalongwiththeShortMessageServiceuniquecharacteristicsandusagepatternsleadthewaytowardsprovidingapromisingsystemforsupportinghealth-promotionresearch.

8667-30, Session 6

Multi-resolution edge detection with edge pattern analysisBoJiang,NationalInstituteofAerospace(UnitedStates)

Edgedetectionisdefinedastheprocessofdetectingandrepresentingthepresenceofandlocationsofimagesignaldiscontinuities.Generally,theedgedetectionoperationhastwomainsteps:filtering,anddetectionandlocalization.Inthefirststep,findinganoptimalscaleofthefilterisanill-posedproblem,especiallywhenasingle(global)scaleisusedovertheentireimage.Multi-resolutiondescriptionoftheimagewhichcanrepresenttheimagefeaturesoccurringinarangeofscalesisused,whereacombinationofGaussianfilterswithdifferentscalescanamelioratethesinglescaleissue.Inthesecondstep,oftenedgedetectorshavebeendesignedtocapturesimpleidealstepfunctionsinimagedata,butrealimagesignaldiscontinuitiesdeviatefromthisidealform.Anotherthreetypesofdeviationsfromthestepfunctionwhichrelatetorealdistortionsoccurringinnaturalimagesareexamined.Thesetypesareimpulse,ramp,andsigmoidfunctionswhichrespectivelyrepresentnarrowlinesignals,simplifiedblureffects,andmoreaccurateblurmodeling.Generalrulesforedgedetectionbasedupontheclassificationofedgetypesintofourcategories-ramp,impulse,step,andsigmoidaredevelopedfromthisanalysis.Theperformanceanalysisonexperimentssupportsthattheproposedalgorithmdoesleadtomoreeffectiveedgedetectionandlocalizationwithimprovedaccuracies.


Clientside Skype forensics: an overviewTinaMeißner,KnutKröger,ReinerCreutzburg,FachhochschuleBrandenburg(Germany)

SkypecommunicationovertheInternetisverypopularnowadays.TheaimofthispaperistogiveanoverviewontheclientsideSkypeForensicsandtoexplainwhatdataisstoredontheuser’scomputerandhowthesedatacanbereadoutinaforensicinvestigation.Both,themanualanalysisandtheanalysisbyopensourcetoolsandcommercialtoolsaredescribedandthecorrespondingproblemsareaddressed.


Real-time human activity classification using tri-axial accelerometer of mobile device and SQLite database engineSahakI.Kaghyan,Armenian-Russian(Slavonic)Univ.(Armenia);DavidAkopian,TheUniv.ofTexasatSanAntonio(UnitedStates);HakobG.Sarukhanyan,InstituteforInformaticsandAutomationProblems(Armenia)

Themovementoftelemedicinefromdesktopplatformstowirelessandmobileconfigurationsmayhaveasignificantimpactonfuturehealthcare.Physicalactivityisrecognizedasoneofthemostimportantdimensionsofhumanhealth.Availablesensorsinmobiledevicesallowforactivityrecognitionand,thus,physicalhealthassessment.Duringrecentyearstherehavebeenmanypublicationsonactivityclassification.Theseapproachestypicallyexploitwearablesensorsattachedtopatient’sbodytocollectnecessaryinformationconcerningactivity.Inthispaper,ourfocusisonactivityrecognitionusingsensorsbuiltininsmartphones.Particularlytri-axialaccelerometersonAndroidsystemsareused.Rawsensorsignalsequencesarecollectedatprogrammedsamplingrates.TheprocessingisperformedusingSQLiterelationaldatabasemanagementsysteminsmartphonememory.Asportabledatabaseallowssimultaneouslyreadandwritedata,somobileapplicationdoesdatacollectionandacquiredinformationanalyzingatthesametimeinordertodoreal-timeactivityclassificationstraightonsmartphone.

Conference 8667C



Gradient-based fusion of infrared and visual face images using support vector machine for human face identificationPriyaSaha,MrinalK.Bhowmik,TripuraUniv.(India);DebotoshBhattacharjee,JadavpurUniv.(India);BarinK.De,TripuraUniv.(India);MitaNasipuri,JadavpurUniv.(India)

HumanFaceIdentificationhasachievedalotofattentionfromresearchcommunityduringpastseveralyears.But,performanceoffaceidentificationsystemdegradesinuncontrolledenvironmentwhenrandomlightingsourcesfromdifferentdirectionschangevisualrepresentationssignificantly.Inthispaper,wehaveproposedgradientbasedfusionmethodofgradientvisualandcorrespondinginfraredfaceimagestoovercometheproblemofilluminationvaryingconditions.Thistechniquemainlyextractsilluminationinsensitivefeaturesunderdifferentconditionsforeffectivefacerecognitionpurpose.Thegradientimagecomputedfromavisiblelightimage,thesubject’sfaceiscomparedwiththegradientimageextractedfromathermalfaceimageofthesamesubject.Theimagefusionofinfraredimageandcorrespondingvisualgradientimageisdoneinwaveletdomainbytakingthemaximuminformationofapproximationanddetailedcoefficients.ThesefusedimagesaretakenfordimensionreductionusingIndependentComponentAnalysis(ICA).Thereduced1?49faceimagesaretakenfortrainingandtestingpurposesfromdifferentclassesofdifferentdatasetsofIRISfacedatabase.WehavetakenSVMmulticlassstrategy‘one-vs.-all’inourexperiment.LinearkernelandPolynomialkernelwithdegree3areusedinourexperimentaskernelfunctions.Theexperimentresultsrevealthatourmethodcanperformwellforthefaceimagesunderdifferentlightingconditions.


Future mobile access for open-data platforms and the BBC-DaaS systemStefanEdlich,SonamSingh,IngoPfennigstorf,BeuthHochschulefürTechnikBerlin(Germany)

Wedevelopaplatformwhichprovidesamarketplacefordatawhichcanactasbaseforapplicationsorinformationseekersrelyingonthisdata.Weexaminevariousmethodstofacilitateautomaticaugmentationofmetadataforeffectivequeryingrelatedtasksinascalablemannerfortheplatform.Dataisautomaticallypreprocessed,analyzed,visualized,transformed,filteredandmadeaccessibletointerestedpartiesthroughaccesspointslikeREST,Multimediadevicesordirectlyaccessiblethroughwebapplication.Weexaminestrategiesforautomaticpricingofdatausageusingvariousparameterse.g.theamountofdata,bandwidthused,CPUandmemorycostsorlicensecostsspecifiedbytheprovider.

DataasaService(DaaS)providestateoftheartservicesfortransformations,visualizations,filtering,sortingofdataandanalytics,pricingascoreandanextensibleframeworkforadaptingtochangingrequirementsofdifferenttypesofdata,dataprovidersanddataconsumers(mostlyapplications).

ThekeyissuesofthisposterareadvancedwaysofmobileaccesstoopendataplatformslikeBBC-DaaSandopen-dataplatformsingeneral.


HIPAA compliance for messaging systems: MessageSpace case studyAnuvrathJoshi,RodrigoD.Escobar,DavidAkopian,TheUniv.ofTexasatSanAntonio(UnitedStates);DeborahParra-Medina,LauraA.Esparza,TheUniv.ofTexasHealthScienceCtr.atSanAntonio(UnitedStates)

NoAbstractAvailable


Location tracking forensics on GPS handhelds, mobile navigation devices, and smartphonesStefanSack,KnutKröger,ReinerCreutzburg,FachhochschuleBrandenburg(Germany)

Thespreadofnavigationdeviceshasincreasedsignificantlyoverthelast10years.WiththehelpofthecurrentdevelopmentofevensmallerNavigationreceiverunitsitispossibletonavigatewithalmostanycurrentSmartphone.Duetotheincreaseduseofnavigationdevicestheirrelevancetoforensicinvestigationshasrisenrapidly.Navigationdataisforensicallyinterestingbecausebytheresidenceofthedevicesinmostcasesthelocationandthetravelledpathoftheownercanbereconstructed.InthisworkpracticesforforensicanalysisofNavigationdevicesaredeveloped.Differentdeviceswillbeanalyzedanditisattempted,bymeansofforensicprocedurestorestorethetraveledpathofthedevice.Foranalysisofthevariousdevicesdifferentsoftwareandhardwareisused.Therewillbepresentedcommonproceduresforsecuringandtestingofmobiledevicesandsomespecialtiesintheinvestigationofeachdevice.ThedifferentclassesconsideredareGPShandhelds,mobilenavigationdevicesandsmartphones.Itwillbeattempted,toreadalldataofthedevice.Thisisrealizedbytheusageofcurrentforensicsoftwaree.g.TomTologyorOxygenForensicSuite.Tolimitthepracticalworkthedataextractionisfocusedonthefrequentlyuseddeviceofaclass,astheprocedureformanygroupsofdevicescanbeidentical.InthepresentworkaGPShandheld,aTomTomGO700,aniPhone3GandaHTCDesire(Android)isusedbecausetheyhaveawidecirculation.


Conception of a course for professional training and education in the field of computer and mobile forensics - Part II: Android ForensicsKnutKröger,ReinerCreutzburg,FachhochschuleBrandenburg(Germany)

ThegrowthofAndroidinthemobilesectorandtheinteresttoinvestigatethesedevicesfromaforensicpointofviewhasrapidlyincreased.ManycompanieshavesecurityproblemswithmobiledevicesintheirownITinfrastructure.Torespondtotheseincidents,itisimportanttohaveprofessionaltrainedstaff.Itisnecessarytofurthertraintheownemployeesinthefieldofmobileforensicsbecausealotofcompaniesworkwithverysensitivedata.Motivatedbythesefacts,thispapershowsprovenpracticalexamplesfortrainingapproachesandpracticalexercisestoinvestigatetheforensicallymostimportanttasksofAndroidmobiledevices.


Possibilities and modification of the forensic investigation process of solid state drivesFrankIrmler,KnutKröger,ReinerCreutzburg,FachhochschuleBrandenburg(Germany)

Thispaperdealswiththepossibilitiesoftheforensicinvestigationofsolid-statedrives.Theaimsofthisstudyaretoclarifytheinformationgainedbyaforensicanalysisofthesemediaandtoexplainthediffenerencestoconventionalforensicexaminationsofconventionalharddiskdrives.Foreachtestseries,avarietyofhardwareandsoftwarewasused.Oneinterestingresultisthatthebuild-intrimfunctionoftheSSDhasgreatimpactontheresultoftheforensicinvestigation.

Conference 8667C



Mobile learning in medicineSabriSerkanGulluoglu,IstanbulArelUniv.(Turkey)

Thispaperoutlinesthemaininfrastructureforimplicatingmobilelearninginmedicineandpresentasamplemobilelearningapplicationformedicallearningwithintheframeworkofmobilelearningsystems.

Mobiletechnologyisdevelopingnowadays.Inthiscaseitwillbeusefultodevelopdifferentlearningenvironmetsusingtheseinnovationsininternetbaseddistanceeducation.M-learningmakesthemostofbeingonlocation,providingimmediateaccess,beingconnected,andacknowledgeslearningthatoccursbeyondformallearningsettings,inplacessuchastheworkplace,home,andoutdoors.Centraltom-learningistheprinciplethatitisthelearnerwhoismobileratherthanthedeviceusedtodelivermlearning.Theintegrationofmobiletechnologiesintotraininghasmadelearningmoreaccessibleandportable.Mobiletechnologiesmakeitpossibleforalearnertohaveaccesstoacomputerandsubsequentlylearningmaterialandactivities;atanytimeandinanyplace.Mobiledevicescaninclude:mobilephone,personaldigitalassistants(PDAs),personaldigitalmediaplayers(egiPods,MP3players),portabledigitalmediaplayers,portabledigitalmultimediaplayers.

Mobilelearning(m-learning)isparticularlyimportantinmedicaleducation,andthemajorusersofmobiledevicesareinthefieldofmedicine.Thecontextsandenvironmentinwhichlearningoccursnecessitatesm-learning.Medicalstudentsareplacedinhospital/clinicalsettingsveryearlyintrainingandrequireaccesstocourseinformationandtorecordandreflectontheirexperienceswhileonthemove.

Themaintitleswillbemobilelearning,medicaleducation,sub-branchesofmedicine,suitableeducationbranchesformobilityand,maindifferencesofm-learningandnormallearningsystems,samplem-learninginterfacemodelformedicinefortabletcomputerandmobilephonesystems,Conclusionsandadvisestothefuture.

Asaresultofthispaper,thispaperstrivestocompareandcontrastmobilelearningwithnormallearninginmedicinefromvariousperspectivesandgiveinsightsandadvisesintotheessentialcharacteristicsofbothforsustainingmedicaleducation.


Overview and forensic investigation approaches of the gaming console Sony PlayStation PortableRalphSchön,StephanSchön,KnutKröger,ReinerCreutzburg,FachhochschuleBrandenburg(Germany)

ThispaperdealswithforensicallyinterestingfeaturesoftheSonyPlaystationPortablegameconsole.Theconstructionandtheinternalstructureareanalyzedmoreprecisely.Interestingforensicfeaturesoftheoperatingsystemandthefilesystemarepresented.


An efficient and fast iris location algorithmGuangyuanJiang,ChangchunNormalUniv.(China)

Theaccuracyandspeedofirislocalizationaffectrecognitionsystemperformanceintheirisrecognitionsystem.Basedonanalyzingsomeprevailingirislocalizationalgorithms,inthispapertheedgeinformationofpupilisextractedbytheleast-squaremethod,andtheirisoutercircleisextractedbytheimprovedCannyOperatorplusHoughTransform,andtheexperimentalresultshowsthatthislocalizationmethodisfastandofhighprecision.Thenoisefromirisregionincludeseyelid,eyelash,eyelidshadowandspecularreflections.Thesegmental-secondarylinearlocalizationmethodadoptingedgedetectionandRadonTransformisproposedtoremovethenoisefromeyelidontheeyelidlocalization,theeyelashnoiseandeyelidshadowareremovedbythresholdmethod,andtheexperimentalresultshowsthatthealgorithmisefficientandaccurate.

Conference 8667C


Conference 8667D: Multimedia Content Access: Algorithms and Systems VIIMonday-Wednesday4–6February2013


8667-1, Session K1

The three R’s of computer vision: recognition, reconstruction and reorganization (Keynote Presentation)JitendraMalik,Univ.ofCalifornia,Berkeley(UnitedStates)

NoAbstractAvailable

8667-2, Session 1

Video structure extraction (Invited Paper)RemiTrichet,RamNevatia,TheUniv.ofSouthernCalifornia(UnitedStates)

Actionrecognitionisachallengingcomputervisiontaskthathasaplethoraofapplicationsinnowadayssocietyofpervasivetechnology,suchassemanticindexing,orsportsequencesskimming.Approachescanbroadlybesplinteredin2categories:structuralandstatisticalbasedstrategies.Astheformertriestomatchthedatatosomepre-definedmodel,thelatterclassifieseventsaccordingtofeaturedistributions.

Butineverycase,theutilizedtechniquesareanattempttobuildhighersemanticinformationfromraw,low-levelfeatures.Thistalkwillfocusonsomecommondifficultiesthatthecommunityisfacinginthisscopeandtheploysusedtotacklethesechallenges.

8667-3, Session 1

Taming the wild: acoustic segmentation on consumer-produced videosGeraldFriedland,BenjaminElizalde,InternationalComputerScienceInstitute(UnitedStates)

Inthelastdecade,thenumberofconsumer-producedvideosintheInternethasincreasedexponentially.Forexample,oneofthemostpopularwebsitesforconsumer-producedvideos,Youtube,claimsthat72hoursofvideoareuploadedeveryminute,resultinginnearly14yearsofcontentuploadedeveryday.Thisresultsinamassivedemandfortechniquesthatcanindexthedataforretrieval.Afirststepinautomaticindexingisusuallysegmentation.Segmentationistheprocessofidentifyingtheboundariesbetweenclasses.Inspeechprocessing,examplesoftheseclassesarephones,words,sounds,speech,music,andsilence.Whilehighaccuracieshavebeenreachedfortraditional,corpus-based,supervisedsegmentationtasks,segmentationapproacheson‘’wild”videosarestillamajorchallenge.Inconsumer-producedmedia,onecannotrelyonanysinglecharacteristictodrawboundariesbetweenclassesanditisdifficulttopre-trainmodelsbecauseofthehighvarianceinthedata.Consumer-producedaudiomaycontainlowqualityaudio,noisyenvironments,singing,music,overlappingsituations,andotherunexpectedcircumstances.Thistalkpresentsourapproachtoahybridsegmentationsystemforconsumerproducedvideos,whichconsistsofbothunsupervisedclustering(usingso-calledaudiopercepts)andsupervisedclassificationbasedonaneventspace.

8667-4, Session 1

Sparse conditional mixture model: late fusion with missing scores for multimedia event detectionRameshM.Nallapati,EricYeh,SRIInternational(UnitedStates)

Inthiswork,wepresenttheSparseConditionalMixtureModel(SCMM)whichaddressestheproblemoflatefusioninmultimediaeventdetectioninthepresenceofmissingdetectionscoresfromoneormoremodalities.OurexperimentsprovethattheSCMMmodelsignificantlyoutperformsthetraditionalConditionalMixturemodelandseveralotherbaselinesontheTRECtaskofMultimediaEventDetection,bydynamicallyadaptingthemodeltocaptureonlytheobservedmodalitiesforagivenvideoclip.

8667-5, Session 1

Can object detectors aid Internet video event retrieval?DavideModolo,CeesSnoek,Univ.vanAmsterdam(Netherlands)

TheproblemofeventrepresentationforautomaticeventdetectioninInternetvideosisacquiringanincreasingimportance,duetotheirapplicabilitytoalargenumberofapplications.Existingmethodsfocusonrepresentingeventsintermsofeitherlow-leveldescriptorsordomain-specificmodelssuitedforalimitedclassofvideoonly,ignoringthehigh-levelmeaningoftheevents.Ultimatelyaimingforamorerobustandmeaningfulrepresentation,inthispaperwequestionwhetherobjectdetectorscanaidvideoeventretrieval.Weproposeanexperimentalstudythatinvestigatestheutilityofpresent-daylocalandglobalobjectdetectorsforvideoeventsearch.Byevaluatingobjectdetectorsoptimizedforhigh-qualityphotographsonlow-qualityInternetvideo,weestablishthatpresent-daydetectorscansuccessfullybeusedforrecognizingobjectsinwebvideos.Weuseanobject-basedrepresentationtore-ranktheresultsofanappearance-basedeventdetector.ResultsonthechallengingTRECVIDmultimediaeventdetectioncorpusdemonstratethatobjectscanindeedaideventretrieval.Whilemuchremainstobestudied,webelievethatourexperimentalstudyisafirststeptowardsrevealingthepotentialofobject-basedeventrepresentations.

8667-6, Session 1

Multimedia event detection using visual concept signaturesEhsanYounessian,MichaelQuinn,TerukoMitamura,AlexHauptmann,CarnegieMellonUniv.(UnitedStates)

MultimediaEventDetection(MED)isamultimediaretrievaltaskwiththegoaloffindingvideosofaparticulareventinalarge-scaleInternetvideoarchive,givenexamplevideosandtextdescriptions.Inthispaper,wemainlyfocusonan’ad-hoc’scenarioinMEDwherewedonotuseanyexamplevideo.WeaimtoretrievetestvideosbasedontheirvisualsemanticsusingaVisualConceptSignature(VCS)generatedforeacheventonlyderivedfromtheeventdescriptionprovidedasthequery.VisualsemanticsaredescribedusingtheSemanticINdexing(SIN)featurewhichrepresentsthelikelihoodofpredefinedvisualconceptsinavideo.TogenerateaVCSforanevent,weprojectthegiveneventdescriptiontoavisualconceptlistusingtheproposedtextualsemanticsimilarity.ExploringSINfeature


properties,weharmonizethegeneratedvisualconceptsignatureandtheSINfeaturetoimproveretrievalperformance.WeconductdifferentexperimentstoassessthequalityofgeneratedvisualconceptsignatureswithrespecttohumanexpectationandinthecontextoftheMEDtasktoretrievetheSINfeatureofvideosinthetestdatasetwhenwehavenooronlyveryfewtrainingvideos.

8667-7, Session 2

A fast approach for integrating ORB descriptors in the bag of words modelCostantinoGrana,DanieleBorghesani,RitaCucchiara,Univ.degliStudidiModenaeReggioEmilia(Italy)

InthispaperweproposetointegratetherecentlyintroducesORBdescriptorsinthecurrentlyfavoredapproachforimageclassification,thatistheBagofWordsmodel.InparticulartheproblemtobesolvedistoprovideaclusteringmethodabletodealwiththebinarystringnatureoftheORBdescriptors.Wesuggesttouseak-meanslikeapproach,calledk-majority,substitutingEuclideandistancewithHammingdistanceandmajorityselectedvectorasthenewclustercenter.ResultscombiningthisnewapproachwithotherfeaturesareprovidedovertheImageCLEF2011dataset.

8667-8, Session 2

Video-based analysis of motion skills in simulation-based surgical trainingQiangZhang,LinChen,QiongjieTian,BaoxinLi,ArizonaStateUniv.(UnitedStates)

Analysisofmotionexpertiseisanimportantprobleminmanydomainsincludingsportsandsurgery.Inrecentyears,surgicalsimulationhasemergedattheforefrontofnewtechnologiesforimprovingtheeducationandtrainingofsurgicalresidents.Insimulation-basedsurgicaltraining,akeytaskistoratetheperformanceoftheoperators,whichisdonecurrentlybyseniorsurgeons.Thispaperintroducesanovelsolutiontothisproblemthroughemployingvision-basedtechniques.Wedevelopanautomatic,video-basedapproachtoanalyzingthemotionskillsofasurgeoninsimulation-basedsurgicaltraining,whereasurgicalactioniscapturedbymultiplevideocameraswithlittleornocalibration,resultinginmultiplevideostreamsofheterogeneousproperties.Typicalmultiple-viewvisiontechniquesareinadequateforprocessingsuchdata.Weproposeanovelapproachthatemploysbothcanonicalcorrelationanalysis(CCA)andthebag-of-wordsmodeltoclassifytheexpertiselevelofthesubjectbasedontheheterogeneousvideostreamscapturingboththemotionofthesubject’shandsandtheresultantmotionofthetools.Experimentsweredesignedandperformedtovalidatetheproposedapproachusingrealisticdatacapturedfromresidentsurgeonsinlocalhospitals.Theresultssuggestthattheproposedapproachmayprovideapromisingpracticalsolutiontotherealworldproblemevaluatingmotionskillsinsimulation-basedsurgicaltraining.

8667-9, Session 2

Adaptive segmentation grids for human action recognition in videosNicolasBallas,BertranDelezoide,Commissariatàl’ÉnergieAtomique(France)

Inthispaper,weintroduceanadaptivesegmentationgridschemeforactionrecognitioninunconstrainedvideos.State-of-artssolutionsfocusingonactionrecognitionembedslow-levelfeaturesspatio-temporalcontextinaBag-of-Wordsmodelthroughstaticallydefinedsegmentationgrids.Whileprovidingacoarselocalizationoflow-levelfeatures,thoseapproachestendtobelimitedbytheirgridsrigidityandregulargeometry.Toadressthisissueweproposetolearnspatio-temporalgridsdirectlyfromthevideodata.Weproposenewsegmentationgrids,namedAdaptiveGrid(AG),whicharelearntand

adaptedtoagiventrainingdataset.OuradapativegridsarethenexploitedbyaBag-of-Wordsmodelattheaggregationstepforactionrecognition.Ourproposalisevaluatedon4publiclyavailabledatasetsshowinganimprovementoverthestate-of-art.

8667-10, Session 2

Exploiting visual search theory to infer social interactionsPaoloRota,Duc-TienDang-Nguyen,NicolaConci,NicuSebe,Univ.degliStudidiTrento(Italy)

Inthispaperweproposeanewmethodtoinferhumansocialinteractionsusingtypicaltechniquesadoptedinliteratureforvisualsearchandinformationretrieval.Themainpieceofinformationweusetodiscriminateamongdifferenttypesofinteractionsisprovidedbyproxemicscuesacquiredbyatracker,andusedtodistinguishbetweenintentionalandcasualinteractions.Theproxemicsinformationhasbeenacquiredthroughtwodifferentmetrics:thefirstoneisrelatedtothecurrentdistancebetweensubjects,whilethesecondonemeasurestheO-spacesynergybetweensubjects.TheobtainedvaluesaretakenateverytimestepinatemporalslidingwindowandthenuncorrelatedusingtheDFTtransformation.Theobtainedfeaturesareeventuallymergedintoanuniquearray.AllthepossibleDFTtracksoftheavailabledatasetareclusteredviaK-meansalgorithm.TheclustersarereorganizedusingasecondtemporalwindowintoaBagOfWordsframework,soastobuildthefeaturevectorthatwillfeedtheSVMclassifier.

8667-11, Session 3

Machine perception for content discovery at YouTube (Keynote Presentation)PaulNatsev,Google(UnitedStates)

YouTube’smissionisforYOUtodiscoverandshapetheworldthroughvideo.Attheheartofthismissioniscontentdiscovery,ortheproblemoffindinginterestingcontentrelevanttoagiventopicoruser.ThisproblemisparticularlychallenginggiventhevarietyandvolumeofYouTubevideos:morethananhourofvideoisuploadedtoYouTubeeverysecond(that’smorethantenyearsworthofcontenteveryday).Inthistalk,IwillgiveanoverviewofsomeworkinthemachineperceptiondepartmentatGoogleResearchaimingtoimprovecontentdiscoveryatYouTube.Specifically,IwillpresentseveralcasestudiesofapplyingmachineperceptionandmachinelearningatYouTubescaletotackleproblemssuchasautomaticallyidentifyingandlabelingcelebritiesandtouristlandmarksinvideo,taggingvideossemantically,discoveringtalentonYouTube,andusinggamestocrowdsourcevideodiscoveryonYouTube.

8667-12, Session 3

Presentation video retrieval using automatically recovered slide and spoken textMatthewL.Cooper,FXPaloAltoLab.(UnitedStates)

Videoisbecomingaprevalentmediumfore-learning.Lecturevideoscontaintextinformationinboththevisualandauralchannels:thepresentationslidesandlecturer’sspeech.Thispaperexaminestherelativeutilityofautomaticallyrecoveredtextfromthesesourcesforlecturevideoretrieval.Toextractthevisualinformation,weapplyvideocontentanalysistodetectslidesandopticalcharacterrecognitiontoobtaintheirtext.Automaticspeechrecognitionisusedsimilarlytoextractspokentextfromtherecordedaudio.Weperformcontrolledexperimentswithmanuallycreatedgroundtruthforboththeslideandspokentextfrommorethan60hoursoflecturevideo.Wecomparetheautomaticallyextractedslideandspokentextintermsofaccuracyrelativetogroundtruth,overlapwithoneanother,andutilityforvideoretrieval.Resultsrevealthatautomaticallyrecoveredslidetextand

Conference 8667D


spokentextcontaindifferentcontentwithvaryingerrorprofiles.Experimentsdemonstratethatautomaticallyextractedslidetextenableshigherprecisionvideoretrievalthanautomaticallyrecoveredspokentext.

8667-13, Session 3

VidCat: an image and video analysis service for personal media managementLeeBegeja,EricZavesky,ZhuLiu,DavidGibbon,RaghuramanGopalan,BehzadShahraray,AT&TLabs.Research(UnitedStates)

Thispaperpresentsasystemforstorage,organizationandconsumptionofconsumermedia.Photoandvideocaptureforconsumershasneverbeeneasier,sotheneedforanenterprise-scalemediaservicethatdoesnotburdentheuserduringupload,share,andorganizationiscritical.Cloud-basedstorageandconsumptionofpersonalphotosandvideosprovidesincreasedaccessibility,functionality,andsatisfactionformobileusers.Onecloudservicefrontierthatisrecentlygrowingisthatofpersonalmediamanagement.Inthiswork,asystemcalledVidCatispresentedthatassistsusersinthetagging,organization,andretrievaloftheirpersonalmediabyfacesandvisualcontentsimilarityinadditiontotimeanddateinformation.Itcouplesalow-levelanalysisenginewithnetwork-basedmediaservicesfortraditionalorganizationandretrievaltasks.Algorithmsthatperformautomatedcontentsegmentation,intelligentkeyframeselection,near-duplicatedetection,andfacedetectionandsimilarityaredescribedandarepresentedwithevaluationsoninternationallyrecognizeddatasets.DesigndecisionsforharmoniouscouplingoftheVidCatback-endmediaserviceandatablet-basedclientprototypewithfourpowerfulconsumerusecasesarealsodiscussed.


Audio stream classification for multimedia database searchMariaTeresaArtese,ConsiglioNazionaledelleRicerche(Italy);SimoneBianco,Univ.degliStudidiMilano-Bicocca(Italy);IsabellaGagliardi,ConsiglioNazionaledelleRicerche(Italy);FrancescaGasparini,Univ.degliStudidiMilano-Bicocca(Italy)

SearchandretrievalofhugearchivesofMultimediadataisachallengingtask.Aclassificationstepisoftenusedtoreducethenumberofentriesonwhichtoperformthesubsequentsearch.Inparticular,whennewentriesofthedatabasearecontinuouslyadded,afastclassificationbasedonsimplethresholdevaluationisdesirable.

InthisworkwepresentaCART-based(ClassificationAndRegressionTree])classificationframeworkforaudiostreamsbelongingtomultimediadatabases.ThedatabaseconsideredistheArchiveofEthnographyandSocialHistory(AESS),whichismainlycomposedofpopularsongsandotheraudiorecordsdescribingthepopulartraditionshandeddowngenerationbygeneration,suchastraditionalfairs,andcustoms.

Thepeculiaritiesofthisdatabasearethatitiscontinuouslyupdated;theaudiorecordingsareacquiredinunconstrainedenvironment;andforthenon-experthumanuserisdifficulttocreatethegroundtruthlabels.

Inourexperiments,halfofalltheavailableaudiofileshavebeenrandomlyextractedandusedastrainingset.Theremainingoneshavebeenusedastestset.Theclassifierhasbeentrainedtodistinguishamongthreedifferentclasses:speech,music,andsong.Alltheaudiofilesinthedatasethavebeenpreviouslymanuallylabeledintothethreeclassesabovedefinedbydomainexperts.


Structuring a sharded image retrieval databaseEricLiang,AvidehZakhor,Univ.ofCalifornia,Berkeley(UnitedStates)

Inpreviousworkwedescribedanapproachtolocalizationbasedinimageretrieval.Specifically,weassumecoarselocalizationbasedonGPSorcelltowerandrefineitbymatchingausergeneratedimagequerytoageotaggedimagedatabase.Wepartitiontheimagedatasetintooverlappingcells,eachofwhichcontainsitsownapproximatenearest-neighborssearchstructure.Bycombiningsearchresultsfrommultiplecellsasspecifiedbycoarselocalization,wehavedemonstratedsuperiorretrievalaccuracyonalargeimagedatabasecoveringdowntownBerkeley.Inthispaper,weinvestigatehowtoselecttheparametersofsuchasysteme.g.sizeandspacingofthecells,andshowhowthecombinationofmanycellsoutperformsasinglesearchstructureoveralargeregion.WeusetwodatasetsfromBerkeleyandOaklandtodemonstrateourresults.Weconcludethat(a)cellradiusshouldbechosensuchthatthenumberoffeaturesisafewmillionratherthanafewtensofmillion;(b)evenwhencellsof20millionfeaturesareused,itispreferabletocombinetheirresultsratherthanuseasinglelargecell;(c)acelldensitywheretheradiusofcellisequaltothedistancebetweenthecenterofcellsoffersthebestaccuracyandcomplexityperformance.


Diversification of visual media retrieval results using saliency detectionOlegMuratov,GiuliaBoato,FrancescoG.B.DeNatale,Univ.degliStudidiTrento(Italy)

Diversificationofretrievalresultsallowsforbetterandfastersearch.

Recentlytherehasbeenproposeddifferentmethodsfordiversificationofimageretrievalresultsmainlyutilizingtextinformationandtechniquesimportedfromnaturallanguageprocessingdomain.However,imagescontainvisualinformationthatisimpossibletodescribeintextandtheuseofvisualfeaturesisinevitable.Visualsaliencyisinformationaboutthemainobjectofanimageimplicitlyincludedbyhumanswhilecreatingvisualcontent.Forthisreasonitisnaturallytoexploitthisinformationforthetaskofdiversicationofthecontent.

Inthisworkwestudywhethervisualsaliencycanbeusedforthetaskofdiversificationandproposeamethodforre-rankingimageretrievalresultsusingsaliency.Theevaluationhasshownthattheuseofsaliencyinformationresultsinhigherdiversityofretrievalresults.

Conference 8667D


Technologies- 3D Imaging, Interaction,

and Metrology

- Visualization, Perception and Color

- Image Processing

- Image Capture

- Multimedia Processing and Applications

- Computer Vision

Conferences and Courses2–6February2014

Location Hilton San Francisco, Union SquareSan Francisco, California, USA

ElectronicImaging

2014

We’re moving to San Francisco in 2014!

Technologies for digital imaging systems, 3D display, image quality, multimedia, and mobile applications

Mark your calendarwww.electronicimaging.org

Ei13 Abstracts L

Documents

Transcript of Ei13 Abstracts L