Canadian Bioinformatics Workshops · Module bioinformatics.ca •Data Communication / Presentation...
Transcript of Canadian Bioinformatics Workshops · Module bioinformatics.ca •Data Communication / Presentation...
CanadianBioinformaticsWorkshops
www.bioinformatics.ca
Module bioinformatics.ca2Module #: Title of Module
Module7DataVisualization
AnamariaCrisan
Module bioinformatics.ca
LearningObjectivesofModule
• Bytheendofthislectureyoushouldbeableto:– Understanddatavisualizationdesignatahigh-level– Breakdownandevaluatedatavisualizations– Beawareofgenomicepidemiologydatavisualizationtoolsandothertools
Module bioinformatics.ca
• Wewon’twatchthiswholevideo,butveryworthspendingtimetowatchonyourownhttps://www.youtube.com/watch?v=hVimVzgtD6w
So- whydatavisualization?
Module bioinformatics.ca
https://www.youtube.com/watch?v=j4Ut4krp8GQ
So- whydatavisualization?
Module bioinformatics.ca
Datavisualizationisn’tjust anartproject
Module bioinformatics.ca
Design Art
Ideastakenfrom@rachelbinx’s 2016OpenVistalkAndhttp://featureguru.com/art-vs-design.html
DataVisualization(Iarguedatavisualizationismuchmoreaboutdesign)
Artvs.design
Module bioinformatics.ca
Function&form
Form(appearance)follows function(howitworks)
Module bioinformatics.ca
• DataCommunication/Presentation– Showsomeresultorfinding– Figuresinpapersorpublicreports(static)– Videoswithpapers(animated,butnotinteractive)– Example:Nextstrain.org;microreact.org (interactive)
• DataExploration– Findtrends,patterns,orclusters(amongotherthings)– Example:R(Shiny);Excel;Tableau(withsomelimits);D3
• HypothesisAssessment– Identifywhetherdatacontainspre-conceivedtrend,pattern,orcluster
Howdoweusedatavisualization?
Module bioinformatics.ca
• PartI:DataVisualizationDesign&Analysis– Principles&methodsforthinkingsystematicallyaboutvisualizationdesign&evaluation
• PartII:Tools&Examples– Overviewofsomeexistingtools– We’lldiveintosomeofthemmoredeeplyinthetutorial
Overview
Module bioinformatics.ca
Part1:DataVisualizationDesign&Analysis
Module bioinformatics.ca
Layersofadatavisualization
Module bioinformatics.ca
Why?Whydoyouneedtovisualizedata?
What?Whatkindofdataisbeingvisualized?
How?Howisdatabeingvisualized?Howdoothersinteractwiththedata?
(Motivation)
(Data)
(Visualand InteractionDesign)
Datavisualizationinthreequestions
Layersofadatavisualization
Peopletendtojumptothislevelandignorewhyandwhat
Module bioinformatics.ca
Why?
What?
How?
Design EvaluationDoesthevisualizationsolvearelevantproblem?
Areyouusingtherightdata,orderiving therightdata?
Arethevisualandinteractivedesignchoicesappropriate?
LayersofadatavisualizationDatavisualizationinthreequestions
Module bioinformatics.ca
DomainProblem
Data+Tasks
Visual+InteractionDesignChoices
Algorithm
WHY WHAT HOW
T.Munzner (2009)- Anestedmodelforvisualizationdesignandvalidation
Introductiontothenestedmodel
Module bioinformatics.ca
DomainProblem
Data+Tasks
Visual+InteractionDesignChoices
Algorithm
DESIGN
EVALUATION
Useaniterative process
Introductiontothenestedmodel
Module bioinformatics.ca
Aniterativeapproachtodevelopmentallowsustogetfeedbackbeforecommittingtoineffectivedesignchoices
Operationalizingthenestedmodel
Useaniterative agile process
Module bioinformatics.ca
DomainProblem
Data+Tasks
Visual+InteractionDesignChoices
Algorithm
DESIGN
ADesignStudyMethodology(Paraphrased)
M.Sedlmair etal.(2012)- Adesignstudymethodology:lessonsfromthetrenches
1. Partnerwithagroupofstakeholdersthathaveaproblem
Operationalizingthenestedmodel
Module bioinformatics.ca
DomainProblem
Data+Tasks
Visual+InteractionDesignChoices
Algorithm
DESIGN
2. Askwhatdata stakeholdersuse(isitavailable)?
3. Askwhatstakeholdersdowiththedata[tasks]
ADesignStudyMethodology(Paraphrased)
Operationalizingthenestedmodel
M.Sedlmair etal.(2012)- Adesignstudymethodology:lessonsfromthetrenches
Module bioinformatics.ca
Note!Don’tjustuserawdata
Operationalizingthenestedmodel
T.Munzner (2014)– VisualizationDesignandAnalysis
Module bioinformatics.ca
Note!Don’tjustuserawdata
Operationalizingthenestedmodel
https://xkcd.com/1138/
Example:Using(raw)absolutecounts,whenyouactuallyneedtoderiverates
Module bioinformatics.ca
DomainProblem
Data+Tasks
Visual+InteractionDesignChoices
Algorithm
DESIGN
4. Exploreifothervisualizationshaveaddressedthisproblem andsetoftasks&data
5. Implementyourownsolution
ADesignStudyMethodology(Paraphrased)
Operationalizingthenestedmodel
M.Sedlmair etal.(2012)- Adesignstudymethodology:lessonsfromthetrenches
Module bioinformatics.ca
DomainProblem
Data+Tasks
Visual+InteractionDesignChoices
Algorithm
DESIGN
4. Exploreifothervisualizationshaveaddressedthisproblem andsetoftasks&data
5. Implementyourownsolution(partorallofthatsolutioncouldbeanewalgorithm)
ADesignStudyMethodology(Paraphrased)
Operationalizingthenestedmodel
M.Sedlmair etal.(2012)- Adesignstudymethodology:lessonsfromthetrenches
Module bioinformatics.ca
DomainProblem
Data+Tasks
Visual+InteractionDesignChoices
Algorithm
EVALUATION
6. Testmultiplealternatives(includingnewonesyoudevelop)withstakeholders
7. Gatherqualitative&quantitativeevaluationdata
Operationalizingthenestedmodel
M.Sedlmair etal.(2012)- Adesignstudymethodology:lessonsfromthetrenches
ADesignStudyMethodology(Paraphrased)
Module bioinformatics.ca
DomainProblem
Data+Tasks
Visual+InteractionDesignChoices
Algorithm
DESIGN
EVALUATION
FORMATIVEEVALUATIONS
Manycyclesofformativeevaluations
ADesignStudyMethodology(Paraphrased)
Operationalizingthenestedmodel
M.Sedlmair etal.(2012)- Adesignstudymethodology:lessonsfromthetrenches
Module bioinformatics.ca
1. Partnerwithagroupofstakeholdersthathaveaproblem
2. Askwhatdata stakeholdersuse(isitavailable)?
3. Askwhatstakeholdersdowiththedata[tasks]
4. Exploreifothervisualizationshaveaddressedthisproblem andsetoftasks&data
5. Implementyourownsolution(visand/oralgorithm)
6. Testmultiplealternatives(includingnewonesyoudevelop)withstakeholders
7. Gatherqualitative&quantitativeevaluationdata
Puttingitalltogether
M.Sedlmair etal.(2012)- Adesignstudymethodology:lessonsfromthetrenches
Module bioinformatics.ca
Why
What
How
How
Design
Eval.
Puttingitalltogether
QualitativeMethods,DomainKnowledge
Qualitative&QuantitativeMethods
Design&CognitiveScience
ComputerScience
Methodology
T.Munzner (2014)– VisualizationDesignandAnalysis
Module bioinformatics.ca
• InformationVisualization(infovis)isadisciplinewithincomputersciencethatstudiesdatavisualization
• Infovisisayoung&evolvinginterdisciplinaryarea
• Alternativemethodsforvisualizationdesign&evaluationexist,butarenotpresentedhere
Anoteondatavisualizationmethods
Module bioinformatics.ca
DomainProblem
Data+Tasks
Visual+InteractionDesignChoices
Algorithm
Divingintohow
WHY WHAT HOW
• Lastbitoftheory– let’stalkmoreaboutthehow• Thiswillbeacherry-picked,high-leveloverview
Module bioinformatics.ca
Visualizationbasics:display&interaction
T.Munzner (2014)– VisualizationDesignandAnalysis
Module bioinformatics.ca
Encoding&decoding
R.Kosara (EagerEyes)– https://eagereyes.org/basics/encoding-vs-decoding
Module bioinformatics.ca
Visualizationbasics:marks&channelsMark:Geometricprimitive(basicbuildingblock)
Channel:Controlstheappearanceofmarks
T.Munzner (2014)– VisualizationDesignandAnalysis
Module bioinformatics.ca
Visualizationbasics:marks&channels
T.Munzner (2014)– VisualizationDesignandAnalysis
Module bioinformatics.ca
Colour=Continent
Transparency=Density
Revealdetailonhover
X-axisposition:Expenditure
y-axispo
sition:life
expectancy
Visualizationbasics:marks&channelsExample of these concepts in action
Module bioinformatics.ca
Differentdataset,butit’susingsomeofthesamechannels(colour,transparency,size)
Visualizationbasics:marks&channelsExample of these concepts in action
Thisisaterriblevisualization
(morebaddatavisat:viz.wtf )
Module bioinformatics.ca
J.Heer (2010)– CrowdsourcingGraphicalPerception:UsingMechanicalTurktoAssessVisualizationDesign
• Calculated(log)errorratesfordifferenttypesofbasicdatavisualizations
• Cleveland&McGill(1984),firststudy
• CrowdsourcedResults(2010),usingMechanicalTurk
• Reproducibleresults• Differencesinerrorratesfor
differentvisualizations
Visualizationbasics:marks&channels
Module bioinformatics.ca
Visualizationdesignaffectshowpeoplewillunderstand,interpret,andactonunderlyinginformationMoreonperception:http://bit.ly/2oQZJBC
Encoding&decoding
Module bioinformatics.ca
Visualizationbasics:display&interaction
T.Munzner (2014)– VisualizationDesignandAnalysis
Module bioinformatics.ca
Visualizationbasics:display&interaction
T.Munzner (2014)– VisualizationDesignandAnalysis
Module bioinformatics.ca
Example:Nextstrain.org
(aboveisananimatedgif)
FilterChange
Module bioinformatics.ca
Example:TreeJuxtaposer
T.Munzner (2006)– TreeJuxtaposer:ScalableTreeComparisonusingFocus+Context withGuaranteedVisibilityhttps://www.youtube.com/watch?v=GdaPj8a9QEo
SelectJuxtapose
(aboveisananimatedgif)
ComparingPhylogeneticTrees
Module bioinformatics.ca
Example:TreeJuxtaposer
T.Munzner (2006)– TreeJuxtaposer:ScalableTreeComparisonusingFocus+Context withGuaranteedVisibilityhttps://www.youtube.com/watch?v=GdaPj8a9QEo
ComparingPhylogeneticTreesNavigateJuxtapose
(aboveisananimatedgif)
Module bioinformatics.ca
• Datavisualizationisnotjustartisticorgraphicdesign
• Thinkaboutwhy-what-how– Datavisualizationlayers:thenestedmodel– Datavisualizationmethods:designstudymethodology– We’lloperationalizetheseconceptsinthetutorial
• Relationshipbetweenhowdataisencoded andhowothersdecode theinformation– Somevisualizationseasiertounderstandthatothers– Somestrategiesforhow tovisualizedata
Takeawaysfromthispart
Module bioinformatics.ca
Part2:ExamplesandTools
Module bioinformatics.ca
Datavisualizationinthewild• Manysourcesofdatavisualizationsacrossdisciplinesandeveninpopularpublications
• ExamplesfrominfoVis community:– Mizbee– Pathfinder:VisualizingPaths– TreeJuxtaposer
• ExamplesfromMicrobialGenomicCommunity:– TheIRIDAplatformintegratesanumberoftools
• PhyloViz- online• IslandViewer
– Microreact
• Generalpurposetools:– R,Python,D3,etc.
We’lltalkmoreaboutthese
Module bioinformatics.ca
DataVisualizationToolsRecreatingonechartusing24tools:http://bit.ly/2gRGx1J
Module bioinformatics.ca
DataVisualizationToolsRecreatingonechartusing24tools:http://bit.ly/2gRGx1J
Module bioinformatics.ca
PhyloVizOnlineWe’regoingtolookattheFasta exampledataset
Module bioinformatics.ca
PhyloVizOnline
• Let’stalkthroughthedesignchoicesofthistool:• Why-what-how
– Whywasthisvisualizationdeveloped?– Whatdatadoesituse?– Howdidauthorschoosetovisualizethedata?
• Marksandchannels– Howisthedataencoded?
• Interactions– Howcanothersinteractwiththedata?
• Whatdoyouthink?– Doyouagreewiththedecisionstheymade?– Wouldyoudoitdifferently?Ifyes,how?
Module bioinformatics.ca
IslandViewer(http://bit.ly/1LFRPkV)
Module bioinformatics.ca
IslandViewer(http://bit.ly/1LFRPkV)
• Let’stalkthroughthedesignchoicesofthistool:• Why-what-how
– Whywasthisvisualizationdeveloped?– Whatdatadoesituse?– Howdidauthorschoosetovisualizethedata?
• Marksandchannels– Howisthedataencoded?
• Interactions– Howcanothersinteractwiththedata?
• Whatdoyouthink?– Doyouagreewiththedecisionstheymade?– Wouldyoudoitdifferently?Ifyes,how?
Module bioinformatics.ca
Microreact(http://bit.ly/2ooye1f)We’regoingtolookattheEbola example
Module bioinformatics.ca
Microreact(http://bit.ly/2ooye1f)
• Let’stalkthroughthedesignchoicesofthistool:• Why-what-how
– Whywasthisvisualizationdeveloped?– Whatdatadoesituse?– Howdidauthorschoosetovisualizethedata?
• Marksandchannels– Howisthedataencoded?
• Interactions– Howcanothersinteractwiththedata?
• Whatdoyouthink?– Doyouagreewiththedecisionstheymade?– Wouldyoudoitdifferently?Ifyes,how?
Module bioinformatics.ca
• Toolstobuildyourownvisualization– Includesstatic&interactivecapabilities
• Toolsthatothershavebuiltforyoutouse– Toolsandlibrariesdifferineaseofuse&flexibility– Manyoptionsavailable,choosewhatworksbestforyou
• Thinkingsystematicallyaboutdatavisualizationstools– Usingwhy-what-howtoassesspros&cons
Takeawaysfromthispart
Module bioinformatics.ca
Whathavewelearnedtoday?
Image:nounproject
Module bioinformatics.ca
• Understanddatavisualizationdesignatahigh-level– Thedifferencebetweendesign&art– Layersofadatavisualization(thenestedmodel)– Aflavour ofdatavisualizationmethodologies
• Breakdownandevaluatedatavisualizations– Usingwhy-what-how– Beingawareofencoding&decoding– Usingmarksandchannels– Thinkingthroughdisplay&interaction
• Beawareofgenomicepidemiologydatavisualizationtoolsandothertools– Generalpurposetools– Communitydriventools:infovis,microbialgenomics
Returningtothelearningobjectives
Module bioinformatics.ca
• We’llimplementashiny(https://shiny.rstudio.com/)applicationinthetutorial
• Thedesignoftheshinyapplicationwillbeinspiredbythetoolswe’veseen
Upnext
Module bioinformatics.ca
WeareonaCoffeeBreak&NetworkingSession