PC GAMES Media Technologies Theodoros Nikitopoulos Chamilothoris.
Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods...
Transcript of Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods...
![Page 1: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/1.jpg)
CS639:DataManagementfor
DataScienceLecture17:EvaluatingMachineLearningMethods
TheodorosRekatsinas
1
![Page 2: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/2.jpg)
Announcements
1. Youcanseeyourexamsduringofficehours
2. Homeworkwillbeannouncedlaterthisweek;wewillhaveonlytwomoreprojectsnotthree
2
![Page 3: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/3.jpg)
Today
1. EvaluatingMLmodels
3
![Page 4: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/4.jpg)
Howcanwegetanunbiasedestimateoftheaccuracyofalearnedmodel?
![Page 5: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/5.jpg)
Testsets
• Howcanwegetanunbiasedestimateoftheaccuracyofalearnedmodel?
• Whenlearningamodel,youshouldpretendthatyoudon’thavethetestdatayet
• Ifthetest-setlabelsinfluencethelearnedmodelinanyway,accuracyestimateswillbebiased
5
![Page 6: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/6.jpg)
Learningcurves
• Howdoestheaccuracyofalearningmethodchangeasafunctionofthetraining-setsize?
• Thiscanbeassessedbylearningcurves
![Page 7: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/7.jpg)
Learningcurves
• Givenatraining/testsetpartition• Foreachsamplesizesonthelearningcurve
• (optionally)repeatntimes• Randomlyselectsinstancesfromthetrainingset• Learnthemodel• Evaluatethemodelonthetestsettodetermineaccuracya• Plot(s,a)
![Page 8: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/8.jpg)
Validation(tuning)sets
• Supposewewantunbiasedestimatesofaccuracyduringthelearningprocess(e.g.tochoosethebestlevelofdecision-treepruning)?
Partition training data into separate training/validation sets
![Page 9: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/9.jpg)
Limitationsofusingasingletraining/testpartition• Wemaynothaveenoughdatatomakesufficientlylargetrainingandtestsets
• Alargertestsetgivesusmorereliableestimatesofaccuracy(i.e.,alowervarianceestimate)
• But… alargertrainingsetwillbemorerepresentativeofhowmuchdataweactuallyhaveforlearningprocess
• Asingletrainingsetdoesnottellushowsensitiveaccuracyistoaparticulartrainingsample
![Page 10: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/10.jpg)
Randomresampling
• Wecanaddressthesecondissuebyrepeatedlyrandomlypartitioningtheavailabledataintotrainingandsetsets.
![Page 11: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/11.jpg)
Stratifiedsampling
• Whenrandomlyselectingtrainingorvalidationsets,wemaywanttoensurethatclassproportionsaremaintainedineachselectedset
![Page 12: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/12.jpg)
Crossvalidation
![Page 13: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/13.jpg)
Crossvalidationexample
• Supposewehave100instances,andwewanttoestimateaccuracywithcrossvalidation
![Page 14: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/14.jpg)
Crossvalidationexample
• 10-foldcrossvalidationiscommon,butsmallervaluesofnareoftenusedwhenlearningtakesalotoftime
• Inleave-one-outcrossvalidation,n=#instances• Instratifiedcrossvalidation,stratifiedsamplingisusedwhenpartitioningthedata
• CVmakesefficientuseoftheavailabledatafortesting• Notethatwheneverweusemultipletrainingsets,asinCVandrandomresampling,weareevaluatingalearningmethodasopposedtoanindividuallearnedmodel
![Page 15: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/15.jpg)
Internalcrossvalidation
• Insteadofasinglevalidationset,wecanusecross-validationwithinatrainingsettoselectamodel(e.g.tochoosethebestlevelofdecision-treepruning)
![Page 16: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/16.jpg)
Confusionmatrices
• Howcanweunderstandwhattypesofmistakesalearnedmodelmakes?
![Page 17: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/17.jpg)
Confusionmatrixfor2-classproblems
![Page 18: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/18.jpg)
Isaccuracyanadequatemeasureofpredictiveperformance?
![Page 19: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/19.jpg)
Otheraccuracymetrics
![Page 20: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/20.jpg)
ROCcurves
![Page 21: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/21.jpg)
ROCcurveexample
![Page 22: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/22.jpg)
ROCcurvesandmisclassificationcosts
![Page 23: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/23.jpg)
AlgorithmforcreatinganROCcurve
![Page 24: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/24.jpg)
Otheraccuracymetrics
![Page 25: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/25.jpg)
Precision/recallcurves
![Page 26: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/26.jpg)
ToAvoidCross-ValidationPitfalls
![Page 27: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/27.jpg)
ToAvoidCross-ValidationPitfalls
![Page 28: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/28.jpg)
ToAvoidCross-ValidationPitfalls
![Page 29: Lecture 18 Eval ML - GitHub Pages · 2019-05-08 · Lecture 17: Evaluating Machine Learning Methods Theodoros Rekatsinas 1. Announcements 1. You can see your exams during office hours](https://reader035.fdocuments.in/reader035/viewer/2022070913/5fb479067dd4c31efd70b1a3/html5/thumbnails/29.jpg)
AblationStudies