Stories Behind Kaggle Competitions with Wendy Kan from Kaggle
Mebi 591D – BHI Kaggle Class Baselines kaggleclass.weebly.com
-
Upload
dwight-richards -
Category
Documents
-
view
226 -
download
0
Transcript of Mebi 591D – BHI Kaggle Class Baselines kaggleclass.weebly.com
![Page 1: Mebi 591D – BHI Kaggle Class Baselines kaggleclass.weebly.com](https://reader036.fdocuments.in/reader036/viewer/2022082505/56649f135503460f94c26c5e/html5/thumbnails/1.jpg)
Mebi 591D –BHI Kaggle Class
Baselines
http://winter2014-mebi591d-kaggleclass.weebly.com/
![Page 2: Mebi 591D – BHI Kaggle Class Baselines kaggleclass.weebly.com](https://reader036.fdocuments.in/reader036/viewer/2022082505/56649f135503460f94c26c5e/html5/thumbnails/2.jpg)
Baseline (I.) What is a baseline for?
(a) a reasonable 1st approach to your problem (b) meant to be quick and to get system running (c) allow you to see improvements
What should be included? (a) your system should be able to take in any test
set and output your prediction (b) you should be able to give evaluation scores on
any test set presented (c) you should be able to visualize which instances
your errors occur inDue in 4 weeks: Start early!
![Page 3: Mebi 591D – BHI Kaggle Class Baselines kaggleclass.weebly.com](https://reader036.fdocuments.in/reader036/viewer/2022082505/56649f135503460f94c26c5e/html5/thumbnails/3.jpg)
Baseline (II.) Examples of how to use your baseline
Case 1. Named-entity recognition task, choose to use sequential CRF implementation Baseline: use unigram features Further experiments: bigram features, POS, etc Change to 2-step classification, change tagging
![Page 4: Mebi 591D – BHI Kaggle Class Baselines kaggleclass.weebly.com](https://reader036.fdocuments.in/reader036/viewer/2022082505/56649f135503460f94c26c5e/html5/thumbnails/4.jpg)
Baseline (II.)
Case 2. Predict stock market price Baseline: HMM – previous stock price same
time Further experiments: add derivative
features, add features from news Can try several other classifications Can use some kind of boosting algorithm
![Page 5: Mebi 591D – BHI Kaggle Class Baselines kaggleclass.weebly.com](https://reader036.fdocuments.in/reader036/viewer/2022082505/56649f135503460f94c26c5e/html5/thumbnails/5.jpg)
Evaluation Metrics (I.) RECAP from last time --- never evaluate on test set
when building your system -- why? You are cheating! Overtraining on mistakes and noise (won’t generalize)
Using a development set or cross-validation A development set is another set you split out just like
the test set (~10%) Used to evaluate Used for tuning parameters
Cross-validation sets Split data to N pieces, use N-1 pieces as training, 1 as test,
then repeat Nx to get variations of scores
![Page 6: Mebi 591D – BHI Kaggle Class Baselines kaggleclass.weebly.com](https://reader036.fdocuments.in/reader036/viewer/2022082505/56649f135503460f94c26c5e/html5/thumbnails/6.jpg)
Evaluation Metrics (II.) Multi-class categorization
Precision, recall, f1-score AUC curve Why may these not measure things well?
Class imblance! Use micro- and macro- definitions
Numeric predictions RMS-error Nearest neighbor error
![Page 7: Mebi 591D – BHI Kaggle Class Baselines kaggleclass.weebly.com](https://reader036.fdocuments.in/reader036/viewer/2022082505/56649f135503460f94c26c5e/html5/thumbnails/7.jpg)
Error analysis Good to see where your system makes error so
you can introduce better features (or a better model)
Good to see where you are getting false positives and false negatives
Confusion matrices for classification are helpful It’s a (n label)x(n label) matrix where
rows/columns represent gold and system predictions
Numbers in matrix represent counts
![Page 8: Mebi 591D – BHI Kaggle Class Baselines kaggleclass.weebly.com](https://reader036.fdocuments.in/reader036/viewer/2022082505/56649f135503460f94c26c5e/html5/thumbnails/8.jpg)
Tasks Decide strategy(next week)
What is baseline How work will be divided What resources you will use
Baseline system (4 weeks) Include prediction module Includes evaluation module Be able to visualize your errors for error
analysis