Special Topics in Educational Data Mining
description
Transcript of Special Topics in Educational Data Mining
![Page 1: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/1.jpg)
Special Topics in Educational Data Mining
HUDK5199Spring term, 2013February 25, 2013
![Page 2: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/2.jpg)
Today’s Class
• Feature Engineering and Distillation - What
![Page 3: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/3.jpg)
Special Rules for Today
• Everyone Votes• Everyone Participates
![Page 4: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/4.jpg)
Feature Engineering
• Not just throwing spaghetti at the wall and seeing what sticks
![Page 5: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/5.jpg)
Construct Validity Matters!
• Crap features will give you crap models
• Crap features = reduced generalizability/more over-fitting
• Nice discussion of this in Sao Pedro paper I assigned
![Page 6: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/6.jpg)
What’s a good feature?
• A feature that is potentially meaningfully linked to the construct you want to identify
![Page 7: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/7.jpg)
Let’s look at some features used in real models
• Split into groups of 3-4
• Take a sheet of features
• Which features (or combinations) can you come up with “just so” stories for why they might predict the construct?
• Are there any features that seem utterly irrelevant?
![Page 8: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/8.jpg)
Each group
• Tell us what your construct is
• Tell us your favorite “just so story” (or two) from your features
• Tell us which features look like junk
• Everyone else: you have to give the feature a thumbs-up or thumbs-down
![Page 9: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/9.jpg)
Now…
• Let’s take a break
![Page 10: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/10.jpg)
I need 3 volunteers
![Page 11: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/11.jpg)
Volunteers
• #1, #2: “Wee dee dee dee”
• #3: “Weema wompa way”
![Page 12: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/12.jpg)
Everyone else
• Has to sing a verse of “In the jungle…”
• With an animal that no one else has mentioned yet
![Page 13: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/13.jpg)
In the jungle….
![Page 14: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/14.jpg)
Now that we’re all feeling creative
![Page 15: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/15.jpg)
Now that we’re all feeling creative
• Break into *different* 3-4 person groups than last time
![Page 16: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/16.jpg)
Now that we’re all feeling creative
• Make up features for Assignment 4
• You need to– Come up with a new feature– Justify how you can would it from the data set– Justify why it would work
![Page 17: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/17.jpg)
I need a volunteer
![Page 18: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/18.jpg)
I need a volunteer
• Your task is to write down the features suggested
• And the counts for thumbs up/thumbs down
![Page 19: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/19.jpg)
Now…
• Each group needs to read their favorite feature to the class and justify it
• Who thinks this feature will improve prediction of off-task behavior?
• Who doesn’t?
• Thumbs up, thumbs down!
![Page 20: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/20.jpg)
Comments or Questions
• About Assignment 4?
![Page 21: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/21.jpg)
Special Request
• Bring a print-out of your Assignment 4 solution to class
![Page 22: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/22.jpg)
Next Class
• Monday, February 27
• Feature Engineering and Distillation – HOW
• Assignment Due: 4. Feature Engineering
![Page 23: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/23.jpg)
Excel
• Plan is to go as far as we can by 5pm• We will continue after next class session
• Vote on which topics you most want to hear about
![Page 24: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/24.jpg)
Topics• Using average, count, sum, stdev (asgn. 4 data set)• Relative and absolute referencing (made up data)• Copy and paste values only (made up data)• Using sort, filter (asgn. 4 data set)• Making pivot table (asgn. 4 data set)• Using vlookup (Jan. 28 class data set)• Using countif (asgn. 4 data set)• Making scatterplot (Jan. 28 class data set)• Making histogram (asgn. 4 data set)• Equation Solver (Jan. 28 class data set)• Z-test (made up data)• 2-sample t-test (made up data)
• Other topics?
![Page 25: Special Topics in Educational Data Mining](https://reader036.fdocuments.in/reader036/viewer/2022062301/56815236550346895dc07c46/html5/thumbnails/25.jpg)
The End