Peer-Grading in a Course on Algorithms and Data...

ei.is.tue.mpg.de/~msajjadi

Peer-Grading in a Course onAlgorithms and Data StructuresMachine Learning Algorithms do not Improve over Simple Baselines

Mehdi S. M. SajjadiMorteza Alamgir

Ulrike von Luxburg

MPI IS TübingenUni HamburgUni Tübingen

Learning @ Scale26.04.2016

Peer-Grading Setting

� How to aggregate grades?

� Are peer grades as accurate as TA grades?

� Challenges� Inexperience� Bias� Limited number of grades� Cheating

Mehdi S. M. Sajjadi

Our University Class

� Algorithms and Data Structures (easy-demanding tasks)

� 1 semester, 220 students, ~14.000 grades

� Groups of 3 students for solving exercises

� Detailed scoring rubric

� Grading done by everyone on their own

� All submissions graded in 3 different ways:� Self-assessment, peer-grading, TA grading

Mehdi S. M. Sajjadi

A glance at one exercise…

� Easy multiple choice with proofs, 0 or 1 point each

� Mean absolute deviation from TA grade� 0.08 Peer� 0.12 Self� 0.08 Peer + Self

Mehdi S. M. Sajjadi

…and another exercise.

� Design algorithm, prove runtime

� Algorithms with bad runtime

� Students did not realize mistake

� Mean absolute deviation from TA grade� 0.18 Peer� 0.28 Self� 0.21 Peer + Self

Mehdi S. M. Sajjadi

Overall Grade Comparison

� Overall bias� 0.06 Peer� 0.12 Self

� Large variance

� Good base for improvements?

Mehdi S. M. Sajjadi

Probabilistic Model

� 𝑧𝑎𝑔 ~ 𝑁 𝑠𝑎 + 𝑏𝑔, 1/𝑟𝑔 Reported Scores

� 𝑠𝑎 ~ 𝑁 𝜇, 𝜎2 True Scores

� 𝑏𝑔 ~ 𝑁 0, 𝜂2 Bias

� 𝑟𝑔 ~ Γ 𝛼, 𝛽 Reliability

� EM algorithm for parameter estimation

� Works well on artificial data� Other algorithms in the literature yield similar results

C. Piech et al. Tuned Models of Peer Assessment in MOOCs. EDM, 2013.Mehdi S. M. Sajjadi

Results on our dataset

Mehdi S. M. Sajjadi

What about the ranking?

Mehdi S. M. Sajjadi

Why no improvements?

Mehdi S. M. Sajjadi

Possible Reasons I

� Amount of data? (6 peer grades per assignment)� Same results with 15 peer grades� Bias and reliability estimation over all assignments

� Wrong priors?� They barely change the results

� Unmotivated students / useless reviews?� Very few, and models should benefit from them anyway

� Different types of assignments?� Grouping them by grading difficults did not change results

Mehdi S. M. Sajjadi

Possible Reasons II

� (Different) TAs as baseline?� The 6 TAs graded similarly� Some noisy in ground truth does not hurt models

� Bias vs. Reliability!� Most errros due to low reliabilities, not bias� This kind of error is hard to correct (artificial models)

� Other sources of errors!� Errors often a result of lacking knowledge� Hard to correct for this

Mehdi S. M. Sajjadi

Conclusion

� Models fail to improve over mean estimator

� Main reasons� Sources of errors are different from the assumption� Not much bias� Reliability difficult to estimate and correct

� Are complicated models acceptable for students?

� Is peer grading a viable option for university courses?

� The dataset is publicly available on our websites

Mehdi S. M. Sajjadi

Peer-Grading in a Course on Algorithms and Data...

Documents

Transcript of Peer-Grading in a Course on Algorithms and Data...

Django in June · For Bb Instruments ÏÏnÏ (F7) Swing ú ú Cmaj ú ú w Cmin ú Î#Ï & # 5 ú ú Gmaj ú ú F # 7 w Gmaj ú î & # 9 ú ú E7 ú #ú Ï Ï Ï Ï ú î & # 13 ú

φυσικη γ γυμνασιου

Deuterons-to- He Channels · 2011. 3. 3. · •Branching Ratio : S n (0)/S p (0)/S g (0) = Γ n / Γ p / Γ g = 0.5/0.5/0.0000001 for E k = 0 to 200 keV •Γ n = Γ p = 0.2 MeV

Radiationfromparticlesrevolvingaroundamagnetized ......σμν =Γ ρ νσ,μ−Γ ρ μσ,ν +Γ μλΓ λ νσ−Γ ρ νλΓ λ μσ (6) or R ρσμν = 1 2 (g ρν,σμ +g σμ,ρν−g

Exo$c&hadrons&with&heavy&quarkstheorie.ikp.physik.tu-darmstadt.de/hirschegg/2014/talks/Fri/Hosaka.pdf · 24 Γ(Z0 b → χ b0(1P )γ):Γ(Z0 b → χ b1(1P )γ):Γ(Z0 b → χ b2(1P

~J--:~ 'i1 Council Map 8x11... · 2019. 12. 27. · ~j--:~ r-... "'i1 , ~ . you'll like t/,e a ltio1 .de in pamd,se 1 ú 4 n n ú ú ú ú 3 n n s r t ú ú ú ú d n n ú ú e l

H HdH HkHxHi - IZU Trail Journeyizutrailjourney.com/userdata/ITJ2019_result_men.pdf8p )H H H H Hl ¡ \(` d $Ñ ] Ë&É Ø u*ç9T34 H HdH HkHxHi >ü? ?? ? >ú? >Ú>ú?? ? >ú?? >Ú??

Direct Measurement of the 21 Na(p, γ ) 22 Mg Reaction: Resonance Strengths and γ - γ Analysis

BAKER to BELLINGHAM RECREATION PLANNING AREA · PDF fileú ú ú ú ú ú ú ú ú ú ú ú ú ú ú ú ú ú ú ú ú ú ú ú???? ? ??? o?? Ë???? o o o

6FKRRO 6WXGHQWV LQ 'LFWDWLRQ - دانشگاه تهران...-RXUQDO RI %HKDYRLU DQG /HDUQLQJ 'LVRUGHUV û ù ú ú ú ú ú û ,GHQWLI\LQJ (UURUV 0DGH E\ (OHPHQWDU\ 6FKRRO 6WXGHQWV

Βασικές Μεθοδολογίες για ην επίλση ασκήσεωνµθοδολογίες Ασκήσεων Γ... · Εά v vλο Τόλη «ΜΑΘΗΜΑΤΙΚΑ Γ΄

# C s 8j0Û o i...# C s8j0Û o8o% b4E _ \ S W Z w6× | ~ w E S 0 b Æ >Ì>Ú>Ú>Ú>Ú>Ú>Ú>Ú>Ú>Ú>Ú>Ì >Ì # C s8j0Û o = i _ P M 0b'¼ b +0[ l g * b0b0 >Ì>Ú>Ú>Ú>Ú>Ú>Ú>Ú>Ú>Ú>Ú>Ú>Ú>Ú>Ú>Ú>Ú>Ú>Ú>Ì

Prelude and Fugue No. 12 2 2 2 ú ú w w w Ped. Maestoso œ œ œ #œ w w w úw ú w w wœ œ œ œ œ œ œ œ w &? 5 úw ú w w #œw œ œ œ w w ú. œ œ ú ú w w ú. #œ œ ú

Research Article -Bazilevic Functionsdownloads.hindawi.com/journals/aaa/2012/383592.pdfST γ CV γ K γ, γ

CHAPTER 1 Γ-SEMIGROUPSshodhganga.inflibnet.ac.in/bitstream/10603/10440/9/09_chapter 1.pdf · Γ-semigroups 35 CHAPTER ... algebraic theory of Γ-subsemigroups and Γ-ideals in Γ-semigroups.

A Prospect Theoretic Look at a Joint Radar and ...eceweb1.rutgers.edu/~cspl/publications/pub2018_PDF...A Prospect Theoretic Look 487 w γ(p= pγ/(pγ +(1−p)γ)1/γ, (7) with γ ∈

Εγ γ ρ α φ ή σ το Ν η π ι α γ ω γ ε ί οdipe-g-athin.att.sch.gr/files/ekpaideftika_themata/GUIDE_GOVGR/NG.… · Εγ γ ρ α φ ή σ το Ν η π ι α γ

Words & Music: Iving Berlin Arr.: Ed Lojeski &#44 · · · î ... · &ú ú ÏÏú Ï ú ú tree tops glis ten, and chil dren &ú ú ÏÏú Ï ú ú tree tops glis ten, and chil dren?

Who is this Man? - Sally DeFord Music › wp-content › uploads › full...4: Ú 14 16 18 ˆ = Ú: Ú üû = Ú ˆ: Ú üû = Ú: Ú 20 ˆ ÞÜÜ ˆ 2234 ˆ ˆ 2444 ˆ Ú ˆ ÞÜÜ

ΘΕɲɩɹɩ Γ΄ ɱɺɰΕɯɵɺ ΘΕΩɷɮɹɯɰɮɸ ɮɲΕɷɮɸɯΩɳlyk-naous.kyk.sch.gr/latin.pdf · - 1 - ΘΕɲɩɹɩ Γ΄ ɱɺɰΕɯɵɺ ΘΕΩɷɮɹɯɰɮɸ ɮɲΕɷɮɸɯΩɳ