Report - Evaluation Experiment Result Models Evidence for answering ... · • EVAL F1 : Overall F1-score on CoNLL-2012-test set • Average disagreement rate : Average d(x,y) across examples.

Please pass captcha verification before submit form