Report - Deep Learning via Hessian-free Optimizationasamir/cifar/HFO_James.pdfGradient descent is bad at deep learning (cont.) Two hypotheses for why gradient descent fails: increased frequency

Please pass captcha verification before submit form