One hidden layer Neural Network Neural Networks …cs230.stanford.edu/files/C1M3.pdf ·...
Transcript of One hidden layer Neural Network Neural Networks …cs230.stanford.edu/files/C1M3.pdf ·...
Andrew Ng
What is a Neural Network?
!"!#!$
%&
x
w
b
' = )*! + , - = .(') ℒ(-, %)
x
4["]
,["]'["] = 4["]! + ,["] -["] = .('["]) '[#] = 4[#]-["] + ,[#] -[#] = .('[#]) ℒ(-[#], %)
!"!#!$
%&
4[#]
,[#]
Andrew Ng
! = #!$ + &
#!$ + &'
$($)$*
+(!) ' = ./!
' = +(!)
Neural Network Representation
$($)$*
./
$($)$*
./
Andrew Ng
Neural Network Representation
!"!#!$
%&
'""
'#"
'$"
'("
)"" = +"" ,! + /"["], '"["] = 3()"" )
)#" = +#" ,! + /#
["], '#["] = 3()#" )
)$" = +$" ,! + /$["], '$["] = 3()$" )
)(" = +(" ,! + /(
["], '(["] = 3()(" )
Andrew Ng
Neural Network Representation learningGiven input x:
! " = $ " % + ' "
( " = )(! " )
! , = $ , ( " + ' ,
( , = )(! , )
%"%,%-
./
(""
(,"
(-"
(0"
Andrew Ng
Vectorizing across multiple examples
!"!#!$
%&
' " = ) " ! + + "
, " = -(' " )' # = ) # , " + + #
, # = -(' # )
Andrew Ng
! " ($) = ' " (($) + * "
+ " ($) = ,(! " $ )! - ($) = ' - + " ($) + * -
+ - ($) = ,(! - $ )
Vectorizing across multiple examplesfor i = 1 to m:
Andrew Ng
!"!#!$
%&
Recap of vectorizing across multiple examplesfor i = 1 to m
' " ()) = , " !()) + . "
/ " ()) = 0(' " ) )' # ()) = , # / " ()) + . #
/ # ()) = 0(' # ) )…1 = !(") !(#) !(2)
/["](#)A["] = /["](") /["](2)…
6 " = , " 1 + . "
7 " = 0(6 " )6 # = , # 7 " + . #
7 # = 0(6 # )
Andrew Ng
Activation functions
!"!#!$
%&
' " = ) " ! + + "
, " = -(' " )' # = ) # , " + + #
, # = -(' # )
Given x:
Andrew Ng
Activation function
! " = $ " % + ' "
( " = )["](! " )! . = $ . ( " + ' .
( . = )[.](! . )
Given x:
%"%.%/
01
Andrew Ng
![#] = &[#]' + )[#]'
&["]
)["]
+[#] = ,(![#]) ℒ(+[0], y)![0] = &[0]' + )[0] +[0] = ,(![0])
Neural network gradients&[$]
)[$]
Andrew Ng
!"[$] = !'[$]( ) *
!+[$] = !'[$]
!'[)] = " $ ,!'[$] ∗ .[)]′(z ) )
!"[)] = !'[)]3,
!+[)] = !'[)]
Summary of gradient descent!'[$] = ([$] − 5
Andrew Ng
!"[$] = '[$] − )
!*[$] = !"[$]' + ,
!-[$] = !"[$]
!"[+] = * $ .!"[$] ∗ 0[+]′(z + )
!*[+] = !"[+]5.
!-[+] = !"[+]
!6["] = 7["] − 8
!*["] = 1:!6["]7 $ ,
!-["] = 1:;<. >?:(!6 " , '5A> = 1, BCC<!A:> = DE?C)
!6[$] = * " %!6["] ∗ 0[$]′(Z $ )
!*[$] = 1:!6[$]G%
!-[$] = 1:;<. >?:(!6 $ , '5A> = 1, BCC<!A:> = DE?C)
Summary of gradient descent