Roots of Equations

1

Roots of Equations

Bracketing Methods

2

Root

We are given f(x), a function of x, and we want to find α such that

f(α) = 0

α is called the root of the equation f(x) = 0, or the zero of the function f(x)

3

Example: Interest RateSuppose you want to buy an electronic appliance from a shop and you can either pay an amount of 12,000 or have a monthly payment of 1,065 for 12 months. What is the corresponding interest rate?

1)1(

)1(

n

n

x

xxPA

A is the monthly payment

P is the loan amount

x is the interest rate per period of time

n is the loan period

To find the yearly interest rate, x, you have to find the zero of

1)1(

)1(000,12065,1

1212

121212

x

xx

We know the payment formulae is:

4

Finding Roots Graphically

• Not accurate

• However, graphical view can provide useful info about a function.– Multiple roots?– Continuous?– etc.

5

The following root finding methods will be introduced:

A. Bracketing MethodsA.1. Bisection MethodA.2. Regula Falsi (False-position Method)

B. Open MethodsB.1. Fixed Point IterationB.2. Newton Raphson's MethodB.3. Secant Method

6

Bracketing Methods

Theorem: If a function f(x) is continuous in the interval [a, b] and f(a)f(b) < 0, then the equation f(x) = 0 has at least one real root in the interval (a, b).

7

Usually• f(a)f(b) > 0 implies zero or even

number of roots – [figure (a) and (c)]

• f(a)f(b) < 0 implies odd number of roots– [figure (b) and (d)]

8

Exceptional Cases

• Multiple roots– Roots that overlap at one

point.– e.g.: f(x) = (x-1)(x-1)(x-2) has

a multiple root at x=1.

• Functions that discontinue within the interval

9

Algorithm for bracketing methods

Step 1: Choose two points xl and xu such that f(xl)f(xu) < 0

Step 2: Estimate the root xr (note: xl < xr < xu)

Step 3: Determine which subinterval the root lies:

if f(xl)f(xr) < 0 // the root lies in the lower subinterval

set xu to xr and goto step 2

if f(xl)f(xr) > 0 // the root lies in the upper subinterval

set xl to xr and goto step 2

if f(xl)f(xr) = 0

xr is the root

10

How to select xr in step 2?

1. Bisection MethodGuess without considering the characteristics of

f(x) in (xl, xu)

2. False Position Method (Regula Falsi)Use "average slope" to predict the location of

the root

11

A.1. Bisection Method

• Each guess reduce the search interval by half

2lu

r

xxx

12

Bisection Method – Example

40)1(38.667

)( 146843.0 xex

xf

Find the root of f(x) = 0 with an approximated error below 0.5%. (True root: α=14.7802)

13

Example (continue)

n xl xr xu f(xl) f(xr) f(xu) f(xl)f(xu) εa

0 12 16 6.067 -2.269

1 12 14 16 6.067 1.569 -2.269 > 0

2 14 15 16 1.569 -0.425 -2.269 < 0 6.667%

3 14 14.5 15 1.569 0.552 -0.425 > 0 3.448%

4 14.5 14.75 15 0.552 0.0590 -0.425 > 0 1.695%

5 14.75 14.875 15 0.0590 -0.184 -0.425 < 0 0.840%

6 14.75 14.8125 14.875 0.0590 -0.0629 -0.184 < 0 0.422%

%100newr

oldr

newr

ax

xx

%219.0%1007802.14

8125.147802.14

t

14

Error BoundsThe true root, α, must lie between xl and xu.

xl xuxr

xl(1) xu

(1)

After the 1st iteration, the solution, xr(1), should be within an

accuracy of)1()1(

2

1lu xx

xr(1)

Let xr(n) denotes xr in the nth iteration

15

Error Bounds

xl(2) xu

(1)xu(2)

Suppose the root lies in the lower subinterval.

xr(2)

)1()1(2

)2()2(

2

1

2

1lulu xxxx

After the 2nd iteration, the solution, xr(2), should be

within an accuracy of

16

Error Bounds

)1()1()1()1(2

)()(

2

1...

2

1

2

1lun

nl

nu

nl

nu xxxxxx

In general, after the nth iteration, the solution, xr(n),

should be within an accuracy of

If we want to achieve an absolute error of no more than Eα

)(log

2

1

)1()1(

2

)1()1(

E

xxn

Exx

lu

lun

17

Implementation Issues• The condition f(xl)f(xr) = 0 (in step 3) is difficult to

achieve due to errors.

• We should repeat until xr is close enough to the root, but we don't know what the root is!

• One possible solution is to estimate the error as

and repeat until ea < es (acceptable error)

%100newr

oldr

newr

ax

xxe

18

Bisection Method (as C function)

// xl, xu: Lower and upper bound of the interval// es: Acceptable relative percentage error// iter_max: Maximum # of iterationsdouble Bisect(double xl, double xu, double es, int iter_max) { double xr; // Est. root double xr_old; // Est. root in the previous step double ea; // Est. error int iter = 0; // Keep track of # of iterations

xr = xl; // Initialize xr in order to // calculating "ea". Can also be "xu". do { iter++; xr_old = xr;

19

xr = (xl + xu) / 2; // Estimate root

if (xr != 0) ea = fabs((xr – xr_old) / xr) * 100;

test = f(xl) * f(xr);

if (test < 0) xu = xr; else if (test > 0) xl = xr; else ea = 0;

} while (ea > es && iter < iter_max);

return xr;}

20

Additional Implementation Issues

• Function call is a relatively slow operation.

• In the previous example, function f() is called twice in each iteration. Is it necessary?– We only need to update one of the bounds (see step 3

in the algorithm for the bracketing method).

21

Revised Bisection Method (as C function)

double Bisect(double xl, double xu, double es, int iter_max) { double xr; // Est. Root double xr_old; // Est. root in the previous step double ea; // Est. error int iter = 0; // Keep track of # of iterations double fl, fr; // Save values of f(xl) and f(xr)

xr = xl; // Initialize xr in order to // calculating "ea". Can also be

"xu". fl = f(xl); do { iter++; xr_old = xr;

xr = (xl + xu) / 2; // Estimate root fr = f(xr);

22

if (xr != 0)

ea = fabs((xr – xr_old) / xr) * 100;

test = fl * fr;

if (test < 0) xu = xr; else if (test > 0) { xl = xr; fl = fr; } else ea = 0;

} while (ea > es && iter < iter_max);

return xr;}

23

Additional Implementation Issues

• Testing if f(xl)f(xr) is positive or negative directly could result in underflow or overflow.

• How can we address this problem?– We can test if both the values have the same sign

24

Comments on Bisection Method

• The method is guaranteed to converge.

• However, the convergence is slow as we gain only one binary digit in accuracy in each iteration.

25

A.2. Regula Falsi Method• Also known as the false-position method, or linear

interpolation method.

• Unlike the bisection method which divides the search interval by half, regula falsi interpolates f(xu) and f(xl) by a straight line and the intersection of this line with the x-axis is used as the new search position.

• The slope of the line connecting f(xu) and f(xl) represents the "average slope" (i.e., the value of f'(x)) of the points in [xl, xu ].

26

rl

l

ru

u

xx

xf

xx

xf )()(

)()(

))((

ul

uluur xfxf

xxxfxx

27

False-position vs Bisection

• False position in general performs better than bisection method.

• Exceptional Cases:– (Usually) When the deviation of f'(x) is high

and the end points of the interval are selected poorly.

– For example,

3.1,0with

1)( 10

ul xx

xxf

28

Iteration xl xu xr εa (%) εt (%)

1 0 1.3 0.65 35

2 0.65 1.3 0.975 33.3 25

3 0.975 1.3 1.1375 14.3 13.8

4 0.975 1.1375 1.05625 7.7 5.6

5 0.975 1.05625 1.015625 4.0 1.6

Iteration xl xu xr εa (%) εt (%)

1 0 1.3 0.09430 90.6

2 0.09430 1.3 0.18176 48.1 81.8

3 0.18176 1.3 0.26287 30.9 73.7

4 0.26287 1.3 0.33811 22.3 66.2

5 0.33811 1.3 0.40788 17.1 59.2

Bisection Method (Converge quicker)

False-position Method

29

3.1,0with

1)( 10

ul xx

xxf

30

Summary• Bracketing Methods

– f(x) has the be continuous in the interval [xl, xu] and f(xl)f(xu) < 0

– Always converge– Usually slower than open methods

• Bisection Method– Slow but guarantee the best worst-case convergent

rate.

• False-position method– In general performs better than bisection method

(with some exceptions).

Roots of Equations

Documents

Transcript of Roots of Equations