IMM...IMM 20.11.2002 In tro duction to In terv al Analysis Ole Caprani Ka j Madsen Hans Bruun...

IMM

20.11.2002

Introduction to

Interval Analysis

Ole Caprani

Kaj Madsen

Hans Bruun Nielsen

0.25

0.5

0 0.25 0.5

0.25

0.5

0 0.25 0.5

Abstract. Interval analysis was introduced by Ramon Moore in 1959

as a tool for automatic control of the errors in a computed result, that

arise from input error, rounding errors during computation, and trunca-

tion errors from using a numerical approximation to the mathematical

problem. An important application is the enclosure of ranges of func-

tions, which enables us e.g. to solve equations with interval coeÆcients,

to prove existence of certain solutions, and to solve global optimization

problems. We discuss the basic concepts of interval analysis and present

methods for interval solution of some numerical problems.

Contents

1. Introduction 3

2. Interval Arithmetic 6

2.1. Real Intervals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

2.2. Interval Vectors and Matrices . . . . . . . . . . . . . 6

2.3. Real Interval Arithmetic . . . . . . . . . . . . . . . . . . 8

2.4. Floating Point Interval Arithmetic . . . . . . . . 9

3. Interval Algebra 11

4. Interval Functions 12

4.1. Interval Hulls . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

4.2. Interval Extensions . . . . . . . . . . . . . . . . . . . . . . . 13

4.3. Interval Extensions for Rational Functions 14

5. The Set of Intervals as Metric Spaces 16

5.1. Continuity of Interval Functions . . . . . . . . . . . 19

5.2. Lipschitz Continuity . . . . . . . . . . . . . . . . . . . . . . 20

5.3. Width of an Interval . . . . . . . . . . . . . . . . . . . . . . 22

6. Linear and Quadratic Interval Extensions 24

6.1. Linear Interval Extensions . . . . . . . . . . . . . . . . 25

6.2. Mean Value Forms . . . . . . . . . . . . . . . . . . . . . . . . 27

6.3. Quadratic Interval Extensions . . . . . . . . . . . . . 30

7. Accurate Computation of Interval Hulls 32

8. Interval Integration 37

8.1. Basic Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37

8.2. Interval Simpson . . . . . . . . . . . . . . . . . . . . . . . . . . 40

9. Roots of Functions 44

9.1. Basic Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44

9.2. Interval Version of Newton's Method . . . . . . 46

9.3. Multidimensional Newton's Method . . . . . . . 52

9.4. Krawczyk's Version of Newton's Method . . 57

10. Linear Systems of Equations 61

10.1. Gaussian Elimination . . . . . . . . . . . . . . . . . . . . . 62

10.2. Sign-Accord Algorithm . . . . . . . . . . . . . . . . . . . . 67

11. Global Optimization 73

11.1. Basic Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73

11.2. Use of Gradient Information . . . . . . . . . . . . . . 76

References 79

Notation 81

Index 82

1.

Introduction

Ingeneraltherearethreesourcesoferrorinconnectionwithnumerical

computations:roundingerrors,truncationerrorsandinputerrors.For

estimationofthee�ectofsucherrorsitmaybeadvantageoustocompute

withintervalsofnumbers.Inthisintroductionwegivesomesimple

examples.Inthefollowingchapterswegiveamorestringentpresentation

ofintervalcalculation.

Example1.1.

Firstconsiderroundingerrors.Assumethatweworkwith3

signi�cantdigitsandnormalrounding,andwishtocompute

y

=

1�0:531+

0:531

2

2

Weget 0

:469+

0:282

2

;

0:469+0:141;

0:610

Howlargeistheerrorinthiscomputedresult?

Inthissimpleexampleitiseasyto�ndtheanswer:allwehavetodois

touseasuÆcientnumberofdigitsinthecomputation,

0:469+

0:281961

2

;

0:469+0:1409805;

0:6099805

Sotheerrorinthe�rstresultis0:610�0:6099805

=

0:00001950.

Itisnotalwayspossible,however,tousethisapproach.Theremaybe

toomanydigitstokeeptrackofordivisionsmaygivein�nitedecimal

fractionsthatneedrounding.Insteadwemustestimatetheerroror

keeptrackofitduringcomputation.Thelatterisourchoicewhenwe

calculatewithintervals.

Example1.2.

Withtheproblemfromthe�rstexample,thwvalueof0:531

2

iscontainedintheinterval[0:281;0:282],andweget

0:469+

0:531

2

2

20:469+[0:281;0:282]�1 2

�0:469+[0:140;0:141]=

[0:609;0:610]

1.Introduction

4

Thus,weknowthatthetruevalueofyisbetween0:609and0:610.During

computationtheendpointsoftheintervalswereroundedinthedirections

thatensuredthattheresultingintervalcontainedthetrueresult.

By

monitoringintervalsinthisspecialwaywecankeeptrackofrounding

errors.

Example1.3.

Next,welookattruncationerrors.Wewishtocompute

y=e�

0:531

,andusetheTaylorserieswithremainderterm,

ex

=

1+x+

x2

2!

et;

wheretisbetween0andx

Weareinterestedinthecasex<0,andsinceet2[0;1]forallt<0,we

get

ex

21+x+

x2

2!

[0;1]:

(1.1)

Withx=�0:531we�nd

e�

0:531

21�0:531+

0:531

2

2

[0;1]

20:469+[:140;0:141]�[0;1]

20:469+[0;0:141]=

[0:469;0:610]

Againwefoundanintervalthatissuretocontainthecorrectresult.The

computationshowsthatintervalcalculationsallowsussimultaneouslyto

keeptrackofroundingerrorsandtruncationerrors.

Example1.4.

Finally,toillustrateinputerrors,supposethatinsteadof

x=

�0:531weonlyknowthatx2[�0:532;�0:531].Wecanestimate

thee�ectofthisuncertaintyinindata:Equation(1.1)gives

ex

21+[�0:532;�0:531]+

[�0:532;�0:531]2

2

�[0;1]

2[0:468;0:470]+

[0:280;0:284]2

2

�[0;1]

2[0:468;0:470]+[0;0:142]=

[0:468;0:612]

Thecomputationshowsthatallthreesourcesoferrorcanbeincluded,and

theresultisanintervalthatcontainsallvaluesfexgforxintheinterval

[�0:532;�0:531].

5

1.Introduction

Theresultsinthelasttwoexamplesarenotimpressivewithrespectto

accuracy.However,wecancopewiththatbyincreasingthenumberof

digitstorepresenttheintervalendpoints,byincreasingthenumberof

termsintheTaylorseries,andbyusingspecialtechniquestocompute

mappings.Thesesubjectsaretreatedinthefollowingchapters.

2.

IntervalArithmetic

Westartthischapterwithamoreformaldescriptionofintervalsand

thenwespecifywhatwemeanby\calculatingwithintervals".The

treatmentfallsintwoparts.Firstweassumethatthemathematically

correctversionofthefourarithmeticoperationsisavailable.Next,we

becomerealistic{weuse oatingpointnumberstorepresenttheinterval

endpointsand oatingpointarithmetictodothecalculationswiththese

endpoints.

2.1.

RealIntervals

A

realintervalA

isthebounded,closedsubsetoftherealnumbers

de�nedby A

=

[a;a]=

fx2Rja�x�ag;

wherea;a2Randa�a.WeuseI(R)todenotethesetofsuch

intervals.AsingletonAisadegenerateinterval:a=a.

Forlateruseweintroducefoursimpleconcepts,themidpointm(A),the

radiusr(A),theupperboundjAjandthewidthw(A)oftheintervalA,

m(A)=

1 2(a+a);

r(A)

=

1 2(a�a);

jAj=

maxa2

A

jaj=

maxfjaj;jajg;

w(A)=

a�a=

2r(A):

(2.1)

NotethatifX

isasingleton,thenm(X)=X

andr(X)=0.

2.2.

IntervalVectorsandMatrices

AnintervalvectorVinI(Rn)hasncomponents(coordinates),eachof

whichisaninterval,Vi2I(R);i=1;:::;n.

Theconceptsde�nedin(2.1)havestraightforwardextensionstointerval

vectors.The�rstthreearede�nedcomponentwise,

7

2.IntervalArithmetic

m(V) i=

m(Vi);

r(V) i=

r(Vi);

jVj i=

jVij:

(2.2)

Thus,eachoftheseisavectorinRn.Weshallfurthermakeuseof

thewidthandthenorm

ofanintervalvector.Bothofthesearescalars,

w(V);kVk2R+.

w(V)=

2kr(V)k1

=

maxi

fw(Vi)g;

kVk=

kjVjk1

=

maxi

jVij:

(2.3)

Example

2.1.

LetV

=

([�3;1];[3;5])�I(R

2).Then

m(V)=

(1;4);

r(V)=(2;1);

jVj=(3;5);

w(V)=4;

kVk=5:

AnintervalmatrixM

inI(Rm�

n)hasm

rowsandncolumns.Each

elementisaninterval,Mij2I(R);i=1;:::;m;j=1;:::n,andagain

theconceptsfrom(2.1)arede�nedelementwise,whilethescalarsfrom

(2.3)generalizeto

w(M)=

maxi;j

fw(Mi;j)g;

kMk=

kjMjk1

=

maxi

fX j

jMijjg:

(2.4)

Example2.2.

LetM

=� [4;6]

[0;2]

[�2;0][2;4]� .

Thenm(M)=

� 51

�13

� ,

r(M)=

� 11

1

1� ,

jMj=

� 62

2

4� ,

w(M)=2,

kMk=8.

Example2.3.

LetY2I(Rm

)denotetheintervalvectorobtainedasY=

MV,whereM

2I(Rm�

n)andV2I(Rn).Itcanbeshownthatwiththe

de�nitionsin(2.3)and(2.4)itholdsthat

kYk�kMk�kVk:

Weshallusethisappraisalinchapters9and10.

2.3.RealIntervalArithmetic

8

Notetheequivalencewiththein�nitynorm

ofrealvectorsandmatrices.

ForV2Rn

andM

2Rm�

n

thenormskVk 1andkMk 1arede�nedasin

(2.3)and(2.4),respectively,andkMVk 1�kMk 1�kVk 1.

Nowwearereadytospecifywhatwemeanby\calculatingwithin-

tervals".Thenextsectiondiscussesarithmeticoperationswithsimple

intervals.

2.3.

RealIntervalArithmetic

Let

�denoteoneofthefourarithmeticoperators,+,�,�or=.For

A;B2I(R)wede�ne

A�B=

fa�bja2A

^b2Bg;

(2.5)

withA=B

unde�nedif02B.Thefourarithmeticoperationsarecon-

tinuousmappingsofR

2ontoRandAandBareboundedandclosed.

Therefore,alsoA�Bisaninterval;A�B2I(R).Thus,Equation(2.5)

de�nesfouroperatorsonI(R).Thede�nitionensuresthattheresult-

ingintervalA�Bcontainsallpossibleoutcomesfromapplying�with

operandsfromAandB.Thede�nitionisnotoperational,however;it

doesnotshowhowto�ndtheendpointsoftheintervalA�B,butitis

easyto�ndmaximumandminimumforthefourarithmeticoperations,

andthisleadsto

A+B

=

[a+b;a+b]

A�B

=

[a�b;a�b]

A�B=

[min(ab;ab;ab;ab);max(ab;ab;ab;ab))]

A=B

=

A�[1=b;1=b]:

(2.6)

Example2.4.

[0;1]+[1;2]=

[1;3],

[1;2]�[1;2]=

[�1;1],

[�3;2]�[1;2]=

[�6;4],

[1;2]=[3;4]=

[1=4;2=3].

Identifyingsingletonswithrealnumbersweseethattheoperationsare

extensionsoftheusualoperationsinR.Thereforewepermitourselves

towritee.g.3+[1;2]asshortfor[3;3]+[1;2].

9

2.IntervalArithmetic

Fromthede�nition(2.5)itfollowsimmediatelythat

A1�A2

and

B1�B2

)

A1�B1�A2�B2

(2.7)

Thispropertyofthefourintervaloperationsisimportant.Wesaythat

theoperationsareinclusionmonotonic.

2.4.

FloatingPointIntervalArithmetic

Whenperformingintervalarithmeticonacomputerwehavetorepresent

intervals,andwechoosetouse oatingpointnumbersfortheendpoints.

LetFdenotethesetof oatingpointnumbersandde�nethe oating

pointintervaloperationsby

(A�B)=

smallestintervalwithendpointsinF,

whichcontainsA�B:

(2.8)

Again,thisde�nitionisnotoperational.Itdependse.g.onthe oating

pointprocessorwhetheritispossibleto�ndthetwoendpoints,butthis

problemisoutsidethescopeofthisintroduction.

Thede�nition(2.8)ensuresthatalso oatingpointintervaloperations

areinclusionmonotonic.

Inthefollowingwedescribeanumberofmethodsusingrealintervals

andrealintervalarithmetic.Suchintervalcomputationsaredonewith

thehelpof oatingpointintervalarithmetic.Theresultingintervalsare

guaranteedby(2.8)tocontaintheintervalswewouldhavefoundbythe

equivalentrealoperations.Intheformulationofthemethodsweonly

needtouserealintervals,andthepracticalcomputationleadstoawider

interval.Thus,wehaveautomatedestimationofroundingerrors,but

{asindicatedbythefollowingexample{westillhavetobecautious

whenweformulatethealgorithms.

Example2.5.

FollowingRis(1972)itholdsthat

p332[5:744;5:745];

p292[5:385;5:386];

andcalculatingwithfoursigni�cantdigitsgives

p33+

p292[11:12;11:14];

p33�

p292[0:3580;0:3600]:

2.4.FloatingPointIntervalArithmetic

10

The�rstresultisquitegood,buttheotheristerrible.Itisthewellknown

numericalproblemaboutcancellationinconnectionwithsubtraction,and

asusualthecureistoreformulatethecalculation

p33�

p29=

4

p33+

p29

2

4

[11:12;11:14]2[0:3590;0:3598];

whichisamuchbetterresult.

Finally,considertherelationbetweenordinary oatingpointcalculation

and oatingpointintervalarithmetic.Ifwemakea oatingpointcalcu-

lation,howdoestheresultrelatetothecorrespondingcalculationwith

oatingpointintervalarithmetic?Toanswerthatquestion,remember

that

(a�b)=

oneofthetwonumbersinF;

whichareclosesttoa�b;

(2.9)

unlessa�b2F,inwhichcase (a�b)=

a�b.Combiningthisresult

with(2.8)weget

(a�b);a�b2 (A�B)

fora2A;b2B:

Therefore,theintervalobtainedby oatingpointintervalarithmeticcon-

tainsboththemathematicallytrueresultandtheresultwegetbyordinary

oatingpointcalculation.

Thisexplainstheresultintheaboveexample.Ifthe oatingpointcalcu-

lationisunstable,i.e.hasalargeroundingerror,thenthe oatingpoint

intervalarithmeticresultsinwideintervals.Theoppositeisunfortu-

natelynotthecase,andthisisprobablythereasonwhysomepeople

saythatintervalarithmeticistoopessimistic.

3.

IntervalAlgebra

InthischapterwelookattherulesthatarevalidforI(R)withthefour

associatedintervaloperations.

Itiseasytoprovethat+

and�arebothcommutativeandassociative,

ForA;B;C2I(R):

A+B

=

B+A;

(A+B)+C

=

A+(B+C);

A�B=

B�A;

(A�B)�C=

A�(B�C):

Moreover[0;0]=0and[1;1]=1areneutralwithrespecttoaddition

andmultiplication,respectively,

A+0=

0+A

=

A;

A�1=

1�A=

A:

Anondegenerateintervalhasnoinversewithrespectto

+

and

�

sinceeitherofthesewithanondegenerateoperandcannotresultinthe

neutralelement,whichisasingleton.

Thedistributiveruleisnotvalidingeneral.

Example3.1.

Asacounterexampleconsider

[1;2]�([1;2]�[1;2])=

[1;2]�[�1;1]=

[�2;2];

[1;2]�[1;2]�[1;2]�[1;2]=

[1;4]�[1;4]=

[�3;3]:

Insteadthesocalledsub-distributivityholds,

A(B+C)�AB+AC:

Theproofofthisisclearfromthede�nition(2.5).

Thelackofvalidityofthedistributiveruleoftencausestroublewhen

formulatingintervalcalculations.Itissounusualnotbeingallowedto

multiplyintoaparenthesis.

Thedistributiveruleisvalidforspecialintervals,e.g.

t(A+B)=

tA+tB

fort2R:

AverydetailedanalysisofotherspecialcasescanbefoundinRis(1972).

4.

IntervalFunctions

Intheintroductionwesawtheuseofintervalcalculationto�ndupper

andlowerboundsfortherangeoftheexponentialfunctionoveragiven

domain.Thisapplicationofintervalsandintervaloperationsisthesub-

jectofthischapterandmoretocome.Itisofinterestbyitselftobe

abletoestimateboundsoftherangeofafunction,buttheconceptspre-

sentedinthischapterwillproveusefullaterwhenweformulatemethods

forintegration,root-�ndingandoptimization.

4.1.

IntervalHulls

Considerthefunctionf:Rn

7!Rm

de�nedonadomainA�I(Rn),cf.

Section2.2.TheimageofAis

f(A)=

(f1(A);:::;fm(A))=

ff(x1;:::;xn)jxi2Aig:

Iffiscontinuous,thenthissetisclosedandcompact.Itisobviousthat

f(X)�f(A)forallX�A.Unfortunatelytheimageisnormallynot

anintervalvector.Thisisillustratedby

Example4.1.

Withf:R

2

7!R

2

de�nedbyy1

=

x1+x2,y2

=

x2 1,the

imagef([0;1];[0;1])istheshadedregionshownbelow.

1 01

2

Figure4.1.

Obviously,wecannotgivesucha�gureastheresultofacalculation.

Thereforeweenclosetheimageinanintervalvector,andusethisas

theresult.Thesmallestintervalvectorthatenclosesf(X)iscalledthe

intervalhulloffanddenotedf.Thus,

f(X)�f(X)

forX

�A:

(4.1)

13

4.IntervalFunctions

Inthespecialcaseofarealfunction,i.e.m=1,theimageisaninterval

inR,andinthiscase(4.1)holdswithequality.

Iffisoneofthefourarithmeticoperations,f(a;b)=

a�bthenthe

intervalhulloffisthecorrespondingintervaloperation,f(A;B)=

A�B.

4.2.

IntervalExtensions

AmappingF

:I(Rn)7!I(Rm)iscalledanintervalfunction.The

intervalhullofafunctionisanexampleofanintervalfunction.

IfwehaveafunctionfasaboveandanintervalfunctionFanditholds

that

f(X1;:::;Xn)�F(X1;:::;Xn)

forXi�Ai;

(4.2)

thenFissaidtobeanintervalextensionoff.Inparticular,theinterval

hullisanintervalextension.

Example4.2.

Wegivethreeexamplesofintervalextensions

f(x)=sinx;F(X)=[�1;1]

f(x)=x(1�x);F(X)=X�(1�X)

f(x1;x2)=(x1+x2;x

2 1);F(X1;X2)=(X1+X2;X1�X1)

The

last

function

was

also

treated

in

Example

4.1.

We

�nd

F([0;1];[0;1])

=

([0;2];[0;1]).

This

is

the

smallest

intervalthat

enclosestheimage,cf.Figure4.1,so

inthiscaseF(X)=f(X).

Figure4.2illustratesthesecond

ex-

ampleon

theintervalX

=

[0;

1 2].

Wegivethegraph

ofthefunction

f(x)=x(1�x)and

thesmallestbox

thatcontainsallpoints(x;y)forwhich

x2X

andy2F(X)=

[0;

1 2].

The

intervalhullisf(X)=[0;1 4

].

0.5 0

0.5

Figure4.2.

4.3.ExtensionsforRationalFunctions

14

4.3.

IntervalExtensionsforRationalFunctions

Forrationalfunctionsthereisasimple,generalmethodforobtainingan

intervalextension:Firstwritearationalexpressionde�ningtherational

function.Next,simplyreplacetherealoperationsbyintervaloperations

andthevariablesbyintervals.

Example4.3.

Wegivetwoexamples,

f(x)=

1�x

2

3+x2

;F(x)=(1�X�X)=(3+X�X)

f(x1;x2;x3)=x1�x2x3

;F(X1;X2;X3)=X1�X2�X3

Thereasonwhywecanusethissimplemethodtogetanintervalexten-

sionliesinthede�nition(2.5)ofintervaloperations.Wecanalsouse

theinclusionmonotonicitytoshowthatFisanintervalextension:Let

x=(x1;:::;xn)2(X1;:::;Xn),then

F([x1;x1];:::;[xn;xn])�F(X1;:::;Xn);

andsincef(x1;:::;xn)=

F([x1;x1];:::;[xn;xn]),itfollowsthat

f(x1;:::;xn)2F(X1;:::;Xn)andtherefore

f(X1;:::;Xn)�F(X1;:::;Xn):

Byuseofthealgebraicrulesforrealoperatorsarationalexpressioncan

beformulatedinmanyways.Note,however,thatwecannotmakethe

samereformulationsofintervalexpressions,e.g.becauseoftheinvalidity

ofthedistributiverule.Thisimpliesthatdi�erentformulationsofa

rationalfunctionwillleadtodi�erentintervalextensions.

Example4.4.

Consider

f(x)=

1�x+x

2�x

3+x

4�x

5

=

(1�x

6)=(1+x)

=

(1�x)(1+x

2+x

4)

15

4.IntervalFunctions

ontheintervalA=[2;3].Itiseasytoshowthatf0(x)<0forallx,and

therefore f

(A)=

[f(3);f(2)]=

[�182;�21]:

Thethreeformulationsoffabovegiverespectively

F1(A)=[�252;49],

F2(A)=

[�242

2 3;�15

3 4]andF3(A)

=

[�182;�21],i.e.,verydi�er-

entintervals.Laterweshallseethatamongthemanypossibleinterval

extensionssomestandoutasparticularlygood.

Withaneyeontheexampleonecouldaskwhetheritisalwayspossibleto

reformulatetherationalfunctionsothattheintervalextensionisequal

totheintervalhull.Alas,thisisnotthecase.Asacounterexample

considerarationalfunctionofonevariablede�nedonanintervalwith

rationalendpoints,andwithirrationalupperand/orlowerlimitofthe

image,e.g.x3�xon[0;1].Thenitisnotpossibleto�ndtheinterval

hullbya�nitenumberofrealoperationsontheendpoints.Thisshows

thatifwewishtocomputetheintervalhulltoarbitraryaccuracy,then

weneedmoretoolsthanjustthefourintervaloperations.Suchtoolsare

describedlater.

Inallimplementationsofintervalarithmeticyouhaveaccesstointerval

extensionsofextendedrationalfunctions.Thisisthesetoffunctions,

wherewecancomputetheexactintervalhull,andconsistoffunctions,

wherethesimplearithmeticoperatorsarecombinedwithstandardfunc-

tionslikesin(x),ex,etc.

5.

TheSetofIntervalsasMetricSpaces

Noteveryintervalextensionisequallyattractive.Insomesituations

wewishthattheextensionis\close"totheintervalhull,buttobe

meaningful,wemustbeabletomeasurethe\distance"betweentwo

intervals.Inotherwords,wehavetoorganizethesetofrealintervals

andthesetofrealintervalvectorsasmetricspaces.

First,wehavetode�neadistancefunction.ForA;B2I(R)wede�ne

thedistancebetweenthetwointervalsas

d(A;B)=

maxfja�bj;ja�bjg:

(5.1)

ItiseasytoshowthatdisametricinI(R),i.e.,itsatis�es

d(A;B)=

0

ifandonlyifA=B

d(A;B)�0

d(A;B)=

d(B;A)

symmetry

d(A;B)�d(A;C)+d(C;B)

insertionrule

(5.2)

Thede�nitionofddoesnotindicatehowfaranelementinAcanbe

from

anelementinB,asmeasuredbytheusualdistanceinR.This

isofinterest,e.g.whenAisacomputedintervalandB

isthedesired

interval,andweknowthatd(A;B)�",where">0.Inthatcase,how

farisanindividualelementinAfromtheelementsinB?

Inorderenableananswertosuchquestionswereformulatethede�nition

(5.1)to

Lemma5.1.LetA;B2I(R).Then

d(A;B)=

max

� max

a2A

fminb2B

ja�bjg;max

b2B

fmina2A

ja�bjg

Proof.

The�rstminimumcanbesimpli�edasfollows

minb2B

ja�bj=

8 < :0

if

a2B

ja�bjif

b<a

forallb2B

ja�bjif

b>a

forallb2B

17

5.MetricSpaces

andthemaximumwithrespecttoais

max

a2A

fminb2B

ja�bjg=

8 > > < > > :

0

if

A�B

j a�bj

if

A�B

ja�bj

if

A�B

maxfja�bj;ja�bjgif

A�B

ThenotationA�B

(respectivelyA�B)meansthatthereexistsno

a2Asothata<b(respectivelya>b)anda2jB.

Similarlyweget

mina2A

ja�bj=

8 < :0

if

b2A

ja�bjif

a<bforalla2A

ja�bjif

a>bforalla2A

and

max

b2B

fmina2A

ja�bjg=

8 > > < > > :

0

if

B�A

ja�bj

if

B�A

ja�bj

if

B�A

maxfja�bj;ja�bjgif

B�A

Finally,wecombineallthepossibilities,takingmaximumandignoring

thecasesthatexcludeeachother.Thisleadstomaxfja�bj;ja�bjg,

whichisthede�nition(5.1)ofthedistancebetweenAandB.

Wecanusethislemmatoansweroneofthequestionsweraisedatthe

beginningofthischapter:

Lemma5.2.LetA;B2I(R)and">0.Thenthestatement

1Æ

d(A;B)�"

isequivalentwith

2Æ

(�)

foralla2Athereexistsb2Bsuchthatja�bj�"

(�)

forallb2Bthereexistsa2Asuchthatja�bj�"

5.MetricSpaces

18

Proof.

WeneedLemma5.1,and�rstlookat1Æ

)2Æ.Sinced(A;B)�

",thenminb2

B

ja�bj�"foralla2Aandmina2

A

ja�bj�"forallb2B.Thus,

1Æ

implies(�)and(�).

Next,toshowthat2Æ

)1Æ,assumption(�)impliesthatfor�xedait

holdsthatminb2

B

ja�bj�",andthisinequalityisalsotruewhenwemax-

imizeovera2A.Similarly,weuseassumption(�)toensurethatthe

secondnumberinthemaximumexpressioninLemma5.1isboundedby

",andtherefored(A;B)�".

ForintervalvectorsA=(A1;:::;An)andB=(B1;:::;Bn)wede�ne

thedistanceas

d(A;B)=

maxi

d(Ai;Bi):

(5.3)

Thus,alsoI(Rn)isorganizedasametricspace.Itissimpletoextend

Lemma5.2tointervalvectors:

Lemma5.3.LetA;B2I(Rn)and">0.Thenthestatement

1Æ

d(A;B)�"

isequivalentwith

2Æ

(�)

foralla2Athereexistsb2Bsuchthatka�bk 1�"

(�)

forallb2Bthereexistsa2Asuchthatka�bk 1�"

Proof.

WeuseLemma5.2andde�nition(5.3).Theassumption

d(A;B)�"impliesthatd(Ai;Bi)�"foralli,andthus,forallai2Ai

thereexistb i2Bisuchthatja i�bij�".Therefore,foralla2Athere

existsb2B

suchthatka�bk 1�".Implication(�)isprovedthesame

way,and2Æ

)1Æ

isequallyeasy.

19

5.MetricSpaces

5.1.

ContinuityofIntervalFunctions

Lemmas5.2and5.3quantifyintermsoftheintervalelements,whatis

meantbysayingthattwointervalsareclose.Now,wewillusethemetric

conceptstode�nelimitsandcontinuityinI(Rn).

A

sequenceofintervalsfX(s)ginI(Rn)issaidtoconvergetoX

if

d(X(s);X)!0fors!1.Fromthede�nition(5.3)itfollowsthatthis

meanscomponentwiseconvergence,i.e.d(X(s)i;Xi)!0,andfromthe

de�nition(5.1)ofthescalardistancethismeansthatx(s)i!xiand

x(s)i!xi,i.e,theendpointsconvergetotheendpointsofthelimit

interval.

AnintervalfunctionF

:I(Rn)7!I(Rm)iscontinuousforX�A

2I(Rn)if li

ms!

1

F(X(s))=

F(X)

for

lims!

1

X(s)

=

X

:

(5.4)

ConstantintervalfunctionsandmappingsofI(Rn)intoI(Rm),where

eachcoordinateinX2I(Rn)iseithereliminatedorduplicatedarecon-

tinuous.Sincecompositemappingsingeneralmetricspacesarecontinu-

ous,weonlyneedtoshowthatthefourintervaloperationsarecontinuous

inordertoverifythatrationalintervalfunctionsarecontinuous.This

isdonewhenwehaveprovedthefollowinggeneraltheorem

aboutthe

intervalhullforacontinuousfunction,

Theorem

5.1.IffisacontinuousmappingofRn

intoRm,de�ned

onanintervalA�Rn,thentheintervalhullfisacontinuousinterval

functionfromI(Rn)intoI(Rm).

Proof.

Wehavetoshowthatthecontinuitycondition

kf(u)�f(v)k1

<"

foru;v2Aandku�vk 1<Æ

impliesthatd(f(U);f(V))<"forU;V�Aandd(U;V)<Æ.

First,weconsidertheimagesf(U)andf(V).Theyarecompact,and

themappingsoftheseundertheprojectionofRnontotheithcoordinate

isalsocompact,whichmeansthatYi=(f(U))iandZi=(f(V))iare

5.2.LipschitzContinuity

20

intervalsinR.Theseintervalsarewhatwegetbytakingtheithcoor-

dinateoftheintervalhull,Yi=(f(U))i,Zi=(f(V))i.Therefore,for

yi2Yithereexistsu2Usuchthatyi=f(u) i.Sinced(U;V)<Æ,Lemma

5.3guaranteestheexistenceofv2V

withku�vk 1<Æandtherefore

kf(u)�f(v)k1

<".Thenumberz iisanelementinZi,andwehave

shownthatcorrespondingtoyithereexistsanelementz iinZisuchthat

jy i�zij<".

Startingwithz iandproceedingthesamewaywegetthesameinequality,

andLemma5.2showsthatd(Yi;Zi)<"whend(U;V)<Æ.Thisholds

foreveryi,andwecandeducethatmaxid(Yi;Zi)=d(f(U);f(V))<".

ThefourordinaryoperationsarecontinuousmappingsofR

2

intoR,

andthereforethetheoremimpliesthatthefourintervaloperationsare

continuous.Further,thisisseentoimplythecontinuityofanyinterval

functionobtainedbythemethodofSection4.3.Inparticular,ifX(s)!

xwithx2Rn,thend(F(X(s));F(x))!0,andsincef(x)=F(x)for

theseintervalextensions,itfollowsthat

d(F(X(s));f(x))!0

for

s!1:

(5.5)

Inwords:ifX(s)isasmallargumentintervalcontainingx,thenF(X(s))

isclosetothefunctionvaluef(x).

The�rstfunctioninExample4.2showsthatthispropertydoesnothold

forallintervalextensions,eveniftheyarecontinuous.Theextension

ofsin(x)toF(X)=[�1;1]isevidentlycontinuousanditdoesnot

satisfy(5.5).

Inthenextchapterweshallseethatwithcertaintypesofintervalex-

tensionsitispossibletoquantifythespeedofconvergencein(5.5).This

isbasedonanestimateofd(F(X);f(X)).

5.2.

LipschitzContinuity

Nowwelookatapresumptiononintervalfunctionswhichisstronger

thancontinuity,viz.Lipschitzcontinuity.

Anordinaryfunctionf

:

Rn

7!Rm

de�nedonanintervalAissaidtobeLipschitzcontinuousif

21

5.MetricSpaces

thereexistsanumberK>0suchthat

d(f(x);f(y))�Kd(x;y)

for

x;y2A:

(5.6)

Thisde�nitionistransferredliterallytointervalfunctions:Aninterval

functionFfromA2I(Rn)intoI(Rm)issaidtobeLipschitzcontinuous

ifthereexistsanumberK>0suchthat

d(F(X);F(Y))�Kd(X;Y)

for

X;Y�A:

(5.7)

Regardingtheintervalhullwehave

Theorem

5.2.

Ifthefunctionf:A�Rn

7!Rm

isLipschitzcon-

tinuous,thenthesameistruefortheintervalhullf:A2I(Rn)7!

I(Rm).

Proof.

Wehavetoconsiderd(f(X);f(Y))forX;Y2A,andaswe

didbefore,welookattheithcoordinate.UsingLemma5.1weget

d(f(X) i;f(Y) i)=

max

� max

x2

Xfminy2

Y

jf(x) i�f(y) ijg;

maxy2

Yfminx2

X

jf(x) i�f(y) ijg:

Hereweusedthatforeveryelement�2f(X)thereisanx2Xsuchthat

�i=f(x) i.Next,fbeingLipschitzcontinuousimpliesthat

jf(x) i�f(y) ij�Kjxi�yij.ThefactorK

goesoutsideallthemaxesand

min

s,andanotherapplicationofLemma5.1leadsto

d(f(X) i;f(Y) i)�Kd(Xi;Yi)�Kd(X;Y):

Thisistrueforalli=1;:::;m

andthetheoremfollows.

Exceptingdivisionbyzero,thefourarithmeticoperationsareLipschitz

continuousfunctions.Therefore,thecorrespondingintervaloperations

areLipschitzcontinuous,andthroughthesamestepsthatwereusedto

showthatrationalintervalfunctionsarecontinuous,wecanshowthat

theyarealsoLipschitzcontinuous.Allweneedis

5.3.Width

22

Theorem

5.3.

IfFandGareLipschitzcontinuousintervalfunc-

tions,thenthecompositefunctionFÆGisLipschitzcontinuous.

Proof.

d(F(G(X));F(G(Y)))�K1d(G(X);G(Y))�

K1K2d(X;Y)withK1;K2>0,andthetheoremisproven.

Inconclusion,rationalintervalfunctionsareLipschitzcontinuous.In

particular,thisistruefortheintervalextensionsfromSection4.3.

5.3.

W

idthofanInterval

The�nalbasicmetricconcepttoconsideristhewidthofaninterval

(vector).Thiswasde�nedinChapter2,

w(A)=

(a�a

ifA2I(R);

maxifw(Ai)gifA2I(Rn):

ForA�B

thereisthefollowingimportantrelationbetweenthewidth

andthedistancede�nedinLemma5.1,

1 2(w(B)�w(A))�d(A;B)�w(B)�w(A):

(5.8)

Further,forA;B;C;D2I(R),a2Rwehavethefollowingrelationsfor

thewidth(thenormkAkisde�nedin(2.3))

w(A+B)=

w(A)+w(B);

w(A�B)=

w(A)+w(B);

w(aA)=

kakw(A);

maxfkBkw(A);kAkw(B)g�w(A�B)

�kBkw(A)+kAkw(B);

w(A=B)�

1 bb

� kBkw(A)+kAkw(B)� ;

(5.9)

andforthedistance

23

5.MetricSpaces

d(A+C;B+C)=

d(A;B);

d(A+B;C+D)�d(A;C)+d(B;D);

d(A�B;C�D)�d(A;C)+d(B;D);

d(A�B;C�D)�kAkd(B;C)+kCkd(A;D);

d(1=A;1=B)�

1

kAkkBk

d(A;B):

(5.10)

Weomittheproofoftheserelations.Itispurelytechnicalandrather

tedious.

Havingintroducedthewidthwecanproveanimportantpropertyofthe

intervalhullofaLipschitzcontinuousfunction,

Theorem

5.4.LetfbetheintervalhullofaLipschitzcontinuous

functionontheintervalA.Then,forX�AthereexistsK>0such

that

w(f(X))�Kw(X):

Proof.

Thereexistu;v2X

sothat

w(f(X))=

maxiw(f(X) i)

=

maxijf(u) i�f(v) ij

�K

maxijui�vij�Kw(X):

Againweexploitedthatallelementsintheithprojectionofthehullare

mappingsofelementsinX,andinparticularthisistruefortheend

pointsoff(X).

NotallLipschitzcontinuousintervalfunctionssatisfytheinequalityin

Theorem5.4.Thisimpliesthatimagesofsingletonsaresingletons,and

F(X)=[�1;1]isasimplecounterexample.

6.

LinearandQuadraticInterval

Extensions

Inthischapterwelookatdi�erenttypesofintervalextensions,and

assesstheirquality.Westartwithanexampletointroducetheconcepts

thatweshallworkwith.

Example6.1.

Therationalfunctionf(x)=x(1�x)hasanintervalexten-

sionF(X)=X(1�X).Weconcentrateontheintervals

X

=

[1 4

�r;1 4

+r]

for

0�r�1 4

;

andfrom

(5.5)weknowthatF(X)!f(1 4

)=

3 16

forr!0.Howfastis

theconvergence?To�ndtheanswer,wemakethefollowingcalculation,

F([1 4

�r;1 4

+r])=

[1 4

�r;1 4

+r]�(1�[1 4

�r;1 4

+r])

=

[3 16

+r

2�r;

3 16

+r

2�r]

=

3 16

+r

2+[�r;r]

ThefunctionfisincreasingonX,soitiseasyto�nditsimage,

f(X)=

[f(1 4

�r);f(1 4

+r)]=

3 16

�r

2+

1 2[�r;r];

andthedistancebetweenF(X)andf(X)is

d(F(X);f(X))=

1 2r+2r

2

=

O(r):

Weshallseethatthisisgenerallyvalid:Intervalextensionsobtainedsim-

plybyreplacingrealargumentsbytheirintervalshave\errors"thatgoto

zerolinearlywiththewidthoftheintervals.

Bymeansofthemeanvaluetheorem

wecangetabetterapproximation

tof.Forx2X

thetheorem

saysthat

f(x)=

f(1 4

)+(x�1 4

)f0(�);

forsome�betweenxand1 4

.Sincef0(�)=1�2�,weseethat

f(x)2FM

(X)�f(1 4

)+(X�1 4

)(1�2X);

i.e.,FM

isanintervalextensionoffontheintervalX.Asregardsthe

di�erencebetweenFM

(X)andf(X),we�nd

FM

([1 4

�r;1 4

+r])=

3 16

+[�r;r]�[

1 2

�2r;

1 2

+2r]

=

3 16

+1 2[�r;r]+2[�r

2;r

2];

25

6.IntervalExtensions

andd(FM

(X);f(X))=3r

2,whichisbetterthand(F(X);f(X))forsmall

valuesofr.WeshallseethatingeneralF(X)hasan\error"O(w(X)),

whilethe\error"withFM

(X)isO(w(X)2).

6.1.

LinearIntervalExtensions

IfF

isanintervalextensionoffonanintervalA�I(Rn)andthere

existsaK>0,independentofX,sothat

d(F(X);f(X))�Kw(X)

forallX

�A;

(6.1)

thenwesaythatF

isalinearintervalextension.Itisobviousthat

alinearintervalextensionsatis�estheconvergencecriterion(5.5)for

w(X)!0.

The�rst,simpleresultaboutlinearintervalextensionsis

Theorem

6.1.

IfF

isalinearintervalextensionofaLipschitz

continuousfunctionf,thenthereexistsaconstantK>0suchthat

w(F(X))�Kw(X):

Proof.

Wemakeuseoff(X)�F(X),Equations(5.8)and(6.1),and

Theorem5.4,

w(F(X))�2d(F(X);f(X))+w(f(X))

�2K1w(X)+K2w(X)=

Kw(X):

Next,welookatcompositefunctions,

Theorem

6.2.IfFandGarelinearintervalextensionsofLipschitz

continuousmappingsf:A�Rn7!Rm

andg:F(A)�Rm

7!Rp,re-

spectively,thenFÆGisalinearintervalextensionoffÆg.

Proof.

LetX�Aandusetheinsertionruletoget

6.1.LinearExtensions

26

d(G(F(X));fÆg(X))�

d(G(F(X));g(F(X)))+d(g(F(X));g(f(X)))

+d(g(f(X));g(f(x)))+d(g(f(x));fÆg(X));

wherexisanarbitrarypointinX.Wenowestimateeachofthefour

terms,

d(G(F(X));g(F(X)))�K1w(F(X))�K1K2w(X):

HereweusedthatGisalinearintervalextensionandthatFsatis�esthe

assumptionsofTheorem6.1.InthesecondtermweusetheLipschitz

continuityofgandthelinearityofF,

d(g(F(X));g(f(X)))�K3d(F(X);f(X))�K3K4w(X):

Inthethirdtermwemakeuseofg(f(x))2g(f(x)),w(g(f(x)))=0,the

relation(5.8)betweenwandd,andTheorem5.4,

d(g(f(X));g(f(x)))�w(g(f(X)))

�K5w(f(X))�K5K6w(X):

Finally,fÆgisLipschitzcontinuousandweapplythesametechniqueto

thelastterm,

d(g(f(x));fÆg(X))�w(fÆg(X))�K7w(X):

CombiningtheseestimatesweseethatFÆGsatis�es(6.1),andtheproof

is�nished.

Constantsintheformofsingletonsandthefourintervaloperationsare

linearintervalextensions.Inbothcasesthisisbecauseweworkwith

intervalhulls.Alsomappingsthatduplicatevariablesoreliminatethem

arelinearintervalextensions.Thisimpliesthattherationalinterval

functionstreatedinSection4.3arelinearintervalextensionsofextended

rationalfunctions.

Insomecasesthede�nition(6.1)ofalinearintervalextensionmaybe

reformulatedto:

27


ThereexistsE2I(Rn)with02Eandw(E)�Kw(X)suchthat

F(X)=

f(X)+E:

(6.2)

Inwords,everypointinF(X)isthesumoftwovectors,oneofwhichis

intheintervalhullandtheotherhaslimitedlength.

6.2.

MeanValueForms

Untilnowwehaveseenonegeneralmethodforconstructingintervalex-

tensions,viz.intervalextensionsforrationalfunctions.Now,weshallsee

howthemeanvaluetheoremcanbeusedtoconstructintervalextensions

foramuchwiderclassoffunctions,thedi�erentiablefunctions.

Considerf:Rn

7!R,whichisdi�erentiableinA�Rn.Forsucha

functionthemeanvaluetheoremsays

f(x)=

f(^x)+

n X i=1

@f

@xi

(�)(xi�^xi);

with^x;x2Aand�isapointonthelinesegmentbetween^xandx.Let

F0

beanintervalextensionofthegradient

f0

=

� @f

@x1

;:::;

@f

@xn

� ;

andletX�Abeanintervalthatcontainsboth^xandx.Then

f(x)2f(^x)+F0(^x)�(X�^x);

wherethemultiplicationinthelastterm

istheinnerproductoftwo

intervalvectors.Inthisreformulationwegotridof�,andtheexpression

onlyinvolvesintervalsandintervaloperations.Inorderforthistobe

anintervalextensionoffwemustaccompanytheintervalX

witha

speci�cationofthechoiceof^x.

Ifwechoose^x=m(X),themidpointofX,wesaythattheresulting

intervalfunction

FM

(X)=

f(m(X))+F0(X)�(X�m(X))

(6.3)

isameanvalueform.

6.2.MeanValueForms

28

Example6.2.

ForX

=[�r;r]withr>0ameanvalueformofthefunction

cosis F

M

(X)=

cos(0)+[�1;1]�(X�0)=

1+[�1;1]�X;

sincecos0(�)2[�1;1],andthisintervalconstantisthereforeaninterval

extensionofcos0.FM

isclearlyalinearintervalextension.

Themeanvalueformisnotconstructedsolelybythefourintervalopera-

tions,butinvolvesgettingthemidpointetc.Therefore,itisnotobvious

thatFM

isinclusionmonotonic,butthenexttheoremguaranteesthat

FM

hasthispropertyprovidedthatF0

does.

Theorem

6.3.IfF0

isaninclusionmonotonicintervalextensionof

f0

thenFM

isaninclusionmonotonicintervalextensionoff.

Proof.

LetX;Y�I(Rn)withX�Y.WehavetoshowthatFM

(X)�

FM

(Y).Toeasethenotationweintroducex=m(X)andy=m(Y),and

startbyusingthemeanvaluetheoremonfandtheinclusionmonotonic-

ityofF0, F

M

(X)=

f(x)+F0(X)(X�x)

=

f(y)+f0(�)(x�y)+F0(X)(X�x)

�f(y)+f0(�)(x�y)+F0(Y)(X�x)

with�2Y.Werewritethecurrentresultto

FM

(X)�f(y)+

n X i=1

� f0 i(�)(xi�yi)+F0 i(Y)(Xi�xi)� ;

andlookattheithcontributiontothesum,usingthatf0 i(�)2F0 i(Y),

xi=m(Xi),andyi=m(Yi):

f0 i(�)(xi�yi)+F0 i(Y)(Xi�xi)

=

f0 i(�)(xi�yi)+jF0 i(Y)jw(Xi)[�1 2;1 2]:

Itiseasilyseenthat

jf0 i(�)(xi�yi)j�jF0 i(Y)j(w(Yi)�w(Xi))=2;

29


andtherefore

f0 i(�)(xi�yi)+jF0 i(Y)jw(Xi)[�1 2;1 2]

�jF0 i(Y)j(w(Yi)�w(Xi)+w(Xi))[�1 2;1 2]

=

jF0 i(Y)jw(Yi)[�1 2;1 2]=

F0 i(Y)(Yi�yi):

Nowthetheoremfollowsimmediately.

Ifweuseagoodintervalextensionforf0

wecangetbetterresultsthan

illustratedinExample6.2.Thisisthesubjectofthenextsection,but

beforethatwegivetwoexamples.

Example6.3.

Asinthepreviousexampleweconsiderf(x)=

cos(x)on

X

=

[�r;r];r>0.Thederivativeiscos0=

�sin,andwecangetan

intervalextensionforcos0byusingameanvalueform

for�sin,i.e.

�sin(x)=

�sin(0)�(x�0)sin0(�)

=

�xcos(x)2X�[�1;1]:

Therefore,cos(x)21+X�X�[�1;1]:

ComparingthiswiththeTaylorexpansionofcos(x)around^x=0,cos(x)=

1�

1 2x

2+O(x

4);weseethatwehaveobtainedaquadraticapproximation

totheintervalhullofcosontheintervalX.

Example6.4.

AsinExample4.2weconsiderf(x)=x(1�x)ontheinterval

X

=[0;

1 2].Sincef0(x)=1�2x,wecanuseF0(X)=1�2X

=[0;1],

andwithm(X)=

1 4

,f(1 4

)=

3 16

weget

FM

(X)=

3 16

+[0;1][0;

1 2]=

[�

116

;7 16]:

Actually,F0(X)=f0(X),andtheideafrom

thepreviousexamplewould

giveusthesameintervalextensionoff0.InExample4.2wefoundF(X)=

[0;

1 2],andFM

(X)isseentobeabetterapproximationtotheinterval

hullf(X)=[0;1 4

].

Obviously,wecanextendtheresultstofunctionsf:Rn7!Rm

byusing

themeanvaluetheoremseparatelyoneachcoordinatefunction.

6.3.QuadraticExtensions

30

Finally,itshouldbementionedthatwhenweusethemeanvalueform

inpracticeonacomputer,thentherealvectorf(m(X))shouldberep-

resentedbyanintervalvectorwithboundsontheroundingerrorscon-

nectedwiththeevaluationoffatthepointm(X),i.e.f(m(X))should

berepresentedbyF([m(X);m(X)]).

6.3.

QuadraticIntervalExtensions

IfF

isanintervalextensionoffonanintervalA�I(Rn)andthere

existsaK>0,independentofX,sothat

d(F(X);f(X))�Kw(X)2

forallX

�A;

(6.4)

thenwesaythatF

isaquadraticintervalextension.Thisissimilar

tothede�nitioninSection6.1.Theadvantageofaquadraticinterval

extensioniswhenw(X)�1,inwhichcaseFisalmostindistinguishable

fromtheintervalhull.

ForquadraticintervalextensionsofLipschitzcontinuousfunctionswe

haveatheoremsimilartoTheorem6.1,

Theorem

6.4.IfFisaquadraticintervalextensionofaLipschitz

continuousfunctionf,thenthereexistsaconstantK>0suchthat

w(F(X))�Kw(X):

Proof.

SimilartotheproofofTheorem6.1.

Themostimportantexampleofaquadraticintervalextensionisviathe

meanvalueformwithF0

chosenasalinearintervalextension.

Theorem

6.5.Letf:A2I(Rn)7!Rbedi�erentiableandassume

thatthederivativesareLipschitzcontinuous.Further,letF0

bea

linearintervalextensionoff0.Then

FM

(X)=

f(m(X))+F0(X)(X�m(X))

isaquadraticintervalextensionoff.

31


Proof.

Weapply(6.2)totheintervalextensionF0,

F0(X)=

f0(X)+E;

whereE�I(Rn)withw(E)�K1w(X)and02E,implyingthatjEj�

w(E)�K1w(X).Nowwecanwrite

FM

(X)=

f(m(X))+( f

0(X)+E)(X�m(X));

showingthateveryelementy2FM

(X)hastheform

y=

f(m(X))+(f0(u)+e)(v�m(X))

withu;v2X

ande2E.Acorrespondingpointintheintervalhullf(X)

isy=f(v),andfromthemeanvaluetheoremweknowthatthereexists

a�2X

suchthat

y=

f(m(X))+f0(�)(v�m(X)):

Weproceedasfollows

d(FM

(X);f(X))�jy�yj

=

j� f0(u)�f0(�)+e�� v�m(X)� j

�n X i=

1� j(f0(u) i�f0(�) ij+je ij� jv i�m(X)j

�n X i=

1� K 2w(X)+K1w(X)� 1 2w(X)

=

n 2(K1+K2)w(X)2:

InthethirdreformulationweusedtheLipschitzcontinuityoff0

andthe

boundonE.Thus,(6.4)issatis�ed,andtheproofis�nished.

7.

AccurateComputationof

IntervalHulls

Inthepreviouschapterswehaveseensomegenerallyapplicableinterval

extensionsthatcanbeusedto�ndapproximationstotheintervalhull.

Undercertainconditionsweareguaranteedthattheerror

d(F(X);f(X))�K�w(X)�;

(7.1)

where�=1fortherationalintervalextensionsofSection4.3andlinear

intervalextensionsdiscussedinSection6.1,while�=2forthequadratic

intervalextensionsofSection6.3.Itispossibletoconstructinterval

extensionswithlarger�-values,butitreallydoesnothelp:wecannot

guaranteeanarbitrarilygoodapproximationifw(X)�1.

Ontheotherhand,ifw(X)�1,then(7.1)showsthatF(X)isvery

closetof(X),andthissuggestsanidea:DividetheintervalX

into

subintervals,eachofwhichhassmallerwidththanX,computeanin-

tervalextensiononeachsubinterval,andgathertheinformationabout

F(X)fromallsubintervals.Letustrytheideaonasimpleproblem.

Example7.1.

AsinExample4.2welookatthefunctionf(x)=x(1�x)

ontheintervalX

=[0;

1 2],withtheintervalextensionF(X)=X(1�X).

InFigure7.1weshowtheresultobtainedwhenwesplittheintervalinto

twoandfourequalparts.ComparingwithFigure4.2weseethatF(X)is

reducedfrom

[0;

1 2]to[0;

3 8]and[0;

5 16],respectively,whicharecloser

totherangeoff(X).

0.250.5 0

0.25

0.5

0.250.5 0

0.25

0.5

Figure7.1.Splittingtheinterval

33

7.AccurateIntervalHulls

Wereturntothisproblem

later.First,however,weformalizetheap-

proach.

Theorem

7.1.Letthefunctionfbede�nedinX�I(Rn),andlet

X(1);:::;X(p)beasubdivisionofX,i.e.,

X(j)�X

and

p [ j=1

X(j)

=

X

:

Further,letFbeanintervalextensionoffonX.Then

f(X)�

p [ j=1

F(X(j)):

Proof.

Ineachsubintervalitclearlyholdsthatf(X(j))�F(X(j)),and

thetheoremfollowsbytakingtheunionofbothsidesoftheinclusion.

Inotherwords,givenanintervalextensionoffandasubdivisionofthe

domainX,wegetanewintervalfunction

Fp(X)=

p [ j=1

F(X(j)):

(7.2)

Thisisanintervalextensionoff,providedthatwede�netheunion

oftwointervalsasthesmallestintervalthatcontainstheunion.This

intervalextensionisthesubjectof

Theorem

7.2.

LetF

beanintervalextensionofA�I(Rn)and

assumethatthereexistsaconstantK>0suchthat

d(F(X);f(X))�Kw(X)�

forallX

�A

with�=1or�=2.Then

d(F(X);f(X))�K

maxj

� w(X(j))�

:


34

Proof.

Wecan�ndtwosubintervalswithindiceslandusothat

Fp(X)=

p [ j=1

F(X(j))=

F(X(l))[F(X(u));

andbyusingthede�nitionofthedistanceinLemma5.1andthecon-

structionofFpwe�nd

d(F(X);f(X))�max

� d(F(X(l));f(X(l)));d(F(X(u));f(X(u)))

�K

max

� w(X(l))�;w(X(u))�

�K

max

p j=1

� w(X(j))�

:

Asaspecialcaseconsiderequidistantsubdivisionineachcoordinate

direction,i.e.w(X(j))=

1 qw(X),wherep,q,andthedimensionalityn

arerelatedthroughp=q�n.Thenthelasttheoremtellsusthat

d(F(X);f(X))�Kw(X)�(n=p)�:

(7.3)

Theright-handsideconvergestozero,i.e.Fp(X)!f(X)forp!1.

Thismeansthatnowwehaveamethodforcomputingtheintervalhull

toarbitraryaccuracy.Inpractice,ofcourse,whencarriedoutin oating

pointonacomputer,roundingerrorslimittheobtainableaccuracy.We

refertoSection2.4,andignorethislimitationinwhatfollows.

Example7.2.

Asinthepreviousexamplewelookat

f(x)=

x(1�x)

on

X

=

[0;

1 2]

with

f(X)=

[0;1 4

]:

Weconsiderthetwointervalextensions

F(X)=

X(1�X);

FM

(X)=

f(m(X))+(1�2X)(X�m(X));

anduseequidistantsubdivision,

X(j)

=

[j�1

p

;j p];

j=1;:::;p:

WeillustratedthechoiceF(X)inFigures4.4and7.1forp=1andp=2.

Below,wegiveresultsobtainedbysuccessivedoublingofp.

35


p

d(Fp(X);f(X))

d(FM

p(X);f(X))

1

0:250000

0:187500

2

0:125000

0:046875

4

0:062500

0:011719

8

0:031250

0:002930

16

0:015625

0:000732

32

0:007813

0:000183

64

0:003906

0:000046

128

0:001953

0:000011

. . .

. . .

. . .

Asexpected,Fp

showslinearconvergenceandFM

p

showsquadraticcon-

vergence:Eachdoublingofpreducestheerrorwithafactor2and4,

respectively.

TheproofofTheorem

7.2suggestsawayofreducingthenumberof

timesneededtorecomputetheintervalextensiononsubintervals.The

errorsatis�es

d(F(X);f(X))�K

max

� w(X(l))�;w(X(u))�

;

andtoreducethedistancebetweenFp(X)andf(X)itmaynotbenec-

essarytore�neallsubintervals,butonlyX(l)andX(u),thesubintervals

thatcurrentlyde�netheendpointsofFp(X).Thisideahasbeenused

bySkelboe(1974)todevelopanadaptiveversion,wherethesubdivision

ofX

ismonitoredbyF.

WewanttostoptheprocesswhenthecurrentapproximationissuÆ-

cientlyclosetotheintervalhull.Itissurprisinglysimpletogetasimple

estimateoftheerror:Becausef(X(l))�F(X(l))andf(X(u))�F(X(u))

we�ndthat

d(Fp(X);f(X))�max

� d(F(X(l));f(X(l)));d(F(X(u));f(X(u)))

�max

� w(F(X(l)));w(F(Xu))

:

Therefore,ifthesimplestoppingcriterion

max

� w(F(X(l)));w(F(Xu))

�Æ

issatis�edforthedesiredvalueofÆ,thend(Fp(X);f(X))�Æ.


36

IfthemeanvalueformFM

isused,thenwenotonlyhavetheadvantage

ofquadraticconvergence,butwealsoknowF0,i.e.wehaveinformation

aboutthepartialderivatives.Supposee.g.,thatF0(X(j)) i>0,thenf

ismonotonicincreasingonX(j)inthedirectionofxi,andifwereplace

theithcoordinateinX(j)withthelowerandupperbound,respectively,

thentheintervalhulloverX(j)iscontainedintheunionofFM

applied

tothetworeduced-dimensionintervals.Thus,monotonyinacoordinate

directiongivesfasterconvergenceinthatdirection,butwestillhave

tousenondegenerateintervalswhenextremaareattainedininterior

points.Moore(1975)andSkelboe(1977)reportconsiderablegainfrom

usingsuchinformationaboutthegradient.

WegivemoredetailsinChapter11,wherewefocuson�ndingthelower

boundoff(X).

8.

IntervalIntegration

Givenacontinuousfunctionf:R7!RontheintervalA=[a;b].We

wanttocomputetheintegral

T

=

Z b a

f(x)dx:

Thisproblemarisesinmanyconnections,andinsomecasesitiseasy:

Ifwecan�ndafunction'suchthat'0(x)=f(x),thenTissimply

evaluatedasT='(b)�'(a).Ifwecannot�ndsuchafunction{or

ifanimplementationofitisnotavailableanditisverycomplicated

todevelop,thenwecanresorttonumericalquadrature,andgetan

approximatione Ttotheintegral.Theapproximationwillbea�ected

bybothroundingerrorsandtruncationerrors,seeChapter1.Itmay

alsohappenthatthegivenfhasinputerror,i.e.itrepresentssome

theoreticalfunctionb finthesensethatf(x)=

b f(x)+e(x),andweare

givenboundsone.Whatisthee�ectofalltheseerrors,i.e.whatcan

wesayaboutjT�e Tj?

8.1.

BasicMethod

Itcancomeasnosurprisethatweo�erintervalanalysisasameansof

answeringallthesequestions.Themathematicalbackgroundisthead-

ditivepropertyofintegrationandthemeanvaluetheoremforintegrals.

Ifa�c�b,then

Z b a

f(x)dx

=

Z c a

f(x)dx+

Z b c

f(x)dx;

andthereexistsa�2X

=[x;x]suchthat

Z x x

f(x)dx

=

w(X)f(�):

Therefore,ifwesplittheintervalAintopsubintervalswithbreakpoints

fajg,

a=a0<a1<:::<ap�

1<ap=b;

(8.1)

8.1.BasicMethod

38

then

T

=

p X i=1

Z a i ai�

1

f(x)dx

=

p X i=1

w(Ai)f(�i);

whereAi=[ai�1;ai]and� i2Ai.

Now,letFbeanintervalextensionoffonA.Then

Ti

�w(Ai)f(�i)�Ri

�w(Ai)F(Ai);

andwehavederivedanintervalinclusionofT,

T

�R

�p X i=

1Ri:

(8.2)

ThewidthoftheintervalR

measureshowwellwehavedetermined

theintegral.IfwetakethemidpointtorepresentT,thenwehavethe

followingboundonthetruncationerror

jT�m(R)j�1 2w(R):

(8.3)

Example8.1.

Consider

T

=

Z 1 0

11+x

dxwiththesolutionT

=

ln2'

0:69314718.Theintegrandismonotonicdecreasingintheintegration

domainA=[0;1],sowecanuseF(X)=[

11+x

;

11+x

].Ifwedivide

Aintopsubintervalsofequalwidth,thenthebreakpointsareaj=j=p,

j=0;1;:::;p,w(Ai)=1=p,andwe�nd

Ri

=

1 p[

pp+i

;

p

p+i�1

];

R

=

[1 p+

1p+1

+��+

1 2p

;

1p�1

+1 p+��+

12p�1

]:

Thus,jT�m(R)j�

1 2w(R)=

1 2� 1 p

�1�

1 2p

� =1 4

p;showingthat

jT�m(R)j�0:005ifp�50.

39

8.IntervalIntegration

Inthissimpleexampleweusedtheintervalhulltorepresentthevariation

offoneachsubinterval.Ingeneral,assumethatFisalinearinterval

extensionoff.ThenTheorem6.1tellsusthat

w(F(Ai))�Kw(Ai)

forsomeconstantK>0,andbymeansof(5.9)and(8.2)we�nd

w(R)=

P iw(Ri)=

P iw(Ai)�w(F(Ai))

�P iw(Ai)�Kw(Ai)=

K�Piw(Ai)2:

(8.4)

Withequidistantbreakpointseachw(Ai)=(b�a)=p,and(8.4)leadsto

w(R)�K�p�� b�

ap

� 2=

K1 p

:

(8.5)

Thus,withagoodintervalextensionoffwecanexpectthatadoubling

ofpreducesthetruncationerrorbyafactor2.

Supposethatwecanignoreinputandroundingerrors,thenwecould

simplyincreasep(e.g.successivelydoubleit)untilw(R)issuÆciently

small.Often,however,aconsiderableamountofcomputationcanbe

savedbyfocusingonthosepartsoftherange,whereF(X)hasthe

greatestwidth.Theframeworkofsuchanadaptivealgorithm

couldbe

thefollowing\divide-and-conquer"algorithm,whichstopswhenwecan

guaranteethatjT�m(R)j�ÆwiththevalueofÆchosenbytheuser,

ok:=false

while

notok

R:=

P iRi

ifw(R)�2Æthen

ok:=true

else j

:=argmaxfw(Ri)g

SplitAjintotwosubintervalsofequalwidth

end

end

(8.6)

8.2.IntervalSimpson

40

8.2.

IntervalSimpson

Themethoddescribedabovebehavessimilartothesimplemethodsfor

\ordinary"numericalintegrationknownastheleftandrightRiemann

formulas,

e T (L)=

b�a

p

� f(a0)+f(a1)+��+f(ap�

1)� ;

e T (R)

=

b�a

p

� f(a1)+��+f(ap�

1)+f(ap)� :

Iftheintegrandisdi�erentiable,wecangetaccurateapproximations

withfewerfunctionevaluations.AsanexampleconsiderSimpson'sfor-

mula.Iffisfourtimescontinuouslydi�erentiable,then

Ti

=

w(Ai)

6

� f(ai)+4f(m(Ai))+f(ai)� �

w(Ai)5

2880

f(4)(�i);

where� i2Ai.IfwehaveintervalextensionsFandGoffandf(4),then

wecanusethesetoget

T

�S

�p X i=

1Si;

(8.7)

with

Si

=

w(Ai)

6

� F(ai)+4F(m(Ai))+F(ai)� �

w(Ai)5

2880

G(Ai):

NotethatthecallsofFwithsingletonargumentsreturnintervalsthat

caterfore�ectsofroundingerrorsanduncertaintyoftheintegrand.

Ignoringthese,andassumingthatG

isalinearintervalextensionof

f(4),weget w

(Si)=

12880

w(Ai)5w(G(Ai))

�

12880

w(Ai)5KG

w(Ai)=

K2w(Ai)6;

andsimilartothederivationof(8.5)weseethatinthecaseofequidistant

breakpointswegetw(S)�K3=p5,showingthatwecanexpectthata

doublingofpreducesthetruncationerrorbyafactor32.

41


Example8.2.

FortheproblemofthepreviousexamplewecanuseG(X)=

24=(1+X)5,andbelowwegivetheresultsforequidistantbreakpoints.

Notethatwithpsubintervalsweuse2p+1singletoncallsofFandpcalls

ofG.

p

m(S)

r(S)

1

0.69014756944444

4.04e-003

2

0.69308539735741

1.26e-004

4

0.69314610032219

3.94e-006

8

0.69314716308433

1.23e-007

16

0.69314718028433

3.85e-009

Eachtimepisdoubled,theradiusr(S)isreducedbythefactor32pre-

dictedabove.

Wecannotuse(8.7)onaproblemlikeT=

R 1 0pxdxbecausethefourth

derivativeoftheintegrandissingularatthelefthandendoftherange.

Wecoulduse(8.2),improvedby(8.6),butweshallassumethatsuch

asingularityoccursonlyatoneand/ortheotherend,andsuggestthe

followinghybridmethod,wheretheRiemannapproachisusedinthetwo

\boundaryintervals"andtheSimpsonapproachisusedintheinterior,

T

�H

�R1+Rp+

p�

1 X i=2

Si:

(8.8)

Wecancombinethiswiththedivide-and-conquerapproachof(8.6),with

specialhandlingofthecaseswherethecriticalsubintervalisabound-

aryinterval,j=1orj=p.Inthe�rstcaselet[a0;a1]bethecurrent

�rstintervalwithlength�=a1�a0.Wesplitthisintothenewbound-

aryinterval[a0;a0+�=32]andtheinteriorinterval[a0+�=32;a1].

Similarifthecriticalsubintervalisattheotherend.

IntheimplementationitisexploitedthatF(ai)=F(ai+1)andwhen

aninteriorintervalissplit,thenthemidpointisanendpointofboth

newsubintervals,soweonlyhavetoevaluateF

atthetwonewmid-

pointsandG

onthetwonewsubintervals.Alsothespecialhandling

oftheboundaryintervalscallsfor4evaluationsofanintervalfunc-

tion.Theinitiallayouthasp=3(oneinteriorinterval)andbreakpoints

a0:3=(a;a+�;b��;b),where�=(b�a)=32,andthe�rstH

needs

6evaluationsofanintervalfunction.

8.2.IntervalSimpson

42

Example8.3.

Forf(x)=

px=x

1=2

wecanuseG(X)=�

1516

X�

7=2,and

withthestoppingcriterionr(H)�10�

3

wefoundtheresult

H

=

[0.666333615;0.668563604]

obtainedwith40intervalfunctionevaluations.The�gureshowsthepoints

(x;F(x))thatwereusedinSimpson'sformula.Asexpected,theyconcen-

trateclosetothesingularityatx=0.

00.

10.

20.

30.

40.

50.

60.

70.

80.

91

0

0.2

0.4

0.6

0.81

Figure8.1.

Example8.4.

Forcomparison,wehavealsousedtheM

atlab6function

quadonthesquarerootproblem.Asourintervalalgorithm,thisisan

adaptivemethodbasedonSimpson'sformulaandbisectionof\critical"

subintervals.Thealgorithm

stopswhentwoconsecutiveapproximations

di�erlessthanÆ,thedesiredaccuracy.Inthetablebelowwegiveresults

forthenumberofsubintervalsandtheerrorsfordi�erentchoicesofthe

desiredaccuracy.

Thetwoalgorithmsuseaboutthesamenumberofsubintervals,butwhile

quaduses2p+1evaluationsofa oatingpointimplementationoff,the

intervalalgorithm

uses4p�6evaluationsofintervalfunctions,whichis

morethantwiceascostly.

Ontheotherhand,theintervalalgorithm

returnstheresulttothedesiredaccuracy,exceptforthelastexample.We

usedthe\safetyvalve"p�1000,whilequadmayproceeduntilthenumber

offunctionevaluationsexceeds10000.Inallcases(exceptforthe�rst,

wherep=2issuÆcient)theerroronthequadresultisuptoalmost10

timesthedesiredaccuracy.

Thetruevalueoftheintegral,T=

2 3

isinanintervalofwidth2�

53

'

1.11e-16betweentwo oatingpointnumbers.Withthatinmind,both

methodsachieveimpressiveaccuracy.

43


Interval

quad

Æ

p

jm(R)�Tj

p

je S�Tj

1e-02

6

2.71e-04

2

8.91e-03

1e-04

12

4.54e-05

8

3.94e-04

1e-06

23

7.07e-07

18

6.34e-06

1e-08

52

4.48e-09

48

3.58e-08

1e-10

120

2.86e-11

110

5.63e-10

1e-12

296

1.24e-13

280

3.34e-12

1e-14

805

6.66e-16

700

4.90e-14

1e-16

1000

4.44e-16

1740

7.77e-16

TheintervalversionofSimpson'smethodneedsagoodintervalextension

off(4)(x),ortheuseofautomaticdi�erentiation,seee.g.Neumaier

(2001)orCorlissetal(2002).Chapter8inMoore(1966)discusses

furtherintervalintegrationmethodsbasedonTaylorexpansionsofthe

integrand.Alsohere,automaticdi�erentiationisuseful,andhegivesa

shortintroductioninhisChapter11.

9.

RootsofFunctions

Lookingforarootofafunctionf:A�Rn

7!Rm

meansthatwewish

to�ndapointx�

2Asuchthatf(x�)=0.Variantsoftheproblemisto

�ndallrootsinA,etc,etc.

9.1.

BasicMethod

Whenweuseintervalmethodsto�ndarootweneedanalgorithmthat

cantelluswhetheragivenintervalX�I(Rn)containsnorootsandan

algorithmthatcangiveusintervalsthatcontainrootsandareasnarrow

aswewish.

The�rstalgorithmisprovidedbythesimple

Theorem

9.1.IfFisanintervalextensionforf:A�I(Rn)7!Rm

andX�Awith02jF(X),thenfhasnorootsinX.

Proof.

Straightforward,since02jF(X))02jf(X).

IfwecombinethisresultwiththesubdivisionschemeofChapter7,we

havetheframeworkforasimplealgorithm.DivideAintosubintervals,

A=

S p j=1X(j),andoneachsubintervalcheckwhether02F(X(j)).If

not,thenthissubintervalisdiscarded,andtheremainingX(j)arefurther

subdivided.IfFisagoodapproximationtofandthesetofrootsin

Aforfisbenign(e.g.�nite),thenthemethodresultsinacollectionof

arbitrarilynarrowintervalsthatmaycontainroots,andtheremainder

ofAisguaranteedtocontainnoroots.

Example9.1.

LetA=[�1;2],f(x)=x�x

2

andF(X)=X(1�X).We

getF(A)=[�2;4],sotheremayberootsinA.Wesplittheinterval

intoX(1)=[�1;

1 2],X(2)=[

1 2;2],and�ndthat0iscontainedinboth

F(X(1))andF(X(2)).Sowesplitagainandgettheresultstabulated

below.

45

9.RootsofFunctions

j

1

2

3

4

X(j)

[�1;�1 4

]

[�1 4

;1 2]

[1 2;

5 4]

[5 4;1]

F(X(j))

[�2;�

516

]

[�

516

;5 8]

[�

516

;5 8]

[�2;�

516

]

Thus,afterthecomputationof7intervalextensionswecandiscardX(1)

andX(4),whilebothofthetwoneighbouringintervalsX(2)andX(3)may

containroots.Furtheruseofthealgorithm

willisolatethetworootsoff,

0and1.

Example9.2.

Themethodalsoworksformultipleroots.Considere.g.

f(x)=x

2�4x+4=(x�2)2onA=[�4;8]withtheintervalextension

F(X)=X�(X�4)+4.Weget

j

1

2

3

X(j)

[�4;0]

[0;4]

[4;8]

F(X(j))

[4;36]

[�12;4]

[4;36]

WecandiscardX(1)

andX(3)

andsubdividetheintervalX(2),which

containsthedoubleroot.

Inthe�rstexampletheproblemissimpleandtheintervalextensionis

quitegood.Foramorecomplicatedproblem

and/oraworseapproxi-

mationtof(X)manysubdivisionsmaybeneededbeforeweareableto

discardrootfreesubintervals.

MooreandJones(1977)describehowthemethodworksinmultiple

dimensions.Theyhalveconsecutivelyineachcoordinatedirectionand

keepalistofintervalsthatstillhavenotbeendiscarded.

Inthecasen=m=1themethodcanbeextendedtoproveexistenceof

simpleroots:IffiscontinuousandFappliedtothetwoneighbouring

intervalsofX(j)hasoppositesign,thenX(j)containsatleastoneroot,

seeFigure9.1below.

TheexistencetestcanbemademoreeÆcientbyjustexaminingF

at

points{singletons.TheendpointsofF([x;x])re ecttherounding

errorsinthecomputationof (f(x)),butexceptforthatwedonothave

todistinguishbetweenFandf.Nickel(1967)usedthistoconstructan

intervalversionoftheclassicalbisectionalgorithm.

9.2.IntervalNewton

46

X(j−

1)X

(j)X

(j+1)

F(X

(j−1)

) >

0

F(X

(j+1)

) <

0

Figure9.1.Testofexistenceofroot

Example9.3.

TheexistencetestonlyworkswhenX(j)

containsanodd

numberofroots.Theproblem

from

Example9.2hastworoots(viz.the

doubleroot2)inX(2),andthereforeF(X(1))�F(X(3))>0.

Untilnowwehavepresentedvariantsofintervalversionsofbisection.

Thismethodisrobustbutratherslow.Inthenextsectionwepresent

afasterintervalalgorithm,butbeforethatwepresentahybridmethod.

Letexdenotean(approximate)root,computedinordinary oatingpoint

bymeansofanyroot�ndingalgorithm,andletÆdenoteanestimateof

theerror.Thismeansthatweexpectthatthereisarootx�

suchthat

jx�

�exj�Æ.Inotherwords,de�netheinterval[a;b]=ex+[�Æ;Æ],

andifF([a;a])�F([b;b])<0,thenweknowthat[a;b]containsat

leastoneroot.Ifthetestisnotsatis�ed,wecanextendtheintervalto

[a;b]=ex+2[�Æ;Æ],etc.

9.2.

IntervalVersionofNewton'sMethod

Inthissectionwediscussthecasen=m

=1.ConsideranintervalX

thatcontainsarootx�

off,wherefiscontinuouslydi�erentiable.Then

themeanvaluetheoremtellsusthatthereexistsa�2X

suchthat

0=

f(x�)=

f(x)+(x�

�x)f0(�):

Assumingthatf0(�)6=0,thisleadsto

x�

=

x�

f(x)

f0(�)

:

47

9.RootsofFunctions

Now,letF0

beanintervalextensionoff0

andassumethatF0(X)does

notcontain0.Then

x�

2N(x;X)�x�

f(x)

F0(X)

;

(9.1)

wherewehaveintroducedtheNewtonoperatorN(x;X).Therootisalso

containedinX,andthereforeitmustlieintheintersectionbetweenX

andN(x;X),

x�

2X\N(x;X):

Thus,wecanformulatetheintervalversionofNewton'smethod:Start

withX(0)

containingx�

andcomputeanestedsequenceofintervals

X(1);X(2);:::bytheformula

X(s+1)

=

X(s)\N(x(s);X(s))

with

x(s)2X(s);

s=0;1;::::

(9.2)

Theterm\nested"isusedbecauseeachnewintervaliscontainedinthe

previous,X(0)�X(1)��.Thisimpliesthatthewidthsdecrease,

w(X(0))�w(X(1))�:::,andsincealltheX(s)arecontainedinX(0),

theyarebounded.Therefore,thereexistsalimitX�

containingx�.

Example9.4.

Considerthefunctionf(x)=

x2�2withf0(x)=

2xand

X(0)

=[1;2].Wechoosex(s)

=m(X(s))andF0(X)=2X,sothatthe

Newtonoperatoris

N(x(s);X(s))=

m(X(s))�(m(X(s))2�2)=(2X(s)):

The�rststepwith(9.2)is

N(x(0);X(0))=

3 2

�1 4=[2;4]=

[2216;

2316];

X(1)

=[1;2]\[

2216;

2316]=

[2216;

2316]:

Below

weshow

resultsobtainedbytheM

atlab

ToolboxIntlab,and

displayedwithformat

long.

9.2.IntervalNewton

48

s

X(s)

1

[1.37499999999998;1.43750000000001]

2

[1.41406249999998;1.41441761363637]

3

[1.41421355929452;1.41421356594719]

4

[1.41421356237309;1.41421356237310]

TheconditionthatF0(X)doesnotcontain0isnecessaryfortheappli-

cationoftheNewtonoperator,andithasaninterestingimplication,

Lemma9.1.

Iff:R7!Riscontinuouslydi�erentiableonthe

intervalX

and02jF0(X),thenX

eithercontainsasimplerootx�

ornoroots.

Proof.

Theemptypartisobvious,soassumethatthereiseithera

multiplerootormorethanonerootinX.

Arootx�

offofmuliplicityqsatis�es

f(x�)=

f0(x�)=

��=

f(q�

1)(x�)=

0:

However,theconditiononF0(X)impliesthatf0(x)6=0forallx2X.

Therefore,X

cannotcontainarootofmultiplicityq�2.

Next,considertwo(ormore)separate,simplerootsx�;y�

2X,y�

6=x�.

Theymustbeseparatedbyapointz2X,wherefhasalocalextremum,

i.e.f0(z)=0.ButtheassumptiononF0

excludestheexistenceofsuch

apoint.

InExample9.4wehadaveryfastconvergencetowardsthesimpleroot

x�

=

p2.Thisexempli�edthatundertheconditionsofconvergence

thelimitX�

isthesingletonX�

=x�,irrespectiveofhowwechoosethe

pointsx(s)2X(s),

Theorem

9.2.

Ifx�

isarootoffinX(0)andF0(X(0))doesnot

contain0,thenthesequenceX(1),X(2),...de�nedby(9.2)converges

tox�.

49

9.RootsofFunctions

Proof.

From(9.2)andthede�nition(9.1)oftheNewtonoperatorwe

seethat

w(X(s+1))�w(N(x(s);X(s)))=

w(f(x(s))=F0(X(s)))

=

jf(x(s))jw(1=F0(X(s)))

�jf(x(s))jw(1=F0(X(0)))=

Kjf(x(s))j;

(9.3)

whereK

=w(1=F0(X(0)))isindependentofX(s).Sincex�

iscontained

ineachX(s),itispossibletochoosefx(s)gsothatx(s)!x�,inwhich

case(9.3)showsthatw(X(s))!0,X(s)!x�.

Wehavetoprovethattheconvergenceisobtainedforallchoicesof

x(s)2X(s),andaccordingto(9.3)thisisensuredifwecanshowthatitis

onlypossibletochoosea�nitenumberofx(s)forwhichjf(x(s))j>">0.

Suchapointsatis�es

� � � �f(x(s))

F0(X(s))

� � � ��jf(x(s))j

jF0(X(s))j�

jf(x(s))j

jF0(X(0))j

;

and

X(s+1)=X(s)\

� x (s)�

f(x(s))

F0(X(s))

� showsthatthedistancefrom

x(s)toX(s+1)isatleast"=jF0(X(0))j,andsincex(s)2X(s),thisimplies

thatw(X(s+1))�w(X(s))�"=jF0(X(0))j.Now,w(X(0))is�nite,sowe

canonlysubtractthepositivenumbera�nitenumberoftimes,andthis

concludestheproof.

TheNewtonoperatorcanhelpusansweringthequestionraisedinLemma

9.1abouttheexistenceornon-existenceofarootsinagiveninterval,

Theorem

9.3.

IfX

isanintervalwith02jF0(X)andthereexists

anx2X

suchthatX\N(x;X)=;,thenfhasnorootinX.

Proof.

Bycontradiction.IfX

containedaroot,thenitwouldalso

becontainedinN(x;X)andintheintersection.SinceX\N(x;X)is

empty,therecanbenorootinX.

9.2.IntervalNewton

50

Theorem

9.4.

Iffiscontinuouslydi�erentiable,X

isaninterval

with02jF0(X),andthereexistsanx2X

suchthatN(x;X)�X,

thenfhasarootinX.

Proof.

ThisproofismorediÆcult;itusesSchauder'stheoremabout

existenceof�xedpoints.

Considerthepointxandanotherpointy2X.Accordingtothemean

valuetheoremthereexistsa�betweenxandysothat

f(x)=

f(y)+f0(�)(x�y):

(9.4)

Sincef0(�)2F(X),itisnonzero,andwede�nethefunctiong,

g(y)=

y�

f(y)

f0(�)

:

(9.5)

Thisiscontinuous,andfrom(9.4)andtheassumptiononN(x;X)we

get

g(y)=

x�

f(x)

f0(�)2x�

f(x)

F0(X)

=

N(x;X)�X

:

Thus,gisacontinuousmappingofXintoitself.AccordingtoSchauder's

theoremghasa�xedpointinX,y�

=g(y�),and(9.5)showsthatthis

isarootoff.

NowwearereadytosketchanalgorithmthatexaminesanintervalA

forsimpleroots:

1.

SplitA

intosubintervalsA(1);:::;A(p),whereeachsubinterval

satis�eseither02jF0(A(j))orw(A(j))�Æ.Thesecondtypecon-

sistsofsuÆcientlynarrowintervalsaroundlocalextremaandmul-

tipleroots.

2.

LetX(0)=A(j)

beasubintervalwith02jF0(X(0)).

IfX(0)\

N(x(0);X(0))=

;,thenwecandiscardthissubinterval,other-

wiseproceedwith(9.2).

51

9.RootsofFunctions

IfanyoftheiteratesX(s)satis�esN(x(s);X(s))�X(s),thenweareguar-

anteedthatthereisarootinX(s),andthereforeinX(0).Depending

onthechoiceofx(s),someiterationstepsmaybeneededbeforethe

non-existenceortheguaranteedexistenceofarootisrevealed.

From

thede�nition(9.1)oftheNewtonoperatorandtheassumption

02jF0(X)itfollowsthatallpointsintheintervalN(x;X)areeitherto

theleftortotherightofthepointx.Thishastheimportantimplication

w(X\N(m(X);X))<

1 2w(X):

(9.6)

Thus,ifwetakex(s)asthemidpointofX(s),wehaveaguaranteed

reductionoftheintervalwidth.Ifthereisarootintheinterval,then

weachievefaster�nalconvergence:

Theorem

9.5.IfthereisarootinX�X(0),F0

isalinearinterval

extensionoff0,and02jF0(X),thenthereexistsaconstantK>0

suchthat

w((N(x;X))�Kw(X)2

forall

x2X

:

Proof.

From(9.3)weknowthatjw(N(x;X))j�K1jf(x)j,andfrom

themeanvaluetheoremandthelinearityofF0

weget

jf(x)j=

jf(x�)+f0(�)(x�x�)

�jf0(�)jw(X)�jF0(X)jw(X)�K2w(X)�w(X);

andthetheoremfollows.

Thistheoremshowsthatunderthegivenassumptionsthesequence(9.2)

satis�esw(X(s+1))�Kw(X(s))2,i.e.ithasquadraticconvergence,as

illustratedinExample9.4.

ThemethodcanbemademoreeÆcientifwerealizethatx(s)canbe

chosenfreelyinX(s).Theestimate(9.3)showsthatthewidthofX(s+1)

isspeciallysmallifwechoosex(s)closetotheroot,andtoreducethe

amountofintervalcomputationwecanuseanordinary oatingpointal-

9.3.NonlinearSystem

52

gorithmto�ndanapproximationextox�,andusex(s)=ex.Theensuing

computationmustbedoneinintervalarithmetic.

Example9.5.

Itiseasilyseenthat

f(x)=

x3+x

2+3x+1

hasarootbetween�1and0.WithX(0)=[�1;0],F0X)=3+X(2+3X)

and

x(s)=m(X(s))wegetthefollowingresultsfrom

(9.2),

s

X(s)

w(X(s))

1

[-0.43750000000001;-0.12499999999998]

3.13e-01

2

[-0.37936994154677;-0.34535656580493]

3.40e-02

3

[-0.36113616568634;-0.36106913433279]

6.70e-05

4

[-0.36110308055118;-0.36110308050612]

4.51e-11

5

[-0.36110308052865;-0.36110308052864]

5.55e-17

Again,wenotethequadraticconvergence.Thisismostclearlyseeninthe

columnwithw(X(s)).Roundingerrorshavedominatingin uenceinthe

laststep.

Threestepswiththeordinary oatingpointNewtonalgorithm

from

the

startingpoint�0:5givesex=�0:36110308052865,andusingtheinterval

NewtonoperatorN(ex;X(0))weget

X(1)

=

ex�f(ex)

F0(X(0)

=

[-0.36110308052865;-0.36110308052864]

w(X(1))=

3.0e-15

Thus,three oatingpointstepsplusoneintervalstepgivesabetterresult

thanfourintervalsteps.

9.3.

MultidimensionalNewton'sMethod

Inthissectionweconsidercontinuouslydi�erentiablefunctionsf:Rn

7!

Rn,andseekx�

2Rnsuchthatfi(x�)=0;i=1;:::;n.Thisisasystem

ofn(nonlinear)equationsinnunknowns,thecomponentsx� 1;:::;x� n

of

x�.

53

9.RootsofFunctions

Again,westartwiththemeanvaluetheorem,butthistimeinitsmul-

tidimensionalversion.Letx�

2X�I(Rn)bearootforx.Thereexist

vectors� [1];:::;� [n]2X

suchthat

0=

f(x�)=

f(x)+J(x;x�)(x�

�x);

whereJ2Rn�

n

istheJacobianwiththeelements

� J(x;x�)� i;j

=

@fi

@xj

(�[i]):

AssumethattheJacobianisnonsingularforallx2X.Thentheinverse

ofthismatrixexists,and

x�

=

x�

� J(x;x�)� �1f(x)2x�V(X)f(x);

(9.7)

whereVisanintervalmatrixcontainingallthepossibleJ�

1,

f� J(x;x�)� �1;x2Xg�V(X):

Thus,from

X(0)containingtherootx�

wecanconstructanestedse-

quenceofintervals,thatallcontainx�,

X(s+1)

=

X(s)\N(x(s);X(s));

s=0;1;:::

with

N(x;X)=

x�V(X)f(x);

x(s)2X(s):

(9.8)

Example9.6.

Considerthefunctionf:R

2

7!R

2

de�nedby

f(x)=

0 @f 1(x)

f2(x)

1 A =0 @x2 1+x

2 2�1

x1�x2

1 A

ontheintervalX(0)=([

1 2;1];[

1 2;1]).TheJacobianis

J(x;x�

)=

2� 11

2� 12

1

�1! ;

� [1]=(�11;� 12):

Thisisnonsingularif� 11+� 126=0,whichissatis�edforall� [1]2X(0).The

inverseis � J(x

;x�

)� �1=

1

� 11+� 12

1 2

� 12

1 2

��11

! ;

9.3.NonlinearSystem

54

andwecanuse

V(X)

=

1

X1+X2

1 2

X2

1 2

�X1

! .

Ifwechoosethe

midpointsx(s)=m(X(s))in(9.8),wegetthefollowingresults

s

X(s);1=X(s);2

w(X(s))

1

[0.68749999999998;0.71875000000001]

3.12e-02

2

[0.70703124999999;0.70720880681819]

1.78e-04

3

[0.70710677964726;0.70710678297359]

3.33e-09

4

[0.70710678118654;0.70710678118655]

1.11e-16

5

[0.70710678118654;0.70710678118655]

1.11e-16

Weseeafast(quadratic)convergencetothesolution,x� 1

=

x� 2

=

p0:5.

ThewidthofX(4)ishalfofthemachineaccuracy,andwecannotgetcloser

tox�

becauseofroundingerrors.

Example9.7.

Nowletf:R

2

7!R

2

bede�nedby

f(x)=

0 @x2 1+x

2 2�5

x1x2�2

1 A

ontheintervalX(0)

=

([1:6;2];[1;1:4]).Ithastherootx�

=

(2;1).

TheJacobianisJ(x;x�

)=

2�11

�2� 12

��22

� 21

! .ItisnonsingularinX(0)

andwecanuse

V(X)=

1

2(X1X1�X2X2)

X1

�2X2

�X2

2X1

! ;

Ifwetakex(s)

=

m(X(s))in(9.8),wegetthesequence(displayedwith

fewerdigitsthanbefore)

s

X(s);1

X(s);2

w(X(s))

1

[1.938666666;2]

[1;1.06133334]

6.13e-02

2

[1.998451819;2]

[1;1.001548181]

1.55e-03

3

[1.999999001;2]

[1;1.000000998]

9.98e-07

4

[1.999999999;2]

[1;1.000000001]

4.15e-13

5

[1.9999999999;2]

[1;1.000000001]

2.22e-16

6

[2;2]

[1;1]

0

Theonlynew

featureisthatnow

thecomponentsoftherootcanbe

representedin oatingpointwithouterror.

55

9.RootsofFunctions

If,insteadwetakex(0)=[2;1:4],weget

N(x(0);X(0))=

� [0.666666666;3.610666667]

[-1.10666667;2.093333334]� �

X(0):

Therefore,thesequencede�nedby(9.8)getsstuckatX(0),anditshows

thatthechoiceofx(0)2X(0)canhavegreatimportance.

Thelastexampleshowedthatthesequencede�nedby(9.8)doesnotnec-

essarilyconvergetoasingleton.ApossiblecauseisthatV(X)maycon-

tainsingularmatrices,inwhichcasetheintervalV(X(s))f(x(s))cancon-

tainthezerovectoreveniff(x(s))6=0,andx(s)2N(x(s);X(s))�X(s+1)

withx(s)6=x�.Ifwechoosex(s+1)=x(s),bothx(s)andx�

arecontained

inX�

=lims!

1

X(s),andthereforew(X�)>0.However,thefollowing

theoremisvalid,

Theorem

9.6.

Letx�

bearootoffinX(0)andassumethat

V

isinclusionmonotoniconX(0).Ifthefx(s)garechosensothat

lims!

1

x(s)=x�

thenlims!

1

X(s)=x�.

Proof.

X(s+1)�x(s)�V(X(s))f(x(s))�x(s)�V(X(0))f(x(s)),and

thetheoremfollowsfromf(x(s))!0.

Thistheorem

hasanimportantpracticalimplication:Wecanusean

ordinarynumericalmethodtocomputethesequencefx(s)gandcombine

itwith(9.8)toboundtheerror.Themethodcane.g.betheordinary

Newtonmethod,

x(s+1)

=

x(s)�

� J(x(s);x(s))� �1f(x(s));

(9.9)

whereJ(x(s);x(s))istheJacobianevaluatedatx(s).Ifx(s+1)computed

by(9.9)isnotcontainedinX(s+1)ascomputedby(9.8),thenwejust

replaceitbytheclosestpointinX(s+1).IftheNewtonprocessconverges,

sodoesthesequencefx(s)g,andthetheoremcanbeapplied.Asinthe

one-dimensionalcaseaconsiderableamountofintervalcomputationcan

besavedifwejustuse(9.8)toestimatetheerroroftheresultsfromthe

ordinary oatingpointalgorithm.

9.3.NonlinearSystem

56

AseriousproblemwiththemultidimensionalNewtonmethodishowto

�ndV

?Intheaboveexamplesitwasderivedfromanalyticalexpres-

sionsfor

� J(x;x�)� �1,buttoautomatethisapproachwewouldneeda

symboliclanguagelikee.g.Maple.Alternativelywecoulduseanin-

tervalextensionofJ(x;x�)andinvertthisnumerically,asdiscussedin

Chapter10.

Finally,Theorems9.3and9.4abouttheexistenceofarootgeneralize

tothemultidimensionalcase.

Theorem

9.7.IfX

isanintervalinwhichV(X)existsandwecan

�ndanx2X

sothatX\N(x;X)=;,thenthereisnorootinX.

Proof.

Followsfrom(9.7).

Theorem

9.8.Iffiscontinuouslydi�erentiable,Xisanintervalin

whichV(X)exists,andthereexistsanx2XsuchthatN(x;X)�X,

thenfhasarootinX.

Proof.

SimilartotheproofofTheorem9.4:Forthegivenxandany

y2X

thereexistsanonsingularmatrixJ(x;y)suchthat

f(x)=

f(y)+J(x;y)(x�y):

Next,wede�ne

g(y)=y�

� J(x;y)� �1f(y):Thisiscontinuousiny,

� J(x;y)� �12V(X),andweseethat

g(y)=

x�

� J(x;y)� �1f(x)2N(x;X)�X

:

TheremainingpartoftheproofisasintheproofofTheorem9.4.

57

9.RootsofFunctions

9.4.

Krawczyk'sVersionofNewton'sMethod

Krawczyk(1969)presentedaversionofNewton'smethodthatavoidsthe

inversionofintervalmatrices.Thismethodsharesthenicepropertiesof

Newton'smethod,butmayhaveslightlyslowerconvergence.

Asbefore,westartbylookingatanintervalX

containingarootx�

of

f.Letx2X

andletH

beanarbitrarymatrixinRn�

n.Then

x�

=

x�Hf(x)+(x�

�x)�H

� f(x�)�f(x)�

=

x�Hf(x)+(I�HJ(x;x�))(x�

�x)

2K(x;X)�x�Hf(x)+(I�HJ(X))(X�x):

(9.10)

Here,I=diag(1;:::;1)istheunitmatrix,andweintroducedtheso-

calledKrawczykoperatorK(x;X).IfwestartwithanintervalX(0)

containingarootx�,thentheformula

X(s+1)

=

X(s)\K(x(s);X(s))

with

x(s)2X(s)

s=0;1;:::

(9.11)

generatesanestedsequenceofintervals,allofwhichcontaintheroot

x�.ForthissequencewehavethefollowingequivalentstoTheorems9.7

and9.8,

Theorem

9.9.

IfthereexistsanxintheintervalX

andamatrix

H2Rn�

n

suchthatX\K(x;X)=;,thenthereisnorootinX.

Proof.

SimilartotheproofofTheorem9.3.

Theorem

9.10.IfthereexistsanxintheintervalX

andanonsin-

gularmatrixH2Rn�

n

suchthatK(x;X)�X,thenthereisaroot

inX.

9.4.Krawczyk'sMethod

58

Proof.

AsintheproofofTheorem9.4welookat�xedpoints.For

anyy2X

weseethat

y�Hf(y)=

x�Hf(x)+y�x�H(f(y)�f(x))

=

x�Hf(x)+(I�HJ(y;x))(y�x)

2x�Hf(x)+(I�HJ(X))(X�x)

=

K(x;X)�X

:

Thus,thecontinuousfunctiony�Hf(y)mapsX

intoitself,andthere

existsa�xedpointy�:y�

=y�

�Hf(y�).BecauseH

isnonsingular,

thisimpliesthatf(y�)=0,i.e.thereisarootforfinX.

Itfollowsfrom(9.11)thatifx(s)andH(s)canbechosensothatK(x(s);X(s))

isnarrow,thenwegetfastconvergence.Moreprecisely,from(9.11)we

�nd

w(X(s))�w(K(x(s);X(s)))

=

w((I�H(s)J(X(s)))(X(s)�x(s))

�kI�H(s)J(X(s))kw(X(s))�%sw(X(s)):

Thenorm

k�kofanintervalmatrixwasde�nedin(2.4).Wewant

tomake%s

assmallaspossible.ThisisachievedbytakingH(s)

=

� m(J(X(s)))

� �1,becausethenH(s)J(X(s))iscenteredaroundI.The

choice

x(s)

=

m(X(s))

H(s)

=

( anapproximationto

� m(J(X(s)))

� �1if%s�%s�

1

H(s�

1)

if%s>%s�

1

(9.12)

isthesubjectof

Theorem

9.11.LettheKrawczykoperatorusethechoice(9.12).If

K(x(0);X(0))�X(0)and%0<1,thenx�

2X(s),lims!

1

X(s)=x�

and

w(X(s))�%s 0w(X(0)).

59

9.RootsofFunctions

Proof.

Followsimmediatelyfrom

w(X(s))�%s�

1w(X(s�

1)),andthe

sequencef%sgismonotonicdecreasingbecauseoftheinclusionmono-

tonicityofJ(X)andthechoiceofH(s)in(9.12).

Thus,ifG(0)=I�H(0)J(X(0))satis�eskG(0)k<1,thenweareensured

atleastlinearconvergence.Underthisconditionwemayalsosimplify

theprocesswithoutloosingthelinearconvergence:Replace(9.11)by

X(s+1)

=

X(s)\

� x (s)�H(0)f(x(s))+G(0)(X(s)�x(s))� :(9.13)

ThenweavoidthecomputationofJ(X(s))andtheinversionofitsmid-

pointineachstep,buttypicallythissimpli�cationincreasesthenumber

ofstepsneededtoobtainasatisfactorynarrowinterval.

Example9.8.

Weseekarootforf(x)=

� x2 1+x

2 2�1

x2 1�x2

� ,andassumethat

weknowtheapproximaterootx(0)=(0:8;0:62),e.g.foundbya oating

pointmethod.WecanuseKrawczyk'smethodbothtotestwhetherthere

isarootclosetox(0),sayinX(0)=([0:7;0:9];[0:5;0:7]),andtogeta

betterapproximationifthisissatis�ed.

ItistrivialtoseethatJ(X)=

� 2X1

2X2

2X1

�1� ,

andwith

H

=

� 0:279

0:346

0:446

�0:446

� '� J(x

(0))� �1

weget

G0

=� [�0:1250;0:1250][�0:0446;0:0670]

[�0:1784;0:1784][�0:0704;0:1080]� ;

%0=0:2864<1;and

K(x(0);X(0))=

� [0:7657;0:8064];[0:5872;0:6481]� �X(0):

ThisshowsthatthereisarootinX(0)andwecanuse(9.13)to�ndit.In

thetablebelowwegivetheresultsfromthisalgorithmwithx(s)=m(X(s))

andthestoppingcriterionw(X(s))�10�

6.

9.4.Krawczyk'sMethod

60

s

X(s)1

X(s)2

w(X(s))

1

[0.7657323;0.8063645]

[0.5872375;0.6480857]

6.08e-02

2

[0.7815712;0.7907271]

[0.6111227;0.6249431]

1.38e-02

3

[0.7851161;0.7871866]

[0.6164709;0.6195970]

3.13e-03

4

[0.7859172;0.7863856]

[0.6176805;0.6183875]

7.07e-04

5

[0.7860984;0.7862044]

[0.6179540;0.6181140]

1.60e-04

6

[0.7861394;0.7861634]

[0.6180159;0.6180521]

3.62e-05

7

[0.7861486;0.7861541]

[0.6180298;0.6180381]

8.18e-06

8

[0.7861507;0.7861520]

[0.6180330;0.6180350]

1.85e-06

9

[0.7861512;0.7861516]

[0.6180337;0.6180342]

4.18e-07

ThetrueKrawczykmethod(9.11)withthechoices(9.12)convergescon-

siderablyfaster;quadratically.

s

X(s)1

X(s)2

w(X(s))

1

[0.7657323;0.8063645]

[0.5872375;0.6480857]

6.08e-02

2

[0.7850995;0.7872034]

[0.6164672;0.6196009]

3.13e-03

3

[0.7861485;0.7861542]

[0.6180298;0.6180382]

8.35e-06

4

[0.7861513;0.7861514]

[0.6180339;0.6180340]

5.93e-11

Example9.9.

Theone-dimensionalversionoftheKrawczykoperatoris

K(x;X)=

x�Hf(x)+(1�HF0(X))(X�x)

withH

'1=f0(x)asagoodchoice.Newton'smethod(9.2)canbeused

onlyif02jF0(X(s)),butthisversionofKrawczyk'smethodonlydemands

thatf0(x(s))6=0.

The1D

versionof(9.13)iscloselyrelatedtotheordinaryroot�nding

algorithm

knownasthe\saw-toothmethod",x(s+1)

=

x(s)�Gf(x(s)),

wheretheconstantGis(anapproximationto)1=f0(x(0)).

Finally,theremarkinExample9.9generalizes:withKrawczyk'smethod

wedonotdemandthatallmatricesinJ(X(s))beinvertible.Thetest

ofexistenceinTheorem

9.10onlydemandsthatH(s)isnonsingular,

andthetestinTheorem

9.9doesnotevenrequirethis.

Therefore,

Krawczyk'smethodcanbeusedtosearchregionsinRnforroots.Moore

andJones(1977)usedthiscombinedwithintervalsubdivisiontogetan

eÆcient,globalalgorithmforroot�nding.

10.

LinearSystemsofEquations

Thisisprobablythemostcommonnumericalproblem,bothinitsown

rightandaspartofaniterativeprocessaswesawinthepreviouschapter.

Inmatrix-vectornotationtheproblemis:GiventhecoeÆcientmatrix

A2Rn�

n

andright-handsidevectorb2Rn,�ndx2Rn

suchthat

Ax

=

b:

(10.1)

WeassumethatA

isnonsingular,inwhichcasethesolutionxexists

andisunique.Now,supposethattheelementsinAandbareuncertain,

butweknowboundsforeachofthem.Wecanusetheseboundsasend

pointsofintervals,andreplace(10.1)with

AX

=

b;

(10.2)

whereAandbaretheintervalmatrixandvectorasdescribedabove,

andweseekX2I(Rn)thatcontainsthesolutiontoeveryproblemen-

compassedin(10.2).WeassumethatAisregular,meaningthatevery

realmatrixcontainedinAisnonsingular.

Example10.1.

Considerthesystem

[2;3]X1+[�1;2]X2

=

[3;4]

[1;3]X1+[4;6]X2

=

[2;4]

Avectorx=(x1;x2)2X

mustsatisfybothequations.The�rstequation

canbereformulatedto

[2;3]x1

=

[3;4]�[�1;2]x2

;

whichhasthesolution

x1

=

8 > > > > < > > > > :[3 2

�x2;2+

1 2x2]

for

x2

�3 2

[1�

2 3x2;2+

1 2x2]for

0�x2<

3 2

[1+

1 3x2;2�x2]

for�3�x2

<0

[3 2

+1 2x2;2�x2]

for

x2

<�3

Thesetofpoints(x1;x2)givenbythisexpressionaremarkedbyhorizontal

hatchinginFigure10.1.Similarly,we�ndthatthepointssatisfyingthe

secondequationaregivenby

10.1.GaussianElimination

62

x2

=

8 > > > > < > > > > :[1 2

�3 4x1;

2 3

�1 6x1]for

x1�4

[1 2

�3 4x1;1�

1 4x1]for

2 3

�x1<4

[1 3

�1 2x1;1�

1 4x1]for0�x1<

2 3

[1 3

�1 6x1;1�

3 4x1]for

x1<0

Thesepointslieintheverticallyhatcheddomain.Thesetofsolutionsto

thesystemistheintersectionofthetwodomains,thestar-shaped,doubly

hatchedregion.

(6, 0

.9)

(0.4

, −4)

Figure10.1.

AsdiscussedinExample4.1,wecannotgivesucha�gureastheresult,

buthavetomakedowiththeintervalhull,markedbythedottedrectangle

inthe�gure.Thus,thesystem

hasthesolution

X

=

� [0:4;6]

[�4;0:9]� :

10.1.

GaussianElimination

Themethodsusedforordinary oatingpointsolutionoflinearsystems

ofequations(atleastforsmallandmediumsizedproblems,n< �200)

arevariantsofGaussianelimination,possiblywithpivoting.Togivethe

backgroundforintervalversionswestartwithaquitedetaileddescrip-

tion.

63

10.LinearSystems

Step1.Transform

Ax=btoUx=y,whereUisupper

triangular.ThenonzeroelementsofUoverwrite

theuppertriangleofAandyoverwritesb

fork=1;:::;n�1

pk=argmaxi=k;:::;njAi;kj

ifpk>kthenswaprowskandpk

inAandb

fori=k+1;:::;n

` i;k=Ai;k=Ak;k

Ai;j:=Ai;j�` i;kAk;j;

j=k+1;:::;n

b i:=b i�` i;kb k

end

end

Step2.SolveUx=ybybacksubstitution.

fori=n;n�1;:::;1

xi=

� b i�P n j

=i+1Ai;jxj

� =Ai;i

end

(10.3)

ThematrixA2Rn�

n

isnonsingulari�allthepivotsUk;k

(storedin

Ak;k)arenonzero.

Fortheintervalsystem

(10.2)thesimplestideaistoreplaceallvari-

ablesandoperationsintheGaussalgorithmbytheirintervalequivalent.

Unfortunately,thisapproachoftengivesverypessimisticresults{ex-

cessivelywideintervalsXj

{andtoooftenitbreaksdownbecauseof

thecurrentpivotAk;k

containing0.P.Wongwises(1975)madeavery

detailedstudyandafurtherdiscussioncanbefoundinNickel(1977).

Theessentialproblemisthatsomeofthematrixelementstakepartin

upton�1ofthetransformationsAi;j:=Ai;j�` i;kAk;j,andeachtime

therelativewidthoftheintervalcangrow.

Example10.2.

Fortheproblem

ofExample10.1,

� [2;3][�1;2]

[1;3]

[4;6]

� X

=

� [3;4]

[2;4]� ;

Step1oftheintervalversionof(10.3)withoutpivotinggives

` 2;1=[

1 3;

3 2];

U=

� [2;3][�1;2]

0

[1;7:5]� ;

y=

� [3;4]

[�4;3]� ;


64

andbybacksubstitutioninUX

=yweget

X

�� [�1:5;6]

[�4;3]

� :

BycomparisonwithExample10.1weseethatforthisproblem

wegetthe

correctvalueforx1

andx2,butinbothintervalstheotherendpointis

toopessimistic.

Example10.3.

Todemonstratetheover-pessimismaboutsingularityofthe

matrix,weneedaproblem

ofordern�3.Considertheproblem

(10.2)

with

A

=

0 @[2;4][0;2]

[�1;1]

1

[3;4][�:5;1:5]

[4;6][3;5]

[8;10]

1 A ;b=

0 @[8;10]

[10;12]

[8;10]

1 A :

Bymeansoftheintervalversionof(10.3)withoutpivotingwe�nd

` 2;1=[:25;:5];

` 3;1=[1;3];

` 3;2=[�1:5;2:5];

U=

0 @[2;4][0;2][�1;1]

0

[2;4][�1;2]

0

0

[0;16]

1 A ;e b=

0 @[5;10]

[�47;17]

[�64:5;27:5]1 A :

Since02U3;3

wecannotproceed.Wewillnowshow,thatthezerocan

notbeobtainedwithanyofthematricesinA.Itfollowsfrom

(10.3)and

theabovematrices,that

U3;3

=

A3;3�` 3;1A1;3�` 3;2U2;3

=

[8;10]�[1;3]�[�1;1]�[�1:5;2:5]�[�1;2]

=

[8;10]�[�3;3]�[�3;5]:

TogetthezeroinU3;3

wehavetocombinetheworstcaseinthetwo

subtrahends.TheyoccurforA1;3=1andU2;3=2,butthisendpointof

U2;3

isachievedonlyifA1;3=

�1,i.e.theotherendoftheintervalA1;3.

BymeansofthemethoddescribedinSection10.2belowwefoundthat

X

=

0 @[�2;10:0500]

[0:1200;5:7000]

[�10:1000;�0:0258]

1 A :

65

10.LinearSystems

Example10.4.

Welookatthesameproblem

asinthepreviousexample,

exceptthatwechangeA3;3to[8:5;10].Theonlychangeinthereduction

(Step1in(10.3))isthatnowU3;3=[0:5;16],andproceedingwithback

substitutionwe�nd

X

�0 @[�142;123:75]

[�44:5;99]

[�94;34]

1 A :

Thesetofallsolutionsisnowcontainedin

X

=

0 @[�1:8947;9:3914]

[0:2037;5:2632]

[�8:7827;�0:0258]1 A :

Theboundsobtainedbythesimplereplacementof oatingpointvariables

andoperationsbytheirintervalequivalentaremorethan10timeswider

thanthetrueinterval.

InordertoimprovetheaccuracywemaytryKrawczyk'smethodfrom

Section9.4.Asolutionx2X

to(10.2)isarootofthefunctionf(x)=

Ax�b,andtheKrawczykoperator(9.10)takestheform

K(x;X)=

x�H(Ax�b)+(I�HA)(X�x);

whereH

=� m(A)� �1istheoptimalchoiceforthematrixH.IfkI�

HAk<1andweknowaninitialX(0)thatcontainsasolution,thenthe

methodshouldwork.

Example10.5.

Fortheproblem

ofExamples10.1and10.2weget

H

=

� 2:50:5

2

5� �1

=

1 23

� 10�11

�45

� ;

I�HA

=

1 23

� [�6;6][�16;16]

[�7;7][�11;11]� ;

kI�HAk=

2223

<

1;

sothenecessaryconditionforconvergenceofKrawczyk'smethodissatis-

�ed.However,withtheintervalfoundinExample10.2,X(0)=

([�1:5;6];[�4;3])wehavenotbeenableto�ndanx2X(0)thatworks.

InallthecasestriedwegetK(x;X(0))�X(0),soKrawczyk'smethodis

stuckatX(0).


66

Example10.6.

Fortheproblem

ofExample10.4we�ndkI�HAk=

1:261>1,sowecannotuseKrawczyk'smethod.IfwereplaceAandbby

b A=m(A)+0:1r(A)=

0 @[2:9;3:1]

[0:9;1:1]

[�0:1;0:1]

1

[3:45;3:55]

[0:4;0:6]

[4:9;5:1]

[3:9;4:1]

[9:175;9:325]1 A ;

b b=m(b)+0:1r(b)=� [8:9;9:1];[10:9;11:1];[8:9;9:1]� ;

thenthemethodworks.WegetkI�HAk=0:1261<1,andwith

X(0)

=

0 @[1:7415;2:4394]

[2:4978;3:0292]

[�1:6906;�1:0396]1 A ;

obtainedbytheintervalversionof(10.3),andx(0)=m(X(0))weget

K(x(0);X(0))=

0 @[1:7429;2:4319]

[2:5204;2:9552]

[�1:5919;�1:0867]1 A ;

X(1)

=

X(0)\K(x(0);X(0))=

0 @[1:7429;2:4319]

[2:5204;2:9552]

[�1:5919;�1:0867]1 A ;

andproceedinglikethiswiththestoppingcriterion

d(X(s);X(s�

1))�10�

12

;

(10.4)

westopwith

X(12)

=

0 @[1:7519;2:4229]

[2:5271;2:9484]

[�1:5854;�1:0931]1 A :

ThisisbetterthanX(0),butstillquitefarfrom

thetruesolution,b X=

([1:7878;2:3937];[2:5479;2:9311];[�1:5783;�1:1314]).

IntheTablebelowwegiveresultsfortheproblemsgivenby

b A=m(A)+ r(A);

b b=m(b)+ r(b);

fordecreasingvaluesof .Foreachproblem

thevectorX(0)

istheresult

from

theintervalversionof(10.3),weusex(s)=m(X(s)),anditsisthe

numberofiterationstepsbefore(10.4)issatis�ed.

67

10.LinearSystems

kI�H

b Akits

d(Xits;b X)

10�

1

1.26e-01

12

3.83e-02

10�

2

1.26e-02

6

3.71e-04

10�

3

1.26e-03

4

3.70e-06

10�

4

1.26e-04

3

3.70e-08

10�

5

1.26e-05

3

3.70e-10

10�

6

1.26e-06

2

3.71e-12

10�

7

1.26e-07

2

3.86e-14

10�

8

1.26e-08

2

2.66e-15

10�

9

1.26e-09

2

2.66e-15

Notehoweachreductionofthewidthofthematrixandright-handside

byafactor10reducesthedistancebetweentheKrawczyksolutionand

thetruesolutionbyafactor100.Thelastthreeresultsforthedistance

areincreasinglya�ectedbyroundingerrors.

ComparedwiththediscussioninSection9.4theresultsinthelast

twoexamplesaresomewhatdisappointing.

However,weintroduced

Krawczyk'smethodforproblems,wherethetruncationwasthedom-

inatingerrorsource.

TheinitialerrorwassuÆcientlysmallforthe

methodtoconverge,anditwasreducedwiththewidthoftheinter-

val.Now,weapplythemethodonproblemswithinputerrors,thatstay

�xedthroughouttheiteration.Example10.6illustratesthatthemethod

performswellforproblemswithsmallinputerrors.Thiswasalsothe

conclusionofWongwises(1975),whousedthemethodtocomputeerror

boundsforcomputedsolutionstoproblemsoftheform(10.1),i.e.she

usedthemethodtoestimatethee�ectofroundingerrors.

10.2.

Sign-AccordAlgorithm

ThisalgorithmwaspresentedbyRohn(1989)andiscurrentlyconsidered

asthebestmethodforsolvingsystemsoflinearintervalequations.Our

presentationisbasedonMadsenandToft(1994).First,letX=X(A;b)

denotethesetofsolutionsto(10.2),i.e.

X=

� x2Rn� �� Ax=� bforreal� A2A;� b2b :

(10.5)

Figure10.1illustratesthatthissetisnotconvex.Thispropertyalso

followsfrom

thefollowing,surprisinglysimplecharacterizationofthe

solutionset,

10.2.Sign-AccordAlgorithm

68

Theorem

10.1.

LetA2I(Rn�

n),b2I(Rn).Thesolutionsetof

AX

=bis X

=

� x2Rn� � jm(A)x�m(b)j�r(A)jxj+r(b)

:(10.6)

Proof.

SeeOettliandPrager(1964).

Thenonconvexityfollowsfrom

thefactorjxj.Supposethatu;v2X,

i.e.eachofthem

satisfy(10.6),butitdoesnotfollowthatanylinear

combinationofthemase.g.x=u+vsatis�esit.

AmongalltheelementsinXthecornersolutionshavespecialinterest.

ThesubsetXCornconsistsofthevectors� A�

1� b,whereallelementsin� A

and� barechosenasoneoftheendpointsofthecorrespondinginterval

elementinA

andb,respectively.Thenumberofcornersolutionsis

2n2+n.

Example10.7.

Fortheproblem

ofExample10.1wehavecomputed250

elementsofX.64ofthesearethecornersolutions,markedby`+"in

Figure10.2below,andtheremaining186correspondtorandom

choices

of� Ai;j

intheintervalAandsimilarfor� b.

0.4

6

−4

0.9

Figure10.2.

69

10.LinearSystems

ThedottedlinebordersConv(X),theconvexhullofX.Notethateach

cornerofthispolygonisacornersolutionofthesystem.

Theobservationinthisexampleistrueingeneral:

Theorem

10.2.

LetA2I(Rn�

n)beregularandletb2I(Rn).

Then

Conv(XCorn)=

Conv(X):

(10.7)

Proof.

SeeToft(1992).

ThesolutionX

tothesystem(10.2)isanintervalvector,andasalready

mentioned,X

istheintervalhullofX,X=X.

Theorem

10.3.

LetA2I(Rn�


ThesolutiontoAX

=bis

X

=

XCorn:

(10.8)

Proof.

Theinclusions

X�Conv(X)�X

areobvious,and(10.8)followsfromX=Xand(10.7).

ThistheoremwasoriginallyprovedinBeeck(1972).Itshowsthatthe

problem

is�nite:wecan�ndX

viathecomputationofthecorner

solutions.Eachoftheseisfoundbysolvingarealsystem

ofordern,

andthecomplexityofthisapproachisO(n32n2+n),whichisprohibitive

evenforsmallvaluesofn,


70

n

2

5

10

n32n2+n

5:12�102

1:34�1011

1:30�1036

Wecandomuchbetter.Thepointswheretheinequalityin(10.6)turns

intoequalityarethesocalledextremesolutions,

XExtr

=

� x2Rn� � jm(A)x�m(b)j=r(A)jxj+r(b)

:(10.9)

Inconnectionwith(10.14)belowweshowthatXExtr�XCorn.Wein-

troducethesignvector

y=

sgn

� m(A)x�m(b)� :

ThisisanelementinSn,

Sn=

� y2Rn� � y i2f1;�1g;i=1;:::;ng:

Now,(10.9)canbereformulatedto

XExtr

=

� x2Rn� � m(A)x�m(b)=Dy(r(A)jxj+r(b))

forsomey2Sn

;

(10.10)

wherewehaveintroducedthediagonalmatrix

Dy

=

diag(y1;:::;yn):

(10.11)

ThenexttwotheoremsofRohn(1989)showthatwecanreducethe

problem

from

�ndingthe2n2+n

elementsinXCorn

to�ndingthe2n

elementsinXExtr.

Theorem

10.4.

LetA2I(Rn�


Foreachy2Snthenonlinearequationin(10.10)hasexactlyone

extremesolutionx=xy,and

Conv(XExtr)=

Conv

� x y� � y2Sn

=Conv(X):(10.12)

71

10.LinearSystems

Theorem

10.5.

LetA2I(Rn�


ThenthesetXExtrofextremesolutionsforAx=bsatis�es

XExtr

=

X:

(10.13)

Example10.8.

Fortheproblem

ofExamples10.1,10.2,10.5and10.7the

extremesolutionsare

y

(1;1)

(1;�1)

(�1;1)

(�1;�1)

xy

(20 9

;4 9)

(6;�4)

(0:4;0:8)

(1415;�0:2)

ThefxygarethecornerpointsofthepolygoninFigure10.2.

Thereductioninthenumberofpointsisnotforfree.Eachpointin

XExtristhesolutiontoanonlinearsystemofequations.From(10.10)

weseethatforagiveny2Snwehavetosolve

m(A)x�m(b)=

Dy

� r(A)jxj+r(b)� :

Introducingz2Snde�nedbyz=sgn(x)(i.e.jxj=Dzx)weseethat

thissystemisequivalentwith

Axyx

=

b y

whereAxy=m(A)�Dyr(A)Dx;b y=m(b)+Dyr(b):

(10.14)

ItisseenthatallelementsinAxy

andb yareatanendpointofthe

correspondingelementinAandb,respectively,sothesolutionisacorner

solution,andwehaveveri�ed(10.9).Further,ifthesolutionxsatis�es

sgn(x)=z,thenx=xy.

Now,wearereadytoformulatethealgorithmforcomputingthesolution

toAX

=b. X

:=;

foreachy2Sndo

�ndtheextremesolutionxy

X

:=X[fxyg

(10.15)


72

Theextremesolutionisfoundbythesign-accordalgorithm

duetoRohn

(1989),

giveny2Sn

chooseinitialz2Sn;

fd:=false

repeat

solveAxyx=b y

if

allz ixi�0then

xy:=x;

fd:=true

else

k:=argminjfzjxj<0g

z k:=�zk

end

untilfd

(10.16)

Thestoppingcriterionisthatthesignofxisinaccordancewithz.Rohn

(1989)showedthatthealgorithmstopsafteratmost2n

iterationsteps,

andrecommendstheinitialchoicez=sgn

� m(A)�1b y

� .

Inthebestcasethe�rstxisinsign-accordancewiththeinitialz,so

thecomplexityofalgorithm(10.15)isbetweenO(n32n)andO(n322n).

WerefertoMadsenandToft(1994)aboutfurtherenhancementsofthe

algorithm,aspectsof oatingpointarithmetic,andhowthealgorithm

canbesuccessfullyimplementedonaparallelcomputer.

11.

GlobalOptimization

Manytechnicalandeconomicproblemscanbeformulatedasmathe-

maticalprogrammingproblems,i.e.astheminimizationofacontinuous

nonlinearfunctionf:D7!R,whereD�Rn.Often,thefunctionfhas

severallocalminima,butweareonlyinterestedinthesmallestone

f�

=

inf� f(x)

� � x2D:

(11.1)

f�

istheglobalminimum

andtheproblemof�ndingf�

andthepoints

X�

=fx2Djf(x)=f�gwhereitisattainediscalledglobaloptimiza-

tion.Methodsforglobaloptimizationcanbedividedintotwoclasses,

deterministicmethodsandstochasticmethods,seee.g.Pint�er(1996).

Weshallpresentadeterministicmethodbasedonintervalanalysis.

11.1.

BasicMethod

Theproblemof�ndingtheglobalminimumiscloselyrelatedtothesub-

jectofChapter7:f�

isequaltothelowerboundoftheintervalhull

f(D),andweshallexpandtheideasoutlinedattheendofChapter7,

combinedwithrelevantalgorithmsfromChapters8and9.Thepresen-

tationisbasedonMadsen(1991)andproofofconvergencemaybefound

inMooreandRatschek(1988).

Thealgorithmisabranchandboundtypemethod.Atanystageofthe

algorithmwehavethecandidatesetC.Thisisa�nitesetofsubregions

C(j)�DwiththesetX�

ofminimizersoffcontainedintheunionof

fC(j)g.Theaimistoreducethecandidateset,andtodothat,letFbe

anintervalextensionoffandnotethat

min jfL(C(j))g�f�

�min jfU(C(j))g��;

(11.2)

wherethefunctionsLandUaretheboundsoftheintervalreturnedby

F,

L(C(j))=

lowerboundofF(C(j));

U(C(j))=

upperboundofF(C(j)):

(11.3)

Therefore,if

L(C(j))>�;

(11.4)

11.1.BasicMethod

74

thenC(j)canbediscardedfromthecandidateset.Otherwise,wecan

getsharperboundsbysplittingC(j)intotwo,smallersubregions,cf.the

algorithmsinChapters7{9.Ifn=1,thenthesplittingisdoneby

simplebisection,andinmultipledimensionswebisectinthedirection

ofthelargestcomponentoftheradiusofC(j).

Nowwearereadytosketchthebasicalgorithm.Wesplitthecandidate

setCintoaworksetSandaresultsetR,sothatX�

isalwayscontained

intheunionofSandR.If

w(F(S(j)))�Æ;

(11.5)

thentheelementS(j)ismovedfromStoR.

Algorithm

GlobalOptimization1

S(1):=D;S:=fS(1)g;�:=U(S(1));

R:=;

while

S6=;

i:=argminjfL(S(j))g

X

:=S(i);

removeS(i)fromS

splitX

intoX(1)andX(2)

^�:=minfU(X(1));U(X(2))g

if^�<�

then

�:=^�;

use(11.4)toreduceS

end

fork=1;2

ifL(X(k))��

then

ifw(F(X(k)))�Æthen

R:=R[X(k)

else

S:=S[X(k)

end

end

end

(11.6)

Example11.1.

Considerthefunctionf(x)=(1�x

2)cos(5x)onD=[0;2]

withtheintervalextensionF(X)=(1�X

2)cos(5X).

InFigure11.1wegiveselectedresultsfrom

theiterationwithAlgorithm

(11.6).kdenotesthenumberofrepeats.

75

11.GlobalOptimization

01

2

−202

k =

1

01

2

−202

k =

2

01

2

−202

k =

3

01

2

−202

k =

4

01

2

−202

k =

5

01

2

−202

k =

6

01

2

−202

k =

7

01

2

−202

k =

8

01

2

−202

k =

26

01

2

−202

k =

27

01

2

−202

k =

28

01

2

−202

k =

29

01

2

−202

k =

30

Figure11.1.

ThestartingpointisS=DwithF(D)=[�3;3].Fork=5theinterval

[1:75;2]satis�es(11.4)andisdiscarded.Fork=9;:::;25computationis

focusedontheregionaroundthetrueminimum,butwestillkeepthelocal

11.2.UseofGradient

76

minimum

around0.6inthecandidateset.Thenthelowerboundaround

theglobalminimum

hasbecomesolargethatthecandidatesaroundthe

localminimum

aresplitanddiscarded.

WithÆ=10�

2

ittakes20furtheriterationstepsbeforethecandidateset

isempty.Theresultlistholds24neighbouringintervalsspanningthe

intervalX

=

[1:3457;1:3926],whichcontainsX�

.Therelation(11.2)

takestheform

�0:7444�f�

��0:7352.

11.2.

UseofGradientInformation

SupposethatwenotonlyhaveaccesstotheintervalextensionF,but

alsotoandintervalextensionF0

ofthegradientf0

=� @f

@x1

;:::;

@f

@xn

� .

Thisinformationcanbeusedintwoways,

1.Monotonicity.If02j

� F0(S(s))� j,thenfismonotoneinthecom-

ponentxj

onthesubdomainS(s).Therefore,wecanreducethejth

componentoftheintervalvectorS(s)toitslower(respectivelyupper)

boundif

� F0(S(s))� j>0(respectively

� F0(S(s))� j<0).Furthermore,if

thisreducedcandidateisinteriortotheoriginaldomainD,thenwecan

discarditfromSbecauseaninteriorminimumisastationarypoint,i.e.

thegradientiszero.

2.Stationarypoints.Asjustmentioned,thegradientataninterior

minimizeriszero.Thismeansthataninteriorminimizerisarootofthe

equationf0(x)=0,andwecanuseoneofthemethodsfromChapter9

tolocatethisminimizer.Especially,ifwealsohaveanimplementationof

theJacobianoff0,i.e.theHessianf00

(oruseautomaticdi�erentiation

tocomputeit),thenwecanuseKrawczyk'sversionofNewton'smethod

(9.11)totryand�ndtheminimizer.Therearethreepossibleresultsfor

theintervalX

=S(i):

i)

Krawczyk'smethodshowsthatX

containsnoroot.IfX

is

interiortoD,thenwecandiscard(S(i))fromthecandidateset,

otherwisewecanreduceittotheunionofitsnon-interioredges.

ii)

Xcontainsaroot,andKrawczyk'smethodisusedto�nditwith

highaccuracy.S(i)canbereplacedbytheresultinginterval.

77

11.GlobalOptimization

iii)

Krawczyk'smethodstallsatX,maybebecausetheintervalis

toowide.Usesimplesplittingtoreducethewidth.

Now,wecanoutlinethealgorithm.Theupdatingofthethreshold�and

thesetsSandRisperformedasinAlgorithm11.6.

Algorithm

GlobalOptimization2

S(1):=D;S:=fS(1)g;�:=U(S(1));

R:=;

while

S6=;

i:=argminjfL(S(j))g

X

:=S(i);

removeS(i)fromS

ifMonotone(X)then

X

isreducedtoRX,see1.above

elseifNewton(X)worksthen

X

isreducedtoRX,seei)andii)above

else sp

lit:RX

:=fX(1);X(2)g

with

X

=X(1)[X(2)

end

useRX

toupdate�andS

withallelementsinRX

discardorstoreinSorR

end

end

(11.7)

Example11.2.

Wehaveappliedthisalgorithm

tothesameproblem

asin

Example11.1andgottheperformanceillustratedinFigure11.2.Astar

signi�esanelementintheResultsetR.

The�rstthreeiterationstepsareidenticalwiththeresultsfrom

Algo-

rithm

11.6,andfork=4the\best"subregionisX

=

[1:25;1:5],and

Krawczyk'smethodgivesustheglobalminimizerwithassociatedthresh-

old�=

�0:73979,whichalsodiscardsthecandidate[1;1:25].There-

mainingiterationstepssuccessivelydiscardtheremainingelementsinthe

candidateset.Theresultlistholdsoneelement,

x�

2X

=[1:36864016023728;1:36864016023729];

andtheminimumis

11.2.UseofGradient

78

01

2

−202

k =

1

01

2

−202

k =

2

01

2

−202

k =

3

01

2

−202

k =

4

01

2

−202

k =

5

01

2

−202

k =

6

01

2

−202

k =

7

01

2

−202

k =

8

01

2

−202

k =

9

01

2

−202

k =

10

Figure11.2.

f�

2F(X)

with

m(F(X))=�0:7397955,r(F(X))=1.55e-15:

Thus,Algorithm

11.7needsfeweriterationsandgivesamuchmoreaccu-

rateresultthanAlgorithm

11.6.

Asmanyotherproblemsglobaloptimizationsu�ersfromthe\curseof

dimensionality",i.e.thecomputationalworkgrowsfastwiththedimen-

sionn.Thereforeitisappropriatetouseparallelcalculationinorderto

decreasereal-time.Madsen(1991)describesaparallelversionofAlgo-

rithm(11.7).

79

References

References

[1]

H.

Beeck

(1972),

� Uber

die

Struktur

und

Absch�atzungen

der

L�osungsmengevonlinearenGleichungssystem

mitIntervalkoeÆzienten,

Computing10,pp231{244.

[2]

G.Corliss,C.Faure,A.Griewank,L.Hasco�etandU.Naumann(2002)

(editors),Automaticdi�erentiationofalgorithms,Springer,NewYork.

[3]

R.Krawczyk(1969),Newton-AlgorithmenzurBestimmungvonNul-

stellenmitFehlerschranken,Computing4,pp187{201.

[4]

K.Madsen(1991),Parallelalgorithmsforglobaloptimization,Report

91-07,IMM.DTU.

[5]

K.MadsenandO.Toft(1994),A

parallelmethodforlinearinterval

equations,IntervalComputations3,pp81{105.

[6]

R.E.Moore(1966),IntervalAnalysis.Prentice-Hall,EnglewoodCli�s,

N.J.

[7]

R.E.Moore(1976),Oncomputingtherangeofvaluesofarational

functionofnvariablesoveraboundedregion,Computing16,pp1{

15.

[8]

R.E.MooreandS.T.Jones(1977),Safestartingregionsforiterative

methods,SIAM

J.Numer.Anal.14,pp1051{1065.

[9]

R.E.MooreandH.Ratschek(1988),Inclusionfunctionsandglobalop-

timizationII,Math.Progr.41(3),Aseriespp341{356.

[10]

A.Neumaier(1990),Intervalmethodsforsystemsofequations,Cam-

bridgeUniversityPress.

[11]

A.Neumaier(2001),IntroductiontoNumericalAnalysis,Cambridge

UniversityPress,Cambridge.

[12]

K.Nickel(1967),DieVollautomatischeBerechnungeinerEinfachen

NullstellevonF(T)=

0einschliesslicheinerFehlerabsch�atzung,Com-

puting2,pp232{245.

[13]

K.Nickel(1977),Interval-Analysis.pp193{226inD.Jacobs(ed),

ProceedingsoftheConferenceontheStateoftheArtinNumerical

AnalysisheldattheUniversityofYork,April12th{15th,1976.

[14]

W.OettliandW.Prager(1964),Compatibilityofapproximatesolution

oflinearequationswithgivenerrorboundsforcoeÆcientsandright-

handsides,Numer.Math.6,pp405{409.

References

80

[15]

J.D.Pint�er(1996),Continuousglobaloptimization:abriefreview,Op-

tima52,pp1{8.Availableathttp://plato.la.asu.edu/gom.html.

[16]

F.N.Ris(1972),Intervalanalysisandapplicationstolinearalgebra,

D.Phil.Thesis,Oxford.

[17]

J.Rohn(1989),Systemsoflinearintervalequations,LinearAlgebra

Appl.126,pp39{78.

[18]

S.Skelboe(1974),Computationofrationalintervalfunctions,BIT14,

pp87{95.

[19]

S.Skelboe(1977),Trueworst-caseanalysisoflinearelectricalcircuits

byintervalarithmetic,ReportIT11,InstituteofCircuitTheoryand

Telecommunications,TechnicalUniversityofDenmark,Lyngby.

[20]

O.Toft(1992),Sequentialandparallelsolutionoflinearintervalequa-

tions,Master'sthesis,InstituteforNumericalAnalysis,TechnicalUni-

versityofDenmark,pp1{98.

[21]

P.Wonqwises(1975),ExperimentelleUntersuchungenzurnumerischen

Aufl�osungvonlinearenGleichungssystemenmitFehlererfassung.

pp

316{325inK.Nickel(ed),"IntervalMathematics",LectureNotesin

ComputerScience29,SpringerVerlag.

81

Notation

Notation

a;a

LowerandupperlimitofintervalA

p.6

jAj

UpperboundofintervalA

(2.1),(2.2)

kAk

NormofintervalA

(2.3),(2.4)

d(A;B)

DistancebetweenintervalsAandB

(5.1),(5.10)

f(X)

IntervalhulloffonX

(4.1)

fM

(X)

Meanvalueform

(6.3)

J(x;y)

Jacobian

p.52

K(x;X)Krawczykoperator

(9.10)

m(A)

MidpointofintervalA

(2.1),(2.2)

Mij

(i;j)thelementintheintervalmatrixM

p.7

N(x;X)Newtonoperator

(9.1)

r(A)

RadiusofintervalA

(2.1),(2.2)

w(A)

WidthofintervalA

(2.1),(2.3),(2.4),p.22

Xi

ithelementintheintervalvectorX

p.6

X(s)

sthelementinasequenceofintervals

pp.33,47,73

X

SetofsolutionstoAx=b

(10.5)

Index

82

Index

adaptivealgorithm,35,39

associativerule,11

automaticdi�erentiation,43,76

bisection,46

branchandbound,73

cancellation,10

candidateset,73

commutativerule,11

compositemapping,19

continuity,19

convergence,19

linear,35

quadratic,35,51,60

convexhull,69

cornersolution,68

curseofdimensionality,79

degenerateinterval,6

distance,16

distributiverule,11

divide-and-conquer,39

existenceofroot,46

extendedrationalfunctions,15

extremesolution,70

oatingpoint,34,54,72

Gaussianelimination,63

gradient,27,36,76

hybridmethod,41,46

inclusionmonotonicity,9,14,28,55

inputerrors,4,37,67

intervalextension,13,24,25,30,33

{function,13

{hull,12,15,21,62,69

intervalmatrix,7

{vector,6

inverseelement,11

Jacobian,52,53,76

Krawczykmethod,57,60,65,76

Lipschitzcontinuity,20

machineaccuracy,54

meanvalueform,27

{theorem,24,27,37,50

metric,16

midpoint,6,27

multipleroots,45,48

nestedsequence,47,57

neutralelement,11

Newton'smethod,47,55,76

norm,7

parallelcomputation,72,79

radius,6

rationalintervalfunction,22

regularmatrix,61

Riemannformulas,40

roundingerrors,3,30,34,37,54,67

saw-toothmethod,60

Simpson'sformula.,40

singleton,6,23,26

stoppingcriterion,35,42,59,66,74

sub-distributivity,11

Taylorexpansion,29

truncationerrors,4,37,38,67

width,6,22

IMM...IMM 20.11.2002 In tro duction to In terv al Analysis Ole Caprani Ka j Madsen Hans Bruun...

Documents

Transcript of IMM...IMM 20.11.2002 In tro duction to In terv al Analysis Ole Caprani Ka j Madsen Hans Bruun...