STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner...

31
-A1BU 623 STRONG CONSISTENCY OF ESTIMATION OF NMNER OF REGRESSION VARIABLES WHEN T (U) PITTSBURGH UNIV PA CENTER FOR MULTIVARIATE ANALYSIS Y WU JUN 87 TR-87-i5 UNCLASSIFIED AFOSR-TR-87-1245 F49620-85-C-088 F/G 12/3 M EEEEEEEohEoiE MEE

Transcript of STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner...

Page 1: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

-A1BU 623 STRONG CONSISTENCY OF ESTIMATION OF NMNER OFREGRESSION VARIABLES WHEN T (U) PITTSBURGH UNIV PACENTER FOR MULTIVARIATE ANALYSIS Y WU JUN 87 TR-87-i5

UNCLASSIFIED AFOSR-TR-87-1245 F49620-85-C-088 F/G 12/3 M

EEEEEEEohEoiEMEE

Page 2: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

111. 528 25

.1* ml Um 2.

111112.

-9(RCOPY RESOLUTION T

S0 , - 0 S O ! 0 0 0 -*O S S Sl * - 0 . zl .. ..@ ... ' ,.,

Page 3: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

unclassified,LL%.I4I f it A.A ,SI AISUN u,.61 rI A( L (UN., r, I. &', .C P

. "lREAD INSTRUCTIONS "

REPORT DOCUMENTATION PAGE m

I. 41P1'RT NUMBER 2 GOVT ACCESSION NO I. RECIPIENT'S CATALOG NUMBER

AFOSR-TR. 87- 9'4 5TIT L.& (anld Subi,,,e S. TYPE OF REPORT & PENIOU COVERED

N' Strong consistency of estimation of number of

regression variables when the errors are indep- 4l - June 1987endent and their expectations are not equal to 6, PERFORMING 0MG. RkPORT NUMULR

.a:h. nthpr- 87-15W AUTNOR(f) G. CONTRACT OR GRANT NUMER(.

oo Yuehua Wu F49620-85-C-0008

, PERFORMING ORGANIZATION NAME AND ADDRESS 10. PROGRAM ELEMENT. PRO) FCT. TASK

I Center for Multivariate Analysis 1 3A.!ORK UNIT NUMBERS

Univerpity of Pittsburgh, 515 Thackeray Hall 1 Q

i Pittsburgh, PA 15260CONTROLLING OFFICE NAME AND ADDRESS IS. REPORT DATE

-kir Force Office' of Scientific Research June 1987"Department of the Air Force IS. NUMBER OF PAGES

Bolling Air Force Base, DC 20332 22,' s14 MONITORING ACENCY NAME & ADDRESS01 dlilaoI nt 11omn Controlling 0 l h ce) I. SECURITY CLASS. (of thie ,pport)

"-C c"Z - C' S .

/ unclassifiedI-6 iS L S iii FI C A T5ON/DWNGR A DING

SCHEDULE

1. DISTHIBuTION STATEMENT (o Ihi Reporij

Approved for public release; distribution unlimited

17 DISTRIOWjTION STATEMENT (at the abstract enter~ed in Block 20. It diII.,.nt train Report)

I.°

'1~13 AUPPLEMENTARY NOTES

,,,: k OCT i 8tEll

1 KEY WORDS (COwtlnu ote#*veee olde II necessary avid Ifletilly by block number)

Linear model, model selection, regression coefficient, strong consistency

0 5O A I RAC I (C et..,en on revetso ad. i necessry and Ide1fiIJI by bl ock 1 2wmbh {)Consider the linear regression model Yi = x!5 + ei i = 1,2,..., where {x is

a sequence of known p-vectors, 5' = ).., is an unknown p-vector, known

as regression coefficients, {e.} is a sequence of random errors. It is ofinterest to test the hypothesis Hk: 8k+1 = . = 0, k = 0,1,...,p. We do

not assume that the random errors are identically distributed and have zero meanssince it is sometimes unrealistic. As a compensation for this relaxation, weassume the errors have a common bounded support [a,,a,]. Under certain

DD o.,- 1473SECURITY C't'1SMItAfiON OF THIS PAGE (Ol'ton Doe Entered)

- 1. r* '- r - . '

Page 4: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

unclassified%.CUNITY CLA' IIICATIUN OF TH8i PAC'(Whf' Deld En#8r0d)

conditions, we obtain the strongly consistent estimate of the number of k fdrwhich k 0 and gk+l . op 0, by using the information theoretical criterila

k+

,J

! '

.r.

*-p

Page 5: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

.1. AFOSR-TM!h 87-12 4'5

STRONG CONSISTENCY OF ESTIMATION OF',. NUMBER OF REGRESSION VARIABLES WHENTHE ERRORS ARE INDEPENDENT AND THEIR

EXPECTATIONS ARE NOT EQUAL TO EACH OTHER*4.

i' Yuehua WU

Center for Multivariate Analysis

University of Pittsburgh

Center for Multivariate Analysis

University of Pittsburgh

4M. '";-.'

OS.

Page 6: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

AFOSR.TK- 8 7- 1 24 5

STRONG CONSISTENCY OF ESTIMATION OFNUMBER OF REGRESSION VARIABLES WHENTHE ERRORS ARE INDEPENDENT AND THEIR

EXPECTATIONS ARE NOT EQUAL TO EACH OTHER*

Yuelua Wu

-r Center for Multivariate AnalysisUniversity of Pittsburgh

I? Accc;i,.i For

"~NTIS CiA&I

W".' -A

June 1987 // " "-.4.", . ... t -, . . ..... .. .. .

Technical Report No. 87-15

-1r

Center for Multivariate Analysis

Fifth Floor, Thackeray HallUniversity of PittsburghPittsburgh, PA 15260

*Research partially supported by the Air Force Office of Scientific Research (AFSC)under contract F49620-85-C-0008. The United States Government is authorized toreproduce and distribute reprints for governmental purposes notwithstanding anycopyright notation heron.

Page 7: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

1

Abstract

Consider: the linear regression model y = x: + e, i = 1, 2 .... where

{x I is a sequence of known p-vectors, 1' = (_ . .. 8 I is an unknown p-

vector, known as regression coefficients, {e)I is a sequence of random errors. It is

of interest to test the hypothesis H = = 0, k = 0, 1 p.. ,k +1 .

We do not assume that the random errors are identically distributed and have zero

* means, since it is sometimes unrealistic. As a compensation for this relaxation, we

assume the errors have a common bounded support [a . a 2. Under certain*1 - 12

conditions, we obtain the strongly consistent estimate of the number k for which J

- 0 and ... = = 0, by using the information theoretical criteria.k+1 P

J,.'"

/-

'F 3

, iI,

"° " ", 4%.,•. , '"" % '"% "% ''"% %,'- -% '" " ',- """ %"%-" - "*%"% - , , '% "% """ """,

Page 8: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

2

1. Introduction

Consider the linear model

y. = x.' + e.,i=1,2, n.

(1)

where x s are experiment points, n = Y.... )' is the regression coefficient

vector to be estimated, and e's are random errors. In the usual linear regression

model it is assumed that the random errors have vanishing expectations and common

variance. In this case, the famous least square estimation (LSE) method plays an

important role in making statistical inference upon the regression coefficient vector

a. In the literature, there are a lot of papers concerning with the LSE and many

important results are obtained (a part of work refers to [1],[2] and [3]).

However the unbiasedness and consistency (even the weak one) of LSE strongly

depend on the assumption that the expectations of errors are zero, and this

assumption is not realistic sometimes. It is of interest to find a consistent estimates

of the regression coefficients when the expectations of errors are not equal to

each other In [4] two methods for finding consistent estimates of the regression

coefficient vector a are proposed.

The first method is to use the measureJw.

I

' q () = max (y. - x.' ) Min (y. - x.'Bn, 1<1<n i i <l<11 i i

The estimator B of a is defined as the vector which minimizes Q (a) The estimate

* ~is temporarily called MD estimate of a in [4] (the estimate based on then

Maximum Difference between residuals)

The second method is to use the measure

W. ~nB Max lyi - x.'aln 1<<n I I

Denote by B the value of a which minimizes Q (B) Also. B is temporarily called

MA eatimate of ( (the estimate based on the Maximum Absolute values of residuals)

. .. .

E . . ..ql >..

...B.*- .

Page 9: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

1:'

3

. Under certain conditions, both B and B are shown to be strongly consistent

in [4],

Now let us consider the hypotheses

Hk: 6k+1 = Bk+2 O. = 0 and B k # 0

k = 0,1 . ,p-1.

It is of interest to determine the true model H by using the model selection

criteria Denote by B kn = (Bkin' kkn, 0 ... 0)' the vector which minimizes

Q (B) under the restriction B . - = =0 and denoted by B (Bn k+1 p kn kin

B 0 , 0Y the vector which minimizes Q (B) under the restrictionSkkn n

q= .. = =0O. Writek+1

p

:k - Qn{ (kn)

and

Q = Qn (0kn >

Choose a sequence of constants C, satisfying certain conditions which will be'V nspecified later, and define

"" Rk Q + kC

k k n

and

R -Q + kCk k n

Choose

i ... , .V

Page 10: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

4

k = ArgMin{R k e {0, p}}

k

and

k= ArgMin{Rk: k c {0, p}}-kwhere ArgMin denote the index which minimizes the quantities following the symbol

ArgMin.

In this paper we shall consider the consistency of k and k to the true model

k.0

2. Consistency of k

In this section, we make the following general assumptions:

Assumption 1. The errors e, i = 1,2. are independent.

Assumption 2. P{e c [a1, a J 0 and there is a positive constant 6 such

that for any c > 0 and any n, we have

P{e c [a , a + CD] > AE

and

P{e n [a2 - c, a2]} > Ae.

Assumption 3. For any a > 0, there exists a positive constant C such that for

any vector ot ft 0 it follows that

#{i < n, I (x.)-9(a) I < a} > Cn

for large n, hereafter (a) = a/1X1

.J%.

Page 11: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

5

Assumption 4. There exists a positive constant m such that

lxii > m, for i 1,2,

Now let us estimate Q (s). Definen n

E = i <n, -x!.B -B > 0}n I n

', (2) "E ( i < n, x!n >0}n I n

Split S = { x e RP: l xi 11 into d disjoint parts Z, . such that Vp 1' "' d

x. y ,x'y > 3/4. Lety Z , j = 1, . d. Define EJ = {i < n, Z(x)'y >' J J n -

3/4), j= 1 . .. d. By Assumtion 3, there exists S > 0 such that

# > 6 n, j = 1,2, ,d.

It is easy to see that -( n - a) e Z and i c EJ implies that

n n

and that 9Aj - B) c I and i c EJ implies thatnj n

x - ) > 0, i.e. i E "

Take r satisfying

r - 0 and nr /logn -=.. n n3

we have

.%

=,"

Page 12: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

6

P(Q n(n) < a2 - a1 - 2r n

<P(max a. r~ )r + P( lE a. +8 rn n

d --

dE~ P( E "a ) i a 2 - rn' 91(n jJ=1 icE

n

d+ E P( mi " > a I + r n c .

n.nnj=1 iEn

d

Z P( max e. < a - r 9( - IJ=1 i E I 2 n ' nj

ndA

+ " P( m i e. > a + r -

j':' nnoC icE1n

d

< . P( max e. < a - r )1 ileE J i - 2J=l C

nd

+ [ P( min e. >a + r,jl icEJ , n

n

< 2d(I-r ) 1 n< 2de n 1 < 2d/n 2

n

for large n. By Borel-Cantelli Lemma we have

Q (B >a - a - 2r, a.s.

when n is large enough.

Let k be the index of the true model and let 0 be the true parameterThen 0Then obviously we have .for p > k > k

-. , - -

04

Page 13: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

7

n ' ' n ' n Q n 'kn

:< Q( ) < Q (B) <a - an n 0-2 1

Thus

-,' )0 < Qn ( () < 2r , p > k > k

If we take C such that C n . C /r n, then for k > kk ' n n n l > 0

01,,.,[ Rk - Rk = k- kO )Cn + Q( n) - n( ) >0O,

f k knn, (2)

for all large n.

Next, we consider the case of k < k0. Denote>00

n =Ik 0~o > 0

and define

E = i < n. 2 (x.) + 9(. - Q), < 1/2}

E n {i < n, Ik (x.) - (k n B 0 < 1/2}

Split S into b disjoint parts 11 11 b such that V x. y c fl Ix -Yp 1 b,

1/4 Let II. j= 1, b DefineJ

Page 14: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

* 8

Fj = {i < n,J(x.) -(x < 1/41 j = 1 b.nJ

S.. By Assumption 3. there exists 6 > 0 such that2

#(F )> 52n, j = 1,2 ,b.

A,2

It is easy to see that

4,.-

-9( - O) 13 I. and i c Fj

-n 0 j n

which implies that

I9 x. ) + ( - < 1/2. i e. i c E.

Also,

"n c JI. and i C Fj

0 n

which implies that

.. (X- + -kI < 1/2, i.e. i E- ,kn 0n

For i E ,we haveWA3

,0-B ) xi 1 I kn-aOI 9' (xi) 9- ( n BO

. "kn%'=~

Ol0

.2

"A ..- . .- . . .; , . . , , . . . . . . . . . . : . . .. . .. .. . .. . . . ., . . - . - . , , .. . . . . - - - - . . . . . ." . . - " , n " " " " " " " " " - " - -" - " -" -

Page 15: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

9

:S -rT1 - 1/2) = -m/2.

Similarly for i c E , we have

x! ( mn > rTI/2.

Hence

Qn(kn) > ma.e i mine. + mTi E iEE-E

n n

Thus

P (Qn(kn) <a 2 - a, + m'r/2)

< P( ma e. < a2 - mrn/4) + P( min e. > a + mrT/'.)-icE icE

n n

E P( ma. e.<a2 -mn/4, -9 II..)j=1 i c E

n

b )+ Z P( min e > a + mn/4, 9( - B) I.)

-'.'-'.~ i E

n

bAE P( max e. < a - mTl/4, -9( - O) e: I.)

J=1 ie-" - 2 0jn

*OpP[ 4

Page 16: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

10

b+ P( min e. > a + mTI/4, A

j= ieFJ I - 1 9Jn

b

E P( may e < a 2 - m/4)j=1 i C FJ

n

b"4 + < Z P( min e. > a + mn/4)

.j 1 i CFI

n

n - AmI 6 n/ 4 2< 2b(I - <m/4) 2 < 2be 2 2b/n

for large n. By Borel-Cantelli Lemma, we have, with probability one,

Qn(a a 2 - 1 + m-l/2, for all large n.

Thus for k < kO, we have

Rk - R 0 = (Bkn) - Qn (ak ) o-k) nOn 0

> mr/2 - (k - k)C > 0,-- n

(3)

for large n, since C -. 0n

(2) and (3) imply that k is strongly consistent. Summarize the above

arguments, we get the following theorem.

Theorem 1. Choose C satisfyingn

%I

Page 17: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

0 - 11

,,(i) C ,

'4"'

(ii) nC /lognn

Suppose the four Assumptions given at the beginning of this section are true, then

k -, k, a.s

Proof. Use the arguments given before. We only need to note that for any

sequence of C satisfying (i) and (ii), we can always choose r such thatn n

(i)' r /C -b, 0,

(ii) nr /logn -

SQ. E. D.

3. Consistency of k

, In this section, we shall make the following general assumptions:

Assumptiom 1 The error e, i = 1, 2. ... are independent,

. %

Assumptiom 2 11 a, < a V in => P(e [a, a]) = 0 there is a positive2 n 1 2

constant L such that for any E > 0 and for any n, we have

P(e c [a - c. a ]) > AE

n 2 2-Assumptiom 3. Same as Assumptiom 3 in Section 2.

Assumptiom 4'. There exists a positive constant m such that

V,=,..

IxI > m, for i = 1,2, .[

Now let us estimate (. Define, , .n.

~ ;.,',. ,..y!. ~ . . p .. .... * ** . *,..* L*

Page 18: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

12

E { < n, x' > 0}n -nf

Split S into d disjoint parts ,......- such that V x, y c , x'y > 3/4.p 1 d J

Let yJ E 2., j = 1..... d. Define =n ={i < n, 9.(x )Y"J > 3/4}, j =1, . d. By

Assumption 3, there exists 6 > 0 such that

# > S n, j = 1, ,dn - 1'

It is easy to see that

S c E . and i c EJn J n

I nl n

Take r satisfyingn

rn -b 0, nr n/logn -n n"

We have

P(Q ( < a2 - r ) < P( max e < a 2 r nnn-2 n - 2EF nIcE

n

dI P( max e < a - r, 9 C)

J=1 icE J

d

SP( max e < a2 -r, -,(sSJ 2 n n j

n

Page 19: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

13

d

P max e. < a -r)-Ej I 2 n

- - n -Ar 6n 2< d (0 Ar ) 1 < de n 1 < d/n

n

f or large n. By Borel-Cantelli Lemma we have

> (~ a - r , a.s.nn -2 no

when n is large enough.

Let k be the index of the true model and let be the true parameter.0 0

Then obviously we have for p > k > k0

n n n pn -n kn

Qn (Bkn) - n 0-< 20

Thus

-n k n n kn -no 0'0

If we take C such that

C -. 0, C /r ccn n n

then for k >k 0

@4

Page 20: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

14

R - R (k - k ) Cn +Q( ) -Q( > 0k k 0n 0 kn n k 0

(4)

for all large n.

Next, we consider the case of k < k Denote0

,TIk 0 ~o > 0

and define

E- < n, .(x.) '9( -(B ) < -1/2}

Split S into b disjoint parts 11.....IV, such that V x, y c II, x'y > 1/2.P b

Let j = 1..... b. Define F as F i < n, (x)' > 275/280}, =J ~ n n

1,.... b. By Assumption 3, there exists > 0 such that2

ns- 2 = 1,

It is easy to see that -9(a -k O) 0 fl and i Fn imply thatkn 0 j n

For i c , we haven".

." A

Page 21: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

15A'.

X1! 1 5/

ix ( -kn 0) Ixii I kn-Yo l(x i ) ' k I > mT/2

Hence

Q(0 > maxe. + mTj/2i n

Thus

P (Qn (5) < a2 + mn/ 4 )

< P( max e2 < a2 - mTI/4)

2 2

L b __

< P( max e. < a2 - m,'1/ 4, -,2,Bkn- BO) cE I

' )J

-=I i 2kn0En

b -

< P( max e. < a2 - mT/ 4 )

nb -

SPA ma, e. < 2- m kn/4)cn

i F

P( maxme. < a n -bi42

'.

- - n 2<b( 0 mnI/ 4) 2 < b/n

for large n. By Borel-Cantelli Lemma, we have with probability one, when n large

enough

Page 22: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

16

Q(8k ) > a 2+ Mn4

Thus for k < k ,we have-~ 0

R -R Q8 )-( )-(k -kCk k n kn' n knk k0 0n 0 n

<mT/4 - (k 0 k)C n> 0,

p (5)

for largh n, since C ~0.In

(4) and (5) proves k is consistent. Summarize the above arguments, we get

the following theorem.

Theorem 2. Choose C nsatisf ying

nn

'U(ii) nC In/logn -

Suppose the four assumptions given at the beginning of this section are true, then

k -* k,as.

Proof. Use the arguments given before, we only need to notice that for any

sequence of C satisfying (i) and 60,) we can always choose r such thatnn

Wi' r /C -0r, n

60i' nr /logn -

Q.ED,

Z .. '~

Page 23: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

*17

4. General Case

N In this section we consider the same regression model (1) But the problem

we are going to solve is to determine the subset (or the model) J = {1 < j 1 <

< jk < p} such that B = 0 if and only if j c J We make the same

assumptions as given in previons sections

Of course, we can use the procedure described in section 2 and 3 to

determine the model J as follows: For each permutation Tr of = (

similarly rearranging (x I, .. x P), we get a new model Mt . Under this model,

using the approach given in section 2 and 3, we obtain estimates k = kA = min k-T T T

and k = k - = min k and let J ={t1), ... k)} andJ =J ={T( 1).....7T IT 11 1 1 1

(1k)}, we can easily prove that, by using Theorem 1 and 2, J J, a. s. and J

J, a. s

An alternative method to estimate J is given as follows: Suppose T is a

subset of { 1, - , p} Consider the model T:

y n x (T) (T) + e n, n n n

where x T) = lx c T) and B(T) 1. j c T)'. Letj j1 j

Q (T) min { max (y. x.(T)'B(T)B(T) ]<i<n

m in (y. - x. (T) B (T))}l<i<n

and

Q (T) min max Iy - x (T)' (T)I"ny

B(T) <i<n

Define

.4..

Page 24: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

18

SRT - In(T) + #(T)Cn

and

R] Q (T) + # (T) C2 n

Choose J such that2

R3 = minR2 T

and choose J such that2

R- = min R12 T T

We can also prove that J 2~ J, a. s. and *J 2 .J, a. s .However, there would be

too much computation involved when p is relatively large. In the first case. there are

=totally p! permutations whileas in the second there are 2Psubsets of'{ 1,.. p}.

In light of this, we propose another approach to estimate J which only involves p+

* 1 quantities to be computed.

Now let

B (j) - (B 1, - , j l . .. P

and def ine

Qn (j) -min { max (y. S x (j))a (j) i<i<n

-min (y. - X'BU))I<i<n

and

LIS'S * . - - - -'Vp .* ss.

Page 25: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

19

Q (j) minn max y - , x (j) I

n U() li<n

WNrite

R (n, j) =Q n(j) - Q - Cn

and

R (n,j)=Q n(j) - Q - C.n

* We choose

i ~ j n [j {jj; R (n, j) >0}

n

and

j n {j I =j {j: R(n~j) > 03n

Then we have the following theorems.

Theorem 3. Under the conditions of theorem 1, we have that

J ,-J , a. s.n

where model J ii .i is the true one.

Proof If c .J, by (3) with the replacement that k 0 p and k p- 1, we

have that with probability one, Rnj) > 0 for all large n. i. e., jc n Hence, when n

*. -**.r*

Page 26: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

20

large enough, J J Conversely, if j ' J, using the same argument as proving

theorem 1, we have

R(n,j) = Q (j) - Q - Cn p n

< O(logn/n) - C a. s.=1 n

which together with (ii) implies that

R(n,j) < 0, for large n,

i. e j ' J when n large enough. Therefore J J which completes the proof ofn n

Theorem 3.

Theorem 4. Under the conditions of theorem 2, we have that

J -. J, a. s.n

where model J = j is the true one1 k

Proof. If j c J, by (5) with the replacement that k = p and k p- 1, we

have that with probability one, R(n,j) > 0 for all large n, i. e., j c J . Hence, when n

nn'' =ilarge enough, Jn J. Conversely, if i ;J, using the same argument as proving

theorem 2, we have

R (n,j) = Q (j - Q - Cn p n

< O(logn/n) - Cn a. s.

which together with (ii) implies that

Page 27: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

21

R(n,j) < 0, for large n,

e 't J when n large enough. Therefore J J which completes the proof ofn n

Theorem 4.

. N

S.%

:-V

,xS'

Page 28: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

22

REFERENCES

1 Drygas, H (1976). Weak and strong consistency of the lease squaresestimators in regression models. Z. Wahrsch. Verw Gebiete, 34,119-127

2 Lai, T. L., Robbins, H and Wei, C. Z. (1979). Stong consistency of least" squares estimates in multiple regression II. Journal of Multivariate

Analysis, 9, 343-361

3, Oberhofer, W (1982) The Consistency of nonlinear regressionminimizing the L -norm. Ann. Statist. 10. 316-319

4. Wu, Y. (1986) On strong consistent estimates of regression

coefficients when the errors are not independently and identicallydistributed. Tech. Report No 86-06, Center for Multivariate Analysis,

*Univ. of Pittsburgh

* W.. o I

..-

"'".".'

,,

Page 29: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

unclassifiedL ,t. UI4 It CL A V IIIl Al IOt U L , tI I W A(L (I,.* l- ioi .j

REPORT DOCUMENTATION PAGE HEAD INSR1UCTIONS

I. HEP',RT NUMBER GOVT ACCESSION NO 3. RECIPIEN'S CATALOG NUMBER

87-15

b4. TITL (and Subttle) S. TYPE OF REPORT & PEFI1OU COVERED

Strong consistency of estimation of number ofregression variables when the errors are indep- technical - June 1987endent and their expectations are not equal to 6. PERFORMINGONG.RLPORT NUMULR

RarhothPr_ 87-157. AUTHOR(.) 4. CONTRACT OR GRANT NUMOER(.)

Yuehua Wu F49620-85-C-0008

., ""- - PERFORMING ORGANIZATION NAME AND ADDRESS 10. PROGRAM ELEMr:NT. PRO iE CT. TASKfor nalsisARE[A & WORK UNIT NUMBERSCenter for Multivariate Analysis

"-I..- University of Pittsburgh, 515 Thackeray HallPittsburgh, PA 15260

I. CQNTROLLING OFFICE NAME AND ADDRESS 12. REPQORT DATE

Air Force Office' of Scientific Research June 1987Department of the Air Force I. NUMBER OF PAGES

Bolling Air Force Base, DC 20332 2214. MONITORING AGENCY NAME & AODRESS(II dllerent froi,,n Cortroillng Ollce) I6. SECURITY CLASS. (of this report)

0 unclassified

SCH %EDULE.,, %' .._%.. . -I-S;.--ECL ASSI'FIC ATIO-NDOWN GR ADIN G'-SH UL

IS. DISTHIBUTION STATEMENT (ol Ihis Report)

Approved for public release; distribution unlimited

17. DISTRIB4JTION STATEMENT (of ithe abltact entered in Buck 20, if different (rom Report)

IIII. !1UPPLEMENTARY NOTES

I2 KEY WORDS (Coflintu ur, 80 ev6 e side i nIec**sawy wed i e(li, by block number)

Linear model, model selection, regression coefficient, strong consistency

* "20 AB3iI RAC T (CoIllnue o, reverse side11 necessry end dJdnerilty by block number)

Consider the linear regression model yi = x'O + e., i = 1,2,..., where {xi} is

a sequence of known p-vectors, 0' = (1,..,8 p ) is an unknown p-vector, known

as regression coefficients, {e.} is a sequence of random errors. It is ofinterest to test the hypothesis Hk: 0k+1 = ... = op = 0, k = 0,1,...,p. We do

not assume that the random errors are identically distributed and have zero meanssince it is sometimes unrealistic. As a compensation for this relaxation, weassume the errors have a common bounded support [a,,a,]. Under certainOH M

DD I FJAN 7 1473SECURITY CLASSIfI AO OF TNS PAGE -(.n Da, a-Ent ,rou

. " " " " "w.' * . .'".-.. .- " . ,--W " - " " .' . W '.;

"' " - " %

Page 30: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

unclassifiedOtCuHiTY CLAIJdlCAIIUN OF THIS PA(,Lg gf Doe fntere.)

conditions, we obtain the strongly consistent estimate of the number of k fdrwhich #0 and 5 ... 5p 0, by using the information theoretical criteriakk+lp

N

J..

J1a

2"

'.,.

: -'i " unc lass i f ied

% .

Page 31: STRONG CONSISTENCY OF ESTIMATION OF NMNER OF …-a1bu 623 strong consistency of estimation of nmner of regression variables when t (u) pittsburgh univ pa center for multivariate analysis

*1F14% S

]D~E c

/.7