Locating the Most Relevant Identification Information in ... · 21 It can be shown that the...

Post on 05-Aug-2020

0 views 0 download

Transcript of Locating the Most Relevant Identification Information in ... · 21 It can be shown that the...

Locating the Most Relevant Identification Locating the Most Relevant Identification Information in Spectral Cubes and Information in Spectral Cubes and

Similar Large Data SetsSimilar Large Data Sets

Robert K. McConnellWAY-2C Arlington, Massachusettsrkm@way2c.comhttp://www.way2c.com

WAYWAY--2C2C

2

Typical Object Detection StepsTypical Object Detection Steps

1. Choose the target class(es). 2. Choose attributes to be sensed.3. Acquire training data set (e.g. image).4. Train the system.5. Acquire a test data set.6. Classify the test data set.7. Locate blobs representing the

class(es) of interest.WAYWAY--2C2C

3

Typical Object Detection StepsTypical Object Detection Steps

1. Choose the target class(es). 2. Choose attributes to be sensed.3. Acquire training data set (e.g. image).4. Train the system5. Acquire a test data set.6. Classify the test data set.7. Locate blobs representing the

class(es) of interest.WAYWAY--2C2C

4

Example: hyperspectral “cube” Example: hyperspectral “cube”

WAYWAY--2C2C

Data Band n

Data Band 2

Data Band 1

...

Lake Training Region

Forest Training Region

Field Training Region

Find optimum data subset for classification

5

Classification ObjectivesClassification Objectives

WAYWAY--2C2C<5 bits/pixel

??

Training regions

ASAP

>1K bits/pixel

6

acquired bands and moreacquired bands and more

WAYWAY--2C2C

HSI bandsother

“bands”derived “bands”

7

Example: “Green Chips” Example: “Green Chips” Hyperspectral CubeHyperspectral Cube

WAYWAY--2C2C

Courtesy Resonon, Inc.

• 80 bands• 12 bits A/D/pixel/band • 16 total bits/pixel/band• ~1K bits/pixel

8

“True Color” Image From Cube“True Color” Image From Cube

WAYWAY--2C2C

64240R55025G45810B

nmBand

8 Classes8 ClassesRequired:Required:Band combination to differentiate reliably.Band combination to differentiate reliably.

Image Courtesy Resonon, Inc.

“Green Chips”

9

“High Contrast” Image From Cube“High Contrast” Image From Cube

WAYWAY--2C2C

76460*R74557*G50117*B

nmBand

8 classes can be differentiated reliably.8 classes can be differentiated reliably.

*some high order bits discarded

10

Pixel by pixel interpretation of high Pixel by pixel interpretation of high contrast “intermediate” image.contrast “intermediate” image.

WAYWAY--2C2C

11

Maximum likelihood classification Maximum likelihood classification based on ~30K pixel regions.based on ~30K pixel regions.

WAYWAY--2C2C

12

Not all bands are equally Not all bands are equally relevant.relevant.

WAYWAY--2C2C

13

Band 60Band 60 PSPA=67.88% PSPA=67.88%

WAYWAY--2C2C Base Image Courtesy Resonon, Inc.

14

Band 00Band 00 PSPA=14.01% PSPA=14.01%

WAYWAY--2C2C Base Image Courtesy Resonon, Inc.

15

Band 79Band 79 PSPA=27.19% PSPA=27.19%

WAYWAY--2C2C Base Image Courtesy Resonon, Inc.

16

Entropy as a measure of Entropy as a measure of uncertaintyuncertainty

WAYWAY--2C2C

ClassificationClassification→→

WAYWAY--2C2C

Before

A DCB

Pro

babi

lity

Class

“In mind’s eye”

*uncertainty

2bits=

After

A D

B

C

Pro

babi

lity

Class

uncertainty0 bits→

*2log ( )

J

j jj

u p p= −∑

18

Not all positions within Not all positions within “band” are equally relevant.“band” are equally relevant.

WAYWAY--2C2C

19

Most Relevant Information: Most Relevant Information: Data Window UncertaintyData Window Uncertainty

WAYWAY--2C2C

“Noise”Padding Relevant Information

10100 00110 110 000

UN

CER

TAIN

TY(E

NTR

OP

Y -B

ITS

)

0.0

3.0

0 10

1 2 9876543 13

12

11WINDOW SHIFT (BITS)

Data Window

20

Objective: Find optimum Objective: Find optimum combination of “bands” and combination of “bands” and data “windows” within those data “windows” within those

bands to differentiate the bands to differentiate the classes of interest.classes of interest.

WAYWAY--2C2C

21

It can be shown that the relevant information in the data set D for differentiating the class set C(known as the mutual information between C and D) is given by

relevance = H(C)+ H(D) - H(C&D) (3)

where:H(C) is the entropy of the class probability distribution,H(D) is the entropy of the data distribution andH(C&D) is the entropy of the joint distribution of C&D.

relevancerelevance

WAYWAY--2C2C

22

It can also be shown that information required to completely specify the class once the data is known, must be given by

uncertainty = H(C&D)- H(D) (6)

= 0 when C and D are perfectlycorrelated and

= H(C) for no correlation between C and D.

uncertainty uncertainty

WAYWAY--2C2C

Maximizing relevance minimizes uncertainty.

23

Finding Most Relevant Data Finding Most Relevant Data to Construct to Construct

Intermediate ImageIntermediate Imagefor Classificationfor Classification

WAYWAY--2C2C

24

Training RegionsTraining Regions

WAYWAY--2C2C Base Image Courtesy Resonon, Inc.

25

0

0.5

1

1.5

2

2.5

3

3.5

0 10 20 30 40 50 60 70 80 90

Band Number

Info

rmat

ion

(bits

)

ClassDataRelevant

WAYWAY--2C2C

Most relevant window: Most relevant window: measured single band measured single band

information contribution.information contribution.

S=5

26

Image constructed from Image constructed from single most relevant band single most relevant band

and window only.and window only.

WAYWAY--2C2C

76460*R

nmBandPlane

relrel = 2.318 bits= 2.318 bitsPSPA = 67.88%PSPA = 67.88%

S=5

27

0

1

2

3

4

5

6

0 10 20 30 40 50 60 70 80 90

Band Number

Info

rmat

ion

(bits

)

ClassData

Relevent

WAYWAY--2C2C

Relevance: (band & window) Relevance: (band & window) pair information contribution pair information contribution

(band 60 window fixed)(band 60 window fixed)

S=7

28

Image constructed from Image constructed from most relevant band pair.most relevant band pair.

WAYWAY--2C2C

76460*R50117*B

nmBandPlane

relrel = 2.742 bits= 2.742 bitsPSPA = 91.07%PSPA = 91.07%

29

2

3

4

5

6

7

0 10 20 30 40 50 60 70 80 90

Band Number

Info

rmat

ion

(bits

)

ClassDataRelevant

WAYWAY--2C2C

Relevance: (band & window) Relevance: (band & window) triplet information contribution triplet information contribution (bands 60,17 windows fixed)(bands 60,17 windows fixed)

S=7

30

Intermediate image from most Intermediate image from most relevant (band & window) triplet.relevant (band & window) triplet.

WAYWAY--2C2C

76460*R74557*G50117*B

nmBandPlane

relrel = 2.844 bits= 2.844 bitsPSPA = 97.74%PSPA = 97.74%

31

0

0 .1

0 .2

0 .3

0 .4

0 .5

0 .6

0 .7

0 .8

0 .9

1

0 10 20 30 40 50 60 70 80

B and N umb e r

Color 1Color 2Color3Color 4Color 5Color 6Color 7Background

1

3

2

Automatically Selected BandsAutomatically Selected Bands

WAYWAY--2C2C

32

Now, using only optimum data subset:Now, using only optimum data subset:

1. Construct an “intermediate” image.2. Train using the same class training

regions as for locating that subset.3. Perform maximum likelihood

interpretations on this and similar images based only on the optimum data subset.

WAYWAY--2C2C

33

Pixel by pixel interpretation of Pixel by pixel interpretation of most relevant band triplet image.most relevant band triplet image.

WAYWAY--2C2C

34

Pixel by pixel class # 1 extractionPixel by pixel class # 1 extraction

WAYWAY--2C2C

35

Blob location and processing.Blob location and processing.

WAYWAY--2C2C

36

“True Color” TERRAIN Image“True Color” TERRAIN Image

WAYWAY--2C2C

thick grass

road

thin grass

trees

bare soilHYDICE

bands 49,35,15

37

Terrain: Mean Spectra and Terrain: Mean Spectra and Selected BandsSelected Bands

WAYWAY--2C2C0

0.05

0.1

0.15

0.2

0.25

0 50 100 150 200

'road'

'bare soil'

'thin grass'

'thick grass'

'trees' 1

2

3

38

“Most Relevant” TERRAIN Image“Most Relevant” TERRAIN Image

WAYWAY--2C2C

thick grass

road

thin grass

trees

bare soilHYDICE

bands 199,52,17

> 98% data > 98% data reduction from reduction from

original original hyperspectral hyperspectral

cube.cube.

39

Interpreted TERRAIN ImageInterpreted TERRAIN Image

WAYWAY--2C2C

thick grass

road

thin grass

trees

bare soilBased on HYDICE

bands 199,52,17

uncertainty = 0.48 bitsPSPA = 71.8%

40

We define the PSPA, or predicted single pixel agreement (with training) by:

log(PSPA) = H(D) - H(C&D) (7)

Predicted Agreement Predicted Agreement

WAYWAY--2C2C

41

“Thin Grass”“Thin Grass”Training Region DetailTraining Region Detail

thick grass

road

thin grasstree

bare soil

Assigned training regions may not be “pure”, leading to system accuracy underestimation.WAYWAY--2C2C

42

“Thin Grass”“Thin Grass”Training Region ClassificationTraining Region Classification

WAYWAY--2C2C

thick grass

road

thin grasstree

bare soil

System may correctly classify but assumes it’s wrong, thus underestimating its own accuracy.

Locating the Most Relevant Identification Locating the Most Relevant Identification Information in Spectral Cubes and Information in Spectral Cubes and

Similar Large Data SetsSimilar Large Data Sets

WAYWAY--2C2C

EndEnd

Robert K. McConnellWAY-2C rkm@way2c.com

Technology AvailableUS Patent 8,918,347