Presentation (1.15 MB)

19
Workshop on Innovative Workshop on Innovative Data Mining Techniques Data Mining Techniques in Support of GEOSS: A in Support of GEOSS: A summary summary Roger L. King and Nicolas H. Younan Roger L. King and Nicolas H. Younan Mississippi State University Mississippi State University Mihai Datcu Mihai Datcu DLR DLR

Transcript of Presentation (1.15 MB)

Page 1: Presentation (1.15 MB)

Workshop on Innovative Data Workshop on Innovative Data Mining Techniques in Support Mining Techniques in Support

of GEOSS: A summaryof GEOSS: A summary

Roger L. King and Nicolas H. YounanRoger L. King and Nicolas H. Younan

Mississippi State UniversityMississippi State University

Mihai DatcuMihai Datcu

DLRDLR

Page 2: Presentation (1.15 MB)

Workshop logistics

August 31, 2009 – September 2, 2009Sinaia, RomaniaHosted by Romanian Space Agency (ROSA)Convened by ROSA (Pisa), MSU (King), DLR (Datcu)International audience

38 participantsRepresenting – users, researchers, governmentDenmark, Finland, Germany, Italy, Japan, Romania,

Switzerland, Turkey, United States

http://events.rosa-rc.ro

Page 3: Presentation (1.15 MB)

Defining terms

Define data mining as:A computer process of discovering unknown

knowledge from massive data sets.Data mining involves massive data sets, high-

performance computers, and mining algorithms.Earth observations, mainly through remote

sensing, have generated huge volumes of geospatial data.Vital for scientific research, socio-economic activities,

and military missions.Those massive data collections contain unknown

knowledge about the status of the earth system, functions and interactions of its components.

Page 4: Presentation (1.15 MB)

Flow of this presentation

The workshop consisted of 2.5 days of presentations by various experts with focused panel discussions that helped shape outcomes and recommendations

This summary will not be able to give many of the specifics made in the presentations. However, they will be incorporated into the final report.

Start with the GEOSS Data Sharing PrinciplesFocus will be on workshop panel discussions

Page 5: Presentation (1.15 MB)

Setting the stage

Marius-Ioan Pisa, CEO ROSAProvided rationale for meeting

Robert Koopman, GEO SecretariatGEOSS: Ongoing Developments and PrioritiesData sharing is a priority for GEOSS and serves as

the “lubrication of the GEOSS engine”Data mining is a cross cutting area reaching across all

9 GEOSS societal areas.

Page 6: Presentation (1.15 MB)
Page 7: Presentation (1.15 MB)
Page 8: Presentation (1.15 MB)

Workshop panel discussions

This presentation is being given to solicit more discussion among data mining professionals.

There will be a statement of fact followed by a series of questions for further discussion in the Q&A for this talk and for further discussion amongst participants during this conference.

The questions will be focused into three areas:OutreachTechnical challengesPolicy challenges

Page 9: Presentation (1.15 MB)

Panel discussions - Outreach

There appear to be many ongoing efforts going on worldwide with data mining and EO data. How do we better communicate amongst ourselves and the GEOSS user communities?

Should we recommend a community of practice for data mining? The concept of a community of practice (often

abbreviated as CoP) refers to the process of social learning that occurs and shared socio-cultural practices that emerge and evolve when people who have common goals interact as they strive towards those goals.

Page 10: Presentation (1.15 MB)

Panel discussions - Outreach

Should we consider an international conference on data mining in support of the GEOSS societal themes?

Is there a need for a formalized group for organizing data mining international activities (conferences, contests)? If so, how do we integrate IIMCG?

Page 11: Presentation (1.15 MB)

Panel discussions – Technology Challenges

Data mining is an important tool for discovering the knowledge contained in EO data sets. However, there are technological challenges that must be overcome before data mining can become a mainstream application for GEOSS societal benefit areas.

Page 12: Presentation (1.15 MB)

Panel discussions – Technology Challenges

How can we use data mining approaches to extract information from radar phase data?

What impact does speckle have on data mining of radar amplitude and phase data?

How is data mining achieved using multi-temporal data sets?

How do we handle gaps in data (spatial, spectral, and temporal)?

How do we handle problems of scale and resolution in data streams?

Page 13: Presentation (1.15 MB)

Panel discussions – Technology Challenges

How do we combine (fusion) data sets for better understanding?

How do we describe algorithms to the general user community?Do we train users a priori so they know of usefulness

of data mining for a particular sensor or application?

The following are some specific technical challenges for high resolution imagery identified by JRC.

Page 14: Presentation (1.15 MB)
Page 15: Presentation (1.15 MB)

Panel discussions – Policy Challenges

We need end users interested in pulling data mining technologies to use in their applications. Not the data mining research community producing papers and algorithms and trying to push our approaches into the application areas.

How do we integrate data mining algorithms into GEOSS community?

What is the success metric for a data mining algorithm? How is it measured?

Page 16: Presentation (1.15 MB)

Panel discussions – Policy Challenges

How does the research community work closer with the user community to better understand the problems they are experiencing?Need to engage user community early in the research.Need demonstration projects that show use of data

mining in user community.

Is an incremental approach a better way to work because it may be easier to make small changes in an operational community, than radical changes?

Page 17: Presentation (1.15 MB)

Panel discussions – Policy Challenges

How should national agencies be better aligned to deal with data streams being made available via the GEOSS data sharing principles?The example used in the workshop focused on

meteorological data

Could the GEO Data Sharing Principles deal with the topic of data formats, quality of data, and other issues to benefit the data mining community?

Page 18: Presentation (1.15 MB)

Conclusions

We are in the process of writing a report for the GEO Secretariat.

There are obviously more questions than answers.

Please send any written thoughts or comments on these questions to Roger King ([email protected])

Page 19: Presentation (1.15 MB)