Presentation (1.15 MB)
Transcript of Presentation (1.15 MB)
Workshop on Innovative Data Workshop on Innovative Data Mining Techniques in Support Mining Techniques in Support
of GEOSS: A summaryof GEOSS: A summary
Roger L. King and Nicolas H. YounanRoger L. King and Nicolas H. Younan
Mississippi State UniversityMississippi State University
Mihai DatcuMihai Datcu
DLRDLR
Workshop logistics
August 31, 2009 – September 2, 2009Sinaia, RomaniaHosted by Romanian Space Agency (ROSA)Convened by ROSA (Pisa), MSU (King), DLR (Datcu)International audience
38 participantsRepresenting – users, researchers, governmentDenmark, Finland, Germany, Italy, Japan, Romania,
Switzerland, Turkey, United States
http://events.rosa-rc.ro
Defining terms
Define data mining as:A computer process of discovering unknown
knowledge from massive data sets.Data mining involves massive data sets, high-
performance computers, and mining algorithms.Earth observations, mainly through remote
sensing, have generated huge volumes of geospatial data.Vital for scientific research, socio-economic activities,
and military missions.Those massive data collections contain unknown
knowledge about the status of the earth system, functions and interactions of its components.
Flow of this presentation
The workshop consisted of 2.5 days of presentations by various experts with focused panel discussions that helped shape outcomes and recommendations
This summary will not be able to give many of the specifics made in the presentations. However, they will be incorporated into the final report.
Start with the GEOSS Data Sharing PrinciplesFocus will be on workshop panel discussions
Setting the stage
Marius-Ioan Pisa, CEO ROSAProvided rationale for meeting
Robert Koopman, GEO SecretariatGEOSS: Ongoing Developments and PrioritiesData sharing is a priority for GEOSS and serves as
the “lubrication of the GEOSS engine”Data mining is a cross cutting area reaching across all
9 GEOSS societal areas.
Workshop panel discussions
This presentation is being given to solicit more discussion among data mining professionals.
There will be a statement of fact followed by a series of questions for further discussion in the Q&A for this talk and for further discussion amongst participants during this conference.
The questions will be focused into three areas:OutreachTechnical challengesPolicy challenges
Panel discussions - Outreach
There appear to be many ongoing efforts going on worldwide with data mining and EO data. How do we better communicate amongst ourselves and the GEOSS user communities?
Should we recommend a community of practice for data mining? The concept of a community of practice (often
abbreviated as CoP) refers to the process of social learning that occurs and shared socio-cultural practices that emerge and evolve when people who have common goals interact as they strive towards those goals.
Panel discussions - Outreach
Should we consider an international conference on data mining in support of the GEOSS societal themes?
Is there a need for a formalized group for organizing data mining international activities (conferences, contests)? If so, how do we integrate IIMCG?
Panel discussions – Technology Challenges
Data mining is an important tool for discovering the knowledge contained in EO data sets. However, there are technological challenges that must be overcome before data mining can become a mainstream application for GEOSS societal benefit areas.
Panel discussions – Technology Challenges
How can we use data mining approaches to extract information from radar phase data?
What impact does speckle have on data mining of radar amplitude and phase data?
How is data mining achieved using multi-temporal data sets?
How do we handle gaps in data (spatial, spectral, and temporal)?
How do we handle problems of scale and resolution in data streams?
Panel discussions – Technology Challenges
How do we combine (fusion) data sets for better understanding?
How do we describe algorithms to the general user community?Do we train users a priori so they know of usefulness
of data mining for a particular sensor or application?
The following are some specific technical challenges for high resolution imagery identified by JRC.
Panel discussions – Policy Challenges
We need end users interested in pulling data mining technologies to use in their applications. Not the data mining research community producing papers and algorithms and trying to push our approaches into the application areas.
How do we integrate data mining algorithms into GEOSS community?
What is the success metric for a data mining algorithm? How is it measured?
Panel discussions – Policy Challenges
How does the research community work closer with the user community to better understand the problems they are experiencing?Need to engage user community early in the research.Need demonstration projects that show use of data
mining in user community.
Is an incremental approach a better way to work because it may be easier to make small changes in an operational community, than radical changes?
Panel discussions – Policy Challenges
How should national agencies be better aligned to deal with data streams being made available via the GEOSS data sharing principles?The example used in the workshop focused on
meteorological data
Could the GEO Data Sharing Principles deal with the topic of data formats, quality of data, and other issues to benefit the data mining community?
Conclusions
We are in the process of writing a report for the GEO Secretariat.
There are obviously more questions than answers.
Please send any written thoughts or comments on these questions to Roger King ([email protected])