Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe...

19
DM_PPT_NP_v02 Data Are from Mars, Tools Are from Venus H. Joe Lee ([email protected]) The HDF Group This work was supported by NASA/GSFC under Raytheon Co. contract number NNG15HZ39C All images used in this presentation are from autodraw.com for public use. https://ntrs.nasa.gov/search.jsp?R=20170007438 2020-02-12T05:24:49+00:00Z

Transcript of Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe...

Page 1: Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe Lee (hyoklee@hdfgroup.org) The HDF Group This work was supported by NASA/GSFC under

DM_PPT_NP_v02

Data Are from Mars,

Tools Are from Venus

H. Joe Lee ([email protected])

The HDF Group

This work was supported by NASA/GSFC under

Raytheon Co. contract number NNG15HZ39C

All images used in this presentation are from

autodraw.com for public use.

https://ntrs.nasa.gov/search.jsp?R=20170007438 2020-02-12T05:24:49+00:00Z

Page 2: Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe Lee (hyoklee@hdfgroup.org) The HDF Group This work was supported by NASA/GSFC under

DM_PPT_NP_v02

2

No “Earth” in title?

“Men Are from Mars, Women are from Venus”

- John Gray

Page 3: Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe Lee (hyoklee@hdfgroup.org) The HDF Group This work was supported by NASA/GSFC under

DM_PPT_NP_v02

3

Are data from Mars?

Why can’t I use Earth tools?Correct geo-referencing

Page 4: Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe Lee (hyoklee@hdfgroup.org) The HDF Group This work was supported by NASA/GSFC under

DM_PPT_NP_v02

4

Are tools from Venus?

Why can’t I open Earth data?

Page 5: Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe Lee (hyoklee@hdfgroup.org) The HDF Group This work was supported by NASA/GSFC under

DM_PPT_NP_v02

5

Data Producers from Mars

• I want my data slim and efficient.

• How can I save money in managing data?

Tool Developers from Venus• I make my tool work for popular data first.

• Can I make money by supporting your data?

Page 6: Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe Lee (hyoklee@hdfgroup.org) The HDF Group This work was supported by NASA/GSFC under

DM_PPT_NP_v02

6

Result: Frustrated Users

Page 7: Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe Lee (hyoklee@hdfgroup.org) The HDF Group This work was supported by NASA/GSFC under

DM_PPT_NP_v02

7

We (HDFEOS.org) can help.

Page 8: Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe Lee (hyoklee@hdfgroup.org) The HDF Group This work was supported by NASA/GSFC under

DM_PPT_NP_v02

8

• Hierarchical Data Format (HDF)

• Network Common Data Format (netCDF)

• Geospatial Tagged Image File Format

(GeoTIFF)

• Keyhole Markup Language (KML) /

zipped KML (KMZ)

• Comma-separated values (CSV)

• etc.

We identify gaps in File Formats

Page 9: Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe Lee (hyoklee@hdfgroup.org) The HDF Group This work was supported by NASA/GSFC under

DM_PPT_NP_v02

9

• hdf

• netcdf-C

• netcdf-Java

• HDF – Earth Observing System (hdf-eos)

• Climate and Forecast Metadata (CF) conventions

• Geospatial Data Abstraction Library (GDAL)

• etc.

We identify gaps in Libraries

Page 10: Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe Lee (hyoklee@hdfgroup.org) The HDF Group This work was supported by NASA/GSFC under

DM_PPT_NP_v02

10

• Microsoft Excel

• Esri ArcGIS

• Google Earth

• MATLAB

• Python

• Interactive Data Language (IDL)

• Panoply

• Integrated Data Viewer (IDV)

• HDFView

• h5dump

• Etc.

We identify gaps in Tools

Page 11: Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe Lee (hyoklee@hdfgroup.org) The HDF Group This work was supported by NASA/GSFC under

DM_PPT_NP_v02

11

• Open-source Project for a Network Data

Access Protocol (OPeNDAP)

• Web Map Service (WMS)

• Web Map Tile Service (WMTS)

• Web Coverage Service (WCS)

• etc.

We identify gaps in Services

Page 12: Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe Lee (hyoklee@hdfgroup.org) The HDF Group This work was supported by NASA/GSFC under

DM_PPT_NP_v02

12

• File conversion

• Libraries and tools usage

• NASA HDF product specific examples

• Demo services (e.g., Hyrax*, THREDDS**)

AND we provide Solutions…

*Hyrax is the data server from OPeNDAP.

**Thematic Real-time Environmental Distributed Data Services

Page 13: Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe Lee (hyoklee@hdfgroup.org) The HDF Group This work was supported by NASA/GSFC under

DM_PPT_NP_v02

13

• Make HDF5 data work with

– GDAL

– netCDF

– Hyrax/THREDDS

• Don’t forget a few key CF conventions.

• Follow DIWG* recommendations.

Suggestions for data producers

*Data Interoperability Working Group

Page 14: Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe Lee (hyoklee@hdfgroup.org) The HDF Group This work was supported by NASA/GSFC under

DM_PPT_NP_v02

14

• Download and test NASA HDF products.

• Support them natively.

• Support augmentation.

– VRT* in GDAL

– NcML** in netCDF

• Support 3D visualization for data in the air.

Suggestions for tool developers

*Virtual Dataset in XML format

**netCDF Markup Language

Page 15: Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe Lee (hyoklee@hdfgroup.org) The HDF Group This work was supported by NASA/GSFC under

DM_PPT_NP_v02

15

• Try OPeNDAP first. CSV may be enough.

• Try netCDF conversion / augmentation.

• Correct metadata with NcML / VRT.

• Try GEE* instead of GDAL.

• Use CMR** wisely.

Suggestions for end-users

*GDAL Enhancement for ESDIS project

** Core Metadata Repository

Page 16: Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe Lee (hyoklee@hdfgroup.org) The HDF Group This work was supported by NASA/GSFC under

DM_PPT_NP_v02

16

• Hadoop / Spark (streaming) / Dask

• Parquet / Arrow

• Elastic Search / Kibana

How about Big (fast) data?

Page 17: Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe Lee (hyoklee@hdfgroup.org) The HDF Group This work was supported by NASA/GSFC under

DM_PPT_NP_v02

17

• scikit-learn / keras / h2o.ai

• Please contact us at

[email protected] if you’d like to see

examples on machine learning.

Future: (Deep) Machine Learning?

Page 18: Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe Lee (hyoklee@hdfgroup.org) The HDF Group This work was supported by NASA/GSFC under

DM_PPT_NP_v02

18

This work was supported by

NASA/GSFC under Raytheon Co.

contract number NNG15HZ39C

Page 19: Data Are from Mars, Tools Are from Venus - NASA · Data Are from Mars, Tools Are from Venus H. Joe Lee (hyoklee@hdfgroup.org) The HDF Group This work was supported by NASA/GSFC under

DM_PPT_NP_v02

19

All images used in this presentation are from

autodraw.com for public use.