An Open Source GIS Architecture Connected and Linked Data

11
Open Source GIS Architect for Connected and Linked Data Jerry Hayes Frank Hardisty (Advisor)

description

Connected Data in GIS Today Many use cases in GIS Creates logical model to represent topology. Storage of logical model is not optimal. Problem: Relational database abstraction impedes performance and scalability.

Transcript of An Open Source GIS Architecture Connected and Linked Data

Page 1: An Open Source GIS Architecture Connected and Linked Data

An Open Source GIS Architecture for

Connected and Linked Data

Jerry HayesFrank Hardisty (Advisor)

Page 2: An Open Source GIS Architecture Connected and Linked Data

Problem: Relational database abstraction impedes performance and scalability.

Connected Data in GIS Today

Many use cases in GIS

Creates logical model to represent topology.

Storage of logical model is not optimal.

Page 3: An Open Source GIS Architecture Connected and Linked Data

Problem: Vast potential of the Semantic Web is unrealized in GIS!

Linked Data in GIS Today

Links described by semantic relationships.

Semantics enable data discovery.

Early stages of adoption in GIS.

Page 4: An Open Source GIS Architecture Connected and Linked Data

Having trouble “visualizing” the network ? … so do machines!

How GIS Stores Connected Data Uses relational database tables

Abstraction introduces unnecessary overhead.

Bad for large datasets!

Page 5: An Open Source GIS Architecture Connected and Linked Data

Much easier to visualize network … machines are happier too!

Graph Databases for Connected Data Stores connected data in its native format.

Removes unnecessary overhead.

Good for large datasets!

Page 6: An Open Source GIS Architecture Connected and Linked Data

Performance comparisons are difficult. … how “connected is the connected data?

Preprocessing data helps mitigate issues. … ESRI’s preprocessed logical network model.

In general … i) RDBMS are optimized for aggregation queries ii) Graph databases are optimized for traversing.

Database Performance Comparisons

Page 7: An Open Source GIS Architecture Connected and Linked Data

Two basic properties define graph databases.

Graph Database Characteristics

Native Graph StorageNati

ve G

raph

Pro

cess

ing

Page 8: An Open Source GIS Architecture Connected and Linked Data

Connects data to data on the Web

Uses Resource Descriptive Framework (RDF).

Creating quality linked data is challenging!

Linked Data … the Next Frontier

Only useful in sufficient quality and quantity.

Page 9: An Open Source GIS Architecture Connected and Linked Data

Many RDF datasets are now available

Data quality, availability and stability concerns.

Tools are available for accessing RDF models.

LinkedGeoData for GIS applications.

Accessing Linked Data

Page 10: An Open Source GIS Architecture Connected and Linked Data

Server side is stateless.

PostGIS used for .. • Storing physical model.• Data visualization.

Neo4j used for …• Storing logical model• Graph traversals

Open Source System Architecture

Implemented in the IBM Cloud

Page 11: An Open Source GIS Architecture Connected and Linked Data

Provides RESTful API.

Enables spatial analytics

Enables “data” discovery.

Integrates physical and logical model processing.

Implemented in the IBM Cloud

Servlet Architecture