Online info2013 reconciliation
-
Upload
tony-hirst -
Category
Technology
-
view
107 -
download
0
description
Transcript of Online info2013 reconciliation
Reconciling ourselves to what's out there: how one dataset talks to another
Tony HirstDept of Computing and Communications,
The Open University UKO:I
I play with other people’s
data….
Clustering and Approximate
Matching
OpenRefine.org
Metaphone3 (soundalike)
metaphone( 'Epic Garments Limited’)EPKKRMNTSLMTT
metaphone( 'EPOCH GARMENTS LTD’)EPXKRMNTSLTT
Metaphone
Levenshtein (edit distance)
You know computers can do this anyway…
..it’s just that no-one’s told you how you can
do it on your computer with your data…
Reconcile your data
http://schoolofdata.org/2013/10/18/in-support-of-the-bangladeshi-garment-industries-data-expedition/
http://bit.ly/ScoDa-bg-reconcile
opencorporates.com
http://opencorporates.com/reconcile
cell.recon.match.name
cell.recon.match.id
In this way, we can make our data linkable…
Reconcile your data with what’s
out there
And why not have a go at
clustering too…?
Can you match your
data to itself?
O:iblog.ouseful.info
@psychemedia