Visualizing doc graph in gephi june 2013
-
Upload
vivian-s-zhang -
Category
Education
-
view
108 -
download
0
description
Transcript of Visualizing doc graph in gephi june 2013
![Page 1: Visualizing doc graph in gephi june 2013](https://reader033.fdocuments.in/reader033/viewer/2022042623/54c643ae4a7959f5368b4569/html5/thumbnails/1.jpg)
Analyzing DocGraph in Gephi
Janos G. Hajagos Stony Brook School of Medicine
1
NYC Open Data Meetup June 24, 2013
![Page 2: Visualizing doc graph in gephi june 2013](https://reader033.fdocuments.in/reader033/viewer/2022042623/54c643ae4a7959f5368b4569/html5/thumbnails/2.jpg)
DocGraph
• Based on FOIA request to CMS by Fred Trotter • Medicare providers (more than doctors) • CY 2011 date of service • Share 10 or more patients in a 30 day forward
window • Initial access restricted to MedStartr funders
but as of June 2013 open access
2
![Page 3: Visualizing doc graph in gephi june 2013](https://reader033.fdocuments.in/reader033/viewer/2022042623/54c643ae4a7959f5368b4569/html5/thumbnails/3.jpg)
Geographic Visualization
http://isurfsoftware.com/blog/2012/12/13/visualizing-geographic-connections-between-us-doctors/
3
![Page 4: Visualizing doc graph in gephi june 2013](https://reader033.fdocuments.in/reader033/viewer/2022042623/54c643ae4a7959f5368b4569/html5/thumbnails/4.jpg)
DocGraph by the numbers
• Directed graph • Average total degree 52.8 • 940,492 providers (graph nodes/vertices) • 49,685,810 shared edges
4
![Page 5: Visualizing doc graph in gephi june 2013](https://reader033.fdocuments.in/reader033/viewer/2022042623/54c643ae4a7959f5368b4569/html5/thumbnails/5.jpg)
DocGraph Data
5
![Page 6: Visualizing doc graph in gephi june 2013](https://reader033.fdocuments.in/reader033/viewer/2022042623/54c643ae4a7959f5368b4569/html5/thumbnails/6.jpg)
6
![Page 7: Visualizing doc graph in gephi june 2013](https://reader033.fdocuments.in/reader033/viewer/2022042623/54c643ae4a7959f5368b4569/html5/thumbnails/7.jpg)
NPPES
• National Plan and Provider Enumeration System
• Source of NPI (National Provider Identifier) • Information is entered and updated by
provider • CSV file with 314 columns • MySQL load script generated by Python script
to normalize database
7
![Page 8: Visualizing doc graph in gephi june 2013](https://reader033.fdocuments.in/reader033/viewer/2022042623/54c643ae4a7959f5368b4569/html5/thumbnails/8.jpg)
Selecting a sub-graph
8
![Page 9: Visualizing doc graph in gephi june 2013](https://reader033.fdocuments.in/reader033/viewer/2022042623/54c643ae4a7959f5368b4569/html5/thumbnails/9.jpg)
Core nodes
9
![Page 10: Visualizing doc graph in gephi june 2013](https://reader033.fdocuments.in/reader033/viewer/2022042623/54c643ae4a7959f5368b4569/html5/thumbnails/10.jpg)
Leaf nodes
10
![Page 11: Visualizing doc graph in gephi june 2013](https://reader033.fdocuments.in/reader033/viewer/2022042623/54c643ae4a7959f5368b4569/html5/thumbnails/11.jpg)
Core-to-core edges
11
![Page 12: Visualizing doc graph in gephi june 2013](https://reader033.fdocuments.in/reader033/viewer/2022042623/54c643ae4a7959f5368b4569/html5/thumbnails/12.jpg)
Core-to-leaf edges
12
![Page 13: Visualizing doc graph in gephi june 2013](https://reader033.fdocuments.in/reader033/viewer/2022042623/54c643ae4a7959f5368b4569/html5/thumbnails/13.jpg)
Leaf-to-leaf edges
13
![Page 14: Visualizing doc graph in gephi june 2013](https://reader033.fdocuments.in/reader033/viewer/2022042623/54c643ae4a7959f5368b4569/html5/thumbnails/14.jpg)
Generating GraphML
• XML based file format for graphs • Readable by a large number of tools
– Gephi – Mathematica – igraph (R)
• NetworkX Python library for graphs can easily export to GraphML
14
![Page 15: Visualizing doc graph in gephi june 2013](https://reader033.fdocuments.in/reader033/viewer/2022042623/54c643ae4a7959f5368b4569/html5/thumbnails/15.jpg)
15
![Page 16: Visualizing doc graph in gephi june 2013](https://reader033.fdocuments.in/reader033/viewer/2022042623/54c643ae4a7959f5368b4569/html5/thumbnails/16.jpg)
16
Gephi
![Page 17: Visualizing doc graph in gephi june 2013](https://reader033.fdocuments.in/reader033/viewer/2022042623/54c643ae4a7959f5368b4569/html5/thumbnails/17.jpg)
Subset defined from 2 Brooklyn zip codes (11215 - Park Slope & 11212 - Brownsville)
17
![Page 18: Visualizing doc graph in gephi june 2013](https://reader033.fdocuments.in/reader033/viewer/2022042623/54c643ae4a7959f5368b4569/html5/thumbnails/18.jpg)
Links
http://strata.oreilly.com/2012/11/docgraph-open-social-doctor-data.html (information) https://github.com/jhajagos/DocGraph (code) https://github.com/ftrotter/DocGraph (data) https://groups.google.com/forum/#!forum/docgraph (mailing list) http://bit.ly/1459NXn (sample Brooklyn GraphML file) http://strataconf.com/rx2013/public/schedule/detail/29840 (StrataRX workshop with Fred Trotter)
18