The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

33
he Census GSS Initiative: panded Partnerships Leading to Improved Data Qualit Kevin Holmes Geographer US Census Bureau 2014 Ohio GIS Conference September 22 - 24, 2014 | Hyatt Regency Columbus| Columbus, Ohio

description

The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality. Kevin Holmes Geographer US Census Bureau. 2014 Ohio GIS Conference September 22 - 24, 2014 | Hyatt Regency Columbus| Columbus, Ohio. The MAF/TIGER Database (MTDB). M aster A ddress F ile (MAF) - PowerPoint PPT Presentation

Transcript of The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

Page 1: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

The Census GSS Initiative:Expanded Partnerships Leading to Improved Data Quality

Kevin Holmes

Geographer

US Census Bureau

2014 Ohio GIS ConferenceSeptember 22 - 24, 2014 | Hyatt Regency Columbus| Columbus, Ohio

Page 2: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

The MAF/TIGER Database (MTDB)

Master Address File (MAF) Topologically Integrated Geographic Encoding

and Referencing (TIGER) Developed to support 1990 Census

Digitized from USGS quads

Page 3: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

Supporting Current/Future Censuses & Surveys

Geographic Support System Initiative (GSS-I) Integrated program utilizing partnerships for:

Improved address coverage Continual spatial feature updates Enhanced quality assessment & measurement (QI)

May allow for a targeted address canvassing operation in 2019 (cost avoidance)

Better quality throughout the decade to support intercensal surveys (American Community Survey)

http://www.census.gov/geo/www/gss/

Page 4: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

4

GSS-I Partnership Progress

2011• Program Definition• Formation of GSS-I

Working Groups

2012• Working Group

recommendations • Established Integrated

Project Teams (IPT)• 1st Address Summit• Developed system for

acquiring, tracking, and evaluating partner files

• Began QI development

2013• Partner file acquisition,

evaluation, and MAF/TIGER update begins

• Began Community TIGER development

• Second Address Summit• Statistical model

development begins

Page 5: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

5

GSS-I Partnership Progress

Partner file acquisition, evaluation, and MAF/TIGER update continues

Feedback development & disbursement Conflation development Developed qualitative change detection methodology Further Community TIGER development Refined QIs Refined statistical models Address Canvassing Recommendation

2014

Page 6: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

6

GSS-I Partner Data Acquisition

Data as of May 5, 2014

Partners Contacted

Partners Providing Files

Address List

Acquired

Structure Coordinates

Acquired

Street Centerlines

Acquired

Partner Files Processed

TOTAL 434 304 181 648 714* 996**

* Some counties provided multiple partial-coverage street centerline datasets (i.e., cities vs. balance of county)

** Includes feature and address files processed through the MAF/TIGER system update process

Page 7: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

7

Page 8: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

8

OGRIP LBRS Datasets Address Points and Road Centerlines

Page 9: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

9

US Census BureauPhiladelphia RegionalOffice Territory

Page 10: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

10

  

A. In order to perform a match to existing MAF addresses, the submitted record must include:  • Complete Address Number • Complete Street Name

 

and AT LEAST ONE OF THE FOLLOWING: • Address Coordinate • ZIP Code • Postal City and State • Census 2010 Tabulation State, County, Tract and Block Code

 

Minimum Address Guidelines: Address Matching

Page 11: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

11

B. In order to update the location (geocode) for an existing MAF address, the submitted record must meet the requirements of “A” above, and either:

• Address Coordinate or • Census 2010 Tabulation State, County, Tract and Block Code

 C. In order to ADD new records to the MAF, the submitted record must meet the requirements of “A” and “B” above, and must include an Address Feature Type indicator identifying the address as

residential, commercial, utility, etc.

Minimum Address Guidelines: Address Geocoding/Adding

Page 12: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

12

Minimum Address Guidelines: Etc.

D. Within Structure Identifiers (Apt 3, etc). If not available:• # of units • Multi-unit structure flag

 

E. Group Quarters:• NAME (i.e. Shady Acres Retirement Home)

• TYPE (i.e. Hospital, Prison, College Dormitory)  

F. Cannot process only Non-City-Style Addresses, such as: • Rural Route Addresses (i.e. RR 3 Box 725 Anytown, NC 28999) • Post Office Box Addresses (i.e. P.O. Box 12374 Anytown, NC 28999) • General Delivery (i.e. General Delivery Anytown, NC 28999) • Location Descriptions (i.e. Brick House at intersection of 1st and Main

Streets)

Page 13: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

13

Partner File Processing Overview

Page 14: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

14

Of 42,111,361 Partner Addresses…

(1,675,765)

(40,435,596)

Page 15: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

15

Of 40,435,596 UnduplicatedPartner Addresses…

(1,675,765)

(34,884,631)

Page 16: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

16

Of 34,884,631 MatchedPartner Addresses…

(32,589,844)

(1,782,125)

(492, 573)

(15,685,628)

(681,243)

(18,517,760)

Page 17: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

17

Of 5,550,965 UnmatchedPartner Addresses…

(1,320,507)

(4,230,458)

Page 18: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

18

Street Centerline Updates

13,601 Miles of new roads added

40,385 Miles of updated roads

53,986 total miles of feature updates- 2x Earth’s Circumference!

Page 19: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

19

GSS-I Feedback After processing completes, GSS-I partner receives:

“Thank You” letter Detailed Feedback Report GSS-I Summary Address Report

RO and WAH geographers in the initial review of the GSS-I feedback products

Total partners provided feedback to date: Addresses: 139 (15%) Centerlines: 91 (14%)

Page 20: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

GSS-I Address Feedback: Reports Figure 1. Detailed Feedback Report Sample for Florence Co South Carolina

(A)State

(B)County

(C)Tract

(D)Block

(E)GEOID

(F)Total

Addresses

(G)Total

Residential

(H)Total

Nonresidential

(I)Total Other

(J)Total

Matched

(K)Total

Added

(L)Total

Coordinates Added

(M)Total Not Accepted

(N)Total Not Accepted Duplicate

(O)Total Not Accepted

Incomplete

(P)Total Not Accepted

Other

(Q)Total

Currently in MAF

45 041 1.01 1000 450410001011000 5 0 1 4 4 0 3 1 0 0 1 0

45 041 1.01 1001 450410001011001 8 0 0 8 5 0 0 3 0 0 3 1

45 041 1.01 1002 450410001011002 21 1 0 20 10 0 10 11 1 0 10 2

45 041 1.01 1003 450410001011003 0 0 0 0 0 0 0 0 0 0 0 0

45 041 1.01 1004 450410001011004 0 0 0 0 0 0 0 0 0 0 0 0

45 041 1.01 1005 450410001011005 89 19 2 68 38 0 28 51 0 3 48 38

Figure 2. Summary Address Report Sample for Florence Co South Carolina

Column Address Data Submitted to the Census Bureau TotalF Total Addresses 64,929 G Total Residential Addresses 51,852H Total Nonresidential Addresses 277I Total Other 12,800 Address Actions taken by the Census Bureau J Total Matched 51,002K Total Added 24L Total Coordinates Added 44,953M Total Not Accepted 13,866N Total Not Accepted Duplicate 1,389O Total Not Accepted Incomplete 537P Total Not Accepted Other 11,940Q Total Currently in MAF 61,500

Page 21: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

21

GSS-I Feature Feedback: Shapefiles

Adding a date of last update field to partnership shapefiles:

Page 22: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

22

What’s Next?2015

• Continue partner file acquisition and MAF/TIGER update using partner data

• Automated partner feedback • Enhance conflation processes• Enhance change detection methodologies• Integrate Community TIGER into MAF/TIGER update • Communicate address canvassing recommendation• Research on data improvements for Puerto Rico and

Group Quarters• Transaction based partner file acquisition and evaluation

environment

Page 23: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

23

Community TIGER

Proof of Concept collaborative project with ESRI Web (cloud) based data exchange and data management portal Phased and iterative project Leverages COTS technology, existing systems and proven

workflows Utilizes and builds upon the next generation ESRI Community

Maps Local govt. GSS-I Phase 2 beta testers

Page 24: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

Crosswalk

Analyze

Overlay

Feedback

CommunityTIGER

Page 25: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

Address Data Migration

Page 26: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

26

Spatial Matching with TIGER

Page 27: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

27

Spatial Data Reviewer

Page 28: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

Contribution Feedback

Page 29: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

Geographic Products

TIGERWEB – WMS & REST map services

Cartographic Boundary Files

TIGER/Line Shapefiles & Geodatabases w/ Demographic Data

TIGER Geodatabases

TIGER/Line Shapefiles

KML Prototype Files

Full detail, extensive attributes, current vintages

Generalized detail, limited attributes, limited vintages

Generalized detail, limited attributes, limited vintages

Detailed, extensive attributes, current vintages

Full detail, limited attributes, current vintages

Full detail, limited attributes, limited vintages

NEW

8/19/14

Page 30: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

Geocoding Service

Single Input, Batch, REST, API

Page 31: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

ACS 5-Year Decennial

High School Graduates (or more) by tract, Philadelphia, PA

Foreign Born by County, Nation

Other Data/Mapping Resources - Census Explorer -

http://www.census.gov/censusexplorer/

Page 32: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

Other Data/Mapping Resources- LEHD OnTheMap -

http://lehd.did.census.gov/

Page 33: The Census GSS Initiative: Expanded Partnerships Leading to Improved Data Quality

Thank You

[email protected]

http://www.census.gov/geo/www/gss/

2014 Ohio GIS ConferenceSeptember 22 - 24, 2014 | Hyatt Regency Columbus| Columbus, Ohio