An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle...

29
An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law [email protected] http://swdb.berkeley.edu

Transcript of An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle...

Page 1: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

An Introduction to theStatewide Database

September 24, 2012

Karin Mac Donald & Nicole Boyle

Berkeley Law

[email protected]

http://swdb.berkeley.edu

Page 2: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

A little history:

A bipartisan agreement after Prop. 140 in 1990 moved creation of redistricting database from Legislature to independent body – UCB selected

Goal: to create a database that is non-partisan, free from political influence, academic, public, free and always available

Not cheap: Legislature invests millions of dollars each decade to create and distribute; but cheaper than lawsuits!

Page 3: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

The Statewide Database today:

The Legislature shall take all steps necessary to ensure that a complete and accurate computerized database is available for redistricting, and that procedures are in place to provide the public ready access to redistricting data and computer software for drawing maps. Upon the commission's formation and until its dissolution, the Legislature shall coordinate these efforts with the commission.

(Government Code Sec. 8253 (7) (b))

Page 4: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

What is in the Official Redistricting Dataset?

Same data that were produced for 2001 redistricting Census Geography Census Data (PL94-171) Statement of Registration Data for 10 years Statement of Vote for 10 years

Added for 2011 Citizen Voting Age Population (CVAP) separately Selected Older Elections Documentation

Page 5: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

What is NOT in the Official Dataset?

Does not include American Community Survey (CVAP separate) Adjusted Census Data (i.e. Prisons) Racially Polarized Voting Analysis Local Election Results Community of Interest database Neighborhood database

Page 6: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

Census Data in the SWDB Basic Information/ Data that jurisdictions are required by

law to use for Redistricting Census 2010 Redistricting Data (Public Law 94-171, or

PL94-171) contains the count of the U.S. population BLOCK-LEVEL dataset Includes data on people's race and ethnicity, for both the

total and the voting age population Information is based on answers to the questions in the

Census 2010 Short-Form questionnaire.

Page 7: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

Census Geography: 2010 - 2020

58 Counties 1,523 Census Places 8,057 Census Tracts 23,212 Census Block Groups 710,145 Census Blocks

Page 8: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

Census Geography

Page 9: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

California is divided into counties…. Counties are divided into tracts…

Census Geography

Tracts are divided into block groups… And block groups are divided into blocks

Page 10: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

5 detailed tables in the PL94-171

Subject Table # RaceTotal population . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . P1

Hispanic or Latino, and Not Hispanic or Latino by RaceTotal population . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . P2

Race for the Population 18 Years and OverTotal population 18 years and over . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .P3

Hispanic or Latino, and Not Hispanic or Latino by Race for the Population 18 Years and OverTotal population 18 years and over . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .P4

Occupancy StatusHousing units . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . H1

Page 11: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

P1. RaceUniverse: Total populationTotal:

Population of one race:White aloneBlack or African American aloneAmerican Indian and Alaska Native aloneAsian aloneNative Hawaiian and Other Pacific Islander aloneSome other race alone

Repeats for the Population of two or more races…..

P2. Hispanic or Latino, and Not Hispanic or Latino by RaceUniverse: Total populationTotal: Hispanic or Latino Not Hispanic or Latino:

Population of one race: White alone

Black or African American aloneAmerican Indian and Alaska Native aloneAsian aloneNative Hawaiian and Other Pacific Islander aloneSome other race alone

Repeats for the Population of two or more races…..

Census Data

Page 12: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

P3. Race For The Population 18 Years and Over P1 variables are repeated for the Population 18 Years and Over

P4. Hispanic or Latino, and Not Hispanic or Latino By Race For The Population 18 Years And Over P2 variables are repeated for the Population 18 Years and Over

H1. Occupancy StatusUniverse: Housing unitsTotal:

OccupiedVacant

Census Data

Page 13: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

Aside from the Redistricting Block Level Dataset: what’s in the SWDB?

Registration and Statement of Vote Data by precinct

Datasets available starting in 1992 Datasets for Statewide General Elections For 1990s some Primaries processed For 2000s all Primaries processed

Page 14: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

Processing Registration Data: Voters are geocoded to individually assign them to their

proper census block.

Page 15: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

 REG ABS POLLV VOTE

CaliforniaStatewide

codebookby rgprecby rrprecby srprec

codebook by rgprec

by rrprecby srprec

codebookby rgprecby rrprecby srprec

codebookby rgprecby rrprecby srprec

County 001(Alameda)

by rgprecby rrprecby srprec

by rgprecby rrprecby srprec

by rgprecby rrprecby srprec

by rgprecby rrprecby srprec

County 003(Alpine)

by rgprecby rrprecby srprec

by rgprecby rrprecby srprec

mail-ballotcounty

by rgprecby rrprecby srprec

County 005(Amador)

by rgprecby rrprecby srprec

by rgprecby rrprecby srprec

by rgprecby rrprecby srprec

by rgprecby rrprecby srprec

Registration Data in the SWDB

Page 16: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

Precinct Types:

RG = Original Registration Precincts (designated by County Registrar)

RR = Registration Precincts (geographic unit constructed for statistical merging purposes by SWDB)

SV = Original Voting Precincts (designated by County Registrar)

SR = Consolidated Precinct (geographic unit constructed for statistical merging purposes by SWDB)

Page 17: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

Variable Start End Format DescriptionTOTREG 16 20 F5.0 Total registrationDEM 21 25 F5.0 Democratic Party RegistrationREP 26 30 F5.0 Republican Party RegistrationAIP 31 35 F5.0 American Independent Party RegistrationPAF 36 40 F5.0 Peace and Freedom Party RegistrationMSC 41 45 F5.0 Miscellaneous RegistrationLIB 46 50 F5.0 Libertarian Party RegistrationNLP 51 55 F5.0 Natural Law Party RegistrationGRN 56 60 F5.0 Green Party RegistrationREF 61 65 F5.0 Reform Party RegistrationDCL 66 70 F5.0 Declined to State RegistrationMALE 91 95 F5.0 MaleFEMALE 96 100 F5.0 Female

Statement of Registration

Page 18: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

Variable Start End Format DescriptionHISPDEM 101 105 F5.0 Latino DemocratsHISPREP 106 110 F5.0 Latino RepublicansHISPDCL 111 115 F5.0 Latino Declined to StateHISPOTH 116 120 F5.0 Latino Other Party

REPEATS for:JewishKoreanJapaneseChineseAsian IndianVietnameseFilipino

Statement of Registration

Page 19: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

Statement of Registration

Variable Start End Format DescriptionDEMMUNK 291 295 F5.0 Dem Male age unknownDEMM1824 261 265 F5.0 Dem Male age 18-24DEMM2534 266 270 F5.0 Dem Male age 25-34DEMM3544 271 275 F5.0 Dem Male age 35-44DEMM4554 276 280 F5.0 Dem Male age 45-54DEMM5564 281 285 F5.0 Dem Male age 55-64DEMM65PL 286 290 F5.0 Dem Male age 65 PlusDEMFUNK 326 330 F5.0 Dem female age unknownDEMF1824 296 300 F5.0 Dem female age 18-24DEMF2534 301 305 F5.0 Dem female age 25-34DEMF3544 306 310 F5.0 Dem female age 35-44DEMF4554 311 315 F5.0 Dem female age 45-54DEMF5564 316 320 F5.0 Dem female age 55-64DEMF65PL 321 325 F5.0 Dem female age 65 Plus

REPEATS FOR:RepublicanDeclined to State Other Party

Page 20: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

Statement of Registration

Variable Start End Format DescriptionRREG1G 541 545 F5.0 REP registered 1 cycle RREG2G 546 550 F5.0 REP registered 2 cyclesRREG3G 551 555 F5.0 REP registered 3 cyclesRREG4G 556 560 F5.0 REP registered 4 cyclesRREG5G 561 565 F5.0 REP registered 5 cyclesRREG6G 566 570 F5.0 REP registered 6 cyclesRREG7G 571 575 F5.0 REP registered 7 cyclesRREG8G 576 580 F5.0 REP registered 8 cyclesRREG9G 581 585 F5.0 REP registered 9+ cycles

REPEATS FOR:DemocratDeclined to State Other Party

Page 21: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

Statement of Vote

Electoral data processed primarily for racially polarized voting analyses (RPV)

Individualized data not available Precincts are the smallest unit of analysis for electoral

data and change from election to election Precincts do not correspond to census geography Converted precinct data to census geography (for

redistricting dataset) Between redistrictings, data are available on the precinct

level only SWDB provides conversion files to move data from

precinct to precinct or from precinct to census geography

Page 22: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

Statement of Vote

2008 California General Election 62,404 Precincts in the State

Page 23: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

Statement of Vote

Precincts do not correspond to census geography

Page 24: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

Statement of Vote For each election, the database includes:

Turnout Statewide Contests Legislative Contests Propositions

see for example: http://swdb.berkeley.edu/d00/g10.html for codebook see here:http://swdb.berkeley.edu/pub/data/G10/c001/001.codes

Page 25: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

Also available on the SWDB site: Reports Census Reports Statewide Electoral and Registration Reports Zipcode Reports Miscellaneous Reports California Official Statements of Registration 1962-2008 California Official Statements of Vote 1990-2008

Web GIS for new and old districts! Lots of maps

Page 26: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

What’s new at SWDB?

1. New in the p12: reg data for for reg voters that voted via forced mail ballot

precincts reg data for reg voters that did not vote g08 and g10 sov data by p12 reg precinct

2. pdf maps of the new districts and district summaries for the new districts,

3.rg to rr to sr precinct conversion files for the 2003S to 2010G elections.

4. reports of the 2010G - 2004G data for the new districts for the top two candidates in each race (i.e.no 3rd party candidate data for the new districts)

5. precinct to 2010 census block conversion files

Page 27: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

What’s in the works?

new district reports with 3rd party candidate data new and old district overlay reports and maps ESRI based webgis

Page 28: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

What are the Advantages of the SWBD? Everyone uses the same database

Saves $$$ Transparency builds trust! “Other side” issues eliminated More time for discussions about lines than data

Availability to other jurisdictions in CA Saves $$$ Cities, counties, special districts – better data!

Debugging over time by users Error goes down; issues resolved promptly

Allows public to familiarize themselves with data Can be used between redistrictings

Avoids building at extreme time pressure Cuts down on data collection issues Leaves time to deal with ‘surprises’

Page 29: An Introduction to the Statewide Database September 24, 2012 Karin Mac Donald & Nicole Boyle Berkeley Law karin@cain.berkeley.edu .

Questions?