Data Science 100 · Lecture 27: Ethics in Data Science. Gender Shades. 12 ... Feature Engineering....

Slides by:

Joshua Kroll

kroll@berkeley.edu

Data Science 100Lecture 27: Ethics in Data Science

Gender Shades

Data science is everywhere…

Scoring Systems Lotteries Criminal Justice& National Security

News Lending Hiring

Data Science Life Cycle

?ContextQuestionRefine Question to an one answerable with data

DesignData CollectionData Cleaning

ModelingTest-train splitLoss function choiceFeature engineering

Transformations, Dummy Variables

Model selectionBest subset regressionCross-Validation

Model evaluationPrediction error

Context & Refinement

Proxies

Data Collection

http://bit.ly/ds100-sp18-eth

From Lum, Kristian, and William Isaac. "To predict and serve?." Significance 13,

no. 5 (2016): 14-19.

Data Cleaning

Test-Train Split

Loss Function Choice

So why does ProPublica think COMPAS is biased?

Feature Engineering

Model Selection

Model Evaluation

So what do we do about this?

Ethical Lessons

1. Know what the data are telling you• Understand your model, and validate it• Involve domain experts; “raw data” is an oxymoron• Communicate not just model capabilities, but also limits and assumptions

2. Data about people are about people• The data do not exist on their own, someone made them

3. Always consider why your model might be wrong in systematic ways• Don’t just trust the technology, make it trustworthy• Close the data science lifecycle: the world changes, and so should your models

Ethics beyond the Data Science Lifecycle

Should we deploy technology at all?

When is it ethical to collect data?

When is it ethical to collect sensitive data?

• Without knowing sensitive attributes, it’s not possible to know how to make good decisions.• E.g., in a credit-granting context, there’s no reason to believe that qualified

minority applicants will be similar to qualified members of the majority• But having sensitive data means it can be repurposed for other uses

Inclusion and Representation

Resources

• Take a course at Berkeley on people and technology• Visit https://fatml.org and https://fatconference.org• Reading list• Principles for Accountable Algorithms

• Books• Weapons of Math Destruction, by Cathy O’Neil• How to Lie With Statistics, by Darrell Huff

• Courses around the world: tinyurl.com/ethics-classes

“Remember that all models are wrong; the practical question is how wrong do they have to be to not be useful.”

-George E. P. Box

Data Science 100 · Lecture 27: Ethics in Data Science. Gender Shades. 12 ... Feature Engineering....

Documents

Transcript of Data Science 100 · Lecture 27: Ethics in Data Science. Gender Shades. 12 ... Feature Engineering....

Adventures in Feature Engineering - UNC Charlotte · Adventures in Feature Engineering. Feature engi-whaaat?: “Feature engineering is the act of extracting features from raw data

Generating Feature Model from Creative Requirements using ......Generating Feature Model from Creative Requirements using Model Driven Design . Fernando Wanderley . Universidade de

Product Line Engineering Meets Model Based …...models, feature profiles, variation points, product configurator, feature-based product line engineering, PLE factory, ACM Reference

Feature engineering for diverse data types

Feature Engineering - UMD

Flexibel omgaan met data / Feature Engineering

Automated Feature Engineering for Predictive Modelingdml.cs.byu.edu/icdm17ws/Horst.pdf · Automated Feature Engineering for Predictive Modeling ... (FS) step and parameter ... [#for

QA System Using Feature Engineering and

Model-Driven Web Feature Service

feature engineering - ift6758.github.io

Agile Modelling in Software Engineering · Agile Modelling in Software Engineering ... 2.4 Feature-Driven Development ... AMDD is a more agile version of Model Driven Development

Feature engineering-ml-meetup2-170220185754

Automated Feature Engineering for Deep Neural Networks ... · Feature engineering is a process that augments the feature vector of a machine learning model with calculated values

Feature Engineering Studio Special Session

Automating Feature Engineering - University of Edinburghworkshops.inf.ed.ac.uk/nips2016-ai4datasci/papers/NIPS2016-AI4... · Automating Feature Engineering Udayan Khurana1, Fatemeh

ProFET - Protein Feature Engineering Toolki

Feature report engineering Practice Accelerating Process ...

Engineering Success: Bioenergy feature

Feature Engineering - files.meetup.comfiles.meetup.com/17627552/josh-wills-feature-engineering.pdf · Feature Engineering The Dark Art of Data Science Josh Wills // Senior Director

Automating Feature Engineering