Trusting AI with important decisions

Trusting AI with important decisions@louisdorard

March 26, 2016

AI is everywhere

Amazon for David Jones (@d_jones, see source)

http://twitter.com/d_jones

http://www.slideshare.net/papisdotio/david-jones-49416956?qid=e41d3bb8-8359-43bc-8b2b-4facca320b2d&v=default&b=&from_search=1

Lars Trieloff

@trieloff

(see source)

http://www.slideshare.net/lars3loff/business-reasons-for-predictive-applications

http://twitter.com/trieloff


@louisdorard

http://twitter.com/louisdorard

ChurnSpotter.io

http://churnspotter.io

• Startups pitch

• AI asks questions live to each startup

• AI assigns score

• Startup with highest score wins 100000 €

18

AI Star tup Batt le at PAPIs. io

http://papis.io

Preseries

How does it work?

Data + Machine Learning

Bedrooms Bathrooms Surface (foot²) Year built Type Price ($)

3 1 860 1950 house 565,0003 1 1012 1951 house2 1.5 968 1976 townhouse 447,0004 1315 1950 house 648,0003 2 1599 1964 house3 2 987 1951 townhouse 790,0001 1 530 2007 condo 122,0004 2 1574 1964 house 835,0004 2001 house 855,0003 2.5 1472 2005 house4 3.5 1714 2005 townhouse2 2 1113 1999 condo1 769 1999 condo 315,000

ML is a set of AI techniques where “intelligence” is built from

examples

30

Use cases

• Real-estate

• Spam filtering

• City bikes

• Startup competition

• Reduce churn

• Optimize pricing

• Anticipate demand

property price

email spam indicator

location & context #bikes

startup success indicator

customer churn indicator

product & price #sales

context demand

Zillow

Gmail

V3 predict

Preseries

ChurnSpotter

Amazon

Blue Yonder

RULES

31

Use cases

• Real-estate

• Spam filtering

• City bikes

• Startup competition

• Reduce churn

• Optimize pricing

• Anticipate demand

property price

email spam indicator

location & context #bikes

startup success indicator

customer churn indicator

product & price #sales

context demand

Zillow

Gmail

V3 predict

Preseries

ChurnSpotter

Amazon

Blue Yonder

RULES

“Weak AI” vs. “Strong AI”

Decisions from predictions

1. Descriptive

2. Predictive

3. Prescriptive

34

Phases of data analysis

1. Show churn rate against time

2. Predict which customers will churn next

3. Suggest what to do about each customer (e.g. propose to switch plan, send promotional offer, etc.)

35

Churn analysis

“Suggest what to do about each customer” → prioritised list of actions, based on…

• Customer representation + context

• Churn prediction & action prediction

• Uncertainty in predictions

• Revenue brought by customer & Cost of actions

• Constraints on frequency of solicitations36

Churn analysis

37

Pric ing optimisat ion

Again, from David Jones (@d_jones, see source)

http://twitter.com/d_jones

http://www.slideshare.net/papisdotio/david-jones-49416956?qid=e41d3bb8-8359-43bc-8b2b-4facca320b2d&v=default&b=&from_search=1

Decide price given product and context…

• For several price candidates (within constrained range):

• Predict # sales given product, context, price

• Multiply by price to estimate revenue

38


Decide price given product and context…

• For several price candidates (within constrained range):

• Predict 95%-confidence lower bound on # sales given product, context, price

• Multiply by price to estimate revenue

39


1. Show past demand against calendar

2. Predict demand for [product] at [store] in next 2 days

3. Suggest how much to ship

• Trade-off: cost of storage vs risk of lost sales

• Constraints on order size, truck volume, capacity of people putting stuff into shelves

40

Replenishment

AI vs humans

42

Who per forms better?

+vs.

Star Wars: The Flat Awakens by Filipe de Carvalho

vs.

https://www.behance.net/gallery/29238821/Star-Wars-The-Flat-Awakens

43

AI per forms better : Chess

+

44

AI per forms better : G o

+

45

AI + Human per form better : Chess

+

47

Humans per form better : footbal l

48

AI per forms better : replenishment

Decisions are faster, cheaper, and better

49

AI alone per forms better : replenishment

Again, from Lars Trieloff @trieloff (see source)

Decision Quality

Status Quo Predictive Prescriptive Automation

Dec

isio

n qu

alit

y

http://twitter.com/trieloff


1. Descriptive analysis

2. Predictive analysis

3. Prescriptive analysis

4. Automated decisions

50

B eyond prescr ipt ive analysis

Can we trust AI to be autonomous?

• Spam filter → decide to skip inbox

• Autonomous Vehicles → decide who to kill

52

Autonomous decis ion-mak ing systems

⇒ “Tool AI” vs “High-stakes autonomous AI”

53

Autonomous Vehicles

• Morality in decision-making algorithm:

• Minimize loss of life

• Account for probabilities of survival, age of occupants…→ optimal formula?

• Sacrifice owner?

• “People are in favor of cars that sacrifice the occupant to save other lives—as long they don’t have to drive one themselves.”

54

Autonomous Vehicles

• Need wide acceptation to get adoption and provide benefit(e.g. save lives with AVs)

• “The public is much more likely to go along with a scenario that aligns with their own views”

• What will the public tolerate? → experimental ethics

• Similar issues whenever AI decides for us and impacts many

⇒ Additional rules in decision making55

H igh-stakes autonomous AIs

56

Per formance guarantees?

“construction worker in orange safety vest is working on road”

95%-accurate scene description

57

Per formance guarantees

“black and white dog jumps over bar”


58


“a young boy is holding a baseball bat”


59


“a young boy is holding a baseball bat”weapon

SIR, DROP THE WEAPON!

1. A robot may not injure a human being or, through inaction, allow a human being to come to harm.

2. A robot must obey the orders given it by human beings, except where such orders would conflict with the First Law.

3. A robot must protect its own existence as long as such protection does not conflict with the First or Second Laws.

60

Defining desired and acceptable behavior

• Performance of predictions -> monitor accuracy

• Decisions

• Monitor AI with other AI (e.g. anomaly detection)

• Define desired and acceptable behavior→ objectives and constraints/bounds

61

Ensuring per formance of autonomous AI systems

• Context

• Predictions

• Uncertainty in predictions

• Constraints (i.e. acceptable behavior)

• Costs / benefits

• Objectives (i.e. desired behavior)62

Decis ions are based on…

• Trusting decisions when we can’t even interpret them

• Who is responsible when things go wrong?

• …

• Issues are not linked to the AI being weak or strong!

63

O ther issues

Original article at stories.papis.io (with references and links)

64

Learn more

http://stories.papis.io

meetup.com/Bordeaux-Machine-Learning-Meetup/

@louisdorard

Merci!

http://meetup.com/Bordeaux-Machine-Learning-Meetup/

http://twitter.com/louisdorard

Trusting AI with important decisions

Technology

Transcript of Trusting AI with important decisions