Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop
-
Upload
crowdsourcing-week -
Category
Technology
-
view
242 -
download
4
Transcript of Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop
![Page 1: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/1.jpg)
Humans to the Rescue: Troubleshooting
AI Systems with Human-in-the-loop
Ece Kamar
Senior Researcher, Microsoft Research AI
![Page 2: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/2.jpg)
Exciting Times
![Page 3: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/3.jpg)
AI and the Crowd
training data
accuracy
test data
![Page 4: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/4.jpg)
Power of Data
[Banko&Brill, 2001]
![Page 5: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/5.jpg)
In the Wild
![Page 6: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/6.jpg)
In the Wild
![Page 7: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/7.jpg)
Hybrid Intelligence
Human Intelligence
AI Systems
![Page 8: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/8.jpg)
AI Applied to Critical Domains
![Page 9: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/9.jpg)
Power of the Hybrid
[Courtesy of Murray Campbell]
![Page 10: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/10.jpg)
Troubleshooting of ML Systems
training data
accuracy
test data
querysystem
response
execution
data
In the lab
In the wild
What is the performance in the wild?
How does the system fail?
Why does the system fail?
How the system can be improved?
![Page 11: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/11.jpg)
Biases in ML
[Lakkaraju, K., Caruana, Horvitz; AAAI 2017]
![Page 12: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/12.jpg)
Biases in ML
[Lakkaraju, K., Caruana, Horvitz; AAAI 2017]
![Page 13: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/13.jpg)
Biases in ML
[Lakkaraju, K., Caruana, Horvitz; AAAI 2017]
![Page 14: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/14.jpg)
Where do Blind Spots Come From?
M
cats
dogs
cat
(conf = 0.96)
Unknown unknowns: Data points with confident but incorrect predictions.
Blind-spots: Feature spaces with high concentration of unknown unknowns
![Page 15: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/15.jpg)
Blind-spots Detection
execution data
Beat the Machine [Attenberg, Ipeirotis, Provost, 2011]
Exploration of Unknown Unknowns[Lakkaraju, K., Caruana, Horvitz, 2011]
Step 1:
Descriptive
Space
Partitioning
execution data
Step 2:
Multi-armed
Bandit
based
Exploration
![Page 16: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/16.jpg)
Troubleshooting Complex Systems
![Page 17: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/17.jpg)
Challenge
Possible fixes
for each
component
Limited development time
Where to invest
development time for
biggest impact?
![Page 18: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/18.jpg)
Human-assisted troubleshooting methodology
system
outputComponent
1
Component
2
Component
3
I/OI/O
Evalu
ation
Failures
Fixes
[Nushi, K., Kossmann, Horvitz, 2011]
![Page 19: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/19.jpg)
Complex Issues
Fairness Biases
TransparencyResponsibility
Good vs. Bad
Policy & Law
![Page 20: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop](https://reader031.fdocuments.in/reader031/viewer/2022030318/5a650e0b7f8b9ac75b8b4a1f/html5/thumbnails/20.jpg)
Complex challenges
require collective efforts
No AI is perfect