Gerrit Gruben: Limits of Data Science and other ethical considerations

Ethics for Data Scientists The Limits of ML Munich DataGeeks
Gerrit Gruben 31. January 2018

about.me 2 •Freelance DS, before worked as DS/SWE. •Training people
in a 3-month boot camp to be DS → •Org. of Kaggle Berlin meetup •ML PhD Dropout @ Potsdam •Degrees in Math. & CS, going for Laws (sic!) datascienceretreat.com

Goals 3

Main points •No data positivism in ML • inductive bias
always there • IID assumption is idealistic. •Can't predict everything •ML systems prone to manipulation (fragility) 5

Limits & Biases 6

7 Benevolent or evil?

”Absence of Evidence is not Evidence of Absence" --- Data
Scientist’s Proverbs

10 Source: http://www.gpmfirst.com/books/exploiting-future-uncertainty/risk-concepts

”I beseech you, in the bowels of Christ, think it
possible that you may be mistaken" --- Oliver Cromwell Dennis Lindley: avoid prior probabilities of 0 and 1.

Problem of Induction •More general as the black swan problem.
•ML models have an inductive bias. 13 ” The process of inferring a general law or principle from the observation of particular instances." --- Oxford's Dictionary (direct opposite of deduction)

” When you have two competing theories that make exactly
the same predictions, the simpler one is the better." --- Ockham’s Razor

Technical Things What goes wrong often…

Multiple Testing Retrying the tests so often, until "hitting" the
significance level by chance. Solution: Bayesian or correction (e.g. Bonferroni correction) or different experimental design. Data Snooping: http://bit.ly/2iWoFrV

Statistical Power

Simpson's Paradox Let's try at: https://vudlab.com/simpsons/

Frequentist vs Bayesian

"P-hacking"

"P-hacking" II "When a measure becomes a target, it ceases
to be a good measure" --- Goodhart's law

selection ≠ evaluation

23 Paper: http://bit.ly/2gBIR1M Prefer to call it “over-selection” In “Learning
with Kernels” from Smola & Schölkopf they name ex. 5.10. “overfitting on the test set”.

Empirical Risk Minimization • 24

Empirical Loss • 25

Empirical Risk Minimization II • 26

Bias / Variance 27

• 28

29 Source: http://bit.ly/2vDfoLp

30 Source: University of Potsdam

31 Source: University of Potsdam

Nested CV 32 From Quora: http://bit.ly/2wvz2aZ

Messing up your experiments •Data split strategy is part of
experiment. •Mainly care for: • Class distribution • Problem domain relevant issues such as time 33 ”Validation and Test sets should model nature and nature is not accommodating." --- Data Scientist’s Proverbs

34 “Model evaluation, model selection…“ by Sebastian Raschka: http://bit.ly/2p6PGY0 “Approximate
Statistical Tests For Comparing Supervised Class. Learning Algorithms” (Dietterich 98): http://bit.ly/2wyItF6

Gallery of Fails

Courier/Terrorist detection in Pakistan 36 Source: http://bit.ly/1KY4SQE

Feedback loops abused Tay.ai was a chat bot deployed on
Twitter by Microsoft for just a day. Trolls started to "subvert" the bot by "teaching" it to be politically incorrect by focussed exposure to extreme content.

Moral Machine http://moralmachine.mit.edu 38

Smaller tips for ML •Always model uncertainty. •Read this •Don’t
mock values of a non-existant predictive model. 39

Other Links •https://www.ma.utexas.edu/users/mks/statmistakes/StatisticsMist akes.html •Quantopian Lecture Series: p-Hacking and Multiple
Comparison bias https://www.youtube.com/watch?v=YiDfbYtgUPc •David Hume: A Treatise on Human Nature: http://www.davidhume.org/texts/thn.html 41

Thanks! Questions? Github: github.com/uberwach 42

Gerrit Gruben: Limits of Data Science and other...

Gerrit Gruben: Limits of Data Science and other ethical considerations

More Decks by MunichDataGeeks

Other Decks in Science

Featured

Transcript