Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Make Machine Learning Boring Again: Best Practi...

szilard
July 20, 2019
100

Make Machine Learning Boring Again: Best Practices for Using Machine Learning in Businesses - LA Data Science Meetup - Playa Vista, August 2019

szilard

July 20, 2019
Tweet

More Decks by szilard

Transcript

  1. Make Machine Learning Boring Again: Best Practices for Using Machine

    Learning in Businesses Szilard Pafka, PhD Chief Scientist, Epoch LA Data Science Meetup Aug 2019
  2. Disclaimer: I am not representing my employer (Epoch) in this

    talk I cannot confirm nor deny if Epoch is using any of the methods, tools, results etc. mentioned in this talk
  3. *

  4. 10x

  5. ML training: lots of CPU cores lots of RAM limited

    time ML scoring: separated servers
  6. “people that know what they’re doing just use open source

    [...] the same open source tools that the MLaaS services offer” - Bradford Cross
  7. already pre-processed data less domain knowledge (or deliberately hidden) AUC

    0.0001 increases "relevant" no business metric no actual deployment models too complex no online evaluation no monitoring data leakage
  8. Aggregation 100M rows 1M groups Join 100M rows x 1M

    rows time [s] time [s] “Motherfucka!”
  9. AI?