Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Big Data, Big Quality? by Irene Gonzálvez at Bi...

Big Data, Big Quality? by Irene Gonzálvez at Big Data Spain 2017

Insights can only be as good as the data. The data quality domain is enormously large, so you need to understand your company pain points to know what to focus on first.

https://www.bigdataspain.org/2017/talk/big-data-big-quality

Big Data Spain 2017
November 16th - 17th Kinépolis Madrid

Big Data Spain

December 05, 2017
Tweet

More Decks by Big Data Spain

Other Decks in Technology

Transcript

  1. TC4D: Test Certified for Data Level 1: Set-up, monitoring, alerting

    and documentation Level 2: Data management and Unit tests Level 3: Build your defenses
  2. What’s next? Build an algorithm library for anomaly detection (ML4ALL)

    Provide the infrastructure to ‘plug&play’ more algorithms Provide parameter recommendations to tweak the algorithms
  3. What’s next? Spotify-wide strategy • Have metrics to understand when

    a dataset qualifies as ‘good’ quality. • Identify which datasets are critical/ central to Spotify and make them of ‘good’ quality