Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Data Analysis with Kotlin Notebook, DataFrame, ...

Data Analysis with Kotlin Notebook, DataFrame, and Kandy

Anton Arhipov

September 16, 2024
Tweet

More Decks by Anton Arhipov

Other Decks in Programming

Transcript

  1. De fi ne objective Collect data Clean data Explore data

    Analyze data Interpret the results Communicate fi ndings Implement decisions Monitor decisions
  2. De fi Collect data Clean data Analyze data Interpret the

    results Communicate fi Implement decisions Monitor decisions Exploratory Data Anasysis (EDA) Explore data This is what we are focusing on in this presentation
  3. The streams count is identi fi ed as String. This

    is clearly wrong in this case!
  4. Top 20 artists Plot is an extension function for DataFrame

    Map the data to axes Add some colors
  5. Let's try fi nding correlations between the di ff erent

    attributes related to the streams count
  6. A bit brighter spots are the points of interest The

    outcome: Danceability is useful And you need to have at least some energy levels to be popular
  7. The static accessor was generated for the new column created

    by the aggregate operation It looks like the most popular songs are contained within 90 to 130 BPM range The overall median is 121 BPM The most popular BPM is 120 - 39 occurances
  8. Some (very naive) outcomes: BPM matters: not too slow, not

    too fast, 120 is good The most popular key is C# Is probably G, G#, and D is acceptable too Having at least some energy levels in the song is useful