Road 2. Gather the resource, and Identify its Characteristic 3. Do the Plan, and Evaluate it 4. Market to Public, and Regather the Opinion Preparation Stage Data Related Stage Drive the Solution Stage
action? Impression • Number of Visit • Number of Active Visitors Budget • Cost Per Product • Cost of Acquisition Users • Number of Retain Users • Satisfaction Score
As a farmer Choose planting method: Hydroponics or Aquaponics As a Data Scientist Choose analytics method: Predictive Modeling (ML) or Diagnostic Analysis
If the question is to determine probabilities of an action • Use a Predictive model If the question is to show relationships • Use a Descriptive model If the question requires a yes/no answer • Use a Classification model
a farmer Get to know what is happening on the soil, plants, and the environment As a Data Scientist Get to know, what does the data tell us about the problem, and visualize it
Visualization Discover data trend, pattern, and any other relevancies accordingly Descriptive Statistics Decipher the aggregate information, such as average median, mean, missing value, etc Funnel Analysis Uncover the hidden information
action? As a farmer Prepare the suitable soil for the selected plants, set the growing medium well As a Data Scientist Handle data problem, such as missing values, duplicates, and other
data to solve the problem? Choose ML Model Determine the model based on expected output (prediction or regression) Model Iteration Iterate the modeling process by K- fold Ensemble Model Combine ML models to gain better model accuracy
need to be improved? As a farmer Inspect the plants, is it free from pest/disease? As a Data Scientist Do the model has good fitting accuracy, should it be enchanted?
need to be improved? Model Interpretation Interpret the model result to be understood by other people Model Evaluation Validate the model performance by its problem type (Accuracy, Precision, Recall, RMSE, etc)
As a farmer Gather suggestions/ comments from our customer As a Data Scientist Take many feedbacks from various entity such as end- user, stakeholders, etc.
METRICS BEFORE AFTER DEPLOYMENT ML MODEL Daily Active Users 1000 1600 (+60%) Cost Spent 1 mio/month 500k/month (-50%) Revenue Gain 10 mio 30 mio (+300%) SLA 3 days 2 days (-33%)
Choose the Road • What problem do you want to take action? • Which lane do you prefer to take? Gather the resource, and Identify its Characteristic • What kind of resource do you need? • How do you collect the resource? • What have the resource tell you? • What have to do before doing an action? Do the Plan, and Evaluate it • How do you make a model from your data to solve the problem? • Have the model already answer the problem or need to be improved? Market to Public, and Regather the Opinion • Can you apply the model to the real life? • Is there any input to your business solution?
and Choose the Road • What problem do you want to take action? • Which lane do you prefer to take? Gather the resource, and Identify its Characteristic • What kind of resource do you need? • How do you collect the resource? • What have the resource tell you? • What have to do before doing an action? Do the Plan, and Evaluate it • How do you make a model from your data to solve the problem? • Have the model already answer the problem or need to be improved? Market to Public, and Regather the Opinion • Can you apply the model to the real life? • Is there any input to your business solution?
and Choose the Road • What problem do you want to take action? • Which lane do you prefer to take? Gather the resource, and Identify its Characteristic • What kind of resource do you need? • How do you collect the resource? • What have the resource tell you? • What have to do before doing an action? Do the Plan, and Evaluate it • How do you make a model from your data to solve the problem? • Have the model already answer the problem or need to be improved? Market to Public, and Regather the Opinion • Can you apply the model to the real life? • Is there any input to your business solution?
and Choose the Road • What problem do you want to take action? • Which lane do you prefer to take? Gather the resource, and Identify its Characteristic • What kind of resource do you need? • How do you collect the resource? • What have the resource tell you? • What have to do before doing an action? Do the Plan, and Evaluate it • How do you make a model from your data to solve the problem? • Have the model already answer the problem or need to be improved? Market to Public, and Regather the Opinion • Can you apply the model to the real life? • Is there any input to your business solution?
of how our customer get impression until producing revenue stream Develop Persistently Focus on dropping channel, constantly evaluate with the whole complexity (engineer, ux, data, etc) Data Driven! They evaluate those channel based on data. Analytics is needed to enhance the decision making process here.
belief to double check progress, start from a helicopter view, end to the ant view. Exploration to Action Focus on explore the situation first, define hypothesis based on pain points, develop product to solve, deliver to evaluate Data Driven! From the beginning till the end, they use data to tell the story about our customers
all funnels of business process, set the subjects of every key points to ensure reliability Customer Satisfaction Aside from the streams, this model also focus on customer growth, such how to maintain the relationship, how to segment them Data Driven! Data is always needed to recap every key points of this model
that focuses on continuous releases and incorporating customer feedback with every iteration Scrum and Kanban Scrum is focused on fixed-length project iterations, Kanban is focused on continuous releases. Data Driven! In order to track the process, data is needed to evaluate the process
to do an AB Test” “The Most Significant Data” Aggregated/Interconnected data, which acted as main metrics of the company Retention Cohort of Total Customer from city X Budget Allocation of Product X based on Customer Group
C tomorrow!” “The Data Guru” Interconnected of most significant findings, as a funnel to answer missing gap (Funnel Type) Retention Cohort based on User Type, Country of Origin, etc Combination of Product Type, Location, and Revenue