Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Data Science 101
Search
Ronojoy Adhikari
September 29, 2015
Research
4
1.5k
Data Science 101
Presentation at the Data Science 101 workshop at Orangescape.
Ronojoy Adhikari
September 29, 2015
Tweet
Share
More Decks by Ronojoy Adhikari
See All by Ronojoy Adhikari
Hydrodynamic and phoretic interactions of active particles in Python
ronojoy
0
140
IMSc Review Presentation
ronojoy
0
310
Probabilistic programming in Python
ronojoy
0
330
Mathematical Modelling
ronojoy
0
210
Data Science : Theory
ronojoy
2
1.2k
Data Science : Probability Theory
ronojoy
1
370
Active Brownian Motion
ronojoy
0
280
Does a droplet roll or slide ?
ronojoy
0
130
Bayesianism : a lightning introduction
ronojoy
2
110
Other Decks in Research
See All in Research
作業記憶の発達的特性が言語獲得の臨界期を形成する(NLP2025)
chemical_tree
2
560
Computational OT #4 - Gradient flow and diffusion models
gpeyre
0
190
SSII2025 [TS2] リモートセンシング画像処理の最前線
ssii
PRO
6
2.3k
Cross-Media Information Spaces and Architectures
signer
PRO
0
220
A multimodal data fusion model for accurate and interpretable urban land use mapping with uncertainty analysis
satai
3
160
実行環境に中立なWebAssemblyライブマイグレーション機構/techtalk-2025spring
chikuwait
0
200
2025年度 生成AIの使い方/接し方
hkefka385
1
660
Ad-DS Paper Circle #1
ykaneko1992
0
5.2k
20250605_新交通システム推進議連_熊本都市圏「車1割削減、渋滞半減、公共交通2倍」から考える地方都市交通政策
trafficbrain
0
120
博士論文公聴会: Scaling Telemetry Workloads in Cloud Applications: Techniques for Instrumentation, Storage, and Mining / PhD Defence
yuukit
1
140
ストレス計測方法の確立に向けたマルチモーダルデータの活用
yurikomium
0
250
言語モデルの内部機序:解析と解釈
eumesy
PRO
41
17k
Featured
See All Featured
Why Our Code Smells
bkeepers
PRO
336
57k
Scaling GitHub
holman
459
140k
Why You Should Never Use an ORM
jnunemaker
PRO
56
9.4k
Side Projects
sachag
454
42k
I Don’t Have Time: Getting Over the Fear to Launch Your Podcast
jcasabona
32
2.3k
Understanding Cognitive Biases in Performance Measurement
bluesmoon
29
1.7k
Improving Core Web Vitals using Speculation Rules API
sergeychernyshev
15
890
Statistics for Hackers
jakevdp
799
220k
Art, The Web, and Tiny UX
lynnandtonic
298
21k
No one is an island. Learnings from fostering a developers community.
thoeni
21
3.3k
The Power of CSS Pseudo Elements
geoffreycrofte
76
5.8k
GitHub's CSS Performance
jonrohan
1031
460k
Transcript
Data Science 101: insight, not numbers Ronojoy Adhikari The Institute
of Mathematical Sciences Chennai, India Orangescape Chennai, India Wednesday, 30 September 15
The purpose of computing is insight, not numbers. Wednesday, 30
September 15
The purpose of computing is insight, not numbers. Wednesday, 30
September 15
The purpose of computing is insight, not numbers. Richard Hamming
Wednesday, 30 September 15
What is the purpose of data science ? Wednesday, 30
September 15
What is the purpose of data science ? Insight, not
numbers! Wednesday, 30 September 15
Data science Wednesday, 30 September 15
Wednesday, 30 September 15
Data Wednesday, 30 September 15
Data Domain knowledge Wednesday, 30 September 15
Data Domain knowledge Data curation Wednesday, 30 September 15
Data Domain knowledge Data curation Mathematical model Wednesday, 30 September
15
Data Domain knowledge Data curation Mathematical model A/B testing Wednesday,
30 September 15
Data Domain knowledge Data curation Mathematical model A/B testing Machine
learning Wednesday, 30 September 15
Data Domain knowledge Data curation Mathematical model A/B testing Machine
learning Machine inference Wednesday, 30 September 15
Data Domain knowledge Data curation Mathematical model A/B testing Machine
learning Machine inference Value from data Wednesday, 30 September 15
1. Problem or question ? Wednesday, 30 September 15
Wednesday, 30 September 15
Let the data speak for themselves! Ronald Fisher Wednesday, 30
September 15
Let the data speak for themselves! Ronald Fisher The data
cannot speak for themselves; and they never have, in any real problem of inference. Edwin Jaynes Wednesday, 30 September 15
Classification Regression Clustering Dimensionality reduction Wednesday, 30 September 15
Classification Regression Clustering Dimensionality reduction predict class, given attributes Wednesday,
30 September 15
Classification Regression Clustering Dimensionality reduction predict class, given attributes Wednesday,
30 September 15
Classification Regression Clustering Dimensionality reduction predict class, given attributes predict
values, given other values Wednesday, 30 September 15
Classification Regression Clustering Dimensionality reduction predict class, given attributes predict
values, given other values Wednesday, 30 September 15
Classification Regression Clustering Dimensionality reduction predict class, given attributes predict
values, given other values group similar things together Wednesday, 30 September 15
Classification Regression Clustering Dimensionality reduction predict class, given attributes predict
values, given other values group similar things together Wednesday, 30 September 15
Classification Regression Clustering Dimensionality reduction predict class, given attributes predict
values, given other values group similar things together keeping only the relevant variables Wednesday, 30 September 15
Classification Regression Clustering Dimensionality reduction predict class, given attributes predict
values, given other values group similar things together keeping only the relevant variables Wednesday, 30 September 15
3. Frame a hypothesis (mathematical models) Wednesday, 30 September 15
Bayesian Blackbox Frequentist Causal Wednesday, 30 September 15
Bayesian Blackbox Frequentist Causal probability is a state of knowledge
Wednesday, 30 September 15
Bayesian Blackbox Frequentist Causal probability is a state of knowledge
probability is a frequency Wednesday, 30 September 15
Bayesian Blackbox Frequentist Causal probability is a state of knowledge
probability is a frequency Wednesday, 30 September 15
Bayesian Blackbox Frequentist Causal probability is a state of knowledge
ML : toolbox for processing data probability is a frequency Wednesday, 30 September 15
Bayesian Blackbox Frequentist Causal probability is a state of knowledge
ML : toolbox for processing data probability is a frequency Wednesday, 30 September 15
Bayesian Blackbox Frequentist Causal probability is a state of knowledge
ML : toolbox for processing data ML : learning generative models of data probability is a frequency Wednesday, 30 September 15
Bayesian Blackbox Frequentist Causal probability is a state of knowledge
ML : toolbox for processing data ML : learning generative models of data probability is a frequency Wednesday, 30 September 15
Wednesday, 30 September 15
Wednesday, 30 September 15
Wednesday, 30 September 15
We are building a causal learning and inference engine that
will beat the current state-of-art! Wednesday, 30 September 15
We are building a causal learning and inference engine that
will beat the current state-of-art! Thank you for your attention! Wednesday, 30 September 15