Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Software Engineers, the New Data Scientists
Search
Christophe Bourguignat
June 15, 2016
Technology
1
140
Software Engineers, the New Data Scientists
scikit-learn day / PyData Paris / 15.06.16
Christophe Bourguignat
June 15, 2016
Tweet
Share
More Decks by Christophe Bourguignat
See All by Christophe Bourguignat
Adding Neurons to your Assistants
kriss
1
360
"Haute Couture" and "Prêt-à-Porter" Data Science
kriss
0
460
Machine Learning for Chief Future Officers
kriss
1
130
Whitening The Blackbox : Why And How To Explain Machine Learning Predictions ?
kriss
1
1.2k
Building a Data Science Team
kriss
2
410
Lean Machine Learning
kriss
5
770
Kaggle Criteo Challenge and Online Learning
kriss
1
280
The #FrenchData landscape
kriss
0
490
Other Decks in Technology
See All in Technology
DatabricksにOLTPデータベース『Lakebase』がやってきた!
inoutk
0
150
【あのMCPって、どんな処理してるの?】 AWS CDKでの開発で便利なAWS MCP Servers特集
yoshimi0227
6
780
american airlines®️ USA Contact Numbers: Complete 2025 Support Guide
supportflight
1
120
公開初日に Gemini CLI を試した話や FFmpeg と組み合わせてみた話など / Gemini CLI 初学者勉強会(#AI道場)
you
PRO
0
1k
関数型プログラミングで 「脳がバグる」を乗り越える
manabeai
2
220
Operating Operator
shhnjk
1
660
QuickSight SPICE の効果的な運用戦略~S3 + Athena 構成での実践ノウハウ~/quicksight-spice-s3-athena-best-practices
emiki
0
260
[SRE NEXT] ARR150億円_エンジニア140名_27チーム_17プロダクトから始めるSLO.pdf
satos
5
2.3k
Amplify Gen2から知るAWS CDK Toolkit Libraryの使い方/How to use the AWS CDK Toolkit Library as known from Amplify Gen2
fossamagna
1
280
Enhancing SaaS Product Reliability and Release Velocity through Optimized Testing Approach
ropqa
1
260
SEQUENCE object comparison - db tech showcase 2025 LT2
nori_shinoda
0
290
データ基盤からデータベースまで?広がるユースケースのDatabricksについて教えるよ!
akuwano
3
160
Featured
See All Featured
Scaling GitHub
holman
460
140k
Balancing Empowerment & Direction
lara
1
440
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
281
13k
The World Runs on Bad Software
bkeepers
PRO
69
11k
BBQ
matthewcrist
89
9.7k
[RailsConf 2023 Opening Keynote] The Magic of Rails
eileencodes
29
9.6k
Embracing the Ebb and Flow
colly
86
4.7k
Thoughts on Productivity
jonyablonski
69
4.7k
The Invisible Side of Design
smashingmag
301
51k
YesSQL, Process and Tooling at Scale
rocio
173
14k
The Web Performance Landscape in 2024 [PerfNow 2024]
tammyeverts
8
700
Evolution of real-time – Irina Nazarova, EuRuKo, 2024
irinanazarova
8
830
Transcript
Christophe Bourguignat @chris_bour / @zelrosHQ scikit-learn day / PyData Paris
/ 15.06.16
None
MODERN EXAMPLE (1936)
SOLUTION 1 : USE EXCEL (easy)
SOLUTION 2 : CODE A SIMPLE BUT ROBUST ALGORITHM (medium)
SOLUTION 3 : DIVE INTO THEORY AND IMPLEMENT (hard) Gilles
Louppe
Hopefully Scikit-Learn Does Exist (and this changes everything)
Machine Learning Without Learning the Machinery
100 000
“I put one of my courses online and it reached
an audience of 100 000 students”
250
“To reach a comparable sized audience I would have had
to teach at Stanford for 250 years”
Andrew Ng
https://www.google.fr/trends/explore#q=kaggle Kaggle Creation
Scikit-learn Moocs Kaggle Time for a Venn Diagram !
Machine Learning Has Become a Meritocracy
WHO IS THE MASS POPULATION BEST PLACED TO LEVERAGE THIS
MERITOCRACY ?
WHO IS THE MASS POPULATION BEST PLACED TO LEVERAGE THIS
MERITOCRACY ?
None
Gartner, 2016
Many Scikit-Learn contributors are themselves software engineers Analysis of 40
scikit-learn contributors Linkedin Profiles
Who will be the New New Data Scientist ?
Citizen Data Scientist
None
TOP 5%
We will all be kind of data scientists (in a
way or an other)
None
We are hiring Data Scientists Software Engineers
[email protected]
Wanting to
become
@chris_bour / @zelrosHQ