Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Food Image Object Detection and Classification
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Leszek Rybicki
February 16, 2017
Research
15k
2
Share
Food Image Object Detection and Classification
Part 1: Detection
Leszek Rybicki
February 16, 2017
More Decks by Leszek Rybicki
See All by Leszek Rybicki
Let's talk about Fakes
lunardog
0
160
How to Patch Image Classifiers
lunardog
0
2.6k
Towards Realistic Predictors - EN
lunardog
0
2.5k
Towards Realistic Predictors
lunardog
1
2.4k
Deep Learning Hot Dog Detector
lunardog
0
300
Finding beans in burgers: paper reading notes
lunardog
0
1.8k
Kelner: Serve Your Models
lunardog
0
140
Image Analysis at Cookpad
lunardog
1
1.9k
Kelner: serve your models
lunardog
1
410
Other Decks in Research
See All in Research
データサイエンティストの業務変化
datascientistsociety
PRO
0
350
機械学習で作った ポケモン対戦bot で 遊ぼう!
fufufukakaka
0
120
The mathematics of transformers
gpeyre
0
180
生成AI による論文執筆サポート・ワークショップ 論文執筆・推敲編 / Generative AI-Assisted Paper Writing Support Workshop: Drafting and Revision Edition
ks91
PRO
0
170
[SITA2025 Workshop] 空中計算による高速・低遅延な分散回帰分析
k_sato
0
140
【SIGGRAPH Asia 2025】Lo-Fi Photograph with Lo-Fi Communication
toremolo72
0
140
都市交通マスタープランとその後への期待@熊本商工会議所・熊本経済同友会
trafficbrain
0
180
【NICOGRAPH2025】Photographic Conviviality: ボディペイント・ワークショップによる 同時的かつ共生的な写真体験
toremolo72
0
210
COFFEE-Japan PROJECT Impact Report(海ノ向こうコーヒー)
ontheslope
0
1.2k
SREのためのテレメトリー技術の探究 / Telemetry for SRE
yuukit
13
3.5k
教師あり学習と強化学習で作る 最強の数学特化LLM
analokmaus
2
1k
Multi-Agent Large Language Models for Code Intelligence: Opportunities, Challenges, and Research Directions
fatemeh_fard
0
150
Featured
See All Featured
Music & Morning Musume
bryan
47
7.1k
No one is an island. Learnings from fostering a developers community.
thoeni
21
3.7k
Connecting the Dots Between Site Speed, User Experience & Your Business [WebExpo 2025]
tammyeverts
11
880
Bash Introduction
62gerente
615
210k
Intergalactic Javascript Robots from Outer Space
tanoku
273
27k
Chrome DevTools: State of the Union 2024 - Debugging React & Beyond
addyosmani
10
1.1k
Easily Structure & Communicate Ideas using Wireframe
afnizarnur
194
17k
Building AI with AI
inesmontani
PRO
1
860
The SEO identity crisis: Don't let AI make you average
varn
0
430
Java REST API Framework Comparison - PWX 2021
mraible
34
9.2k
Efficient Content Optimization with Google Search Console & Apps Script
katarinadahlin
PRO
1
470
Paper Plane (Part 1)
katiecoart
PRO
0
6.4k
Transcript
Food Image Object Detection and Classification Challenges and Solutions
Part 1: Detection
自己紹介 • リビツキ レシェック • ポーランド出身 • 2016~ クックパッド • github:
lunardog
Warning! This presentation contains images that may cause severe drooling
and stomach grumbling. @cookpad
History 歴史
ImageNet KWWSLPDJHQHWRUJ
ImageNet Large Scale Visual Recognition Competition KWWSZZZLPDJHQHWRUJFKDOOHQJHV/695&
ILSVRC 2010 task Classification )RUHDFKLPDJHDOJRULWKPV ZLOOSURGXFHDOLVWRIDWPRVW REMHFWFDWHJRULHVLQWKH GHVFHQGLQJRUGHURI FRQILGHQFH KWWSZZZLPDJHQHWRUJFKDOOHQJHV/695&
ILSVRC 2011 tasks 1. Classification 2. *Classification with localization *tester
task
KWWSFVQVWDQIRUGHGXV\OODEXVKWPO Classification + Localization
ILSVRC 2012 tasks 1. Classification 2. Classification with localization 3.
Fine-grained classification
Fine-grained classification KWWSZZZLPDJHQHWRUJFKDOOHQJHV/695&
AlexNet ,PDJHQHWFODVVLILFDWLRQZLWKGHHSFRQYROXWLRQDOQHXUDOQHWZRUNV $.UL]KHYVN\,6XWVNHYHU*(+LQWRQ$GYDQFHVLQQHXUDOLQIRUPDWLRQ SURFHVVLQJV\VWHPV
ILSVRC 2013 tasks 1. Detection 2. Classification 3. Classification with
localization
ILSVRC 2014 tasks 1. Detection 2. Classification 3. Classification with
localization
Object Detection KWWSFVQVWDQIRUGHGXV\OODEXVKWPO
Deep Learning KWWSVGHYEORJVQYLGLDFRP
ILSVRC 2015 tasks 1. Object detection 2. Object localization 3.
*Object detection from video 4. *Scene classification
ILSVRC 2016 tasks 1. Object localization 2. Object detection 3.
Object detection from video 4. Scene classification 5. Scene parsing
Cookpad 2016
画像データセット 1997年~ レシピ数:国内約260万 + 国外 + つくれぽ + 手順写真 17言語、60カ国
※数字は2017年02月時点のものです
画像解析の研究関心 • これは料理ですか? • どの料理ですか? • 料理はどこですか? • 。。。 Part
2
Where is the food? 料理はどこですか?
ゴール )LQGIRRGLQWKHLPDJHGUDZ DERXQGLQJER[DURXQGWKH IRRGLWHPLQFOXGLQJWKH GLVKLIYLVLEOH
,IWKHUHDUHPXOWLSOHLWHPV GUDZDERXQGLQJER[ DURXQGHDFKRQH ゴール
ground truth bounding box > 0.9 We count it as
a positive detection if Intersection over Union ratio is greater than 0.9. ƴ
QXPEHURIWUXHSRVLWLYHV QXPEHURIJURXQGWUXWKER[HV ƴ ƴ ƴ QXPEHURIWUXHSRVLWLYHV QXPEHURIJHQHUDWHGER[HV 再現率 (precision) (recall)
ƴ ƴ
Methods
1. Build a classifier 2. Pick Regions of Interest 3.
Run classifier on each region 4. Remove duplicate detections IDEA
Fast, Faster R-CNN 5LFKIHDWXUHKLHUDUFKLHVIRUDFFXUDWHREMHFWGHWHFWLRQDQGVHPDQWLFVHJPHQWDWLRQ 5RVV*LUVKLFN-HII'RQDKXH7UHYRU'DUUHOO-LWHQGUD0DOLN )DVWHU5&117RZDUGV5HDO7LPH2EMHFW'HWHFWLRQZLWK5HJLRQ3URSRVDO1HWZRUNV 6KDRTLQJ5HQ.DLPLQJ+H5RVV*LUVKLFN-LDQ6XQ
)DVW5&11 5RVV*LUVKLFN
問題 1. Computational cost 2. Context is important 3. ...but
context can be confusing. KDQG IRRG JUDVV IRRG KWWSSL[DED\FRP
Single Shot Detector 66'6LQJOH6KRW0XOWL%R['HWHFWRU :HL/LX'UDJRPLU$QJXHORY'XPLWUX(UKDQ&KULVWLDQ6]HJHG\ 6FRWW5HHG&KHQJ<DQJ)X$OH[DQGHU&%HUJ
Either The Least Or Most Employable Person Ever 7KH+XIILQJWRQ3RVW JLWKXEFRPSMUHGGLH
SMUHGGLHFRPGDUNQHW ZZZNDJJOHFRPSMUHGGLH Joseph Redmon
You Only Look Once <RX2QO\/RRN2QFH8QLILHG 5HDO7LPH2EMHFW'HWHFWLRQ -RVHSK5HGPRQ6DQWRVK'LYYDOD5RVV *LUVKLFN$OL)DUKDGL 'HF
<2/2%HWWHU)DVWHU 6WURQJHU -RVHSK5HGPRQ$OL)DUKDGL
<RX2QO\/RRN2QFH8QLILHG5HDO7LPH2EMHFW'HWHFWLRQ -RVHSK5HGPRQ6DQWRVK'LYYDOD5RVV*LUVKLFN$OL)DUKDGL YOLO in Context
None