Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Food Image Object Detection and Classification
Search
Leszek Rybicki
February 16, 2017
Research
2
15k
Food Image Object Detection and Classification
Part 1: Detection
Leszek Rybicki
February 16, 2017
Tweet
Share
More Decks by Leszek Rybicki
See All by Leszek Rybicki
Let's talk about Fakes
lunardog
0
140
How to Patch Image Classifiers
lunardog
0
2.2k
Towards Realistic Predictors - EN
lunardog
0
2.1k
Towards Realistic Predictors
lunardog
1
2.2k
Deep Learning Hot Dog Detector
lunardog
0
270
Finding beans in burgers: paper reading notes
lunardog
0
1.6k
Kelner: Serve Your Models
lunardog
0
120
Image Analysis at Cookpad
lunardog
1
1.8k
Kelner: serve your models
lunardog
1
390
Other Decks in Research
See All in Research
時系列データに対する解釈可能な 決定木クラスタリング
mickey_kubo
2
1k
GPUを利用したStein Particle Filterによる点群6自由度モンテカルロSLAM
takuminakao
0
340
PhD Defense 2025: Visual Understanding of Human Hands in Interactions
tkhkaeio
1
240
When Submarine Cables Go Dark: Examining the Web Services Resilience Amid Global Internet Disruptions
irvin
0
320
Combinatorial Search with Generators
kei18
0
880
AWSで実現した大規模日本語VLM学習用データセット "MOMIJI" 構築パイプライン/buiding-momiji
studio_graph
2
630
RHO-1: Not All Tokens Are What You Need
sansan_randd
1
180
2021年度-基盤研究B-研究計画調書
trycycle
PRO
0
330
AIスパコン「さくらONE」のLLM学習ベンチマークによる性能評価 / SAKURAONE LLM Training Benchmarking
yuukit
2
640
集合間Bregmanダイバージェンスと置換不変NNによるその学習
wasyro
0
150
単施設でできる臨床研究の考え方
shuntaros
0
3k
【輪講資料】Moshi: a speech-text foundation model for real-time dialogue
hpprc
3
720
Featured
See All Featured
Rails Girls Zürich Keynote
gr2m
95
14k
Side Projects
sachag
455
43k
Connecting the Dots Between Site Speed, User Experience & Your Business [WebExpo 2025]
tammyeverts
9
570
Performance Is Good for Brains [We Love Speed 2024]
tammyeverts
12
1.1k
Faster Mobile Websites
deanohume
310
31k
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
285
14k
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
52
5.6k
Fight the Zombie Pattern Library - RWD Summit 2016
marcelosomers
234
17k
Mobile First: as difficult as doing things right
swwweet
224
9.9k
The Language of Interfaces
destraynor
162
25k
Easily Structure & Communicate Ideas using Wireframe
afnizarnur
194
16k
Intergalactic Javascript Robots from Outer Space
tanoku
273
27k
Transcript
Food Image Object Detection and Classification Challenges and Solutions
Part 1: Detection
自己紹介 • リビツキ レシェック • ポーランド出身 • 2016~ クックパッド • github:
lunardog
Warning! This presentation contains images that may cause severe drooling
and stomach grumbling. @cookpad
History 歴史
ImageNet KWWSLPDJHQHWRUJ
ImageNet Large Scale Visual Recognition Competition KWWSZZZLPDJHQHWRUJFKDOOHQJHV/695&
ILSVRC 2010 task Classification )RUHDFKLPDJHDOJRULWKPV ZLOOSURGXFHDOLVWRIDWPRVW REMHFWFDWHJRULHVLQWKH GHVFHQGLQJRUGHURI FRQILGHQFH KWWSZZZLPDJHQHWRUJFKDOOHQJHV/695&
ILSVRC 2011 tasks 1. Classification 2. *Classification with localization *tester
task
KWWSFVQVWDQIRUGHGXV\OODEXVKWPO Classification + Localization
ILSVRC 2012 tasks 1. Classification 2. Classification with localization 3.
Fine-grained classification
Fine-grained classification KWWSZZZLPDJHQHWRUJFKDOOHQJHV/695&
AlexNet ,PDJHQHWFODVVLILFDWLRQZLWKGHHSFRQYROXWLRQDOQHXUDOQHWZRUNV $.UL]KHYVN\,6XWVNHYHU*(+LQWRQ$GYDQFHVLQQHXUDOLQIRUPDWLRQ SURFHVVLQJV\VWHPV
ILSVRC 2013 tasks 1. Detection 2. Classification 3. Classification with
localization
ILSVRC 2014 tasks 1. Detection 2. Classification 3. Classification with
localization
Object Detection KWWSFVQVWDQIRUGHGXV\OODEXVKWPO
Deep Learning KWWSVGHYEORJVQYLGLDFRP
ILSVRC 2015 tasks 1. Object detection 2. Object localization 3.
*Object detection from video 4. *Scene classification
ILSVRC 2016 tasks 1. Object localization 2. Object detection 3.
Object detection from video 4. Scene classification 5. Scene parsing
Cookpad 2016
画像データセット 1997年~ レシピ数:国内約260万 + 国外 + つくれぽ + 手順写真 17言語、60カ国
※数字は2017年02月時点のものです
画像解析の研究関心 • これは料理ですか? • どの料理ですか? • 料理はどこですか? • 。。。 Part
2
Where is the food? 料理はどこですか?
ゴール )LQGIRRGLQWKHLPDJHGUDZ DERXQGLQJER[DURXQGWKH IRRGLWHPLQFOXGLQJWKH GLVKLIYLVLEOH
,IWKHUHDUHPXOWLSOHLWHPV GUDZDERXQGLQJER[ DURXQGHDFKRQH ゴール
ground truth bounding box > 0.9 We count it as
a positive detection if Intersection over Union ratio is greater than 0.9. ƴ
QXPEHURIWUXHSRVLWLYHV QXPEHURIJURXQGWUXWKER[HV ƴ ƴ ƴ QXPEHURIWUXHSRVLWLYHV QXPEHURIJHQHUDWHGER[HV 再現率 (precision) (recall)
ƴ ƴ
Methods
1. Build a classifier 2. Pick Regions of Interest 3.
Run classifier on each region 4. Remove duplicate detections IDEA
Fast, Faster R-CNN 5LFKIHDWXUHKLHUDUFKLHVIRUDFFXUDWHREMHFWGHWHFWLRQDQGVHPDQWLFVHJPHQWDWLRQ 5RVV*LUVKLFN-HII'RQDKXH7UHYRU'DUUHOO-LWHQGUD0DOLN )DVWHU5&117RZDUGV5HDO7LPH2EMHFW'HWHFWLRQZLWK5HJLRQ3URSRVDO1HWZRUNV 6KDRTLQJ5HQ.DLPLQJ+H5RVV*LUVKLFN-LDQ6XQ
)DVW5&11 5RVV*LUVKLFN
問題 1. Computational cost 2. Context is important 3. ...but
context can be confusing. KDQG IRRG JUDVV IRRG KWWSSL[DED\FRP
Single Shot Detector 66'6LQJOH6KRW0XOWL%R['HWHFWRU :HL/LX'UDJRPLU$QJXHORY'XPLWUX(UKDQ&KULVWLDQ6]HJHG\ 6FRWW5HHG&KHQJ<DQJ)X$OH[DQGHU&%HUJ
Either The Least Or Most Employable Person Ever 7KH+XIILQJWRQ3RVW JLWKXEFRPSMUHGGLH
SMUHGGLHFRPGDUNQHW ZZZNDJJOHFRPSMUHGGLH Joseph Redmon
You Only Look Once <RX2QO\/RRN2QFH8QLILHG 5HDO7LPH2EMHFW'HWHFWLRQ -RVHSK5HGPRQ6DQWRVK'LYYDOD5RVV *LUVKLFN$OL)DUKDGL 'HF
<2/2%HWWHU)DVWHU 6WURQJHU -RVHSK5HGPRQ$OL)DUKDGL
<RX2QO\/RRN2QFH8QLILHG5HDO7LPH2EMHFW'HWHFWLRQ -RVHSK5HGPRQ6DQWRVK'LYYDOD5RVV*LUVKLFN$OL)DUKDGL YOLO in Context
None