Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Food Image Object Detection and Classification
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Leszek Rybicki
February 16, 2017
Research
15k
2
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Food Image Object Detection and Classification
Part 1: Detection
Leszek Rybicki
February 16, 2017
More Decks by Leszek Rybicki
See All by Leszek Rybicki
Let's talk about Fakes
lunardog
0
170
How to Patch Image Classifiers
lunardog
0
2.7k
Towards Realistic Predictors - EN
lunardog
0
2.6k
Towards Realistic Predictors
lunardog
1
2.5k
Deep Learning Hot Dog Detector
lunardog
0
310
Finding beans in burgers: paper reading notes
lunardog
0
1.9k
Kelner: Serve Your Models
lunardog
0
150
Image Analysis at Cookpad
lunardog
1
2k
Kelner: serve your models
lunardog
1
430
Other Decks in Research
See All in Research
2026年1月の生成AI領域の重要リリース&トピック解説
kajikent
0
1k
業界横断 副業コンプライアンス調査 三者(副業者・本業先・発注者)におけるトラブル認知ギャップの構造分析
fkske
0
1.3k
YOLO26_ Key Architectural Enhancements and Performance Benchmarking for Real-Time Object Detection
satai
3
820
R&Dチームを起ち上げる
shibuiwilliam
1
270
明日から使える!研究効率化ツール入門
matsui_528
13
7.4k
COFFEE-Japan PROJECT Impact Report(海ノ向こうコーヒー)
ontheslope
0
2k
重要だけど測れていないもの:高齢者ケアの見えない課題
theoriatec2024
0
370
AIを叩き台として、 「検証」から「共創」へと進化するリサーチ
mela_dayo
0
290
Fukui Shibiten 39 - AI Art
butchi
0
130
多様なデータを許容し学習し続ける模倣学習 / Advanced Imitation Learning for VLA
prinlab
0
220
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
shunk031
4
1.1k
Using our influence and power for patient safety
helenbevan
0
360
Featured
See All Featured
Site-Speed That Sticks
csswizardry
13
1.2k
A Modern Web Designer's Workflow
chriscoyier
698
190k
JAMstack: Web Apps at Ludicrous Speed - All Things Open 2022
reverentgeek
1
480
HDC tutorial
michielstock
2
720
Design of three-dimensional binary manipulators for pick-and-place task avoiding obstacles (IECON2024)
konakalab
0
470
Sam Torres - BigQuery for SEOs
techseoconnect
PRO
0
290
Keith and Marios Guide to Fast Websites
keithpitt
413
23k
Introduction to Domain-Driven Design and Collaborative software design
baasie
1
860
Crafting Experiences
bethany
1
190
Leading Effective Engineering Teams in the AI Era
addyosmani
9
2.1k
Navigating Weather and Climate Data
rabernat
0
240
Claude Code どこまでも/ Claude Code Everywhere
nwiizo
65
56k
Transcript
Food Image Object Detection and Classification Challenges and Solutions
Part 1: Detection
自己紹介 • リビツキ レシェック • ポーランド出身 • 2016~ クックパッド • github:
lunardog
Warning! This presentation contains images that may cause severe drooling
and stomach grumbling. @cookpad
History 歴史
ImageNet KWWSLPDJHQHWRUJ
ImageNet Large Scale Visual Recognition Competition KWWSZZZLPDJHQHWRUJFKDOOHQJHV/695&
ILSVRC 2010 task Classification )RUHDFKLPDJHDOJRULWKPV ZLOOSURGXFHDOLVWRIDWPRVW REMHFWFDWHJRULHVLQWKH GHVFHQGLQJRUGHURI FRQILGHQFH KWWSZZZLPDJHQHWRUJFKDOOHQJHV/695&
ILSVRC 2011 tasks 1. Classification 2. *Classification with localization *tester
task
KWWSFVQVWDQIRUGHGXV\OODEXVKWPO Classification + Localization
ILSVRC 2012 tasks 1. Classification 2. Classification with localization 3.
Fine-grained classification
Fine-grained classification KWWSZZZLPDJHQHWRUJFKDOOHQJHV/695&
AlexNet ,PDJHQHWFODVVLILFDWLRQZLWKGHHSFRQYROXWLRQDOQHXUDOQHWZRUNV $.UL]KHYVN\,6XWVNHYHU*(+LQWRQ$GYDQFHVLQQHXUDOLQIRUPDWLRQ SURFHVVLQJV\VWHPV
ILSVRC 2013 tasks 1. Detection 2. Classification 3. Classification with
localization
ILSVRC 2014 tasks 1. Detection 2. Classification 3. Classification with
localization
Object Detection KWWSFVQVWDQIRUGHGXV\OODEXVKWPO
Deep Learning KWWSVGHYEORJVQYLGLDFRP
ILSVRC 2015 tasks 1. Object detection 2. Object localization 3.
*Object detection from video 4. *Scene classification
ILSVRC 2016 tasks 1. Object localization 2. Object detection 3.
Object detection from video 4. Scene classification 5. Scene parsing
Cookpad 2016
画像データセット 1997年~ レシピ数:国内約260万 + 国外 + つくれぽ + 手順写真 17言語、60カ国
※数字は2017年02月時点のものです
画像解析の研究関心 • これは料理ですか? • どの料理ですか? • 料理はどこですか? • 。。。 Part
2
Where is the food? 料理はどこですか?
ゴール )LQGIRRGLQWKHLPDJHGUDZ DERXQGLQJER[DURXQGWKH IRRGLWHPLQFOXGLQJWKH GLVKLIYLVLEOH
,IWKHUHDUHPXOWLSOHLWHPV GUDZDERXQGLQJER[ DURXQGHDFKRQH ゴール
ground truth bounding box > 0.9 We count it as
a positive detection if Intersection over Union ratio is greater than 0.9. ƴ
QXPEHURIWUXHSRVLWLYHV QXPEHURIJURXQGWUXWKER[HV ƴ ƴ ƴ QXPEHURIWUXHSRVLWLYHV QXPEHURIJHQHUDWHGER[HV 再現率 (precision) (recall)
ƴ ƴ
Methods
1. Build a classifier 2. Pick Regions of Interest 3.
Run classifier on each region 4. Remove duplicate detections IDEA
Fast, Faster R-CNN 5LFKIHDWXUHKLHUDUFKLHVIRUDFFXUDWHREMHFWGHWHFWLRQDQGVHPDQWLFVHJPHQWDWLRQ 5RVV*LUVKLFN-HII'RQDKXH7UHYRU'DUUHOO-LWHQGUD0DOLN )DVWHU5&117RZDUGV5HDO7LPH2EMHFW'HWHFWLRQZLWK5HJLRQ3URSRVDO1HWZRUNV 6KDRTLQJ5HQ.DLPLQJ+H5RVV*LUVKLFN-LDQ6XQ
)DVW5&11 5RVV*LUVKLFN
問題 1. Computational cost 2. Context is important 3. ...but
context can be confusing. KDQG IRRG JUDVV IRRG KWWSSL[DED\FRP
Single Shot Detector 66'6LQJOH6KRW0XOWL%R['HWHFWRU :HL/LX'UDJRPLU$QJXHORY'XPLWUX(UKDQ&KULVWLDQ6]HJHG\ 6FRWW5HHG&KHQJ<DQJ)X$OH[DQGHU&%HUJ
Either The Least Or Most Employable Person Ever 7KH+XIILQJWRQ3RVW JLWKXEFRPSMUHGGLH
SMUHGGLHFRPGDUNQHW ZZZNDJJOHFRPSMUHGGLH Joseph Redmon
You Only Look Once <RX2QO\/RRN2QFH8QLILHG 5HDO7LPH2EMHFW'HWHFWLRQ -RVHSK5HGPRQ6DQWRVK'LYYDOD5RVV *LUVKLFN$OL)DUKDGL 'HF
<2/2%HWWHU)DVWHU 6WURQJHU -RVHSK5HGPRQ$OL)DUKDGL
<RX2QO\/RRN2QFH8QLILHG5HDO7LPH2EMHFW'HWHFWLRQ -RVHSK5HGPRQ6DQWRVK'LYYDOD5RVV*LUVKLFN$OL)DUKDGL YOLO in Context
None