Upgrade to Pro — share decks privately, control downloads, hide ads and more …

画像検索 (特定物体認識) — 古典手法、マッチング、深層学習、Kaggle

@smly
July 27, 2018

画像検索 (特定物体認識) — 古典手法、マッチング、深層学習、Kaggle

(7/24) 大阪大学大学院情報科学研究科、ビッグデータ解析のゲストスピーカー担当分講義の資料をアップしました。画像検索とコンテストの話です。

@smly

July 27, 2018
Tweet

More Decks by @smly

Other Decks in Technology

Transcript

  1. ࣗݾ঺հ ϦΫϧʔτςΫϊϩδʔζͷϥϘͰ%"3ͷݚڀ։ൃΛ͍ͯ͠·͢ɻ ݚڀ։ൃ΍ٕज़ΞυόΠβʔͷձࣾΛܦӦ͍ͯ͠·͢ɻ ⾣ ,BHHMFྺ ೥ (SBOENBTUFS )JHIFTUSBOLUI $VSSFOUUI 

    ⾣ ,BHHMF 5PQpOJTIFTY 1SJ[FY ˞ࠓ೥5PQ͸ճ ୯ಠճ  ⾣ 5PQ$PEFS.BSBUIPO.BUDI 8*/4  ⾣ "$.,%% ,%%$VQTUQSJ[FXJOOFS 2/125
  2. "HFOEB ը૾ݕࡧ ಛఆ෺ମೝࣝ ʹ͍ͭͯ঺հ͠·͢ɻ ⾣ 0WFSWJFX$MBTTJDBM"QQSPBDI   ⾣ -PDBM%FTDSJQUPS

    ⾣ *OEFY4FBSDI ⾣ .BUDIJOH ⾣ %FFQ-FBSOJOH5SFOET   ⾣ $//CBTFE-PDBM%FTDSJQUPS ⾣ $//CBTFE(MPCBM%FTDSJQUPS ⾣ $713`84PG-BSHFTDBMF-BOENBSL3FUSJFWBM$IBMMFOHF 4/125 全体的に基礎的な話題。
 後半に少しだけコンテストと
 発展的な話題。
  3. ⾣ 0WFSWJFX$MBTTJDBM"QQSPBDI   ⾣ -PDBM%FTDSJQUPS ⾣ *OEFY4FBSDI ⾣ .BUDIJOH

    ⾣ %FFQ-FBSOJOH5SFOET   ⾣ $//CBTFE-PDBM%FTDSJQUPS ⾣ $//CBTFE(MPCBM%FTDSJQUPS ⾣ $713`84PG-BSHFTDBMF-BOENBSL3FUSJFWBM$IBMMFOHF "HFOEB ը૾ݕࡧ ಛఆ෺ମೝࣝ ʹ͍ͭͯ঺հ͠·͢ɻ 5/125
  4. ݹయతͳಛఆ෺ମೝࣝͷΞϓϩʔν ⾣  ը૾ू߹͔Βہॴಛ௃ -PDBMEFTDSJQUPS Λநग़ ⾣  ہॴಛ௃͔ΒݕࡧΠϯσοΫεΛ࡞੒ ⾣

     ΫΤϦը૾͔Βہॴಛ௃Λநग़ͯ͠ΠϯσοΫε͔ΒީิΛऔಘ ⾣  ہॴతͳಛ௃ͷ഑ஔʹزԿతͳ੔߹ੑ͕ͱΕ͍ͯΔ͜ͱΛ֬ೝ DB DB ը૾ू߹ ΫΤϦը૾ ݕࡧ
 ΠϯσοΫε ఏࣔީิ ݕࡧ
 ΠϯσοΫε     ہॴతͳಛ௃Λநग़ͯ͠ݕࡧΠϯσοΫεߏஙɻ<4JWJD -PXF +ÉHPV ʜ> 15/125
  5. ݹయతͳಛఆ෺ମೝࣝͷΞϓϩʔν ⾣  ը૾ू߹͔Βہॴಛ௃ -PDBMEFTDSJQUPS Λநग़ ⾣  ہॴಛ௃͔ΒݕࡧΠϯσοΫεΛ࡞੒ ⾣

     ΫΤϦը૾͔Βہॴಛ௃Λநग़ͯ͠ΠϯσοΫε͔ΒީิΛऔಘ ⾣  ہॴతͳಛ௃ͷ഑ஔʹزԿతͳ੔߹ੑ͕ͱΕ͍ͯΔ͜ͱΛ֬ೝ DB DB ը૾ू߹ ΫΤϦը૾ ݕࡧ
 ΠϯσοΫε ఏࣔީิ ݕࡧ
 ΠϯσοΫε     ہॴతͳಛ௃Λநग़ͯ͠ݕࡧΠϯσοΫεߏஙɻ<4JWJD -PXF +ÉHPV ʜ> 16/125
  6. -PDBM%FTDSJQUPSͷݕग़ͱهड़ ը૾͔Βہॴతͳؔ৺ྖҬͷ࠲ඪʢΩʔϙΠϯτʣΛݕग़͠ɺ
 ؔ৺ྖҬͷύλʔϯɾಛ௃Λهड़͢Δɻ x0 = (72 9 73 57 79

    4 38 11 69 57 . . . ) x1 = (21 15 42 3 99 97 64 16 97 88 . . . ) x2 = (11 87 80 12 30 15 92 91 28 1 . . . ) x3 = (38 77 67 98 43 2 9 16 17 32 . . . ) ݕग़͞Εͨؔ৺ྖҬɾ࠲ඪ ؔ৺ྖҬͷهड़ キーポイント Key point 記述・特徴量量 Descriptor 17/125
  7. ͲͷΑ͏ͳؔ਺GΛ༻͍Δ͔ʁ εέʔϧෆมͳؔ਺͸࣍ͷੑ࣭͕͋Δͱ๬·͍͠ɿ ⾣ ࡱӨ৚݅ͷมԽʹରͯ͠ɺؔ਺͕҆ఆతͰ͋Δ ⾣ ୯ҰͷӶ͍ϐʔΫͷ͋ΔࢁͷܗΛͨؔ͠਺ ୯ๆੑ  ⾣ ಉ͡ہॴྖҬͰಉ͡εέʔϧ͕ಘΒΕΔ

    <4DINJE>Ͱ͸ݕग़ثͱͯ͠-BQMBDJBOPG(BVTTJBO -P( ɺ͢ͳΘͪ Ψ΢γΞϯͷϥϓϥγΞϯͷ৞ࠐΈॲཧ͕ྑ͍ͱධՁݚڀ͕͞Ε͍ͯΔɻ $4DINJE 3.PIS BOE$#BVDLIBHF &WBMVBUJPOPG*OUFSFTU1PJOU%FUFDUPST  *OUFSOBUJPOBM+PVSOBMPG$PNQVUFS7JTJPO   QQ  30/125
  8. -BQMBDJBOPG(BVTTJBO -P( Ψ΢γΞϯʹରͯ͠ϥϓϥε࡞༻ૉͰྠֲΛڧௐͨ͠΋ͷɻ Ψ΢γΞϯ % ೖྗʹରͯ͠৞ΈࠐΈԋࢉΛ༻͍Δɻ Ψ΢γΞϯΛͰภඍ෼ͯ͠ɺ ֊ภඍ෼͢Δͱ ʹରͯ͠΋ಉ༷ʹ֊ภඍ෼ͯ͠·ͱΊΔͱɺ *ೖྗը૾


    ͸৞ΈࠐΈԋࢉ ʢೋ֊ภඍ෼͢Δ͜ͱͰɺ໌҉ͷ͕ࠩɺࢁͱ୩ʹͳΔʣ [G ] LoG ⌘ [G (x, y)] = @2 @x2 G (x, y) + @2 @y2 G (x, y) = x2 + y2 2 2 4 e (x2+y2) 2 2 G (x, y) = 1 p 2⇡ 2 e( x2+y2 2 2 ) 導出 @ @x G (x, y) = @ @x e (x2+y2) 2 2 = x 2 e (x2+y2)2 2 @2 @2x G (x, y) = x2 4 e (x2+y2) 2 2 1 2 e (x2+y2) 2 2 = x2 2 4 e (x2+y2) 2 2 x y [G (x, y) ⇤ I] = [ G (x, y)] ⇤ I = LoG ⇤ I 31/125
  9. ิ଍ɿը૾ %σʔλ ͷ৞ΈࠐΈԋࢉ -P(͸৞ΈࠐΈԋࢉʢίϯϘϦϡʔγϣϯʣΛߦ͏͜ͱͰը૾શମʹ ద༻͢Δɻؔ਺,Λ࠲ඪҠಈ͠ͳ͕Βؔ਺*ͱॏͶ଍͠߹ΘͤΔ̈ɻ ৞ΈࠐΈΛߦ͏ؔ਺, Y Z ೖྗը૾* Y

    Z ରԠ͢Δཁૉ͝ͱʹੵΛͱΓɺ
 ݁ՌΛ଍͠߹ΘͤΔɻ [G ] ͢΂ͯͷ஫໨ըૉ
 Y Z Ͱܭࢉ I22 I23 I24 I32 I33 I34 I42 I43 I44 K11 K12 K13 K21 K22 K23 K31 K32 K33 ݁Ռ4 Y Z ৞ΈࠐΈԋࢉʢίϯϘϦϡʔγϣϯʣ ʢ̈৞ΈࠐΈԋࢉ͸ɺ্Լࠨӈ͕qJQͨ͠ΧʔωϧͱͷॏͶ߹Θͤɻֶश͢Δ্Ͱ͸ͲͪΒ΋ҧ͍ͳ͍ͨΊɺ؆ུԽͷͨΊʹqJQ͠ͳ͍DSPTTDPSSFMBUJPOͰ࣮૷͞ΕΔ͜ͱ͕ଟ͍ʣ S33 = I22K33 + I32K23 + I42K13 + . . . S(x, y) = (I ⇤ K)(x, y) = X m X n I(x m, y n)K(m, n) 33/125
  10. Input image = 1.00 = 20.0 = 15.7 = 3.11

    = 7.33 = 11.5 スケール係数を変えて LoG 適⽤用。 スケール空間を作る。 4UFQεέʔϧۭؒͷ࡞੒ 48/125
  11. Input image Foot-print = 1.00 = 20.0 = 15.7 =

    3.11 = 7.33 = 11.5 1FBL Y Z М 1FBL      4UFQہॴ࠷େ஋Λݟ͚ͭΔ 50/125
  12. Input image Foot-print = 1.00 = 20.0 = 15.7 =

    3.11 = 7.33 = 11.5 1FBL Y Z М 1FBL      4UFQہॴ࠷େ஋Λݟ͚ͭΔ 51/125
  13. Input image Foot-print = 1.00 = 20.0 = 15.7 =

    3.11 = 7.33 = 11.5 1FBL Y Z М 1FBL      4UFQہॴ࠷େ஋Λݟ͚ͭΔ 51/125
  14. Input image Foot-print = 1.00 = 20.0 = 15.7 =

    3.11 = 7.33 = 11.5 1FBL Y Z М 1FBL      4UFQہॴ࠷େ஋Λݟ͚ͭΔ 52/125
  15. Input image Foot-print = 1.00 = 20.0 = 15.7 =

    3.11 = 7.33 = 11.5 1FBL Y Z М 1FBL      4UFQہॴ࠷େ஋Λݟ͚ͭΔ 54/125
  16. Input image Foot-print = 1.00 = 20.0 = 15.7 =

    3.11 = 7.33 = 11.5 1FBL Y Z М 1FBL      4UFQہॴ࠷େ஋Λݟ͚ͭΔ 55/125
  17. Input image Foot-print 1FBL Y Z М 
 1FBL Y

    Z М 
 1FBL Y Z М 
 1FBL Y Z М 
 1FBL Y Z М 
 1FBL Y Z М 
 ʜ しきい値以上の「座標、スケール」を選択 Results = 1.00 = 20.0 = 15.7 = 3.11 = 7.33 = 11.5 4UFQہॴ࠷େ஋Λݟ͚ͭΔ
  18. 4*'5͸ΞϑΟϯෆมͰ͸ͳ͍ 4*'5͸ࣹӨมԽͨ͠ը૾ʹϩόετͰ͸ͳ͍ɻ ΞϑΟϯྖҬΛݕग़ͯ͠ɺݕग़ͨ͠ΩʔϙΠϯτͷྖҬΛਅԁʹਖ਼ن Խͯ͠ಛ௃Λهड़͢Δ͜ͱͰΞϑΟϯෆมͷੑ࣭ΛಘΔ͜ͱ͕Ͱ͖Δɻ ݩը૾ʹͯΞϑΟϯྖҬ
 Λݕग़ͯ͠ਅԁʹਖ਼نԽ ճస ಛ௃Λهड़ ⾣ ,.JLPMBKD[ZLBOE$4DINJE

    
 l4DBMFBOE"⒏OFJOWBSJBOUJOUFSFTUQPJOUEFUFDUPSTz*O*+$7    ⾣ 1FSEPDI .BOE$IVN 0BOE.BUBT +
 l&⒏DJFOU3FQSFTFOUBUJPOPG-PDBM(FPNFUSZGPS-BSHF4DBMF0CKFDU3FUSJFWBM*OQSPDPG$713` ⾣ 4PGUXBSFl)FTTJBO"⒏OFEFUFDUPSXJUI4*'5EFTDSJQUPSz
 IUUQDNQGFMLDWVUD[dQFSEPNDPEFJOEFYIUNM 64/125
  19. ݹయతͳಛఆ෺ମೝࣝͷΞϓϩʔν ⾣  ը૾ू߹͔Βہॴಛ௃ -PDBMEFTDSJQUPS Λநग़ ⾣  ہॴಛ௃͔ΒݕࡧΠϯσοΫεΛ࡞੒ ⾣

     ΫΤϦը૾͔Βہॴಛ௃Λநग़ͯ͠ΠϯσοΫε͔ΒީิΛऔಘ ⾣  ہॴతͳಛ௃ͷ഑ஔʹزԿతͳ੔߹ੑ͕ͱΕ͍ͯΔ͜ͱΛ֬ೝ DB DB ը૾ू߹ ΫΤϦը૾ ݕࡧ
 ΠϯσοΫε ఏࣔީิ ݕࡧ
 ΠϯσοΫε     ہॴతͳಛ௃Λநग़ͯ͠ݕࡧΠϯσοΫεߏஙɻ<4JWJD -PXF +ÉHPV ʜ> 69/125
  20. #BHPG7JTVBM8PSET ը૾ݕࡧʹ͓͚Δ4*'5ͳͲͷώετάϥϜهड़͸࣍ͷ࣮਺ϕ ΫτϧɻݕࡧΠϯσοΫεͱͯ͠ొ࿥͢Δʹ͸େ͖͗͢Δɻ visual word: ID=1 ͋Β͔͡Ί,NFBOT ֊૚,NFBOTͳͲͰΫϥελϦϯάͯ͠ɺ
 ֤Ϋϥελͷத৺ΛWJTVBMXPSEͱͯ͠ఆٛ͢Δɻ
 ώετάϥϜهड़͸࠷ۙ๣ͷWJTVBMXPSEʹׂΓ౰ͯΔʢϕΫτϧྔࢠԽʣ

    ࠷ۙ๣Ϋϥελͷத৺఺ visual word: ID=2 x0 = (72 9 73 57 79 4 38 11 69 57 . . . ) x1 = (21 15 42 3 99 97 64 16 97 88 . . . ) x2 = (11 87 80 12 30 15 92 91 28 1 . . . ) ࠷ۙ๣Ϋϥελͷத৺఺ ࠷ۙ๣Ϋϥελͷத৺఺ 71/125
  21. సஔΠϯσοΫε ௥Ճ visual word visual word visual word λςɿWJTVBMXPSE*%ɺϤίɿJNBHF*% 72/125

    4MJEFDSFEJU:BOOJT"WSJUIJTzIUUQTTJGEMWHJUIVCJPTMJEFTJOEFYQEG
  22. సஔΠϯσοΫε ݕࡧ visual word visual word visual word λςɿWJTVBMXPSE*%ɺϤίɿJNBHF*% 73/125

    4MJEFDSFEJU:BOOJT"WSJUIJTzIUUQTTJGEMWHJUIVCJPTMJEFTJOEFYQEG
  23. సஔΠϯσοΫε ݕࡧ visual word visual word visual word λςɿWJTVBMXPSE*%ɺϤίɿJNBHF*% 74/125

    4MJEFDSFEJU:BOOJT"WSJUIJTzIUUQTTJGEMWHJUIVCJPTMJEFTJOEFYQEG
  24. సஔΠϯσοΫε ݕࡧ visual word visual word visual word λςɿWJTVBMXPSE*%ɺϤίɿJNBHF*% ランキングされた


    検索候補 76/125 4MJEFDSFEJU:BOOJT"WSJUIJTzIUUQTTJGEMWHJUIVCJPTMJEFTJOEFYQEG
  25. ࢀߟࢿྉ ߨٛࢿྉ lMFDUVSFNBUDIJOHBOEJOEFYJOH EFFQ MFBSOJOHGPSWJTJPOz %FFQ-FBSOJOHGPS7JTJPO
 :BOOJT"WSJUIJTઌੜ
 IUUQTTJGEMWHJUIVCJPTMJEFTJOEFYQEG lCJMMJPOTDBMFͷۙࣅ࠷ۙ๣୳ࡧz
 দҪ༐༎ઌੜ


    IUUQZVTVLFNBUTVJNFQSPKFDUTVSWFZ@QREPDBOO@CJMMJPO@QEG ॻ੶ lίϯϐϡʔλϏδϣϯ޿͕Δཁૉٕज़ͱԠ༻z
 ୈষɿۙࣅ࠷ۙ๣୳ࡧ 79/125
  26. ݹయతͳಛఆ෺ମೝࣝͷΞϓϩʔν ⾣  ը૾ू߹͔Βہॴಛ௃ -PDBMEFTDSJQUPS Λநग़ ⾣  ہॴಛ௃͔ΒݕࡧΠϯσοΫεΛ࡞੒ ⾣

     ΫΤϦը૾͔Βہॴಛ௃Λநग़ͯ͠ΠϯσοΫε͔ΒީิΛऔಘ ⾣  ہॴతͳಛ௃ͷ഑ஔʹزԿతͳ੔߹ੑ͕ͱΕ͍ͯΔ͜ͱΛ֬ೝ DB DB ը૾ू߹ ΫΤϦը૾ ݕࡧ
 ΠϯσοΫε ఏࣔީิ ݕࡧ
 ΠϯσοΫε     ہॴతͳಛ௃Λநग़ͯ͠ݕࡧΠϯσοΫεߏஙɻ<4JWJD -PXF +ÉHPV ʜ> 80/125
  27. http://www.cse.psu.edu/~rtc12/CSE486/lecture15.pdf ྫɿઢܗճؼϞσϧ Ϟσϧɿઢܗճؼ ZBY C ɺύϥϝʔλɿB Cɺαϯϓϧ਺ɿ̎ y = ax

    + b $SFEJU3PCFSU$PMMJOTIUUQXXXDTFQTVFEVdSUD$4&MFDUVSFQEG ࠷খೋ৐๏ʹΑΓࢉग़ 83/125
  28. ⾣ ϞσϧύϥϝʔλΛܭࢉ͢ΔͨΊͷσʔλΛαϯϓϦϯά ⾣ Ϟσϧύϥϝʔλͷਪఆ ⾣ Ϟσϧͱ੔߹ੑͷ͋͏αϯϓϧΛJOMJFSTͱͯ͠Χ΢ϯτ͢Δ ⾣ ܁Γฦ͠ɻJOMJFST͕࠷େͱͳΔϞσϧύϥϝʔλΛճ౴͢Δ inliers count

    = 162 x0 = Ax ϞσϧɿΞϑΟϯม׵
 ύϥϝʔλɿΞϑΟϯࣸ૾ߦྻ"
 αϯϓϧ਺ɿ 103/125 あらかじめ決めた繰り返し回数に 到達するまで繰り返す
  29. ͦͷଞͷ޻෉ɿΫΤϦ֦ு2VFSZ&YQBOTJPO ΫΤϦը૾ʹΑΔݕࡧ݁Ռ͔Βɺ৽ͨʹݕࡧΫΤϦΛ࡞੒͢Δɻ ը૾ؒͰʮ෦෼తʹϚον͢Δؔ܎ʯ͕ଟ͍৔߹͸ಛʹ༗ޮɻ 52& 5SBOTJUJWF2VFSZ&YQBOTJPO <$IVN>ɺ
 "2& "WFSBHF2VFSZ&YQBOTJPO <$IVN>ɺ
 %2&

    %JTDSJNJOBUJWFRVFSZFYQBOTJPO <"SBOEKFMPWJD>ɺ
 Ћ2& ЋXFJHIUFERVFSZFYQBOTJPO <3BEFOPWJ㶛B>ɺ
 %'4 HSBQICBTFERVFSZFYQBOTJPOEJ⒎VTJPO <*TDFO>ͳͲ ݕࡧ ݕࡧ ݕࡧ 105/125
  30. ΞδΣϯμ ը૾ݕࡧ ಛఆ෺ମೝࣝ ʹ͍ͭͯ঺հ͠·͢ɻ 106/125 ⾣ 0WFSWJFX$MBTTJDBM"QQSPBDI   ⾣

    -PDBM%FTDSJQUPS ⾣ *OEFY4FBSDI ⾣ .BUDIJOH ⾣ %FFQ-FBSOJOH5SFOET   ⾣ $//CBTFE-PDBM%FTDSJQUPS ⾣ $//CBTFE(MPCBM%FTDSJQUPS ⾣ $713`84PG-BSHFTDBMF-BOENBSL3FUSJFWBM$IBMMFOHF
  31. ਂ૚ֶशʹΑΔ໠Җ *-473$͸೥͔Β࢝·ͬͨେن໛zը૾ೝࣝzͷίϯςετɻ 〜 深層学習⼀一般の話題と画像認識(多クラス分類類)の⽂文脈での話 〜     

     2010 2011 2012 2013 2014 2015 Classification error (%) 0 10 20 30 CNN for ImageNet Krizhevsky et al. ೥ʹେن໛ը૾ೝࣝʹ͓͚Δ $//ʹΑΔϒϨΠΫεϧʔͷಥഁ Ͱେ͖͘஫໨ΛूΊͨɻ ͷେ෯ͳվળ 107/125
  32. 1Z5PSDI࣮૷ɿ 7(( YY 1PPMJOHͷࣜɿ      

    *OQVU͸࠷ޙͷ$POWPMVUJPOMBZFSͷ
 0VUQVUͰYY 41P$<#BCFOLP> F.avg_pool2d(x, (x.size(-2), x.size(-1))) ˞આ໌ͷศ্ٓɺ7((Λ༻͍͍ͯΔ LFSOFMTJ[F ݸʑͷ,FSOFMͷશମ Y $POWPMVUJPOMBZFSʹΑͬͯಘͨہॴతͳಛ௃Λ·ͱΊΔɻ
 ˠLݸͷ,FSOFMͷ൓ԠΛผʑʹ%ʹQPPMJOH͢Δ 4QBUJBMlTVNzQPPMJOH  ʢTVNQPPMJOHͳͷͰɺͦΕͧΕͷʹΑΔ,FSOFMʹΑΔ൓Ԡͷස౓ͷΑ͏ͳදݱʣ 111/125 *NBHF$SFEJU:BOOJT"WSJUIJTzIUUQTTJGEMWHJUIVCJPTMJEFTJOEFYQEG
  33. ."$<5PMJBT> 1Z5PSDI࣮૷ɿ 7(( YY F.max_pool2d(x, (x.size(-2), x.size(-1))) 1PPMJOHͷࣜɿ  

        *OQVU͸࠷ޙͷ$POWPMVUJPOMBZFSͷ
 0VUQVUͰYY Y $POWPMVUJPOMBZFSʹΑͬͯಘͨہॴతͳಛ௃Λ·ͱΊΔɻ
 ˠLݸͷ,FSOFMͷ൓ԠΛผʑʹ%ʹQPPMJOH͢Δ 4QBUJBMNBYQPPMJOH  ʢNBYQPPMJOHͳͷͰɺͦΕͧΕͷʹΑΔ,FSOFMʹΑΔ൓Ԡͷ༗ແͷΑ͏ͳදݱʣ LFSOFMTJ[F ݸʑͷ,FSOFMͷશମ ˞આ໌ͷศ্ٓɺ7((Λ༻͍͍ͯΔ 112/125 *NBHF$SFEJU:BOOJT"WSJUIJTzIUUQTTJGEMWHJUIVCJPTMJEFTJOEFYQEG
  34. 1Z5PSDI࣮૷ɿ 1PPMJOHͷࣜɿ ࠷ޙͷ$POWPMVUJPOMBZFSͷPVUQVUʹରͯ͠ɺ
 4QBUJBMHFOFSBMJ[FENFBOQPPMJOHɻ-QOPSNͷΑ͏ͳQΛύϥϝʔλͱͨ͠ҰൠԽɻ (F.<3BEFOPWJ㶛> F.avg_pool2d(
 x.clamp(min=eps).pow(p),
 (x.size(-2), x.size(-1))
 ).pow(1./p)

          gem 7(( YY Q@L͸ֶशՄೳͳϞσϧύϥϝʔλ ˞આ໌ͷศ্ٓɺ7((Λ༻͍͍ͯΔ 113/125 *NBHF$SFEJU:BOOJT"WSJUIJTzIUUQTTJGEMWHJUIVCJPTMJEFTJOEFYQEG
  35. 2010 2011 2012 2013 2014 2015 CNN for ImageNet Krizhevsky

    et al. 2016 2017 2018 2004 2005 2006 2007 2008 2009 2003 Video Google Sivic and Zisserman 2002 Hamming Embedding
 Jegou et al. CNN off-the-shelf
 Razavian et al. DELF Noh et al. GeM Radenović et al. SPoC Babenko et al. R-MAC VLAD
 Jegou et al. SIFT-based CNN-based MAC Tolias et al. REMAP
 Bober-Irizar et al. ? (manuscript submitted) 2019? ͜Ε·ͰͷาΈ 115/125
  36. $713XPSLTIPQίϯϖςΟγϣϯ ⾣ ϥϯυϚʔΫ ཱྀߦऀͳͲʹͱͬͯ৔ॴΛಛఆ͢Δͷʹ໾ཱͭ໨ҹ Λର৅ͱ ͨ͠େن໛ը૾ݕࡧίϯςετɻ ⾣ ڝٕظؒɿdͷ໿ϲ݄ ⾣ ධՁࢦඪɿN"1!

    ⾣ ϥϯυϚʔΫ਺ɿສઍछྨ ⾣ σʔλ਺ɿΫΤϦը૾ ΠϯσοΫεը૾(# ສຕ  ⾣ ΠϯσοΫεը૾͔Β4*'5Λநग़͢Δͱ.ݸ ԯݸ  ⾣ 3PPU4*'5ʹม׵͢Δͱ.  (# 117/125 $713`XPSLTIPQͷίϯϖςΟγϣϯͱͯ͠,BHHMFͰ։࠵ɻ ݱ࣌఺Ͱ࠷΋େن໛ͳɺಛఆ෺ମೝࣝͷධՁσʔληοτɻ
  37. ্Ґ͸$//CBTFE(MPCBMEFTDSJQUPS 
 4JBNFTF5SJQMFUOFUXPSLʹΑΔ'JOFUVOJOH͕ओྲྀ     mAP@100 0 20

    40 60 3BEFOPWJD (F. '5 1$"X &VDMJEJBOTFBSDI   3BEFOPWJD (F. '5 1$"X &VDMJEJBOTFBSDI %J⒎VTJPO #PCFS*SJ[BS &OTFNCMF 1$"X %#"VH &VDMJEJBO 2& JUFSBUJPOT  #PCFS*SJ[BS 3&."1 '5 1$"X &VDMJEJBOTFBSDI 2& JUFSBUJPOT #PCFS*SJ[BS 3&."1 '5 &VDMJEJBOTFBSDI  0[BLJ %&-' 52& 41ˠ3FTDPSJOHCZ(F. '5 %J⒎VTJPO 0[BLJ %&-' 52& 41  0[BLJ (F. QSFUSBJOFE 1$"X &VDMJEJBOTFBSDI .JTILJO )FT"⒎/FU)BSE/FU 41 
 2VFSZFYQBOTJPOCZ%J⒎VTJPO ͸1VCMJD-#TDPSFʹ͓͚Δ਺ࣈͰ͋ΔͨΊɺଞͱएׯҟͳΔՄೳੑ͕͋Δ 118/125 -PDBMEFTDSJQUPSʹΑΔݕࡧΛϕʔεͱͨ݁͠Ռ 3&."1ɻ
 %J⒎VTJPOΛ༻͍ͣ&VDMJEJBOTFBSDI 2&͚ͩͰ
 ඇৗʹߴ͍ਫ਼౓Λୡ੒͍ͯ͠Δɻ TU UI UI UI
  38. 3&."1<$74417JTVBM"UPNT5FBN NBOVTDSJQUTVCNJUUFE > ⾣ -BOENBSL3FUSJFWBM$IBMMFOHFͰॳग़ͷ(MPCBMEFTDSJQUPSˍ405"ɻ ⾣ δϟʔφϧ౤ߘதʹ͖ͭɺৄࡉ͸$713`84ͷϓϨθϯςʔγϣϯͱ,BHHMF'PSVNͰͷ৘ ใ͕࠷΋ৄ͍͠ɻ ⾣ ʮ͢΂ͯͷ$POWPMVUJPO૚͔ΒʯͦΕͧΕ30*1PPMʹΑͬͯSFHJPOGFBUVSFTΛऔΓग़͢ɻ

    &OUSPQZϕʔεͷXFJHIUJOHΛ༩͑ͯTVNQPPMJOHͯ͠࿈݁͢Δɻ ⾣ ೖྗ͕ߴղ૾ˍ෦෼తͳࣸਅ͕ͨ͘͞Μ͋ΔͷͰɺ30*1PPM͸ͱͯ΋༗ޮͦ͏ʂ ⾣ 0YGPSE 1BSJTͳͲ,BHHMFҎ֎ͷධՁσʔληοτͰ΋ൺֱ͞Ε͓ͯΓଞΛѹ౗͍ͯ͠Δɻ ⾣ RVFSZ NBUDI OPONBUDI ͷUSJQMFUͰ3&."1Λܭࢉͯ͠USJQMFUMPTTʹΑΔpOFUVOJOH Λߦ͏ɻ 3FGIUUQTXXXLBHHMFDPNDMBOENBSLSFUSJFWBMDIBMMFOHFEJTDVTTJPO 119/125
  39. ·ͱΊʗࡶײ w ݹయతΞϓϩʔνʹΑΔख๏ͱ$//ϕʔεͷΞϓϩʔν w 4*'5͸εέʔϧෆมɾճసෆมɾর໌ෆมͷੑ࣭Λ࣋ͭ w 3"/4"$͸PVUMJFSʹରͯ͠ϩόετʹϞσϧύϥϝʔλΛࢉग़Մೳ w $//ϕʔεͷ(MPCBMEFTDSJQUPS͸୯ҰͷϕΫτϧͰߴ͍ݕࡧਫ਼౓ w

    4JBNFTF5SJQMFUOFUXPSLͰ'JOFUVOJOHʹΑΔਫ਼౓ఈ্͕͛Մೳ w ίϯςετΛ௨ͯ͡ͷࡶײ w େن໛ʹͳΔͱ-PDBMEFTDSJQUPS͸σʔλ਺͕๲େʹͳΔɻඞཁͱ͢Δܭࢉίετ΍Ϧιʔ ε΋େ͖͘ͳΔͷͰ೉఺Λײ͡Δɻ w زԿతͳݕূʹΑͬͯಘΒΕΔ݁Ռ͸ඇৗʹਫ਼౓͕ߴ͍ͨΊɺ-PDBMEFTDSJQUPSΛ࢖Θͣ (MPCBMEFTDSJQUPS͚ͩͰྑ͍είΞΛୡ੒Ͱ͖Δ͜ͱʹ͸େมڻ͍ͨɻ w (MPCBMEFTDSJQUPSͷ݁ՌͰ͸ࢁͳͲͷࣗવ؀ڥͷࣸਅʹର͢Δ3FDBMM͕-PDBMEFTDSJQUPS ͱൺֱͯ͠ߴ͘ɺ͜Ε͕શମͷείΞʹӨڹΛ༩͑ͨͷͰ͸ͳ͍͔ͱࢥΘΕΔ ཁݕূ 125/125 ը૾ݕࡧʢಛఆ෺ମೝࣝʣͷݹయख๏ͱ$//ϕʔεͷख๏Λ঺հͨ͠ɻ