Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
AI最新技術Update会 8月
Search
M.Inomata
August 05, 2020
Research
510
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
AI最新技術Update会 8月
イベント発表資料です。
https://deeplearning-b.connpass.com/event/181528/
M.Inomata
August 05, 2020
More Decks by M.Inomata
See All by M.Inomata
20221005_AI勉強会
ecoopnet
0
580
20220112_AI勉強会
ecoopnet
0
590
AI最新論文読み会2021年11月
ecoopnet
0
370
AI最新論文読み会2021年5月
ecoopnet
0
580
AI最新論文読み会2020年12月
ecoopnet
0
450
医療ディープラーニング勉強会 DL勉強会 第3回 2020.4
ecoopnet
0
560
Kotlin Coroutines ことはじめ
ecoopnet
2
380
ハマった時に役立つ通信系デバッグの話
ecoopnet
3
1.9k
モバイルアプリ開発に使える設計の話
ecoopnet
1
1.7k
Other Decks in Research
See All in Research
National high-resolution cropland classification of Japan with agricultural census information and multi-temporal multi-modality datasets
satai
3
270
Using our influence and power for patient safety
helenbevan
0
360
「行ける・行けない表」による地域公共交通の性能評価
bansousha
0
160
第12回人と環境にやさしい交通をめざす全国大会/熊本都市圏「車1割削減、渋滞半減、公共交通2倍」をめざして
trafficbrain
0
110
YOLO26_ Key Architectural Enhancements and Performance Benchmarking for Real-Time Object Detection
satai
3
780
存立危機事態の再検討
jimboken
0
290
[BlackHatAsia2026] Hidden Telemetry: Uncovering TraceLogging ETW Providers You're Not Using (Yet)
asuna_jp
1
510
社内データ分析AIエージェントを できるだけ使いやすくする工夫
fufufukakaka
1
1.1k
羽田新ルート運用6年の検証
1manken
0
160
AY 2026 Guide to Academic Writing Using Generative AI - Workshop
ks91
PRO
0
120
NLP colloquium: AI Safety Survey
kanekomasahiro
0
540
SAKURAONE:An Open Ethernet-based AI HPC System And Its Observed Workload Dynamicsin a Single-Tenant LLM Development Environment
yuukit
1
300
Featured
See All Featured
The Spectacular Lies of Maps
axbom
PRO
1
790
How to Get Subject Matter Experts Bought In and Actively Contributing to SEO & PR Initiatives.
livdayseo
0
130
sira's awesome portfolio website redesign presentation
elsirapls
0
270
BBQ
matthewcrist
89
10k
Marketing Yourself as an Engineer | Alaka | Gurzu
gurzu
0
210
Bridging the Design Gap: How Collaborative Modelling removes blockers to flow between stakeholders and teams @FastFlow conf
baasie
0
580
Typedesign – Prime Four
hannesfritz
42
3.1k
Mind Mapping
helmedeiros
PRO
1
240
What does AI have to do with Human Rights?
axbom
PRO
1
2.2k
Fashionably flexible responsive web design (full day workshop)
malarkey
408
66k
Thoughts on Productivity
jonyablonski
76
5.2k
How to Align SEO within the Product Triangle To Get Buy-In & Support - #RIMC
aleyda
2
1.5k
Transcript
AI࠷৽ٕज़Updateձ 8݄ ᷂tech vein ழມ ॆԝ
ࣗݾհ ழມ ॆԝ (͍ͷ·ͨ ΈͭͻΖ) גࣜձࣾ tech vein දऔక ݉
σϕϩούʔ twitter: @ino2222 IUUQTXXXUFDIWFJODPN
ΞδΣϯμ Archive Sanity (arxiv-sanity.com) ͔ΒϐοΫΞο ϓͨ͠ɺarxiv.org ͷաڈ1ϲ݄ؒͷจհɻ ɾtop recentͷจτοϓ10 ɾtop
hype ͷจτοϓ10 ɾҰ൪ؾʹͳͬͨจͷհ
Archive Sanity? https://www.arxiv-sanity.com/top
Arxiv Sanity Top recent: Best10
ᶃTransfromerRNNͰ͢ɻઢܗҙͷߴࣗݾճؼτϥϯεϑΥʔϚʔ (ݪจ: Transformers are RNNs: Fast Autoregressive Transformers with Linear
Attention) Transformer͍͔ͭ͘ͷλεΫͰݦஶͳੑೳΛୡ͍ͯ͠Δ͕ɼ ೖྗͷ͞ʹରͯ͠ೋ࣍తͳෳࡶ͞Λ࣋ͭͨΊɼඇৗʹ͍γʔ έϯεͰ๏֎ʹ͍ɽ͜ͷ੍ݶʹରॲ͢ΔͨΊʹɼզʑࣗݾ ҙΛΧʔωϧಛྔϚοϓͷઢܗੵͱͯ͠දݱ͠ɼߦྻੵͷ ࿈ੑͷੑ࣭Λར༻ͯ͠ɼෳࡶ͞Λ O(N^2) ͔Β O(N) ʹݮΒ͢ ͜ͱΛࢼΈΔɽ͜ͷఆࣜԽʹΑΓɺࣗݾճؼܕTransformerΛܶత ʹߴԽ͠ɺϦΧϨϯτɾχϡʔϥϧɾωοτϫʔΫͱͷؔΛ ໌Β͔ʹ͢Δ෮࣮͕ՄೳͰ͋Δ͜ͱΛࣔ͢ɻզʑͷઢܗม ثόχϥมثͱಉ༷ͷੑೳΛୡ͠ɺඇৗʹ͍γʔέϯε ͷࣗݾճؼత༧ଌʹ͓͍ͯ࠷େ4000ഒͷΛୡͨ͠ɻ http://arxiv.org/abs/2006.16236v2
None
None
ᶄ ҟৗݕग़ͷͨΊͷσΟʔϓϥʔχϯά.ϨϏϡʔ (ݪจ: Deep Learning for Anomaly Detection: A Review)
ҟৗݕग़ɺผ໊ʮ֎Εݕग़ʯɺेલ͔Β༷ʑͳݚڀίϛϡχςΟ ʹ͓͍ͯɺӬଓతͰ͋Γͳ͕Β׆ൃͳݚڀͱͳ͍ͬͯ·͢ɻ·ͩ· ͩɺߴͳΞϓϩʔνΛඞཁͱ͢Δಠಛͷͷෳࡶ͞ͱ՝͕͋Γ· ͢ɻۙɺσΟʔϓϥʔχϯάΛར༻ͨ͠ҟৗݕग़ɺ͢ͳΘͪσΟʔϓ ҟৗݕग़͕ॏཁͳํੑͱͯ͠ු্͖͍ͯͯ͠ΔɻຊจͰɺਂҟ ৗݕͷݚڀΛɺ3ͭͷߴϨϕϧΧςΰϦͱ11ͷࡉԽΧςΰϦʹ͚ͯ ཏతͳλΫιϊϛΛ༻͍ͯϨϏϡʔ͢ΔɻຊߘͰɺ͜ΕΒͷख๏ͷओ ཁͳ؍ɺతؔɺجૅͱͳΔԾఆɺॴͱॴΛϨϏϡʔ͠ɺલड़ͷ ՝ʹͲͷΑ͏ʹରॲ͍ͯ͠Δ͔Λٞ͢Δɻ͞ΒʹɺকདྷͷՄೳੑͱ՝ ʹରॲ͢ΔͨΊͷ৽ͨͳࢹʹ͍ͭͯٞ͢Δɻ http://arxiv.org/abs/2007.02500v2
ܽؕݕग़ͷྫ (MVTec Adσʔληοτ) https://www.mvtec.com/company/research/datasets/mvtec-ad/
None
ᶅNVAE: ਂ֊తมࣗಈΤϯίʔμʔ (ݪจ: NVAE: A Deep Hierarchical Variational Autoencoder) ਖ਼نԽϑϩʔɺࣗݾճؼϞσϧɺมࣗಈΤϯίʔμʔʢVAEʣɺσΟʔϓΤωϧΪʔϕʔεϞ
σϧɺσΟʔϓੜֶशͷͨΊͷڝ߹͢ΔϕʔεͷϑϨʔϜϫʔΫͷҰͭͰ͋Δɻ͜Ε ΒͷதͰɺߴͰѻ͍͍͢αϯϓϦϯάͱΞΫηε͍͢͠ූ߸ԽωοτϫʔΫͷར͕ ͋Δɻ͔͠͠ɺݱࡏͷͱ͜Ζɺਖ਼نԽϑϩʔࣗݾճؼϞσϧͷΑ͏ͳଞͷϞσϧʹྼͬͯ ͍ΔɻVAEͷݚڀͷେ෦౷ܭతͳ՝ʹযΛ͍ͯͯΔ͕ɺզʑ֊Խ͞ΕͨVAEͷ ͨΊͷχϡʔϥϧΞʔΩςΫνϟΛ৻ॏʹઃܭ͢Δͱ͍͏ަ͢ΔํੑΛ୳Δɻզʑɺਂ ͞ํͷՄೳͳΈࠐΈͱόονਖ਼نԽΛ༻͍ͯը૾ੜͷͨΊʹߏங͞Εͨਂ֊ܕ VAEͰ͋ΔNouveau VAEʢ̣̫̖̚ʣΛఏҊ͢ΔɻNVAEਖ਼نͷࠩύϥϝʔλԽΛඋ ͓͑ͯΓɺֶशεϖΫτϧਖ਼ଇԽʹΑͬͯ҆ఆԽ͞Ε͍ͯΔɻMNIST, CIFAR-10, CelebA HQ ͷσʔληοτʹ͓͍ͯɺNVAE͕ඇࣗݾճؼతϞσϧͷதͰ࠷ઌͷ݁ՌΛୡ͠ɺ FFHQͷڧྗͳϕʔεϥΠϯΛఏڙ͢Δ͜ͱΛࣔ͢ɻྫ͑ɺCIFAR-10ͰɺNVAE1࣍ݩ͋ ͨΓͷϏοτΛ2.98͔Β2.91ʹԡ্͛͠ɺCelebA HQͰਤ1ʹࣔ͢Α͏ʹߴ࣭ͷը૾Λ ੜ͠·͢ɻࢲͨͪͷΔݶΓͰɺNVAEɺ256x256ϐΫηϧͷେ͖͞ͷࣗવը૾ʹద༻ ͞Εͨ࠷ॳͷޭͨ͠VAEͰ͢ɻ http://arxiv.org/abs/2007.03898v1
෮शɿVAE https://qiita.com/shionhonda/items/e2cf9fe93ae1034dd771
෮श:VAE(2) • ಛZΛͣΒ͍ͯ͘͠ͱ݁Ռ͕มΘ͍ͬͯ͘
NVAEͷੜྫ
ˡͷS3FTJEVSBM/FUXPSL ˣ3FTJEVSBM/FUXPSLͷߏ
ᶆϞσϧϕʔεͷڧԽֶश.ௐࠪ (ݪจ: Model-based Reinforcement Learning: A Survey) ϚϧίϑܾఆϓϩηεʢMDPʣ࠷దԽͱͯ͠ҰൠతʹܗࣜԽ͞Ε͍ͯΔஞ࣍తҙࢥܾఆ ɺਓೳͷॏཁͳ՝Ͱ͋Δɻ͜ͷʹର͢Δ2ͭͷॏཁͳΞϓϩʔνɺڧԽֶ श(RL)ͱϓϥϯχϯάͰ͋ΔɻຊߘͰɺϞσϧϕʔεͷڧԽֶशͱͯ͠ΑΓΑ͘ΒΕ
͍ͯΔ྆ͷ౷߹ʹ͍ͭͯͷௐࠪΛհ͢ΔɻϞσϧϕʔεRLʹ2ͭͷओཁͳεςο ϓ͕͋ΔɻୈҰʹɺ֬ੑɺෆ࣮֬ੑɺ෦తͳ؍ଌՄೳੑɺ࣌ؒతͳநԽͳͲͷ՝ ΛؚΉྗֶϞσϧֶशͷΞϓϩʔνΛମܥతʹѻ͏ɻୈೋʹɺܭըͱֶशͷ౷߹ͷମܥ తͳྨΛఏࣔ͠ɺͲ͔͜ΒܭըΛ։࢝͢Δ͔ɺܭըͱ࣮σʔλऩूʹͲͷΑ͏ͳ༧ࢉΛ ׂΓͯΔ͔ɺͲͷΑ͏ʹܭը͢Δ͔ɺֶशͱߦಈͷϧʔϓʹܭըΛͲͷΑ͏ʹ౷߹͢Δ ͔ɺͳͲͷଆ໘ΛؚΉɻ͜ΕΒ2ͭͷॏཁͳηΫγϣϯͷޙʹɺσʔλޮͷ্ɺ λʔήοτΛߜͬͨ୳ࡧɺ҆ఆੑͷ্ͳͲɺϞσϧϕʔεͷRLͷજࡏతͳརʹ͍ͭͯ ٞ͢ΔɻௐࠪʹԊͬͯɺ֊తRLୡͳͲͷؔ࿈͢ΔRLɺߦಈ৺ཧֶͳͲͷ ଞͷݚڀͱͷؔ࿈ੑ͍ࣔͯ͠·͢ɻશମͱͯ͠ɺຊௐࠪͰɺMDP࠷దԽͷͨΊ ͷܭըֶशͷΈ߹Θͤʹ͍ͭͯɺ෯͍֓೦తͳ֓ཁΛఏ͍ࣔͯ͠Δɻ http://arxiv.org/abs/2006.16712v2
෮श:MDP(Ϛϧίϑܾఆաఔ) https://ja.wikipedia.org/wiki/%E3%83%9E%E3%83%AB%E3%82%B3%E3%83%95%E6%B1%BA%E5%AE%9A%E9%81%8E%E7%A8%8B 4ঢ়ଶ "ߦಈ
None
None
ᶇ χϡʔϥϧωοτϫʔΫͷάϥϑߏ (ݪจ: Graph Structure of Neural Networks) χϡʔϥϧωοτϫʔΫɺχϡʔϩϯؒͷଓͷάϥϑͱͯ͠දݱ͞ΕΔ͜ͱ͕ ଟ͍ɻ͔͠͠ɺ͘ΘΕ͍ͯΔʹ͔͔ΘΒͣɺχϡʔϥϧωοτϫʔΫͷάϥ
ϑߏͱ༧ଌੑೳͱͷؔʹ͍ͭͯɺ΄ͱΜͲཧղ͞Ε͍ͯͳ͍ͷ͕ݱঢ়Ͱ͢ɻ ͜͜ͰɺχϡʔϥϧωοτϫʔΫͷάϥϑߏ͕༧ଌੑೳʹͲͷΑ͏ͳӨڹΛ༩ ͑Δͷ͔Λܥ౷తʹௐΔɻ͜ͷతͷͨΊʹɺզʑؔάϥϑͱݺΕΔ χϡʔϥϧωοτϫʔΫͷ৽͍͠άϥϑϕʔεͷදݱΛ։ൃͨ͠ɻ͜ͷදݱΛ༻͍ ͯɺҎԼͷ͜ͱΛࣔ͢ɻ(2)χϡʔϥϧωοτϫʔΫͷੑೳɺͦͷؔάϥϑͷΫ ϥελϦϯάͱฏۉύεͷΒ͔ͳؔͰ͋Δ͜ͱɺ(3)զʑͷݟଟ͘ͷ ҟͳΔλεΫσʔληοτʹ͓͍ͯҰ؏͍ͯ͠Δ͜ͱɺ(4)εΠʔτεϙοτޮ తʹಛఆͰ͖Δ͜ͱɺ(5)τοϓύϑΥʔϚϯεͷχϡʔϥϧωοτϫʔΫɺ࣮ ࡍͷੜֶతχϡʔϥϧωοτϫʔΫͷͦΕʹڻ͘΄ͲࣅͨάϥϑߏΛ͍࣋ͬͯ Δ͜ͱΛࣔ͢ɻզʑͷݚڀɺχϡʔϥϧΞʔΩςΫνϟͷઃܭͱχϡʔϥϧωο τϫʔΫҰൠͷཧղʹ৽ͨͳํੑΛ։͘ͷͰ͋Δɻ http://arxiv.org/abs/2007.06559v1
άϥϑߏ
None
ᶈ PyTorch3DʹΑΔ3DσΟʔϓϥʔχϯάͷߴԽ (ݪจ: Accelerating 3D Deep Learning with PyTorch3D) http://arxiv.org/abs/2007.08501v1
˒1JDL6Q
ᶉ ϥϕϧԽ͞Ε͍ͯͳ͍σʔλ͍ͯ͢͠Θ͚Ͱͳ͍ɻڭࢣֶ͖श ʹ͓͚ΔσʔλͷॏΈֶ͚श (ݪจ: Not All Unlabeled Data are Equal:
Learning to Weight Data in Semi-supervised Learning) طଘͷڭࢣֶ͖शʢSSLʣΞϧΰϦζϜɺϥϕϧ͖ྫͱϥϕϧ ͳ͠ྫͷଛࣦͷόϥϯεΛͱΔͨΊʹ୯ҰͷॏΈΛ༻͍ͯ͠·͢ɺ͢ ͳΘͪɺͯ͢ͷϥϕϧͳ͠ྫ͕͘͠ॏΈ͚͞Ε͍ͯ·͢ɻ͔͠ ͠ɺϥϕϧ͚͞Ε͍ͯͳ͍σʔλ͕ͯ͘͢͠ͳΔΘ͚Ͱͳ ͍ɻ͜ͷจͰɺϥϕϧ͚͞Ε͍ͯͳ͍ྫ͝ͱʹҟͳΔॏΈΛ ༻͢Δํ๏Λݚڀ͍ͯ͠·͢ɻ͜Ε·ͰͷݚڀͰߦΘΕ͍ͯͨΑ͏ͳɺ ͯ͢ͷॏΈΛखಈͰௐ͢Δ͜ͱෆՄೳͰ͋ΔɻͦͷΘ ΓʹɺզʑӨڹؔʹج͍ͮͨΞϧΰϦζϜΛ༻͍ͯॏΈΛௐ͢ Δɻ͜ͷΞϓϩʔνΛޮతʹ͢ΔͨΊʹɺզʑӨڹؔͷߴͰ ޮՌతͳۙࣅΛఏҊ͢Δɻ͜ͷख๏͕ɺڭࢣ͖ը૾͓Αͼݴޠ ྨλεΫʹ͓͍ͯɺ࠷ઌͷख๏ΑΓ༏Ε͍ͯΔ͜ͱΛ࣮ূ͢Δɻ http://arxiv.org/abs/2007.01293v1
ᶃ ᶄ ᶅ
ᶊϦΧϨϯτχϡʔϥϧωοτϫʔΫʹ͓͚Δτοϓμϯ৴߸ͱϘτϜΞοϓ ৴߸ͷΈ߹ΘͤͷֶशͱϞδϡʔϧ্Ͱͷҙשى (ݪจ: Learning to Combine Top-Down and Bottom-Up Signals
in Recurrent Neural ݎ࿚ͳ֮ɺϘτϜΞοϓͱτοϓμϯͷ྆ํͷ৴߸ʹґଘ͍ͯ͠·͢ɻϘτϜΞοϓ ৴߸ɺײ֮Λ௨ͯ͠؍͞ΕΔͷͰ͋Δɻτοϓμϯͷγάφϧɺաڈͷܦݧ ظهԱʹجͮ͘৴೦ظɺྫ͑ʮϐʔφοπόλʔͱʙ...ʯͱ͍͏ϑϨʔζ͕Ͳͷ Α͏ʹ͢Δ͔ͱ͍͏Α͏ͳͷͰ͢ɻϘτϜΞοϓใͱτοϓμϯใͷ࠷దͳ Έ߹ΘͤະղܾͷͰ͋Δ͕ɺͦͷΈ߹ΘͤํಈతͰ͋Γɺจ຺λεΫʹґଘ͠ ͍ͯͳ͚ΕͳΒͳ͍ɻར༻Մೳͳજࡏతͳτοϓμϯใͷ๛͞ΛޮՌతʹར༻͠ɺ ํΞʔΩςΫνϟͰͷ৴߸ͷࠞ͟Γ߹͍ͷෆڠԻΛ͙ͨΊʹɺใͷྲྀΕΛ੍ݶ ͢ΔϝΧχζϜ͕ඞཁͰ͋ΔɻզʑɺϘτϜΞοϓͱτοϓμϯͷ৴߸͕ҙΛͬͯ ಈతʹ݁߹͞ΕΔσΟʔϓϦΧϨϯτχϡʔϥϧωοτΞʔΩςΫνϟΛ୳ٻ͍ͯ͠ΔɻΞʔ ΩςΫνϟͷϞδϡʔϧੑɺใͷڞ༗ͱ௨৴Λ͞Βʹ੍ݶ͢ΔɻҙͱϞδϡʔϧੑ ใͷྲྀΕΛ༠ಋ͠ɺ֮ͱݴޠλεΫʹ͓͚Δ৴པੑͷߴ͍ੑೳ্ΛͨΒ͠ɺಛʹ ҙࢄອϊΠζͷଟ͍σʔλʹର͢ΔϩόετੑΛ্ͤ͞ΔɻຊݚڀͰɺݴޠϞσϦϯ άɺஞ࣍ը૾ྨɺϏσΦ༧ଌɺڧԽֶशͳͲͷ༷ʑͳϕϯνϚʔΫʹ͓͍ͯɺʮํੑʯ ใͷྲྀΕ͕ڧྗͳϕʔεϥΠϯΑΓ݁ՌΛ্ͤ͞Δ͜ͱΛ࣮ূͨ͠ɻ http://arxiv.org/abs/2006.16981v1
Bidirectional Recurrent Independent Mechanisms (BRIMs)
47(-14UPDIBTUJD7JEFP(FOFSBUJPOXJUIB-FBSOFE1SJPS IUUQTBSYJWPSHQEGQEG
None
ᶋ ϩόετੑͷଟ໘ੑɻ֎ҰൠԽͷ൷తੳ (ݪจ: The Many Faces of Robustness: A Critical
Analysis of Out-of-Distribution Generalization) ຊݚڀͰɺը૾ελΠϧɺཧతҐஔɺΧϝϥૢ࡞ͳͲͷࣗવൃੜత ͳมԽ͔ΒͳΔ3ͭͷ৽͍͠ϩόετੑϕϯνϚʔΫΛհ͢Δɻ͜ ͷϕϯνϚʔΫΛ༻͍ͯɺҎલʹఏҊ͞Εͨ֎ϩόετੑʹؔ͢Δ ԾઆΛݕূ͠ɺͦΕΒΛݕূ͠·͢ɻ͜Ε·ͰͷݚڀͰͷओுʹͯ͠ɺ ΑΓେ͖ͳϞσϧͱ߹σʔλͷ૿ڧΛ༻͢Δ͜ͱͰɺ࣮ੈքͷ γϑτʹର͢Δϩόετੑ͕վળ͞ΕΔ͜ͱ͕Θ͔Γ·ͨ͠ɻ͜ΕΛ ͖͔͚ͬʹɺզʑ࠷ઌͷٕज़Λਐาͤ͞ɺ1000ഒҎ্ͷϥϕϧ͚ ͞ΕͨσʔλͰࣄલֶश͞ΕͨϞσϧΛ྇կ͢Δ৽͍͠σʔλ૿ڧ๏Λ ಋೖ͠·ͨ͠ɻͦͷ݁Ռɺ͍͔ͭ͘ͷख๏ςΫενϟہॴతͳը૾ ౷ܭʹ͓͚ΔͷมԽʹҰ؏ͯ͠༗ޮͰ͋Δ͕ɺཧతมԽͷΑ͏ ͳଞͷͷมԽʹ༗ޮͰͳ͍͜ͱ͕Θ͔ͬͨɻࠓޙͷݚڀͰɺෳ ͷγϑτΛಉ࣌ʹݚڀ͠ͳ͚ΕͳΒͳ͍ͱ͚͍݁ͮͯ·͢ɻ http://arxiv.org/abs/2006.16241v1
৽͍͠ϩόετωεࢦඪͷհ
ImageNet-R ࣮͚ͩͰͳ͘࡞ؚΉ
• DeepFassion Remixed దͳΧϝϥઃఆʢζʔϜɾαΠζɾϑΥʔΧεͳ ͲʣͰࡱӨͨ͠ෳͷҥྉͷϚϧνϥϕϧྨλ εΫͷͨΊͷσʔληοτɻ • SVSF • ετϦʔτϏϡʔͷళͷ֎؍Λళฮͷۀछ͝ͱʹ
ྨͨ͠σʔλ(ඇެ։)
طଘจͷ7ͭͷϩόετੑԾઆͷݕূ(Ұ෦ൈਮ): ʮDiverse Data AugmentationࣗવͳϩόετੑΛॿ͚ͳ͍આʯ DeepAugumentͨ͠σʔλͰݕূ → ImageNet-R͚ͩޮՌత
ᶌਆܦϓϩάϥϜʹ͓͚ΔڧྗͳҰൠԽͱޮੑ (ݪจ: Strong Generalization and Efficiency in Neural Programs) ຊݚڀͰɺχϡʔϥϧϓϩάϥϜ༠ಋͷΈͷதͰɺڧྗʹҰൠԽ͢Δޮత
ͳΞϧΰϦζϜΛֶश͢ΔΛݚڀ͍ͯ͠·͢ɻχϡʔϥϧϞσϧͷೖग़ྗΠϯ λϑΣʔεΛ৻ॏʹઃܭ͠ɺ฿͢Δ͜ͱͰɺҙͷೖྗαΠζʹରͯ͠ਖ਼͍݁͠ ՌΛग़͢ϞσϧΛֶश͠ɺڧྗͳҰൠԽΛ࣮ݱ͢Δɻ·ͨɺڧԽֶशΛར༻͢Δ͜ ͱͰɺϓϩάϥϜͷޮࢦඪΛ࠷దԽ͠ɺ฿Ͱ༻ͨ͠ڭࢣΛ͑Δ৽͍͠Ξϧ ΰϦζϜΛൃݟ͢Δ͜ͱ͕Ͱ͖Δɻ͜ΕʹΑΓɺզʑͷΞϓϩʔνɺιʔτɺॱ ং͖ϦετͰͷݕࡧɺNPશͳ0/1φοϓβοΫͰςετͨ͠Α͏ʹɺ༷ʑ ͳʹରͯ͠ΧελϜهड़͞ΕͨղΛ্ճΔੑೳΛֶश͢Δ͜ͱ͕Ͱ͖ɺχϡʔ ϥϧϓϩάϥϜ༠ಋͷͰ͖͢ϚΠϧετʔϯΛઃఆ͍ͯ͠ΔɻϋΠϥΠ τͱͯ͠ɺզʑͷֶशϞσϧɺզʑ͕ςετͨ͠ͲͷΑ͏ͳೖྗσʔλαΠζͰ ɺO(n log n)ͷෳࡶ͞ͰιʔτΛᘳʹ࣮ߦ͢Δ͜ͱ͕Ͱ͖ɺҰํͰɺֶशதʹ ݟΒΕͨϦεταΠζΛΔ͔ʹ͑ΔϦεταΠζͰɺΫΠοΫιʔτΛؚΉ ϋϯυίʔυԽ͞ΕͨΞϧΰϦζϜΛૢ࡞ճͰ্ճΔ͜ͱ͕Ͱ͖·ͨ͠ɻ http://arxiv.org/abs/2007.03629v2
a=f(s) ͷ f ΛҰൠԽֶͭͭ͠श͢Δ • ྫ) a=ιʔτ݁Ռσʔλɺs=ιʔτରσʔλ G T
B ֶश
Arxiv Sanity Top hype: Best10
ᶃ GShard: ͖݅ܭࢉͱࣗಈγϟʔσΟϯάΛ༻͍ͨڊେϞσϧͷεέʔϦ ϯά (ݪจ: GShard: Scaling Giant Models with
Conditional Computation and Automatic Sharding) χϡʔϥϧωοτϫʔΫͷεέʔϦϯάɺେͳྔͷֶशσʔλͱܭࢉྔΛ࣋ͭଟ͘ ͷ࣮ੈքͷػցֶशΞϓϦέʔγϣϯʹ͓͍ͯɺϞσϧͷ࣭Λ্ͤ͞ΔͨΊʹॏཁ ͳׂΛՌ͖ͨͯ͠·ͨ͠ɻ͜ͷΑ͏ͳεέʔϦϯάͷɺϞσϧ࣭Λ্ͤ͞ ΔͨΊͷ࣮֬ͳΞϓϩʔνͰ͋Δ͜ͱ͕֬ೝ͞Ε͍ͯ·͕͢ɺͦͷಓͷΓʹɺܭࢉί ετɺϓϩάϥϛϯάͷ༰қ͞ɺฒྻσόΠε্Ͱͷޮతͳ࣮ͳͲͷ՝͕͋Γ· ͢ɻGShardɺܰྔͳΞϊςʔγϣϯAPIͱXLAίϯύΠϥͷ֦ுػೳ͔ΒͳΔϞ δϡʔϧͰ͢ɻGShardɺطଘͷϞσϧίʔυΛ࠷খݶʹมߋ͢Δ͚ͩͰɺ෯͍ฒྻ ܭࢉύλʔϯΛදݱ͢ΔΤϨΨϯτͳํ๏Λఏڙ͠·͢ɻGShardɺࣗಈγϟʔσΟϯ άΛ༻͍ͯɺεύʔεϦʔɾήʔςουɾϛοΫενϟʔɾΦϒɾΤΩεύʔτʹΑΔ ଟݴޠχϡʔϥϧػց༁τϥϯεϑΥʔϚʔϞσϧΛ6000ԯݸͷύϥϝʔλΛ͑ͯ εέʔϧΞοϓ͢Δ͜ͱΛՄೳʹ͠·ͨ͠ɻ͜ͷΑ͏ͳڊେͳϞσϧΛ2048TPU v3Ξ ΫηϥϨʔλ্Ͱ4ؒͰޮతʹֶश͠ɺ100ݴޠ͔Βӳޠͷ༁ʹ͓͍ͯɺैདྷ ͷٕज़ͱൺֱͯ͠Δ͔ʹ༏Ε࣭ͨΛୡͰ͖Δ͜ͱΛ࣮ূ͠·ͨ͠ɻ http://arxiv.org/abs/2006.16668v1
ᶄᷖճతMCMCɿ౷ҰతͳϑϨʔϜϫʔΫ (ݪจ: Involutive MCMC: a Unifying Framework) Ϛϧίϑ࿈ϞϯςΧϧϩ๏ʢMCMCʣɺਪɺ౷߹ɺ࠷దԽɺγϛϡϨʔ γϣϯͳͲͷجຊతͳʹର͢ΔܭࢉΞϓϩʔνͰ͢ɻ͜ͷͰɺ͞· ͟·ͳΞϧΰϦζϜ͕։ൃ͞Ε͓ͯΓɺͦͷಈػɺద༻ํ๏ɺαϯϓϦϯάͷ
ޮੑͳͲ͕ҟͳΔɻͯ͢ͷҧ͍ʹ͔͔ΘΒͣɺͦΕΒͷଟ͘ಉ͡ίΞ ݪཧΛڞ༗͓ͯ͠ΓɺզʑͦΕΛInvolutive MCMC (iMCMC)ϑϨʔϜϫʔ Ϋͱͯ͠౷Ұ͍ͯ͠·͢ɻ͜Εʹج͍ͮͯɺզʑiMCMCͷ؍͔Βൣғͷ MCMCΞϧΰϦζϜΛهड़͠ɺ৽͍͠MCMCΞϧΰϦζϜΛ։ൃ͢ΔͨΊͷઃ ܭݪཧͱͯ͠༻Ͱ͖Δ͍͔ͭ͘ͷʮτϦοΫʯΛఆࣜԽ͢Δɻ͜ͷΑ͏ʹɺ iMCMCଟ͘ͷطͷMCMCΞϧΰϦζϜΛ౷ҰతʹݟΔ͜ͱ͕Ͱ͖ɺڧྗ ͳ֦ுػೳͷ։ൃΛ༰қʹ͠·͢ɻզʑɺطͷՄٯMCMCΞϧΰϦζϜΛ ΑΓޮతͳෆՄٯMCMCΞϧΰϦζϜʹม͢Δ2ͭͷྫΛ༻͍ͯɺޙऀΛ ࣮ূ͠·͢ɻ http://arxiv.org/abs/2006.16653v1
෮शɿMCMC? • Ϛϧίϑ࿈ • લͷঢ়ଶ͔Β࣍ͷঢ়ଶ͕ܾ·Δ • ϞϯςΧϧϩ๏ • ֬తΞϧΰϦζϜɻ •
ϥϯμϜʹσʔλΛूΊͨΒɺਖ਼͍͠ॴʹσʔ λ͕ଟ͘ͳΔɻ
https://qiita.com/kenmatsu4/items/55e78cc7a5ae2756f9da
iMCMC ͰఆࣜԽ͞Εͨ Trickྫ σʔλೱ͕ೱ͍ʢ͕֬ߴ͍ʣͱ͜Ζ͔Βബ ͍ʢ͍ʣͱ͜ΖͷҠಈΛڋ൱͢Δ αϯϓϧιʔε: https://github.com/necludov/iMCMC
ᶅ ݴޠϞσϧগਫ਼Ӷͷֶशऀ (ݪจ: Language Models are Few-Shot Learners) ࠷ۙͷݚڀͰɺେنͳςΩετͷίʔύεͰࣄલֶशΛߦͬͨޙɺಛఆͷλεΫͰඍௐΛߦ͏͜ͱͰɺ ଟ͘ͷNLPλεΫϕϯνϚʔΫͰେ͖ͳՌ͕ಘΒΕ͍ͯΔ͜ͱ͕࣮ূ͞Ε͍ͯΔɻΞʔΩςΫνϟతʹ
λεΫʹͱΒΘΕͳ͍ͷ͕ҰൠతͰ͕͢ɺ͜ͷํ๏Ͱઍ͔ΒສͷྫͷλεΫݻ༗ͷඍௐσʔληο τ͕ඞཁͱͳΓ·͢ɻରরతʹɺਓؒҰൠతʹɺ৽͍͠ݴޠλεΫΛݸͷྫ؆୯ͳ໋ྩ͔Β࣮ߦ͢Δ͜ ͱ͕Ͱ͖·͕͢ɺ͜ΕݱࡏͷNLPγεςϜͰ͍·ͩʹࠔͳ͜ͱͰ͢ɻ͜͜ͰզʑɺݴޠϞσϧΛε έʔϧΞοϓ͢Δ͜ͱͰɺλεΫʹґଘ͠ͳ͍ɺγϣοτͷੑೳΛେ෯ʹ্ͤ͞ɺ࣌ʹઌߦ͢Δ࠷ઌ ͷඍௐΞϓϩʔνʹඖఢ͢Δੑೳʹୡ͢Δ͜ͱΛ͍ࣔͯ͠Δɻ۩ମతʹɺ1,750ԯݸͷύϥϝʔλΛ࣋ͭ ࣗݾճؼతݴޠϞσϧͰ͋ΔGPT-3Λֶशͤ͞ɺͦͷੑೳΛγϣοτͷઃఆͰςετͨ͠ɻͯ͢ͷλεΫ ʹ͓͍ͯɺGPT-3ޯͷߋ৽ඍௐΛҰߦΘͣʹద༻͞ΕɺλεΫͱγϣοτͷσϞϞσϧͱͷς ΩετΠϯλϥΫγϣϯͷΈͰࢦఆ͞ΕͨɻGPT-3ɺ༁ɺ࣭ԠɺΫϩʔδϯάͳͲͷଟ͘ͷNLPσʔ ληοτʹՃ͑ͯɺ୯ޠͷεΫϥϯϒϧղআɺจதͷ৽͍͠୯ޠͷ༻ɺ3ܻͷԋࢉͳͲɺͦͷͰͷਪ ྖҬదԠΛඞཁͱ͢Δ͍͔ͭ͘ͷλεΫʹ͓͍ͯߴ͍ੑೳΛୡ͍ͯ͠Δɻಉ࣌ʹɺGPT-3ͷൃֶश͕ະ ͩʹۤઓ͍ͯ͠ΔσʔλɺେنͳΣϒίʔύε্Ͱͷֶशʹؔ࿈ͯ͠ํ๏తͳΛ๊͍͑ͯΔσʔ λΛ͍͔ͭ͘ڍ͛ͨɻ࠷ޙʹɺGPT-3ਓ͕ؒॻ͍ͨهࣄͱਓ͕ؒॻ͍ͨهࣄΛ۠ผ͢Δͷ͕͍͠χϡʔε هࣄͷαϯϓϧΛੜ͢Δ͜ͱ͕Ͱ͖Δ͜ͱΛൃݟͨ͠ɻຊݚڀͰɺ͜ͷൃݟͱGPT-3ͷҰൠతͳࣾձతӨ ڹʹ͍ͭͯٞ͢Δɻ http://arxiv.org/abs/2005.14165v4 ઌ݄ɾઌʑ݄ͱॏෳ
ᶆઐՈͱͯ͠ͷࣄ࣮ɻతͳࣝͷ্ʹదԠՄೳͰղऍՄೳͳਆܦه Ա (ݪจ: Facts as Experts: Adaptable and Interpretable Neural
Memory over Symbolic Knowledge) େنͳݴޠϞσϧݱͷNLPϞσϦϯάͷத֩Ͱ͋Γɺେͳྔͷৗ ࣝతͰࣄ࣮ʹج͍ͮͨใΛΤϯίʔυ͢Δ͜ͱ͕ࣔ͞Ε͍ͯ·͢ɻ͔͠ ͠ɺͦͷࣝϞσϧͷજࡏతͳύϥϝʔλͷதʹ͔͠ଘࡏͤͣɺݕࠪ ղऍʹΞΫηε͢Δ͜ͱͰ͖·ͤΜɻ·ͨɺύϥϝʔλͱͯ͠อଘ͞Ε ͨࣝɺඞવతʹݪࢿྉʹࡏ͢Δͯ͢ͷόΠΞεΛࣔ͢͜ͱʹͳ Δɻ͜ΕΒͷʹରॲ͢ΔͨΊʹɺզʑɺه߸తʹղऍՄೳͳࣄ࣮ ใͱαϒγϯϘϧతͳਆܦࣝͱͷؒͷ໌ࣔతͳΠϯλʔϑΣʔεΛؚΉ ਆܦݴޠϞσϧΛ։ൃͨ͠ɻ͜ͷϞσϧ͕ɺࣝूతͳ2ͭͷ࣭Ԡ λεΫͷύϑΥʔϚϯεΛܶతʹ্ͤ͞Δ͜ͱΛࣔ͢ɻ͞Βʹڵຯਂ͍ ͜ͱʹɺ͜ͷϞσϧɺͦͷه߸දݱΛૢ࡞͢Δ͜ͱͰɺ࠶܇࿅ͳ͠ʹߋ ৽͢Δ͜ͱ͕Ͱ͖Δɻಛʹ͜ͷϞσϧɺैདྷͷϞσϧͰෆՄೳͩͬͨ ৽͍͠ࣄ࣮ͷՃطଘͷࣄ࣮ͷ্ॻ͖ΛՄೳʹ͢Δɻ http://arxiv.org/abs/2007.00849v1
Facts as Experts Ϟσϧߏ
ᶇFathomNet. ւ༸୳ࠪɾൃݟͷͨΊͷਫதը૾τϨʔχϯάσʔλϕʔε (ݪจ: FathomNet: An underwater image training database for
ocean exploration and discovery) ԕִૢ࡞ंʢROVʣͦͷଞͷਫதࢿ࢈͔Βɺؒઍ࣌ؒʹٴͿւ༸ϏσΦσʔλ͕ऩ ू͞Ε͍ͯ·͢ɻ͔͠͠ɺݱࡏͷख࡞ۀʹΑΔղੳํ๏ͰɺROVେنͳੜଟ༷ੑղ ੳͷͨΊͷϦΞϧλΠϜΞϧΰϦζϜͷͨΊʹऩूͨ͠σʔλΛेʹ׆༻͢Δ͜ͱ͕Ͱ͖ ·ͤΜɻFathomNetɺ࠷৽ͷΠϯςϦδΣϯτͰࣗಈԽ͞Εͨਫதը૾ղੳͷ։ൃΛՃ ͢ΔͨΊʹ࠷దԽ͞Εͨɺ৽͍͠ϕʔεϥΠϯը૾τϨʔχϯάηοτͰ͢ɻࢲͨͪͷγʔ υσʔληοτɺ26,000࣌ؒҎ্ͷϏσΦςʔϓɺ680ສճͷΞϊςʔγϣϯɺ4,349ޠͷ ࣝϕʔε͔ΒͳΔɺઐՈʹΑΔऍ͖ͷܧଓతͳσʔλϕʔε͔Βߏ͞Ε͍ͯ·͢ɻ FathomNetɺ͜ͷσʔληοτΛ׆༻ͯ͠ɺػցֶशΞϧΰϦζϜͷ։ൃΛՄೳʹ͢Δͨ Ίʹɺਫதͷ֓೦ͷը૾ɺϩʔΧϦθʔγϣϯɺΫϥεϥϕϧΛఏڙ͍ͯ͠·͢ɻݱࡏ·Ͱ ʹɺதਫҬఈੜੜΛؚΉ233ͷҟͳΔΫϥεʹ͍ͭͯɺ80,000Ҏ্ͷը૾ͱ106,000Ҏ ্ͷϩʔΧϥΠθʔγϣϯ͕ఏڙ͞Ε͍ͯ·͢ɻզʑͷ࣮ݧͰɺऑڭࢣ͖ఆҐɺը૾ϥ ϕϦϯάɺମݕग़ɺྨͳͲͷΞϓϩʔνΛ༻͍ͯɺ༷ʑͳσΟʔϓϥʔχϯάΞϧΰϦζ ϜΛֶश͕ͤͨ͞ɺ͜ΕΒ༗Ͱ͋Δ͜ͱ͕Θ͔ͬͨɻ͜ͷ৽͍͠σʔληοτͰͷ༧ଌ ݁ՌྑͰ͕͋ͬͨɺ࠷ऴతʹւ༸୳ࠪͷͨΊͷΑΓେ͖ͳσʔληοτ͕ඞཁͰ͋Δ ͜ͱΛ͍ࣔͯ͠Δɻ IUUQTBSYJWPSHBCTW
None
None
ᶈBlazePose.ΦϯσόΠεͰϦΞϧλΠϜʹମͷϙʔζΛτ ϥοΩϯά (ݪจ: BlazePose: On-device Real-time Body Pose tracking) զʑɺϞόΠϧσόΠε্ͰͷϦΞϧλΠϜਪͷͨΊʹௐ͞Ε
ͨɺਓؒͷϙʔζਪఆͷͨΊͷܰྔͳΈࠐΈχϡʔϥϧωοτϫʔ ΫΞʔΩςΫνϟͰ͋ΔBlazePoseΛհ͠·͢ɻਪதɺ͜ͷωο τϫʔΫҰਓͷਓؒʹରͯ͠33ͷମΩʔϙΠϯτΛੜ͠ɺ Pixel 2ܞଳి্Ͱຖඵ30ϑϨʔϜҎ্ͷͰಈ࡞͠·͢ɻ͜ͷ ͨΊɺϑΟοτωετϥοΩϯάखೝࣝͷΑ͏ͳϦΞϧλΠϜͷ Ϣʔεέʔεʹಛʹద͍ͯ͠·͢ɻզʑͷओͳߩݙɺ৽͍͠ମτ ϥοΩϯάιϦϡʔγϣϯͱɺώʔτϚοϓͱΩʔϙΠϯτ࠲ඪͷ ճؼͷ྆ํΛ༻͢ΔܰྔͳମਪఆχϡʔϥϧωοτϫʔΫͰ͢ɻ http://arxiv.org/abs/2006.10204v1
None
None
طଘݚڀͱͷൺֱ BlazePose vs OpenPose
Ϟσϧߏʢখ͍͞ʂʣ
ᶉີͳςΩετݕࡧͷͨΊͷۙࣅ࠷ۙෛͷରরֶश (ݪจ: Approximate Nearest Neighbor Negative Contrastive Learning for Dense
Text Retrieval) ֶश͞ΕͨີͳදݱۭؒΛ༻͍ͯςΩετݕࡧΛߦ͏͜ͱɺεύʔεݕࡧʹൺͯ ଟ͘ͷڵຯਂ͍ར͕͋Δɻ͔͠͠ɺີͳֶशදݱۭؒͰͷςΩετݕࡧͷ༗ޮੑΛ ߴΊΔͨΊʹɺεύʔεݕࡧͱͷΈ߹Θ͕ͤඞཁͱͳΔ͜ͱ͕ଟ͍ɻຊจͰ ɺֶशϝΧχζϜʹϘτϧωοΫ͕͋Δ͜ͱΛ໌Β͔ʹ͠ɺֶशʹ༻͞ΕΔෛͷ Πϯελϯε͕ςετͰແؔͳจॻΛද͍ͯ͠ͳ͍͜ͱΛ໌Β͔ʹͨ͠ɻຊ จͰɺίʔύεͷۙࣅ࠷ۙʢANNʣΠϯσοΫε͔ΒωΨςΟϒΛߏங͠ɺֶ शϓϩηεͱฒߦͯ͠ߋ৽͢Δ͜ͱͰɺΑΓݱ࣮తͳωΨςΟϒֶशΠϯελϯεΛ બ͢ΔֶशϝΧχζϜͰ͋Δۙࣅ࠷ۙωΨςΟϒɾίϯτϥετਪఆʢANCEʣ ΛఏҊ͢Δɻ͜ΕʹΑΓɺDRͷֶशͱςετͰ༻͞ΕΔσʔλͷෆҰக͕ࠜ ຊతʹղܾ͞ΕΔɻզʑͷ࣮ݧͰɺANCEBERT-Siamese DRϞσϧΛϒʔετ ͠ɺڝ߹͢Δີͳݕࡧͱૄͳݕࡧͷͯ͢ͷϕʔεϥΠϯΛ্ճΔੑೳΛࣔͨ͠ɻ ANCEͰֶशͨ͠දݱۭؒʹ͓͚ΔυοτϓϩμΫτΛ༻͍ͨεύʔεݕࡧ͓Αͼ BERT࠶ϥϯΩϯάͷਫ਼ͱ΄΅Ұக͠ɺ΄΅100ഒͷߴԽΛ࣮ݱͨ͠ɻ http://arxiv.org/abs/2007.00808v1
None
ᶊ͍ϕΠζχϡʔϥϧωοτϫʔΫͷਖ਼֬ͳࣄޙ (ݪจ: Exact posterior distributions of wide Bayesian neural networks)
࠷ۙͷݚڀͰɺσΟʔϓϕΠδΞϯχϡʔϥϧωοτϫʔΫ(BNN)ʹΑͬ ͯ༠ى͞Εͨؔͷࣄલɺશͷ෯͕େ͖͘ͳΔʹͭΕͯΨεա ఔ(GP)ͷΑ͏ʹৼΔ͏͜ͱ͕ࣔ͞Ε͍ͯΔɻ͔͠͠ɺଟ͘ͷBNNΞϓϦ έʔγϣϯͰɺBNNۭؔؒͷࣄޙॲཧ͕ͱͳ͍ͬͯ·͢ɻNeal (1996)MatthewsΒ(2018)ͷΦϦδφϧͷݚڀͰࣄޙऩଋͷ͍͔ͭ͘ͷ ܦݧతূڌ͕ఏڙ͞Ε͍ͯ·͕͢ɺͦΕBNNࣄޙۙࣅͷਖ਼֬͞Λऔಘ͠ ͯݕূ͢Δ͜ͱͷѱ໊ߴ͍ࠔ͞ͷͨΊʹɺখ͞ͳσʔληοτΞʔΩ ςΫνϟʹݶఆ͞Ε͍ͯ·͢ɻզʑɺਖ਼֬ͳBNNࣄޙۙࣅ͕ɺࣄલ ͷGPۃݶʹΑͬͯ༠ى͞ΕΔͷʹ(ऑ͘)ऩଋ͢Δͱ͍͏ܽམͨ͠ཧత ূ໌Λఏڙ͢Δɻ࣮ূతݕূͷͨΊʹɺখ͞ͳσʔληοτ্ͷ༗ݶBNN ͔ΒڋઈαϯϓϦϯάΛ༻͍ͯਖ਼֬ͳαϯϓϧΛੜ͢Δํ๏Λࣔ͢ɻ http://arxiv.org/abs/2006.10541v1
d͕େ͖͘ͳΔ΄Ͳແݶͷ෯ݶքͷࣄલॲཧͱಉ͡ͷࣄޙॲཧʹऩଋ͢Δ͜ͱΛূ໌͠ ͨɻແݶ෯ݶքࣄޙॲཧͷܭࢉ͕༰қͰ͋ΕɺύϥϝʔλۭؒࣄޙॲཧͷධՁ͕ࠔͰ͋ͬ ͯɺۭؔؒਪ͕༰қʹͳΔಓ͕։͚Δɻ
ᶋֶश͞Εͨදݱͷઢܗࣝผੑʹ͍ͭͯ (ݪจ: On Linear Identifiability of Learned Representations) ࣝผՄೳੑ౷ܭϞσϧͷ·͍͠ಛੑͰ͋Γɺेͳܭࢉࢿݯͱσʔλ͕ ͋ΕɺਅͷϞσϧύϥϝʔλΛҙͷਫ਼ͰਪఆͰ͖Δ͜ͱΛҙຯ͠·͢ɻ
զʑදݱֶशͷจ຺ͰࣝผՄೳੑΛݚڀ͍ͯ͠·͢ɿԼྲྀͷλεΫʹؔͯ͠ ࠷దͳඇઢܗσʔλදݱΛൃݟ͢Δ͜ͱͰ͢ɻσΟʔϓχϡʔϥϧωοτϫʔ Ϋͱͯ͠ύϥϝʔλԽ͞Εͨ߹ɺͦͷΑ͏ͳදݱؔɺઃܭ্ύϥϝʔλ ͕աʹઃఆ͞Ε͍ͯΔͨΊɺҰൠతʹύϥϝʔλۭؒͰͷࣝผੑΛ͍͍ܽͯ ·͢ɻ͜ͷจͰɺ࠷ۙͷඇઢܗICAͷਐาʹج͍ͮͯɺେنͳࣝผϞσ ϧͷϑΝϛϦʔ͕ɺઢܗෆ֬ఆੑ·ͰۭؔؒͰ࣮ࡍʹࣝผՄೳͰ͋Δ͜ͱ Λࣔ͢͜ͱʹΑͬͯɺࣝผՄೳੑΛճ෮ͤ͞Δ͜ͱΛతͱ͍ͯ͠Δɻදݱֶ शͷͨΊͷଟ͘ͷϞσϧɺςΩετɺը૾ɺԻͳͲɺ͞·͟·ͳྖҬͰ͜ ͷҙຯͰࣝผՄೳͰ͋Γɺൃද࣌ʹ࠷ઌͷͷͰͨ͠ɻզʑɺઢܗࣝ ผՄೳੑͷͨΊͷेͳ݅Λಋग़͠ɺγϛϡϨʔτ͞Εͨσʔλͱ࣮ੈք ͷσʔλͷ྆ํͰ͜ͷ݁ՌΛ࣮ূతʹࢧ࣋͢Δɻ http://arxiv.org/abs/2007.00810v3
ࣝผՄೳੑʁ • ࣝผՄೳੑ౷ܭϞσϧͷ·͍͠ಛੑͰ͋ Γɺेͳܭࢉࢿݯͱσʔλ͕͋Εɺਅͷ ϞσϧύϥϝʔλΛҙͷਫ਼ͰਪఆͰ͖Δ ͜ͱɻ
ࣝผՄೳੑΛٻΊΒΕΔϞσϧ ͷ݅(ಡ·ͳ͍͍ͯ͘Ͱ͢)
ࣝผՄೳੑΛٻΊΒΕΔͱخ͠ ͍͜ͱ • ֶशͨ͠࠷దͳදݱ͕࠶ݱ͍͔͢͠Ͳ͏͔Λ ༧ଌ͢Δͷʹཱͭˠ࠷దͳ݁ՌΛ࠶ݱ͢Δͷ ʹ(ݪଇͱͯ͠)ֶ͚̍ͩश͢ΕΑ͘ͳΔɻ • ূ໌Մೳͳ࠷దͳ܇࿅ϞσϧϥΠϒϥϦʹஔ͖ ͑ΒΕΔΑ͏ʹͳΔɻ •
࣮ߦ͢ΔܭࢉϦιʔε͕ݮΔͷͰɺίετɾࢿ ݯͷݮʹͳΔɻ
ᶌ ਆܦϓϩάϥϜʹ͓͚ΔڧྗͳҰൠԽͱޮੑ (ݪจ: Strong Generalization and Efficiency in Neural Programs)
ຊݚڀͰɺχϡʔϥϧϓϩάϥϜ༠ಋͷΈͷதͰɺڧྗʹҰൠԽ͢Δޮత ͳΞϧΰϦζϜΛֶश͢ΔΛݚڀ͍ͯ͠·͢ɻχϡʔϥϧϞσϧͷೖग़ྗΠϯ λϑΣʔεΛ৻ॏʹઃܭ͠ɺ฿͢Δ͜ͱͰɺҙͷೖྗαΠζʹରͯ͠ਖ਼͍݁͠ ՌΛग़͢ϞσϧΛֶश͠ɺڧྗͳҰൠԽΛ࣮ݱ͢Δɻ·ͨɺڧԽֶशΛར༻͢Δ͜ ͱͰɺϓϩάϥϜͷޮࢦඪΛ࠷దԽ͠ɺ฿Ͱ༻͢ΔڭࢣΛ͑Δ৽͍͠Ξϧ ΰϦζϜΛൃݟ͠·͢ɻ͜ΕʹΑΓɺզʑͷΞϓϩʔνɺιʔτɺॱং͖Ϧε τͰͷݕࡧɺNPશͳ0/1φοϓβοΫͰςετͨ͠Α͏ʹɺ༷ʑͳʹର ͯ͠ΧελϜهड़͞ΕͨղΛ্ճΔੑೳΛֶश͢Δ͜ͱ͕Ͱ͖ɺχϡʔϥϧϓϩά ϥϜ༠ಋͷͰ͖͢ϚΠϧετʔϯΛઃఆ͍ͯ͠ΔɻϋΠϥΠτͱͯ͠ɺ զʑͷֶशϞσϧɺզʑ͕ςετͨ͠ͲͷΑ͏ͳೖྗσʔλαΠζͰɺ$O(n log n)$ͷෳࡶ͞ͰιʔτΛᘳʹ࣮ߦ͢Δ͜ͱ͕Ͱ͖ɺҰํͰɺֶशதʹݟΒΕͨ ϦεταΠζΛΔ͔ʹ͑ΔϦεταΠζͰɺΫΠοΫιʔτΛؚΉϋϯυ ίʔυԽ͞ΕͨΞϧΰϦζϜΛૢ࡞ճͰ্ճΔ͜ͱ͕Ͱ͖·ͨ͠ɻ http://arxiv.org/abs/2007.03629v2 5PQSFDFOU/Pͱ ॏෳ
My favorite
ᶈ PyTorch3DʹΑΔ3DσΟʔϓϥʔχϯάͷߴԽ (ݪจ: Accelerating 3D Deep Learning with PyTorch3D) σΟʔϓϥʔχϯάɺ2Dͷը૾ೝࣝΛେ෯ʹվળ͖ͯͨ͠ɻ3Dʹ֦ு͢Δ͜ͱͰɺࣗߦंɺԾ
ݱ࣮֦ுݱ࣮ɺ3DίϯςϯπͷΦʔαϦϯάɺ͞Βʹ2DೝࣝͷվળͳͲɺଟ͘ͷ৽͍͠ΞϓϦέʔ γϣϯ͕ਐల͢ΔՄೳੑ͕͋Γ·͢ɻ͔͠͠ɺؔ৺͕ߴ·͍ͬͯΔʹ͔͔ΘΒͣɺ3DσΟʔϓϥʔχϯ ά·ͩൺֱతະ։ͳঢ়ଶʹ͋Γ·͢ɻ͜ͷΑ͏ͳ֨ࠩɺҟछσʔλͷޮతͳॲཧɺάϥϑΟο Ϋεૢ࡞ΛࠩผԽ͢ΔͨΊͷϦϑϨʔϛϯάͳͲɺ3DσΟʔϓϥʔχϯάʹؔΘΔֶతͳ՝ʹىҼ ͍ͯ͠Δͱߟ͑ΒΕ͍ͯ·͢ɻզʑɺ3DσΟʔϓϥʔχϯάͷͨΊͷϞδϡʔϧԽ͞ΕͨޮతͰඍ ՄೳͳԋࢉࢠͷϥΠϒϥϦͰ͋ΔPyTorch3DΛಋೖ͢Δ͜ͱͰɺ͜ΕΒͷ՝ʹରॲ͍ͯ͠·͢ɻ͜ͷϥ ΠϒϥϦʹɺϝογϡͱ܈ͷͨΊͷߴͰϞδϡʔϧࣜͷඍՄೳͳϨϯμϥʔؚ͕·Ε͓ͯΓɺ߹ ʹΑΔղੳΞϓϩʔνΛՄೳʹ͍ͯ͠·͢ɻଞͷඍՄೳͳϨϯμϥʔͱൺֱͯ͠ɺPyTorch3DΑΓ ϞδϡʔϧԽ͞Ε͓ͯΓɺޮతͰ͋ΔͨΊɺϢʔβʔΑΓ؆୯ʹ֦ு͢Δ͜ͱ͕Ͱ͖ɺେ͖ͳϝο γϡը૾ͷεέʔϦϯά༰қͰ͢ɻPyTorch3D ͷԋࢉࢠͱϨϯμϥʔΛଞͷ࣮ͱൺֱ͠ɺͱ ϝϞϦͷେ෯ͳվળΛ࣮ূ͍ͯ͠·͢ɻ·ͨɺPyTorch3DΛ༻ͯ͠ɺShapeNet্ͷ2Dը૾͔Βͷڭ ࢣͳ͠3Dϝογϡͱ܈༧ଌͷͨΊͷ࠷৽ٕज़Λվળ͠·ͨ͠ɻPyTorch3DΦʔϓϯιʔεͰ͋Γɺ 3DσΟʔϓϥʔχϯάͷݚڀΛՃͤ͞ΔҰॿͱͳΔ͜ͱΛظ͍ͯ͠·͢ɻ http://arxiv.org/abs/2007.08501v1
2Dը૾͔Βͷ3DγϧΤοτ༧
None
https://pytorch3d.org/
PyTorch3Dͷओཁͳػೳ • ࡾ֯ܗϝογϡͷอଘͱૢ࡞ͷͨΊͷσʔλ ߏ • ࡾ֯ܗϝογϡͷޮతͳૢ࡞ʢࣹӨมɺ άϥϑΈࠐΈɺαϯϓϦϯάɺଛࣦؔ) • ඍՄೳͳϝογϡϨϯμϥʔ
PyTorch3DPyTorchͱ౷߹Ͱ͖ ΔΑ͏ʹͳ͍ͬͯΔɻ • ͯ͢ͷPyTorch3DΦϖϨʔλҎԼΛຬͨ͢ɻ • PyTorchςϯιϧΛ࣮ͬͯ͞Ε͍ͯΔɻ • ҟछσʔλͷϛχόονΛѻ͏͜ͱ͕Ͱ͖Δɻ • ඍՄೳɻ
• ΞΫηϥϨʔγϣϯʹGPUΛར༻Ͱ͖Δɻ
ҟछσʔλͷόονྫ
ެ։3DֶशϞσϧ Mesh R-CNN https://github.com/facebookresearch/meshrcnn
Ұؾʹ̏DσΟʔϓϥʔχϯ άͷݚڀ͕Ճͦ͠͏
Special Thanks
DeepL Translator (deepl.com) https://www.deepl.com/en/translator