Upgrade to PRO for Only $50/Year—Limited-Time Offer! 🔥

[Journal club] Scalable Diffusion Models with T...

[Journal club] Scalable Diffusion Models with Transformers

More Decks by Semantic Machine Intelligence Lab., Keio Univ.

Other Decks in Technology

Transcript

  1. ؔ࿈ݚڀɿ֦ࢄϞσϧͷόοΫϘʔϯͱͯ͠6/FU͕ଟ༻  • 6/FUͷ.VMUJTDBMFTLJQDPOOFDUJPOTˠ ෆཁͳܭࢉࢿݯͷ࢖༻ 手法 概要 DALL-E 2 [Ramesh+,

    22] CLIPを用いてテキストと画像のAlignmentを行う Stable Diffusion [Rombach+, CVPR22] 潜在拡散モデル 6/FU<3POOFCFSHFS .*$$"*> 4UBCMF%JGGVTJPO<3PNCBDI $713>
  2. ఏҊख๏  ɿ৚݅෇͖ೖྗ $POEJUJPOJOH ͷॲཧ • ৚݅෇͖֦ࢄϞσϧͰ͸ϊΠζΛؚΉը૾ͱͱ΋ʹ௥Ճ৘ใ͕Ճ͑ΒΕΔ FH UJNFTUFQɼΫϥεϥϕϧɼࣗવݴޠ FUD

    • ຊݚڀͰ͸͜ΕΒͷ৚݅෇͖ೖྗΛॲཧ͢ΔͨΊʹҎԼͷͭͷҟͳΔઃܭΛఏҊ • *ODPOUFYUDPOEJUJPOJOH • $SPTT"UUFOUJPOCMPDL • "EBQUJWFMBZFSOPSN BEB-/ CMPDL • BEB-/;FSPCMPDL 
  3. ఏҊख๏  ɿBEB-/;FSPCMPDL  • 7J5ͷTFMGBUUFOUJPOCMPDLʹରͯ͠"EB-/ػߏΛಋೖ • "EB-/ͷεέʔϧ܎਺ ͓Αͼ ࢒ࠩ઀ଓͷલͷεέʔϧ܎਺

    Λύϥϝʔλͱͯ͠௥Ճ ˠ৚݅৘ใΛը૾ʹΑΓڧ͘൓ө • "EB-/;FSPCMPDLͰ͸ͦΕΒΛθϩʹॳظԽ ˠֶशͷॳظஈ֊͸߃౳ؔ਺ʹ͍ۙಇ͖ ˠ ֶशͷ҆ఆԽ
  4. ࣮ݧઃఆ  • σʔληοτ • $MBTT$POEJUJPOBM*NBHF/FUY Y<%FOH $713> • ΞʔΩςΫνϟ

    • 7J5ͱಉ༷ʹͭͷϞσϧͷେ͖͞ 4 # - 9- Λ༻ҙ • QBUDITJ[FQ   • %%1.TBNQMJOHTUFQT • ධՁई౓ • '*% T'*% *4 1SFDJTJPO 3FDBMM • (GMPQT • ֶश • 516WQPE #BUDITJ[F
  5. ·ͱΊ • എܠ • ֦ࢄϞσϧʹΑΔಈը૾ੜ੒ FH 4PSB ͷൃల • ֦ࢄϞσϧʹ͓͚ΔUSBOTGPSNFSͷར༻͕গͳ͍

    • ఏҊख๏ • USBOTGPSNFSϕʔεͷ֦ࢄϞσϧͰ͋Δ%JGGVTJPO5SBOTGPSNFS %J5 ΛఏҊ • ݁Ռ • %J5͸εέʔϥϏϦςΟ͕ߴ͘ɼ(GMPQT͕େ͖͍΄Ͳ'*%͕௿Լ ˠ ܭࢉࢿݯͱग़ྗը૾ͷ඼࣭ʹڧ͍૬ؔؔ܎ • %J59-Ϟσϧ͕ɼ$MBTT$POEJUJPOBM*NBHF/FUʹ͓͍ͯ ैདྷͷ6/FUϕʔεͷ֦ࢄϞσϧΛ্ճͬͨ 
  6. "QQFOEJYɿ(GMPQT  • 'MPQTɿුಈখ਺఺ԋࢉͷճ਺ • (GMPQT 'MPQT • ը૾ੜ੒λεΫͰΞʔΩςΫνϟͷෳࡶ͞ΛධՁ͢Δࡍύϥϝʔλ਺Λ༻͍Δͷ ͕Ұൠత

    • ੑೳʹେ͖͘Өڹ͢Δը૾ղ૾౓ΛҰ੾ߟྀ͍ͯ͠ͳ͍ • Ϟσϧͷෳࡶ͞Λද͢ࢦඪͱͯ͠͸ෆे෼ͳ৔߹͕͋Δ