Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Deep Learning Image Manipulation
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Leszek Rybicki
May 18, 2017
Research
230
2
Share
Deep Learning Image Manipulation
Illustrated guide to some image manipulation methods, with demonstration.
Leszek Rybicki
May 18, 2017
More Decks by Leszek Rybicki
See All by Leszek Rybicki
Let's talk about Fakes
lunardog
0
170
How to Patch Image Classifiers
lunardog
0
2.7k
Towards Realistic Predictors - EN
lunardog
0
2.5k
Towards Realistic Predictors
lunardog
1
2.4k
Deep Learning Hot Dog Detector
lunardog
0
310
Finding beans in burgers: paper reading notes
lunardog
0
1.9k
Kelner: Serve Your Models
lunardog
0
150
Image Analysis at Cookpad
lunardog
1
1.9k
Kelner: serve your models
lunardog
1
430
Other Decks in Research
See All in Research
Unified Audio Source Separation (Defense Slides)
kohei_1979
1
610
進学校の生徒にはア行の苗字が多いのか
ozekinote
0
420
NLP colloquium: AI Safety Survey
kanekomasahiro
0
530
英語教育 “研究” のあり方:学術知とアウトリーチの緊張関係
terasawat
1
970
2026 東京科学大 情報通信系 研究室紹介 (すずかけ台)
icttitech
0
3.7k
AIエージェント時代のLLM-jpモデルのあるべき姿
k141303
0
430
LOSの検討(λ Kansai 2026 in Winter)
motopu
0
130
羽田新ルート運用6年の検証
1manken
0
160
AGI4OPT:自然言語から数理最適化を導くエ ージェントスキル Translating Human Intent into Mathematical Optimization
mickey_kubo
0
130
LLM Compute Infrastructure Overview
karakurist
2
1.4k
PGDM: Physically Guided Diffusion Model for L Downscaling
satai
0
240
Dual Quadric表現を用いた動的物体追跡とRGB-D・IMU制約の密結合によるオドメトリ推定
nanoshimarobot
0
400
Featured
See All Featured
Lightning talk: Run Django tests with GitHub Actions
sabderemane
0
190
JAMstack: Web Apps at Ludicrous Speed - All Things Open 2022
reverentgeek
1
460
How to make the Groovebox
asonas
2
2.2k
Rails Girls Zürich Keynote
gr2m
96
14k
Ecommerce SEO: The Keys for Success Now & Beyond - #SERPConf2024
aleyda
1
2k
The Art of Programming - Codeland 2020
erikaheidi
57
14k
Un-Boring Meetings
codingconduct
0
310
Put a Button on it: Removing Barriers to Going Fast.
kastner
60
4.3k
Build The Right Thing And Hit Your Dates
maggiecrowley
39
3.2k
Building a A Zero-Code AI SEO Workflow
portentint
PRO
0
550
Practical Orchestrator
shlominoach
191
11k
Darren the Foodie - Storyboard
khoart
PRO
3
3.4k
Transcript
%FFQ-FBSOJOH *NBHF.BOJQVMBUJPO BOJMMVTUSBUFEHVJEF .-,JUDIFO
"CPVUNF w -FT[FL3ZCJDLJ w HJUIVC!MVOBSEPH w CPSOJO1PMBOE w .-3FTFBSDIFSBU$PPLQBE w
*MJLFOBUUP
DBSFFST!DPPLQBEDPN 8BOUUPXPSLXJUIVT
$POWPMVUJPOBM "SJUINFUJD OCIKE
*NBHFTUPGFBUVSFT
$POWPMVUJPO http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html input output input output kernel
4USJEF http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html 2px 2px 2px 2px
1BEEJOH http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html 2px 2px
4USJEF QBEEJOH http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html
5SBOTQPTFE http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html simulated here with padding also called “deconvolution” “fractional
stride”
%PXOTBNQMJOH features or small resolution image convolutional layer or layers
RGB image input output
6QTBNQMJOH upsampling CNN layer or layers RGB image features or
small resolution image input output
&ODPEFS%FDPEFS D E image in Decoder Encoder image out feature
space
'VMMZ$POOFDUFE $MBTTJpFS approve loan reject class data or features also
called “Dense” layer
$//$MBTTJpFS food person plant other AlexNet, LeNet, VGG…
'PPE/FU ™ food not food
@teenybiscuit
None
@teenybiscuit
@teenybiscuit
@teenybiscuit
@teenybiscuit
@teenybiscuit
(FOFSBUJWF "EWFSTBSJBM /FUXPSLT
Generator Discriminator https://speakerdeck.com/lunardog/deep-convolutional-voight-kampf-test “Couple of bots studying for the Turing
Test”
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks Alec
Radford, Luke Metz, Soumith Chintala (Submitted on 19 Nov 2015 (v1), last revised 7 Jan 2016 (this version, v2)) https://arxiv.org/abs/1511.06434
Generator Discriminator G MPPLTMFHJU UPUBMMZTIPQQFE D
G SFBM GBLF D D(G(noise)) ˠ real (FOFSBUPSUSBJOJOH Discriminator acts
as the teacher
G SFBM GBLF D SFBM GBLF D D(G(noise)) ˠ fake
D(photo) ˠ real %JTDSJNJOBUPSUSBJOJOH Generator provides negative examples
None
https://www.youtube.com/watch?v=rs3aI7bACGc ©Yota Ishida
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks Alec
Radford, Luke Metz, Soumith Chintala (Submitted on 19 Nov 2015 (v1), last revised 7 Jan 2016 (this version, v2)) https://arxiv.org/abs/1511.06434
$POEJUJPOBM ("/T
G NBMF GFNBMF DIJME FMEFSMZ G(noise | conditions) $POEJUJPOBM(FOFSBUPS
SJHIU XSPOH NBMF GFNBMF DIJME FMEFSMZ D $POEJUJPOBM%JTDSJNJOBUPS
SJHIU XSPOH NBMF GFNBMF DIJME FMEFSMZ D SJHIU XSPOH NBMF
GFNBMF DIJME FMEFSMZ SJHIU XSPOH NBMF GFNBMF DIJME FMEFSMZ D D
SJHIU XSPOH D $POEJUJPOBM("/ https://arxiv.org/abs/1411.1784 Conditional Generative Adversarial Nets Mehdi
Mirza, Simon Osindero (Submitted on 6 Nov 2014) Generator Discriminator NBMF GFNBMF DIJME FMEFSMZ G NBMF GFNBMF DIJME FMEFSMZ same condition
G NBMF GFNBMF DIJME FMEFSMZ SJHIU XSPOH NBMF GFNBMF DIJME
FMEFSMZ D $POEJUJPOBM("/ Discriminator Generator
https://www.faceapp.com/ Disclaimer: FaceApp authors don’t disclose their method. This is
only my guess. It may have nothing to do with GANs. original
original https://www.faceapp.com/
https://www.faceapp.com/ original
"SUJTUJD4UZMF5SBOTGFS Improved!
https://prisma-ai.com/
https://prisma-ai.com/ https://prisma-ai.com/
https://prisma-ai.com/ https://prisma-ai.com/
https://prisma-ai.com/ https://prisma-ai.com/
https://arxiv.org/abs/1603.08155 transformation network loss network Gram matrices in feature space
pre-trained content image style image
“Gram matrices in feature space” https://en.wikipedia.org/wiki/Gramian_matrix
https://www.youtube.com/watch?v=xVJwwWQlQ1o
$ZDMF("/
https://github.com/junyanz/CycleGAN
https://github.com/junyanz/CycleGAN
https://github.com/junyanz/CycleGAN
(FOFSBUPS transformation network https://arxiv.org/abs/1603.08155
GBLF IPSTF GBLF IPSTF … %JTDSJNJOBUPS fully convolutional judges patches
of the input image https://arxiv.org/abs/1603.08155
"EWFSTBSJBM-PTT X F G Y GBLF [FCSB GBLF [FCSB …
GBLF IPSTF GBLF IPSTF … X(F(horse)) ˠ classify as zebra Y(F(zebra)) ˠ classify as horse
$ZDMF-PTT G F G(F(image))ˠ the same image F G F(G(image))ˠ
the same image
https://www.youtube.com/watch?v=9reHvktowLY
5IF&OE