Deep Learning Image Manipulation

%FFQ-FBSOJOH *NBHF.BOJQVMBUJPO BOJMMVTUSBUFEHVJEF .-,JUDIFO

"CPVUNF w -FT[FL3ZCJDLJ w HJUIVC!MVOBSEPH w CPSOJO1PMBOE w .-3FTFBSDIFSBU$PPLQBE w
*MJLFOBUUP

DBSFFST!DPPLQBEDPN 8BOUUPXPSLXJUIVT

$POWPMVUJPOBM "SJUINFUJD OCIKE

*NBHFTUPGFBUVSFT

$POWPMVUJPO http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html input output input output kernel

4USJEF http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html 2px 2px 2px 2px

1BEEJOH http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html 2px 2px

4USJEF QBEEJOH http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html

5SBOTQPTFE http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html simulated here with padding also called “deconvolution” “fractional
stride”

%PXOTBNQMJOH features or small resolution image convolutional layer or layers
RGB image input output

6QTBNQMJOH upsampling CNN layer or layers RGB image features or
small resolution image input output

&ODPEFS%FDPEFS D E image in Decoder Encoder image out feature
space

'VMMZ$POOFDUFE $MBTTJpFS approve loan reject class data or features also
called “Dense” layer

$//$MBTTJpFS food person plant other AlexNet, LeNet, VGG…

'PPE/FU ™ food not food

@teenybiscuit

(FOFSBUJWF "EWFSTBSJBM /FUXPSLT

Generator Discriminator https://speakerdeck.com/lunardog/deep-convolutional-voight-kampf-test “Couple of bots studying for the Turing
Test”

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks Alec
Radford, Luke Metz, Soumith Chintala (Submitted on 19 Nov 2015 (v1), last revised 7 Jan 2016 (this version, v2)) https://arxiv.org/abs/1511.06434

Generator Discriminator G MPPLTMFHJU UPUBMMZTIPQQFE D

G SFBM GBLF D D(G(noise)) ˠ real (FOFSBUPSUSBJOJOH Discriminator acts
as the teacher

G SFBM GBLF D SFBM GBLF D D(G(noise)) ˠ fake
D(photo) ˠ real %JTDSJNJOBUPSUSBJOJOH Generator provides negative examples

https://www.youtube.com/watch?v=rs3aI7bACGc ©Yota Ishida

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks Alec
Radford, Luke Metz, Soumith Chintala (Submitted on 19 Nov 2015 (v1), last revised 7 Jan 2016 (this version, v2)) https://arxiv.org/abs/1511.06434

$POEJUJPOBM ("/T

G NBMF GFNBMF DIJME FMEFSMZ G(noise | conditions) $POEJUJPOBM(FOFSBUPS

SJHIU XSPOH NBMF GFNBMF DIJME FMEFSMZ D $POEJUJPOBM%JTDSJNJOBUPS

SJHIU XSPOH NBMF GFNBMF DIJME FMEFSMZ D SJHIU XSPOH NBMF
GFNBMF DIJME FMEFSMZ SJHIU XSPOH NBMF GFNBMF DIJME FMEFSMZ D D

SJHIU XSPOH D $POEJUJPOBM("/ https://arxiv.org/abs/1411.1784 Conditional Generative Adversarial Nets Mehdi
Mirza, Simon Osindero (Submitted on 6 Nov 2014) Generator Discriminator NBMF GFNBMF DIJME FMEFSMZ G NBMF GFNBMF DIJME FMEFSMZ same condition

G NBMF GFNBMF DIJME FMEFSMZ SJHIU XSPOH NBMF GFNBMF DIJME
FMEFSMZ D $POEJUJPOBM("/ Discriminator Generator

https://www.faceapp.com/ Disclaimer: FaceApp authors don’t disclose their method. This is
only my guess. It may have nothing to do with GANs. original

original https://www.faceapp.com/

https://www.faceapp.com/ original

"SUJTUJD4UZMF5SBOTGFS Improved!

https://prisma-ai.com/

https://prisma-ai.com/ https://prisma-ai.com/

https://arxiv.org/abs/1603.08155 transformation network loss network Gram matrices in feature space
pre-trained content image style image

“Gram matrices in feature space” https://en.wikipedia.org/wiki/Gramian_matrix

https://www.youtube.com/watch?v=xVJwwWQlQ1o

$ZDMF("/

https://github.com/junyanz/CycleGAN

(FOFSBUPS transformation network https://arxiv.org/abs/1603.08155

GBLF IPSTF GBLF IPSTF … %JTDSJNJOBUPS fully convolutional judges patches
  of the input image https://arxiv.org/abs/1603.08155

"EWFSTBSJBM-PTT X F G Y GBLF [FCSB GBLF [FCSB …
GBLF IPSTF GBLF IPSTF … X(F(horse)) ˠ classify as zebra Y(F(zebra)) ˠ classify as horse

$ZDMF-PTT G F G(F(image))ˠ the same image F G F(G(image))ˠ
the same image

https://www.youtube.com/watch?v=9reHvktowLY

5IF&OE

Deep Learning Image Manipulation

Deep Learning Image Manipulation

More Decks by Leszek Rybicki

Other Decks in Research

Featured

Transcript