Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Deep Learning Image Manipulation
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Leszek Rybicki
May 18, 2017
Research
230
2
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Deep Learning Image Manipulation
Illustrated guide to some image manipulation methods, with demonstration.
Leszek Rybicki
May 18, 2017
More Decks by Leszek Rybicki
See All by Leszek Rybicki
Let's talk about Fakes
lunardog
0
170
How to Patch Image Classifiers
lunardog
0
2.7k
Towards Realistic Predictors - EN
lunardog
0
2.6k
Towards Realistic Predictors
lunardog
1
2.5k
Deep Learning Hot Dog Detector
lunardog
0
310
Finding beans in burgers: paper reading notes
lunardog
0
1.9k
Kelner: Serve Your Models
lunardog
0
150
Image Analysis at Cookpad
lunardog
1
2k
Kelner: serve your models
lunardog
1
430
Other Decks in Research
See All in Research
データセンター事業者を取り巻く近年の状況とその中での研究開発動向、テストベッドへの貢献の可能性
kikuzo
1
220
Using our influence and power for patient safety
helenbevan
0
360
2026-01-30-MandSL-textbook-jp-cos-lod
yegusa
1
1.4k
Data Visualization Tools in the Age of AI
flekschas
0
160
コーディングエージェントとABNを再考
hf149
2
730
Unified Audio Source Separation (Defense Slides)
kohei_1979
1
620
「AIとWhyを深堀る」をAIと深堀る
iflection
0
500
NLP colloquium: AI Safety Survey
kanekomasahiro
0
750
英語教育 “研究” のあり方:学術知とアウトリーチの緊張関係
terasawat
1
1k
Can We Teach Logical Reasoning to LLMs? – An Approach Using Synthetic Corpora (AAAI 2026 bridge keynote)
morishtr
1
260
計算情報学研究室(数理情報学第7研究室)2026
tomohirokoana
0
580
明日から使える!研究効率化ツール入門
matsui_528
13
7.4k
Featured
See All Featured
What does AI have to do with Human Rights?
axbom
PRO
1
2.2k
The Success of Rails: Ensuring Growth for the Next 100 Years
eileencodes
47
8.2k
How to Ace a Technical Interview
jacobian
281
24k
Building Applications with DynamoDB
mza
96
7.1k
Designing for humans not robots
tammielis
254
26k
The Art of Delivering Value - GDevCon NA Keynote
reverentgeek
16
2k
How Fast Is Fast Enough? [PerfNow 2025]
tammyeverts
3
620
The Language of Interfaces
destraynor
162
27k
SEO Brein meetup: CTRL+C is not how to scale international SEO
lindahogenes
1
2.7k
Practical Tips for Bootstrapping Information Extraction Pipelines
honnibal
25
2k
Jess Joyce - The Pitfalls of Following Frameworks
techseoconnect
PRO
1
170
Speed Design
sergeychernyshev
33
1.9k
Transcript
%FFQ-FBSOJOH *NBHF.BOJQVMBUJPO BOJMMVTUSBUFEHVJEF .-,JUDIFO
"CPVUNF w -FT[FL3ZCJDLJ w HJUIVC!MVOBSEPH w CPSOJO1PMBOE w .-3FTFBSDIFSBU$PPLQBE w
*MJLFOBUUP
DBSFFST!DPPLQBEDPN 8BOUUPXPSLXJUIVT
$POWPMVUJPOBM "SJUINFUJD OCIKE
*NBHFTUPGFBUVSFT
$POWPMVUJPO http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html input output input output kernel
4USJEF http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html 2px 2px 2px 2px
1BEEJOH http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html 2px 2px
4USJEF QBEEJOH http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html
5SBOTQPTFE http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html simulated here with padding also called “deconvolution” “fractional
stride”
%PXOTBNQMJOH features or small resolution image convolutional layer or layers
RGB image input output
6QTBNQMJOH upsampling CNN layer or layers RGB image features or
small resolution image input output
&ODPEFS%FDPEFS D E image in Decoder Encoder image out feature
space
'VMMZ$POOFDUFE $MBTTJpFS approve loan reject class data or features also
called “Dense” layer
$//$MBTTJpFS food person plant other AlexNet, LeNet, VGG…
'PPE/FU ™ food not food
@teenybiscuit
None
@teenybiscuit
@teenybiscuit
@teenybiscuit
@teenybiscuit
@teenybiscuit
(FOFSBUJWF "EWFSTBSJBM /FUXPSLT
Generator Discriminator https://speakerdeck.com/lunardog/deep-convolutional-voight-kampf-test “Couple of bots studying for the Turing
Test”
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks Alec
Radford, Luke Metz, Soumith Chintala (Submitted on 19 Nov 2015 (v1), last revised 7 Jan 2016 (this version, v2)) https://arxiv.org/abs/1511.06434
Generator Discriminator G MPPLTMFHJU UPUBMMZTIPQQFE D
G SFBM GBLF D D(G(noise)) ˠ real (FOFSBUPSUSBJOJOH Discriminator acts
as the teacher
G SFBM GBLF D SFBM GBLF D D(G(noise)) ˠ fake
D(photo) ˠ real %JTDSJNJOBUPSUSBJOJOH Generator provides negative examples
None
https://www.youtube.com/watch?v=rs3aI7bACGc ©Yota Ishida
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks Alec
Radford, Luke Metz, Soumith Chintala (Submitted on 19 Nov 2015 (v1), last revised 7 Jan 2016 (this version, v2)) https://arxiv.org/abs/1511.06434
$POEJUJPOBM ("/T
G NBMF GFNBMF DIJME FMEFSMZ G(noise | conditions) $POEJUJPOBM(FOFSBUPS
SJHIU XSPOH NBMF GFNBMF DIJME FMEFSMZ D $POEJUJPOBM%JTDSJNJOBUPS
SJHIU XSPOH NBMF GFNBMF DIJME FMEFSMZ D SJHIU XSPOH NBMF
GFNBMF DIJME FMEFSMZ SJHIU XSPOH NBMF GFNBMF DIJME FMEFSMZ D D
SJHIU XSPOH D $POEJUJPOBM("/ https://arxiv.org/abs/1411.1784 Conditional Generative Adversarial Nets Mehdi
Mirza, Simon Osindero (Submitted on 6 Nov 2014) Generator Discriminator NBMF GFNBMF DIJME FMEFSMZ G NBMF GFNBMF DIJME FMEFSMZ same condition
G NBMF GFNBMF DIJME FMEFSMZ SJHIU XSPOH NBMF GFNBMF DIJME
FMEFSMZ D $POEJUJPOBM("/ Discriminator Generator
https://www.faceapp.com/ Disclaimer: FaceApp authors don’t disclose their method. This is
only my guess. It may have nothing to do with GANs. original
original https://www.faceapp.com/
https://www.faceapp.com/ original
"SUJTUJD4UZMF5SBOTGFS Improved!
https://prisma-ai.com/
https://prisma-ai.com/ https://prisma-ai.com/
https://prisma-ai.com/ https://prisma-ai.com/
https://prisma-ai.com/ https://prisma-ai.com/
https://arxiv.org/abs/1603.08155 transformation network loss network Gram matrices in feature space
pre-trained content image style image
“Gram matrices in feature space” https://en.wikipedia.org/wiki/Gramian_matrix
https://www.youtube.com/watch?v=xVJwwWQlQ1o
$ZDMF("/
https://github.com/junyanz/CycleGAN
https://github.com/junyanz/CycleGAN
https://github.com/junyanz/CycleGAN
(FOFSBUPS transformation network https://arxiv.org/abs/1603.08155
GBLF IPSTF GBLF IPSTF … %JTDSJNJOBUPS fully convolutional judges patches
of the input image https://arxiv.org/abs/1603.08155
"EWFSTBSJBM-PTT X F G Y GBLF [FCSB GBLF [FCSB …
GBLF IPSTF GBLF IPSTF … X(F(horse)) ˠ classify as zebra Y(F(zebra)) ˠ classify as horse
$ZDMF-PTT G F G(F(image))ˠ the same image F G F(G(image))ˠ
the same image
https://www.youtube.com/watch?v=9reHvktowLY
5IF&OE