GAN example: Modifying random generator for different image shapes

markhirsch · December 15, 2020, 9:24pm

I recognize that this question is less specific to PerceptiLabs, but since it has been a valuable entry point into ML for me, I’m hoping to get feedback within this community.

I have a collection of images that I’d like to use with the GAN example, but the shape is 28x28x3 as opposed to the 28x28x1 of the mnist. What is the best practice to modify the generator so that the random components will work with this set of images?

Thank you!

mukund_s · November 4, 2020, 5:17pm

Hello @markhirsch, what we basically do in the generator is, generate an image of required size from a random sample. In our GAN template, we are starting with a a random tensor of size 100 and then upsampling it using dense layers to a size 784 in a gradual manner. Here 784 is same as the shape of the data we want to learn from(28 * 28 * 1), before reshaping.

Because you have images of shape 28 * 28 * 3, you would basically need your generator to produce images of same size and then we reshape using the reshape layer. Since, 28 * 28 * 3 = 2352, we want the generated image to have same number of pixels. Here is one way I chose to do it in gradual steps from 100 to 2352. Here 100 is also arbitrary. You can choose a random sample of your choice. It can be of any shape and size.

Hope this helps you in building the model successfully! Let us know, if you have any more questions.

Regards,
Mukund

markhirsch · November 4, 2020, 5:31pm

this is extremely helpful, thank you!!

JulianSMoore · February 19, 2021, 3:42pm

Since you mention upsampling here, is there or will there be support for convolution transpose upsampling?

robertl · February 22, 2021, 9:04am

The Deconvolution component should do just that for you

JulianSMoore · February 22, 2021, 6:39pm

Ah, the “Deconvolution” cunningly labelled “Deconvolution” just to check whether I’m paying attention