Translating image styles with CycleGAN from Monet to photos

April 17, 2025

Deep Learning Computer Vision

Translating image styles with CycleGAN from Monet to photos

Image created by author using GPT-4

This project implements a CycleGAN (Cycle-Consistent Generative Adversarial Network) to transform photographs into paintings that mimic Claude Monet’s artistic style. The implementation follows the architecture described in the paper “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks.”

Dataset

The project utilizes two primary datasets:

300 Monet paintings (256x256 pixels)
7,038 photographs (256x256 pixels)

Both datasets are provided in TFRecord format for efficient processing.

Technical Implementation

Data Preprocessing

Images undergo random jittering (resizing to 286x286 and random cropping to 256x256)
Random horizontal flipping for data augmentation
Normalization to the range [-1, 1]

Model Architecture

Generator: Uses a modified U-Net architecture with:
- Downsampling path with skip connections
- Instance normalization for style consistency
- Dropout layers for improved generalization
Discriminator: Outputs a 32x32 patch map classifying regions as real or fake, rather than a single binary classification
CycleGAN: Combines two generators and two discriminators:
- Monet-to-photo generator and discriminator
- Photo-to-Monet generator and discriminator

Loss Functions

The model utilizes multiple loss components:

Generator loss: Encourages generators to produce convincing images
Discriminator loss: Helps discriminators distinguish real from generated images
Cycle consistency loss: Ensures $F(G(X))≈X$ and $G(F(Y))≈Y$ where $F$ and $G$ are the mapping functions
Identity loss: Stabilizes training by encouraging generators to maintain content

Training

Adam optimizer with learning rate 2e-4 and beta_1=0.5
The model learns bidirectional mappings simultaneously

Results

The final model successfully transforms photographs into images with characteristics of Monet’s painting style, including:

Softer edges
Impressionistic brush stroke effects
Modified color palettes typical of Monet’s work

The project demonstrates how deep learning techniques like GANs can be used for creative applications in art style transfer, providing an automated way to reimagine ordinary photographs as if painted by a master impressionist.

Source Code Live Demo

Filed under: Deep Learning , Computer Vision