[파이썬] Keras 스타일 전송 알고리즘

07 Sep 2023

Keras

이 블로그 포스트에서는 Keras를 사용하여 스타일 전송 알고리즘을 구현하는 방법에 대해 알아보겠습니다. 스타일 전송은 주어진 이미지의 스타일을 다른 이미지에 적용하는 작업입니다. 이 알고리즘은 딥 러닝과 컴퓨터 비전 기술을 결합하여 흥미로운 시각 효과를 만들 수 있습니다.

스타일 전송 알고리즘이란?

스타일 전송 알고리즘은 두 가지 핵심 이미지를 사용하여 작동합니다. 첫 번째 이미지는 “스타일 이미지”로써 전달하고자 하는 스타일이 담겨 있습니다. 두 번째 이미지는 “콘텐츠 이미지”로써 스타일을 전송하고자 하는 대상입니다. 알고리즘은 콘텐츠 이미지의 내용을 유지한 채로 스타일 이미지의 스타일을 적용하여 새로운 이미지를 생성합니다.

필요한 패키지 설치

스타일 전송 알고리즘을 구현하기 위해 다음 패키지를 설치해야 합니다:

Keras: 딥 러닝 프레임워크로써 우리의 모델을 구축하기 위해 사용됩니다.
TensorFlow: Keras의 핵심 백엔드로써 딥 러닝 모델의 계산을 처리합니다.

pip install keras tensorflow

스타일 전송 알고리즘 구현

스타일 전송 알고리즘은 크게 세 가지 단계로 구성됩니다:

콘텐츠 이미지, 스타일 이미지, 그리고 생성할 이미지에 대한 입력 값을 준비합니다.
신경망 모델을 구축합니다. 이 모델은 콘텐츠 이미지, 스타일 이미지, 그리고 생성된 이미지의 손실을 계산하기 위해 사용됩니다.
이미지를 생성합니다. 초기화된 이미지에대해 경사하강법을 반복 적용하여 손실을 최소화하고 콘텐츠 이미지의 내용을 유지한 채로 스타일을 전달합니다.

다음은 해당 알고리즘을 구현하는 Python 코드의 예입니다:

# 필요한 패키지 임포트
from keras.applications import VGG19
from keras.preprocessing.image import load_img, img_to_array
from keras import backend as K

# 콘텐츠 이미지, 스타일 이미지, 생성할 이미지 경로
content_image_path = 'content.jpg'
style_image_path = 'style.jpg'
output_image_path = 'output.jpg'

# 이미지의 크기 정의
width, height = load_img(content_image_path).size
img_width = 400
img_height = int(width * img_width / height)

# 이미지 전처리 유틸 함수
def preprocess_image(image_path):
    img = load_img(image_path, target_size=(img_height, img_width))
    img = img_to_array(img)
    img = np.expand_dims(img, axis=0)
    img = vgg19.preprocess_input(img)
    return img

# 이미지의 역전 처리 유틸 함수
def deprocess_image(x):
    x = x.reshape((img_height, img_width, 3))
    x[:, :, 0] += 103.939
    x[:, :, 1] += 116.779
    x[:, :, 2] += 123.68
    x = x[:, :, ::-1]
    x = np.clip(x, 0, 255).astype('uint8')
    return x

# 콘텐츠 이미지, 스타일 이미지, 생성할 이미지 준비
content_image = K.variable(preprocess_image(content_image_path))
style_image = K.variable(preprocess_image(style_image_path))
generated_image = K.placeholder((1, img_height, img_width, 3))

# 입력 이미지 텐서를 단일 배치로 결합
input_tensor = K.concatenate([content_image, style_image, generated_image], axis=0)

# VGG19 모델 로드 (사전 훈련된 가중치와 함께)
model = VGG19(input_tensor=input_tensor, weights='imagenet', include_top=False)

# 손실 함수 정의
def content_loss(content, gen):
    return K.sum(K.square(gen - content))

def style_loss(style, gen):
    S = gram_matrix(style)
    G = gram_matrix(gen)
    channels = 3
    size = img_height * img_width
    return K.sum(K.square(S - G)) / (4. * (channels ** 2) * (size ** 2))

def total_variation_loss(x):
    a = K.square(x[:, :img_height-1, :img_width-1, :] - x[:, 1:, :img_width-1, :])
    b = K.square(x[:, :img_height-1, :img_width-1, :] - x[:, :img_height-1, 1:, :])
    return K.sum(K.pow(a + b, 1.25))

content_weight = 0.025
style_weight = 1.0
total_variation_weight = 1.0

# 전체 손실 정의
loss = K.variable(0.)
layer_features = outputs_dict['block5_conv2']
content_image_features = layer_features[0, :, :, :]
generated_image_features = layer_features[2, :, :, :]
loss += content_weight * content_loss(content_image_features, generated_image_features)

feature_layers = ['block1_conv1', 'block2_conv1', 'block3_conv1', 'block4_conv1', 'block5_conv1']
for layer_name in feature_layers:
    layer_features = outputs_dict[layer_name]
    style_features = layer_features[1, :, :, :]
    generated_image_features = layer_features[2, :, :, :]
    sl = style_loss(style_features, generated_image_features)
    loss += (style_weight / len(feature_layers)) * sl

loss += total_variation_weight * total_variation_loss(generated_image)

# 기울기 계산 및 경사하강법 단계 정의
grads = K.gradients(loss, generated_image)[0]
fetch_loss_and_grads = K.function([generated_image], [loss, grads])

# 경사하강법 반복
iterations = 20
for i in range(iterations):
    loss_value, gradient_values = fetch_loss_and_grads([generated_image_value])
    generated_image_value += step * gradient_values
    generated_image_value = np.clip(generated_image_value, 0, 255)

# 생성된 이미지 저장
img = deprocess_image(generated_image_value)
save_img(output_image_path, img)

이 코드를 실행하면 콘텐츠 이미지와 스타일 이미지를 사용하여 새로운 이미지를 생성하는 스타일 전송 알고리즘의 구현이 완료됩니다.

추가로, 콘텐츠 이미지, 스타일 이미지, 그리고 생성된 이미지의 경로를 적절히 변경하고 원하는 설정을 사용하여 알고리즘을 실행할 수 있습니다.