Real-Time Data-Efficient Portrait Stylization Via Geometric Alignment

14 Pages Posted: 27 Jan 2025

See all articles by Xinrui Wang

Xinrui Wang

affiliation not provided to SSRN

Zhuoru Li

Xiamen University

Xuanyu Yin

Meituan

Xiao Zhou

Hefei Normal University

Yusuke Iwasawa

affiliation not provided to SSRN

Yutaka Matsuo

affiliation not provided to SSRN

Jiaxian Guo

affiliation not provided to SSRN

Abstract

Portrait Stylization aims to imbue input face photos with vivid artistic effects drawn from style examples. Despite the availability of enormous training datasets and large network weights, existing methods struggle to maintain geometric consistency and achieve satisfactory stylization effects due to the disparity in facial feature distributions between facial photographs and stylized images, limiting the application on rare styles and mobile devices. To alleviate this, we propose to establish meaningful geometric correlations between portraits and style samples to simplify the stylization by aligning corresponding facial characteristics. Specifically, we integrate differentiable Thin-Plate-Spline (TPS) modules into an end-to-end Generative Adversarial Network (GAN) framework to improve the training efficiency and promote the consistency of facial identities. By leveraging inherent structural information of faces, e.g., facial landmarks, TPS module can establish geometric alignments between the two domains, at global and local scales, both in pixel and feature spaces, thereby overcoming the aforementioned challenges. Quantitative and qualitative comparisons on a range of portrait stylization tasks demonstrate that our models not only outperforms existing models in terms of fidelity and stylistic consistency, but also achieves remarkable improvements in 2× training data efficiency and 100× less computational complexity, allowing our small model to achieve real-time inference (30 FPS) at 512*512 resolution on mobile devices.

Keywords: Portrait Stylization, Style Transfer, Image-to-Image Translation, Generative Adversarial Networks

Suggested Citation

Wang, Xinrui and Li, Zhuoru and Yin, Xuanyu and Zhou, Xiao and Iwasawa, Yusuke and Matsuo, Yutaka and Guo, Jiaxian, Real-Time Data-Efficient Portrait Stylization Via Geometric Alignment. Available at SSRN: https://ssrn.com/abstract=5113384 or http://dx.doi.org/10.2139/ssrn.5113384

Xinrui Wang (Contact Author)

affiliation not provided to SSRN ( email )

Zhuoru Li

Xiamen University ( email )

Xiamen, 361005
China

Xuanyu Yin

Meituan ( email )

China

Xiao Zhou

Hefei Normal University ( email )

Hefei
China

Yusuke Iwasawa

affiliation not provided to SSRN ( email )

Yutaka Matsuo

affiliation not provided to SSRN ( email )

Jiaxian Guo

affiliation not provided to SSRN ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
49
Abstract Views
228
PlumX Metrics