Transforming the Latent Space of StyleGAN for Real Face Editing

Li, Heyi; Liu, Jinlong; Zhang, Xinyu; Bai, Yunzhi; Wang, Huayan; Mueller, Klaus

Computer Science > Computer Vision and Pattern Recognition

arXiv:2105.14230 (cs)

[Submitted on 29 May 2021 (v1), last revised 21 Jul 2022 (this version, v2)]

Title:Transforming the Latent Space of StyleGAN for Real Face Editing

Authors:Heyi Li, Jinlong Liu, Xinyu Zhang, Yunzhi Bai, Huayan Wang, Klaus Mueller

View PDF

Abstract:Despite recent advances in semantic manipulation using StyleGAN, semantic editing of real faces remains challenging. The gap between the $W$ space and the $W$+ space demands an undesirable trade-off between reconstruction quality and editing quality. To solve this problem, we propose to expand the latent space by replacing fully-connected layers in the StyleGAN's mapping network with attention-based transformers. This simple and effective technique integrates the aforementioned two spaces and transforms them into one new latent space called $W$++. Our modified StyleGAN maintains the state-of-the-art generation quality of the original StyleGAN with moderately better diversity. But more importantly, the proposed $W$++ space achieves superior performance in both reconstruction quality and editing quality. Despite these significant advantages, our $W$++ space supports existing inversion algorithms and editing methods with only negligible modifications thanks to its structural similarity with the $W/W$+ space. Extensive experiments on the FFHQ dataset prove that our proposed $W$++ space is evidently more preferable than the previous $W/W$+ space for real face editing. The code is publicly available for research purposes at this https URL.

Comments:	28 pages, 15 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2105.14230 [cs.CV]
	(or arXiv:2105.14230v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2105.14230

Submission history

From: Heyi Li [view email]
[v1] Sat, 29 May 2021 06:42:23 UTC (18,012 KB)
[v2] Thu, 21 Jul 2022 15:59:06 UTC (18,198 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Transforming the Latent Space of StyleGAN for Real Face Editing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Transforming the Latent Space of StyleGAN for Real Face Editing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators