半監督對抗式生成網絡實現多場域影像轉譯

多場域影像轉譯 (multi-domain image-to-image translation) 是將影像由一個場域(domain)轉譯到其他多個場域的研究。近年來，許多影像轉譯的研究已經能夠利用生成方式對抗網路(generative adversarial network)的方法，從具有場域標記的資料中，學習場域之間的關係，建立複雜的生成模型。然而，這類型的演算法的學習成效仰賴於大量的標記資料，所以建構這樣的模型需要花費很高的時間與成本。為了降低成本，本論文提出 SemiStarGAN，結合兩個半監督式學習技術: self ensembling 與 pseudo labeling，並提出名為 Y model 的新網絡參數共享方式, 將網絡中的判別器(discriminator) 與輔助分類器(auxiliary classifier) 的參數部分共享，以提升輔助分類器的泛化能力及穩定性。本論文設計了人臉特徵轉譯的實驗，比較 StarGAN 與 SemiStarGAN 在不同標記資料量下的生成表現。實驗結果證實了我們所提出來的方法，僅需較少的標記資料，即可達到與 StarGAN 同等的轉譯效果。

關鍵字

對抗式生成網絡；半監督式學習；多場域影像轉譯

並列摘要

Recent studies have shown significant advance for multi-domain image-to-image translation, and generative adversarial networks (GANs) are widely used to address this problem. However, existing methods all require a large number of domain-labeled images to train an effective image generator, but it may take time and effort to collect a large number of labeled data for real-world problems. In this thesis, we propose SemiStarGAN, a semi-supervised GAN network to tackle this issue. The proposed method utilizes unlabeled images by incorporating a novel discriminator/classifier network architecture Y model, and two existing semi-supervised learning techniques---pseudo labeling and self-ensembling. Experimental results on the CelebA dataset using domains of facial attributes show that the proposed method achieves comparable performance with state-of-the-art methods using considerably less labeled training images.

並列關鍵字

Generative Adversarial Network ； Semi-Supervised Learning ； Multi-Domain Image-to-Image Translation

參考文獻

[1] O. Chapelle, B. Schölkopf, and A. Zien. Semi-Supervised Learning. 2006.

Google Scholar

[2] Y. Choi, M. Choi, M. Kim, J.-W. Ha, S. Kim, and J. Choo. StarGAN: Unified gener- ative adversarial networks for multi-domain image-to-image translation. In CVPR, 2018.

Google Scholar

[3]Z.Dai,Z.Yang,F.Yang,W.W.Cohen,andR.Salakhutdinov.GoodSemi-supervised Learning that Requires a Bad GAN. In NIPS. 2017.

Google Scholar

[4] J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. Imagenet: A large-scale hierarchical image database. In CVPR, 2009.

Google Scholar

[5] L. A. Gatys, A. S. Ecker, and M. Bethge. Image style transfer using convolutional neural networks. In CVPR, June 2016.

Google Scholar

國際替代計量

半監督對抗式生成網絡實現多場域影像轉譯

全文下載

主題瀏覽