透過您的圖書館登入
IP:18.221.140.111
  • 學位論文

半監督對抗式生成網絡實現多場域影像轉譯

SemiStarGAN: Semi-Supervised Generative Adversarial Networks for Multi-Domain Image-to-Image Translation

指導教授 : 許永真

摘要


多場域影像轉譯 (multi-domain image-to-image translation) 是將影像由一個場域(domain)轉譯到其他多個場域的研究。近年來,許多影像轉譯的研究已經能夠利用生成方式對抗網路(generative adversarial network)的方法,從具有場域標記的資料中,學習場域之間的關係,建立複雜的生成模型。然而,這類型的演算法的學習成效仰賴於大量的標記資料,所以建構這樣的模型需要花費很高的時間與成本。 為了降低成本,本論文提出 SemiStarGAN,結合兩個半監督式學習技術: self ensembling 與 pseudo labeling,並提出名為 Y model 的新網絡參數共享方式, 將網絡中的判別器(discriminator) 與輔助分類器(auxiliary classifier) 的參數部分共享,以提升輔助分類器的泛化能力及穩定性。 本論文設計了人臉特徵轉譯的實驗,比較 StarGAN 與 SemiStarGAN 在不同標記資料量下的生成表現。實驗結果證實了我們所提出來的方法,僅需較少的標記資料,即可達到與 StarGAN 同等的轉譯效果。

並列摘要


Recent studies have shown significant advance for multi-domain image-to-image translation, and generative adversarial networks (GANs) are widely used to address this problem. However, existing methods all require a large number of domain-labeled images to train an effective image generator, but it may take time and effort to collect a large number of labeled data for real-world problems. In this thesis, we propose SemiStarGAN, a semi-supervised GAN network to tackle this issue. The proposed method utilizes unlabeled images by incorporating a novel discriminator/classifier network architecture Y model, and two existing semi-supervised learning techniques---pseudo labeling and self-ensembling. Experimental results on the CelebA dataset using domains of facial attributes show that the proposed method achieves comparable performance with state-of-the-art methods using considerably less labeled training images.

參考文獻


[1] O. Chapelle, B. Schölkopf, and A. Zien. Semi-Supervised Learning. 2006.
[2] Y. Choi, M. Choi, M. Kim, J.-W. Ha, S. Kim, and J. Choo. StarGAN: Unified gener- ative adversarial networks for multi-domain image-to-image translation. In CVPR, 2018.
[3]Z.Dai,Z.Yang,F.Yang,W.W.Cohen,andR.Salakhutdinov.GoodSemi-supervised Learning that Requires a Bad GAN. In NIPS. 2017.
[4] J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. Imagenet: A large-scale hierarchical image database. In CVPR, 2009.
[5] L. A. Gatys, A. S. Ecker, and M. Bethge. Image style transfer using convolutional neural networks. In CVPR, June 2016.

延伸閱讀