學習式先驗正規化自編碼器

隱變量深度生成模型 (Deep latent generative model) 大多需要選擇簡單、可解析 (tractable) 機率分佈，來做為隱變量先驗假設，但從最近研究發現，選擇不同的先驗分佈可能會影響生成模型能力;本研究提出一種從資料中學習隱變量先驗分佈的方法，無須人為給定選擇特定的先驗分佈，並用於先驗正規化自編碼器 (Prior Regularied Autoencoder)，引入一個編碼生成網路 (Code generator) 來學習先驗分佈，可以更好的捕捉資料特性，最後提出一個訓練框架，聯合訓練生成模型與先驗分佈;從實驗上來看，本研究提出方法能有效提高生成樣本品質，並且於表徵學習任務、文字至影像的轉換任務上都有良好的表現。

關鍵字

深度學習；電腦視覺；深度生成模型

並列摘要

Most deep latent factor models choose simple priors for simplicity, tractability or not knowing what prior to use. Recent studies show that the choice of the prior may have a profound effect on the expressiveness of the model, especially when its generative network has limited capacity. In this paper, we propose to learn a proper prior from data for Prior Regularied Autoencoder. We introduce the notion of code generators to transform manually selected simple priors into ones that can better characterize the data distribution. Experimental results show that the pro- posed model can generate better image quality and learn better disentangled rep- resentations than AAEs in both supervised and unsupervised settings. Lastly, we present its ability to do cross-domain translation in a text-to-image synthesis task.

並列關鍵字

deep learning ； computer vision ； generative adversarial networks

參考文獻

[1] Noah D Goodman and Joshua B. Tenenbaum. Probabilistic Models of Cognition. http: //probmods.org/v2, 2016. Accessed: 2018-3-16.

Google Scholar

[2] K. He, X. Zhang, S. Ren, and J. Sun. Deep Residual Learning for Image Recognition. ArXiv e-prints, December 2015.

Google Scholar

[3] Gao Huang, Zhuang Liu, Laurens van der Maaten, and Kilian Q Weinberger. Densely connected convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017.

Google Scholar

[4] Geoffrey Hinton, Li Deng, Dong Yu, George Dahl, Abdel rahman Mohamed, Navdeep Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Tara Sainath, and Brian Kingsbury. Deep neural networks for acoustic modeling in speech recognition. Signal Processing Magazine, 2012.

Google Scholar

[5] Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostro- vski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dhar- shan Kumaran, Daan Wierstra, Shane Legg, and Demis Hassabis. Human-level control through deep reinforcement learning. Nature, 518(7540):529–533, February 2015.

Google Scholar

國際替代計量

學習式先驗正規化自編碼器

全文下載

主題瀏覽