卷積神經網路中基於投影的維度縮減層

本研究提出了一個卷積神經網路中取代池化的降維方法。池化層是接在卷積層後面，並發揮維度縮減的作用。目前，最大池化或平均池化等的方法被廣泛使用，而我們提出的方法將卷積層的輸出利用截斷的正交矩陣來轉換為維度較小的矩陣。我們將該截斷的正交矩陣視為神經網路中的訓練參數，並推導反向傳播演算法中出現的相關微分。除此以外，我們實際將上述所提的方法寫為電腦程式，驗證其可行性；同時，針對上述所提的方法與池化方法，於盡量相同的條件下進行比較。在實驗中，我們的方法展現較池化方法佳的性能。

關鍵字

卷積神經網路；維度縮減；池化；截斷正交矩陣；投影；反向傳播演算法

並列摘要

In this research, we proposed a dimensionality reduction method that takes the place of the pooling methods. A pooling layer is usually put after a convolutional layer to summarize the output images from the convolutional layer. At the moment, the max-pooling method or the average-pooling method is widely used on CNN. On the other hand, our proposed method transforms an output image from a convolutional layer into a lower-dimensional image by multiplying truncated orthogonal matrices. We regard the truncated orthogonal matrices as parameters of the neural network, and we derived the derivatives that appear in the backpropagation algorithm. Moreover, we also verified the feasibility of our proposed method by implementing it as a computer program. We compared the performance of our proposed method with the pooling methods under similar conditions. In the experiment, our proposed method achieved better performance than the pooling methods.

並列關鍵字

Convolutional Neural Network ； Dimensionality Reduction ； Pooling ； Truncated Orthogonal Matrix ； Projection ； Backpropagation Algorithm

參考文獻

[1] Chainer: A flexible framework for neural networks. https://chainer.org/. (Accessed on 02/28/2019).

Google Scholar

[2] Pytorch. https://pytorch.org/. (Accessed on 07/05/2019).

Google Scholar

[3] M. Abadi, A. Agarwal, P. Barham, E. Brevdo, Z. Chen, C. Citro, G. S. Corrado, A. Davis, J. Dean, M. Devin, S. Ghemawat, I. Goodfellow, A. Harp, G. Irving, M. Isard, Y. Jia, R. Jozefowicz, L. Kaiser, M. Kudlur, J. Levenberg, D. Mané, R. Monga, S. Moore, D. Murray, C. Olah, M. Schuster, J. Shlens, B. Steiner, I. Sutskever, K. Talwar, P. Tucker, V. Vanhoucke, V. Vasudevan, F. Viégas, O. Vinyals, P. Warden, M. Wattenberg, M. Wicke, Y. Yu, and X. Zheng. TensorFlow: Large-scale machine learning on heterogeneous systems, 2015. Software available from tensorflow.org.

Google Scholar

[4] J. C. Duchi, E. Hazan, and Y. Singer. Adaptive subgradient methods for online learning and stochastic optimization. Journal of Machine Learning Research, 12:2121–2159, 07 2011.

Google Scholar

[5] F. Chollet et al. Keras. https://keras.io, 2015.

Google Scholar

國際替代計量

卷積神經網路中基於投影的維度縮減層

全文下載

主題瀏覽