基於深度卷積神經網路移轉學習技術的臉面辨識系統

近年來，機器學習和深度學習受到了高度的關注，特別是在與使用深度學習相關的分類，例如：資料探勘，人臉和語音辨識等。其中性能的提升主要是由於複雜的演算法和架結構，部分原因則歸功於好的數據資料。本論文的主要動機是將卷積神經網路(CNN)用於人臉辨識，其目的是透過轉移學習 (Transfer Learning)，使用新數據訓練預先訓練模型(Pre-trained Model)的方法，進而獲得正確的預測和準確的分類結果，在三個分類的訓練資料庫中，各有兩百張圖片。這個訓練資料庫是用來進一步訓練改良的預先訓練模型(pre-trained model) ，進而可以在不同的情境下測試資料庫影像，並在各別的情境下得到準確的預測輸出

關鍵字

CNN ； Transfer learning ； Face recognition ； Alex net

並列摘要

Machine learning and deep learning particularly have gained a lot of attention in recent years, especially for classification related tasks, such as text mining, face and speech, etc. The performance increase is mostly due to complex algorithm and architecture, and partly due to the use of good data sets. The main motivation of this thesis is to train a Convolutional Neural Network (CNN) based system for face recognition aiming at positive prediction and appreciative accuracy result. By way of transfer learning, a pre-trained model can be tailored for different applications with new data. The resulting output attains good accuracy and result in different cases. The objective is to differentiate 3 labeled categories, each with 200 images in the training dataset. The training data is provided to modify the pre-trained model, which is further classified with the test images in different scenarios, where the prediction results achieve high accuracy for each individual case.

並列關鍵字

CNN ； Transfer learning ； Face recognition ； Alex net

參考文獻

[1] S.P. Bahurupi and D.S. Chaudhari, “Principal Component Analysis for Face Recognition,” International Journal of Engineering and Advanced Technology (IJEAT), pp.2249-8958, 2012. [2] H. Kong, X. Li, J.G. Wang, and C. Kambhamettu, “Ensemble LDA for face recognition,” In International Conference on Biometrics, Springer, Berlin, Heidelberg, pp.166-172, 2006. [3] A. Krizhevsky, I. Sutskever, and G.E. Hinton, “ImageNet classification with deep convolutional neural networks,” In Advances in neural information processing systems, pp. 1097-1105, 2012.

Google Scholar

[4] H.B. Björgvinsdottir and R. Seibold, “Face recognition based on embedded systems,” Master's Thesis in Mathematical Sciences, Lund university, Sweden, 2016. [5] J. Schmidhuber, “Deep Learning in Neural Networks: An overview.” Neural networks 61, pp. 85-117, 2015. [6] J. Gu, Z. Wang, J. Kuen, L. Ma, A. Shahroudy, B. Shuai, T. Liu, X. Wang, L. Wang, G. Wang and J. Cai, “Recent advances in convolutional neural networks.” Pattern Recognition, 2017. [7] Z. Huang, Z. Pan, and B. Lei, “Transfer Learning with Deep Convolutional Neural Network for SAR Target Classification with Limited Labeled Data,” International journal of Remote Sensing, 9(9), pp. 907, 2017. [8] B.M. Lake, T.D. Ullman, J. B. Tenenbaum, and S.J. Gershman, “Building Machines That

Google Scholar

Learn and Think Like People,” Behavioral and Brain Sciences 40, 2017. [9] A. Krizhevsky and G. Hinton, “Learning multiple layers of features from tiny images,” Vol. 1, No. 4, p. 7, University of Toronto, Canada, Apr 8, 2009. [10] P.J. Werbos, “Consistency of HDP applied to a simple reinforcement learning problem,” Neural networks, 3(2), pp. 179-189, 1990. [11] D.E. Rumelhart, G. E. Hinton, and R.J. Williams, “Learning Representations by Back-Propagating Errors,” Nature, 323(6088), pp. 533, 1986. [12] Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, “Gradient-based learning applied to document recognition,” Proceedings of the IEEE, 86(11), pp. 2278-2324, 1989. [13] R.O. Duda, P.E. Hart, and D.G. Stork, Pattern Classification, John Wiley & Sons, 2012. [14] G.B. Huang, Z. Bai, L.L.C. Kasun, and C.M. Vong, “Local receptive fields based extreme learning machine,” IEEE Computational Intelligence Magazine, 10(2), pp. 18-29, 2015. [15] D. de Ridder, “Shared weights neural networks in image analysis, Master's thesis in Applied physics Department,” Delft University of Technology, Netherlands, March 1996. [16] C. Gulcehre, K. Cho, R. Pascanu, and Y. Bengio, “Learned-norm pooling for deep feedforward and recurrent neural networks,” In Joint European Conference on Machine Learning and Knowledge Discovery in Databases, Springer, Berlin, Heidelberg, pp. 530-546, 2014. [17] F. Manessi and A. Rozza, “Learning Combinations of Activation Functions,” arXiv preprint arXiv:1801.09403, 2018. [18] D. Chen, X. Cao, F. Wen, and J. Sun, “Blessing of dimensionality: High-dimensional

Google Scholar

feature and its efficient compression for face verification,” In Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on, pp. 3025-3032, IEEE, 2013. [19] G.E. Hinton, S. Osindero, and Y.W. The, “A Fast Learning Algorithm for Deep Belief Nets,” Neural computation, 18(7), pp. 1527-1554, 2006. [20] V. Dumoulin, and F. Visin, “A guide to convolution arithmetic for deep learning,” arXiv preprint arXiv:1603.07285, 2016. [21] W. Zhou, S. Newsam, C. Li, and Z. Shao, “Learning Low Dimensional Convolutional Neural Networks for High-Resolution Remote Sensing Image Retrieval,” International journal of Remote Sensing, 9(5), pp. 489, 2017. [22] S. Han, J. Pool, J. Tran, and W. Dally, “Learning both weights and connections for efficient neural network,” In Advances in neural information processing systems, pp. 1135-1143, 2015. [23] O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M.Bernstein and A.C. Berg, “Image Net Large Scale Visual Recognition Challenge,” International Journal of Computer Vision, 115(3), pp. 211-252, 2015. [24] Math Works, Alex Net, [online], Available: https://www.mathworks.com/help/nnet/ref/alexnet.html#bvn44n6, July 6, 2018. [25] K. Murphy, “Machine learning: a probabilistic approach.” Massachusetts Institute of Technology, pp. 1-21, 2012. [26] D.P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” arXiv preprint arXiv:1412.6980, 2014.

Google Scholar

[27] Math works, Training options [online], Available: https://www.mathworks.com/help/nnet/ref/trainingoptions.html , July 6, 2018.

Google Scholar

國際替代計量

基於深度卷積神經網路移轉學習技術的臉面辨識系統

全文下載

主題瀏覽