基於卷積神經網路之即時人臉表情辨識

本文提出基於卷積神經網路(Convolution Neural Network, CNN)之即時人臉表情辨識系統，透過所提出之穩定度提升方法，以解決即時人臉表情辨識的不穩定問題。提高人臉表情辨識準確率的方式有許多種，例如：圖片預處理、辨識架構改變等無非都是要讓應用方面的效果更好。本文想解決攝影機在光照等影響下會造成不斷擷取畫面的某些時刻之圖片特徵改變，導致人臉表情在辨識中產生錯誤。由於攝影機的高速擷取影像，圖片與圖片之間時間間隔較小，因此，本文針對於改良LeNet卷積神經網路和Two Stream卷積神經網路架構辨識系統提出不同的方法，前者使用比重平均法，而後者使用統計法，使用提出之方法後對於即時人臉表情辨識整體穩定度及強健性均獲得提升。

關鍵字

深度學習；卷積神經網路；人臉表情辨識；影像處理

並列摘要

This thesis proposes a real-time facial expression recognition system based on Convolution Neural Network (CNN), solving the unstable problem of real-time facial expression recognition based on different convolutional neural network architectures according to different databases. There are many ways to improve the accuracy of facial expression recognition, such as image preprocessing, adjustment of network architecture, etc. The revamp of the training framework and image preprocessing allow better recognition results in applications. One existing problem is that when the camera captures images in high speed, changes in image characteristics may occur at certain moments due to the influence of light and other factors. Such changes inevitably result in incorrect recognition of the human facial expression. As an attempt to solve this problem, this thesis proposes several methods for improving the LeNet convolutional neural network and the Two Stream convolutional neural network architecture recognition system. The former uses the average weighting method, and the latter uses the statistical method. The overall robustness of real-time facial expression recognition is greatly improved by using the proposed method.

並列關鍵字

deep learning ； convolution neural network (CNN) ； facial expression recognition ； image processing

參考文獻

[1] J Searle, The Behavioral and Brain Sciences, in Minds Brains and Programs, vol. 3, 1980

Google Scholar

[2] https：//en.wikipedia.org/wiki/Turing_test

Google Scholar

[3] Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, “Gradient-based learning applied to document recognition,” in Proc. of the IEEE, vol. 86, no. 11, pp. 2278-2324, Nov. 1998.

Google Scholar

[4] A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet Classification with Deep Convolutional Neural Networks,” in Proc. 25th Int. Conf. Neural Inf. Process. Syst., Lake Tahoe, Nevada, USA, Dec. 2012, pp. 1106-1114.

Google Scholar

[5] V. Nair and G. Hinton, “Rectified linear units improve restricted boltzmann machines,” in Proc. International Conference on Machine Learning, Haifa, Israel, June 2010, pp. 807-814.

Google Scholar

國際替代計量

基於卷積神經網路之即時人臉表情辨識

主題瀏覽