透過您的圖書館登入
IP:18.191.108.168
  • 學位論文

基於卷積神經網路之即時人臉表情辨識

Real-Time Facial Expression Recognition Based on Convolution Neural Network

指導教授 : 許陳鑑 王偉彥
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


本文提出基於卷積神經網路(Convolution Neural Network, CNN)之即時人臉表情辨識系統,透過所提出之穩定度提升方法,以解決即時人臉表情辨識的不穩定問題。提高人臉表情辨識準確率的方式有許多種,例如:圖片預處理、辨識架構改變等無非都是要讓應用方面的效果更好。本文想解決攝影機在光照等影響下會造成不斷擷取畫面的某些時刻之圖片特徵改變,導致人臉表情在辨識中產生錯誤。由於攝影機的高速擷取影像,圖片與圖片之間時間間隔較小,因此,本文針對於改良LeNet卷積神經網路和Two Stream卷積神經網路架構辨識系統提出不同的方法,前者使用比重平均法,而後者使用統計法,使用提出之方法後對於即時人臉表情辨識整體穩定度及強健性均獲得提升。

並列摘要


This thesis proposes a real-time facial expression recognition system based on Convolution Neural Network (CNN), solving the unstable problem of real-time facial expression recognition based on different convolutional neural network architectures according to different databases. There are many ways to improve the accuracy of facial expression recognition, such as image preprocessing, adjustment of network architecture, etc. The revamp of the training framework and image preprocessing allow better recognition results in applications. One existing problem is that when the camera captures images in high speed, changes in image characteristics may occur at certain moments due to the influence of light and other factors. Such changes inevitably result in incorrect recognition of the human facial expression. As an attempt to solve this problem, this thesis proposes several methods for improving the LeNet convolutional neural network and the Two Stream convolutional neural network architecture recognition system. The former uses the average weighting method, and the latter uses the statistical method. The overall robustness of real-time facial expression recognition is greatly improved by using the proposed method.

參考文獻


[1] J Searle, The Behavioral and Brain Sciences, in Minds Brains and Programs, vol. 3, 1980
[2] https://en.wikipedia.org/wiki/Turing_test
[3] Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, “Gradient-based learning applied to document recognition,” in Proc. of the IEEE, vol. 86, no. 11, pp. 2278-2324, Nov. 1998.
[4] A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet Classification with Deep Convolutional Neural Networks,” in Proc. 25th Int. Conf. Neural Inf. Process. Syst., Lake Tahoe, Nevada, USA, Dec. 2012, pp. 1106-1114.
[5] V. Nair and G. Hinton, “Rectified linear units improve restricted boltzmann machines,” in Proc. International Conference on Machine Learning, Haifa, Israel, June 2010, pp. 807-814.

延伸閱讀