透過您的圖書館登入
IP:18.191.146.8
  • 學位論文

探討圖像資料增強對深度學習方法的影響

Effects of Image Augmentation on Deep Learning Methods

指導教授 : 蔣益庭 張紘睿

摘要


線性混合或組合多個圖像是計算機視覺中經常使用的圖像處理方法。而Mixup是近期崛起的一種線性運算方法,它能夠提高基於深度學習的模型的性能之外也提高訓練好的模型面對對抗性攻擊的有效性防禦能力,加上Mixup的方便性使得此方法獲得許多關注。但是,我們仍對線性處理後的效果和電腦背後隱藏的基本機制了解甚少。   在這項研究中,我們研究了線性運算對圖像分類任務的影響以及提供未來可研究方向。我們主要將幾種自引用線性混合運算應用於圖像處理,並使用這些圖像評估在不同混合參數下基於深度學習的圖像分類器的性能,這項研究的貢獻在於建立一個基礎,以幫助人們可以更好地理解線性運算在計算機視覺中的潛在機制。

並列摘要


Linearly mixing or combining multiple images is a frequently used image processing methods in computer vision. Mixup, which is a kind of linear operations, shows its effectiveness on improving the performance of deep-learning-based models and increasing the robustness of trained models against adversarial attacks. However, the effect and the underlying mechanism of linear operations are little understood. In this study, we investigate the effect of linear operations on the task of image classification. We apply several self-referential linear-mixing operations to process images, and use these images to evaluate the performance of deep-learning-based image classifiers under different mixing parameters. The contribution of this study is on establishing a foundation to better understand the underlying mechanism of linear operations.

參考文獻


[Krizhevsky 17] Krizhevsky, A., Sutskever, I., and Hinton, G. E.: ImageNet classification with deep convolutional neural networks, Communications of the ACM, Vol. 60, No. 6, pp. 84–90 (2017)
[Alex Krizhevsky, 09.]Learning Multiple Layers of Features from Tiny Images, (2009) https://www.cs.toronto.edu/~kriz/cifar.html
[LeCun 99] et al. The MNIST Dataset Of Handwritten Digits. (1999)
[Kaggle Inc. 18] Dogs vs. Cats Redux: Kernels Edition
[LeCun 89] LeCun, Y., Boser, B. E., Denker, J. S., Henderson, D., Howard, R. E., Hubbard, W. E., and Jackel, L. D.: Backpropagation Applied to Handwritten Zip Code Recognition, Neural Comput., Vol. 1, No. 4, pp. 541–551 (1989)

延伸閱讀