線性混合或組合多個圖像是計算機視覺中經常使用的圖像處理方法。而Mixup是近期崛起的一種線性運算方法,它能夠提高基於深度學習的模型的性能之外也提高訓練好的模型面對對抗性攻擊的有效性防禦能力,加上Mixup的方便性使得此方法獲得許多關注。但是,我們仍對線性處理後的效果和電腦背後隱藏的基本機制了解甚少。 在這項研究中,我們研究了線性運算對圖像分類任務的影響以及提供未來可研究方向。我們主要將幾種自引用線性混合運算應用於圖像處理,並使用這些圖像評估在不同混合參數下基於深度學習的圖像分類器的性能,這項研究的貢獻在於建立一個基礎,以幫助人們可以更好地理解線性運算在計算機視覺中的潛在機制。
Linearly mixing or combining multiple images is a frequently used image processing methods in computer vision. Mixup, which is a kind of linear operations, shows its effectiveness on improving the performance of deep-learning-based models and increasing the robustness of trained models against adversarial attacks. However, the effect and the underlying mechanism of linear operations are little understood. In this study, we investigate the effect of linear operations on the task of image classification. We apply several self-referential linear-mixing operations to process images, and use these images to evaluate the performance of deep-learning-based image classifiers under different mixing parameters. The contribution of this study is on establishing a foundation to better understand the underlying mechanism of linear operations.