基於元學習的開集中文字元辨識

近來的基於深度神經網路的模型在中文手寫辨識已高於人類辨識率。然而訓練集與真實環境的特徵分佈和類別分佈存在差距，當這類模型面對與訓練集存在特徵差異的資料準確率會下降，且無法直接用於辨識未學過的類別。因此本研究的目的是提出一個能夠在不微調或不重新訓練的情況下，模型能夠辨識不在訓練集內的類別，且對特徵變化的敏感度下降。根據本研究的目的，我們透過訓練模型比較手寫字與印刷字相似性的方式，提出一個基於偽孿生網路架構的模型PSN-GC，透過給予新類別的印刷字範本，即可辨識不在訓練集中的類別。我們的方法相較過去研究提升了準確率，並降低記憶體用量與計算量。實驗使用多種測試集對PSN-GC做全面的評估，測試條件可被歸類為閉集與開集。為了更進一步測試PSN-GC的極限，我們也使用甲骨文作為訓練集與測試集，因甲骨文的筆畫變化較現代手寫中文更高。以上實驗顯示我們的模型略遜於專精於閉集條件，也就是對已知類別最佳化的方法；但是與開集方法相比，我們的模型得到更高的準確率，且對特徵敏感度較低。

關鍵字

深度學習；卷積神經網路；手寫中文辨識；甲骨文辨識

並列摘要

Recently, deep neural network-based models have achieved higher performance than humans in handwritten Chinese character recognition. However, the feature distribution and label distribution of real-world data are different from training sets. The recognition rates will drop when this type of model is evaluated on real-world data. Also, the models can not recognize unlearned categories without retraining or finetuning. Therefore, this study aims at proposing a model that can be applied to open-set and is less sensitive to feature changes. According to the purpose of this research, by training the model to compare the similarity between handwritten and printed characters, we propose a model PSN-GC based on the pseudo-Siamese network architecture. Our method improves accuracy and consumes less memory usage and computation than previous studies. The experiments use multiple testing sets to conduct a comprehensive evaluation of PSN-GC, including closed-set conditions and open-set conditions. In order to further test the limit of PSN-GC, we also use oracle bone inscriptions as the training set and testing set due to the stroke variation of oracle bone script higher than modern handwritten Chinese characters. Though our model is less accurate than the models optimized to learned categories under closed-set conditions, our model achieves higher accuracy and is less sensitive to feature changes under open-set conditions.

並列關鍵字

deep learning ； convolution neural network ； handwritten Chinese character recognition ； oracle bone inscription recognition

參考文獻

[1] N. Arica and F. T. Yarman-Vural, "An overview of character recognition focused on off-line handwriting," IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews), vol. 31, no. 2, pp. 216-233, 2001-05-01 2001, doi: 10.1109/5326.941845.

Google Scholar

[2] X.-Y. Zhang, Y. Bengio, and C.-L. Liu, "Online and offline handwritten Chinese character recognition: A comprehensive study and new benchmark," Pattern Recognition, vol. 61, pp. 348-360, 2017-01-01 2017, doi: 10.1016/j.patcog.2016.08.005.

Google Scholar

[3] C.-L. Liu, F. Yin, D.-H. Wang, and Q.-F. Wang, "CASIA Online and Offline Chinese Handwriting Databases," in 2011 International Conference on Document Analysis and Recognition, 2011-09-01 2011: IEEE, doi: 10.1109/icdar.2011.17.

Google Scholar

[4] R. Dai, C. Liu, and B. Xiao, "Chinese character recognition: history, status and prospects," Frontiers of Computer Science in China, vol. 1, no. 2, pp. 126-136, 2007-05-01 2007, doi: 10.1007/s11704-007-0012-5.

Google Scholar

[5] F. Kimura, K. Takashina, S. Tsuruoka, and Y. Miyake, "Modified Quadratic Discriminant Functions and the Application to Chinese Character Recognition," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. PAMI-9, no. 1, pp. 149-153, 1987-01-01 1987, doi: 10.1109/tpami.1987.4767881.

Google Scholar

主題瀏覽