自監督學習於圖神經網路：實驗與分析

隨著我們在機器學習領域的了解日趨深入，將大量已標記的樣本作為訓練對象的監督式學習正被廣泛地應用在各式各樣的情境與任務當中。然而，對於那些僅有部分樣本帶有標記的資料集，要如何在有限的時間和資源裡，讓電腦能從中學習相關的特徵並加以應用，便成為了一個值得研究的新問題。「自監督學習」提供了可能的解決方案。和監督式學習不同的是，在自監督學習中，我們無需大量的事前作業，只需將少量已標記的樣本送入模型，模型即可從中自我學習、生成標記，進而達到、甚至超越監督學習下的結果。目前，自監督學習的研究與應用大多環繞著電腦視覺與自然語言處理，對於「圖」這種資料結構的了解仍處於起步的摸索階段。在本篇論文中，我們將深入探討圖資料結構下的自監督學習模型，藉由實驗不同的方法與參數，對結果提出可能性的推測：包括使用較深的編碼器架構可以得到較佳的結果、在中小型資料集中提高隱藏維度對預測效果的提升有限、不同的資料擴增方式和模型在化學與生物資訊類別的資料集當中，會產生不同的效果等。

關鍵字

自監督學習；自監督編碼器；圖神經網路

並列摘要

Supervised learning is a popular model training method. However, its success relies on the use of huge amounts of labeled data. Recent advances in self-supervised learning have provided researchers with a means to train models on data in which only a few labeled observations are required. Self-supervised learning is efficient because it can perform model training without requiring a large amount of preprocessed data. State-of-the-art self-supervised models can achieve, even exceed, the performance of supervised models. Most studies on self-supervised learning have been conducted in the fields of computer vision and natural language processing. Meanwhile, self-supervised learning on graph data is still nascent. In this thesis, we explored self-supervised learning for training graph neural networks (GNNs). We conducted experiments by training GNN models on four molecular and bioinformatics datasets in different experimental settings. Furthermore, we provided possible explanations for the experiment results. We found that models with a deeper encoder structure can obtain superior results. However, increasing the hidden dimension size when a model is trained on small or medium-size datasets can only result in little improvement. By contrast, different data augmentation methods and different types of models can yield different results on molecular and bioinformatics datasets.

並列關鍵字

self-supervised learning ； graph neural network ； encoder training

參考文獻

[1] Y. M. Asano, C. Rupprecht, and A. Vedaldi. Self-labelling via simultaneous clustering and representation learning. arXiv preprint arXiv:1911.05371, 2019.

Google Scholar

[2] P. Bielak, T. Kajdanowicz, and N. V. Chawla. Graph barlow twins: A self-supervised representation learning framework for graphs. arXiv preprint arXiv:2106.02466, 2021.

Google Scholar

[3] K. M. Borgwardt, C. S. Ong, S. Schönauer, S. Vishwanathan, A. J. Smola, and H. P. Kriegel.Protein function prediction via graph kernels. Bioinformatics, 21(suppl_1):i47–i56, 2005.

Google Scholar

[4] M. Caron, P. Bojanowski, A. Joulin, and M. Douze. Deep clustering for unsupervised learning ofvisual features. In Proceedings of the European conference on computer vision (ECCV), pages132–149, 2018.

Google Scholar

[5] M. Caron, I. Misra, J. Mairal, P. Goyal, P. Bojanowski, and A. Joulin. Unsupervised learning ofvisual features by contrasting cluster assignments. Advances in Neural Information ProcessingSystems, 33:9912–9924, 2020.

Google Scholar

主題瀏覽