基於訊息回應配對相似度估計的聊天記錄解構

一般而言，為建立Retrieval-based聊天機器人，我們可以從聊天紀錄中來建立所需的問答配對（Question-Answer Pair），然而問答配對並非完全連續地呈現在聊天紀錄中，不同內容的問答配對可能互相穿插，而從互相穿插的訊息中分離出不同子題的會話任務稱為對話解構（conversation disentanglement）。現有的對話解構研究大多透過計算兩個訊息的相似度來解決問題，過去研究將問題定義為判斷兩則Reddit訊息是否屬於相同主題的對話，但其所提出的模型對於未見過訊息的效能很差。實務上我們發現即使是使用者，在沒有上下文的情況下，要單從兩個給定訊息，判定其是否屬於相同會話是非常困難的。但若我們的目標是預測兩則訊息是否為回覆關係，則使用者判斷的一致性效能相對的較高。因此在本篇論文中，我們使用IRC與Reddit資料集進行實驗，並使用聊天記錄進行對話解構。利用Reddit回覆標記合成的資料集提供大量訓練資料建立模型，最後透過BERT模型在新定義的回覆關係預測上獲得良好的效能。

關鍵字

對話解構；回覆關係預測； BERT模型應用

並列摘要

To build a Retrieval-based dialog system, we can exploit conversation log to extract question-answer pairs. However, the question-answer pairs are hidden in the conversation log, interleaving each other. The conversation task that separates different sub-topics from the interspersed messages is called conversation disentanglement. In this paper, we examined the task of judging whether two Reddit messages belong to the same topic dialogue and found that the performance is worse if training and testing data are splitted by time. In practice, it is also a very hard task even for human beings as there are only two messages and no context. However, if our goal is to predict whether a message is a reply to the other, the problem becomes much easier to judge. By changing the way of data preparation, we are able to achieve better performance through DA-LSTM (Dual Attention LSTM) and BERT-based models in the newly defined Reply prediction task.

並列關鍵字

Chatlog Disentanglement ； Reply Relation Prediction ； BERT Neural Model

參考文獻

Allan, J. (2002). Introduction to Topic Detection and Tracking. In: Allan J. (eds) Topic Detection and Tracking. The Information Retrieval Series, vol 12. Springer, Boston, MA. doi: 10.1007/978-1-4615-0933-2_1

Aoki, P. M., Szymanski, M. H., Plurkowski, L., Thornton, J. D., Woodruff, A., & Yi, W. (2006). Where’s the party in multiparty?: Analyzing the structure of small-group sociable talk. In Proceedings of CSCW’06, 393-402. doi: 10.1145/1180875.1180934

Elsner, M. & Charniak, E. (2010). Disentangling chat. Computational Linguistics, 36(3), 389-409. doi: 10.1162/coli_a_00003

Jiang, J.-Y., Chen, F., Chen, Y.-Y., & Wang, W. (2018). Learning to Disentangle Interleaved Conversational Threads with a Siamese Hierarchical Network and Similarity Ranking. In Proceedings of NAACL ‘18. doi: 10.18653/v1/N18-1164

Pennington, J., Socher, R., & Manning, C. D. (2014). Glove: Global vectors for word representation. In Proceedings of EMNLP ’14, 1532-1543. doi: 10.3115/v1/D14-1162

國際替代計量

基於訊息回應配對相似度估計的聊天記錄解構

全文下載

主題瀏覽