從文本中使用推理和轉換器產生問題

問題生成是數年來研究人數不斷增加的領域，教育者想透過機器學習與人工智慧，找到更簡單產生試卷的方法。在這篇論文中，我們探究了轉換器模型是如何利用樣本段落進行推論，並自動產生端對端的提問。這個模型使用端對端的方法訓練，會針對文本段落處理，在理解各句子後，再產生提問。而這些問題的答案並不直接出現在文本之中。言談分析與重述技巧是用來找出隱藏句子的一種推論方法，將資料輸入調校好的模型，即可從新句子或隱藏句子中產生提問。史丹佛的語法剖析器也被運用來特別針對句中的動詞、代名詞，及其他關鍵單元，以取得更清楚的文章全貌。我們在SQuAD 1.1資料集上進行文本段落的研究，先輸入原始段落，再經由推論規則轉換。所有句子轉換後，我們將這些縮減的段落輸進轉換器模型，以產生具更深理解程度的提問。分析及結果顯示，比起之前研究中較簡單、能從輸入段落裡找到答案的問題來說，這個模型能夠產生更深層的提問。重述及語法剖析的運用，及其創造和產生對於段落的推論，有望助於模型理解文章更深的涵義。

關鍵字

人工智能；人工智能；問題產生；釋義；解析器；轉換器；推理

並列摘要

Question Generation is a field of research that has been growing in popularity through the years, as educators seek to find ways to make test generation a lot easier with the use of Machine Learning and Artificial Intelligence. In this paper we research how automated end-to-end question generation that utilizes transformers will generate an understandable question by making an inference on sample paragraphs. The model is trained in an end-to-end approach where the model focuses on the context paragraph to understand the sentences and generate a question where the answer is not directly in the context paragraph. An inference approach is proposed to find hidden sentences by using discourse analysis and paraphrasing techniques based on fine tuning transformers to be fed into a model that can generate a question from the hidden or new sentences. The Stanford parser was also utilized to get a clearer view of the parts of speech focusing particularly on the verbs, pronouns, and other key entities in the sentence. Experiments on the context paragraphs was conducted on the SQuAD 1.1 dataset where we attempted to transform the original input paragraphs using the inference rule. After transforming all sentences, we then fed these new shortened paragraphs to the transformer model to generate a deeper level understanding question. The analysis results show that the model was able to generate questions of a deeper level than a simple one from previous research where the answer can be found in the input paragraph. Paraphrasing and utilizing parsers to create and conduct an inference on the sentences seemed to help the model to generate on a deeper level.

並列關鍵字

Artificial Intelligence ； Artificial Intelligence ； Inference ； Paraphrasing ； Parser ； Transformers

參考文獻

1] N. A. Smith and M. Heilman, “Automatic factual question generation from text,” 2011.

Google Scholar

[2] X. Du, J. Shao, and C. Cardie, “Learning to ask: Neural question generation for reading compre-hension,” 2017.

Google Scholar

[3] P. Rajpurkar, R. Jia, and P. Liang, “Know what you don’t know: Unanswerable questions for squad,”2018.

Google Scholar

[4] L. E. Lopez, D. K. Cruz, J. C. B. Cruz, and C. Cheng, “Transformer-based end-to-end questiongeneration,”ArXiv, vol. abs/2005.01107, 2020.

Google Scholar

[5] P. Rajpurkar, J. Zhang, K. Lopyrev, and P. Liang, “Squad: 100,000+ questions for machine com-prehension of text,” 2016.

Google Scholar

延伸閱讀

Lo, S. W. (2007). 不需製表格的字串比對演算法的分析 [master's thesis, National Chi Nan University]. Airiti Library. https://doi.org/10.6837/NCNU.2007.00222
張瀞婷（2019）。以生成對抗網路自動產生中英文語碼轉換文句〔碩士論文，國立臺灣大學〕。華藝線上圖書館。https://doi.org/10.6342/NTU201900420
陳滿銘（2012）。篇章邏輯與文本分析－以多二一（0）螺旋結構切入作探討。臺北大學中文學報，(11)，1-32。https://doi.org/10.29766/JCLLNTU.201203.0001
Lu, M. J. (2010). Learning Thematic Role Assignment and Emotion Detection from Text Sentences Using Adaboost Algorithms [master's thesis, National Tsing Hua University]. Airiti Library. https://doi.org/10.6843/NTHU.2010.00457
Ramya, V., & Gopinath, G. (2014). Hiding Secret Text in Quick Response Code and Transforming the Position of the Secret Data using Rotation Transformation. Research Journal of Applied Sciences, Engineering and Technology, 7(19), 4124-4129. https://www.airitilibrary.com/Article/Detail?DocID=20407467-201405-201507070020-201507070020-4124-4129

國際替代計量

從文本中使用推理和轉換器產生問題

全文下載

主題瀏覽