醫療編碼指為醫學敘述加上編碼,用以表示醫療診斷和處置。它的好處在於將自由形式的文字標準化,可應用於:健康追蹤、醫療決策、統計分析、保險費用估價等。其中,最常用的國際編碼以國際疾病統計分類(ICD)為大宗。現今在醫院內,有疾病分類師為病歷標上國際疾病統計分類;然而,國際疾病統計分類的類別眾多,即使是專業人員也不一定能正確標記。隨著電子病歷的普及和預測模型的發展,自動化國際疾病統計分類成為一個長遠的研究目標。 在這篇論文中,我們提出了一個方法結合醫學知識圖譜以幫助國際疾病統計分類。我們的核心想法是:病歷中重要的醫學文字應對分類結果造成較大的影響。我們提取病歷中的醫學概念,透過計算概念和國際疾病統計分類的相似度,藉此提高重要醫學文字在注意力機制中的權重。在知識圖譜的幫助下,實驗顯示我們提出的方法能幫助先前的模型更好地預測國際疾病統計分類。
Clinical coding refers to translate medical narratives to code representation for indicating medical diagnosis and procedures. Its advantage lies in standardizing free-text, which can be applied to health tracking, medical decision-making, statistical analysis, insurance pricing, etc. Among them, International Statistical Classification of Diseases and Related Health Problems (ICD) is the most commonly used. Nowadays, clinical coders label ICD in hospitals. However, there are so many categories in the ICD ontology that even professionals might not be able to label them correctly. With the development of electronic medical records and prediction models, automatic ICD coding has become a long-term research goal. In this paper, we propose a method to combine medical knowledge graph to improve ICD coding models. Our main idea is that important medical words in clinical texts should contribute more to the prediction results. We extract the medical concepts from the clinical texts. Then, we calculate the concept similarity between these concepts and ICD to increase the weights of important medical words in the attention mechanism. Experiments show that our approach helps the previous model better predict the ICD.