整合資料探勘方法應用於肝病輔助診斷

肝病，是台灣地區最主要的本土病。根據衛生署 2007年統計資料顯示，估計台灣每年約有一萬人以上死於肝炎、肝硬化、肝癌。因為肝病在初期症狀並不明顯，要等到病情相當嚴重時才會出現症狀，所以目前學者研究以建立一個輔助肝病診斷機制為主要議題之一。故本研究實際蒐集病患資料，使用多種資料探勘方法並整合專家們的意見，並運用基因演算法 (genetic algorithm, GA) 尋找最佳組合解的能力，建立一套最佳化的整合型肝病輔助診斷模式 (integrated liver diagnosis model, ILDM)。並且使用多元適應性雲形迴歸 (multivariate adaptive regression splines, MARS) 獲得較重要的肝病診斷變數，期望能建立更有效率的診斷模式。研究結果顯示，整合資料探勘方法所建構的診斷模型表現優於單一方法，且 GA 能減少建構所有診斷模型所要耗費的與成本，快速找到最佳的診斷模型。另外，經由 MARS 所建立的診斷模型，表現優於未篩選變數的模型。及所建立的肝病輔助診斷模式，可以降低因誤判所造成的延誤就醫的可能性，並且節省醫療成本，減少不必要的檢查。

關鍵字

肝病診斷；資料探勘；基因演算法

並列摘要

Liver disease is the most common local disease in Taiwan. According to the statistics from Department of Health in 2007, around ten thousand people die from liver cirrhosis, liver cancer and other liver diseases because the symptoms of liver disease are not obvious in the initial stage, and the condition is usually too serious to be treated when related symptoms make themselves felt. Developing an assisted liver disease diagnosis model has therefore become a major issue attracting growing attention from scholars and researchers. This study accordingly aims at constructing an optimal integrated liver disease diagnosis model (ILDM) by collecting patient data, using data mining techniques, integrating expert opinions, and utilizing genetic algorithm that is capable of finding best combination of diagnosis models. Moreover, MARS (multivariate adaptive regression splines) is adopted to obtain significant diagnosis variables, helping to construct a more efficient diagnosis system. As the results reveal, the integrated data mining techniques of diagnosis model outperforms the single data mining techniques of diagnosis model. Using GA helps reduce the time and cost spent on model construction and speed up the identification of the best combination of ILDM. In addition, the diagnosis model established by MARS outperforms the diagnosis model with no screening variables. ILDM can be expected to decrease the possibility of delays in medical treatment caused by wrong diagnosis and save medical costs by eliminating unnecessary inspections.

並列關鍵字

liver disease diagnosis ； data mining ； genetic algorithm

參考文獻

[18] 葉怡成、杜榮原，「以因子實驗法發掘支援向量機的重要變數與建構最小成本之診斷模型」，智慧科技與應用統計學報，第五卷，第二期，2007，第1-22頁。

[12] 林豐澤，「演化式計算上篇：基因演算法以及三種應用實例」，智慧科技與應用統計學報，第三卷，第一期，2005，第29-56頁。

[26] 蔡蕙如、柯明中、張偉斌、劉德明，「應用類神經網路與分類迴歸樹於肝癌分類模式」，北市醫學雜誌，第四卷，第八期，2007，第658-667頁。

[13] 張偉斌、吳振龍、紀櫻珍、黃育文與劉德明，「案例推理法增進乳癌診斷率」，北市醫學雜誌，第三卷，第十一期，2006，第78-84頁。

[9] 何子銘、盧瑜芬、許家瑋、白健佑、白璐、周雨青、孫建安、Thomas Wetter、林金定、楊燦、朱基銘，「運用三種資料探勘方法預測子宮頸癌存活情形之比較」，台灣家醫誌，第十六卷，第三期，2006，第192-203頁。

被引用紀錄

郭振達（2010）。多元適應性雲形迴歸於高價值顧客商品偏好之研究〔碩士論文，淡江大學〕。華藝線上圖書館。https://doi.org/10.6846/TKU.2010.00046

林義祥（2011）。運用健檢資料建構大腸癌預測模型〔碩士論文，朝陽科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0078-1511201110382712

蔣馥帆（2014）。應用資料探勘技術建構大腸直腸癌第二期病患存活之預測模式〔碩士論文，國立中正大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0033-2110201613593789

國際替代計量

整合資料探勘方法應用於肝病輔助診斷

未授權

主題瀏覽