運用資料探勘方法建構乳癌預後模式

在癌症流行病學中，可使用資料探勘技術來推測病患的預後結果以及造成癌症的可能因子，若能運用此技術針對本土常見疾病進行研究，將有助於改善治療與預防，進而達到降低醫療成本的效益。乳癌是世界各地最為常見的婦女癌症，近年來乳癌的死亡率亦有日漸增加的趨勢，成為國內女性癌症死亡原因第4名。本研究針對台灣中部某地區醫院之資料為樣本，經由資料的蒐集與研究變數彙整，使用5疊交互驗證法建構訓練及測試資料，以類神經網路、決策樹、貝氏分類法、支援向量機四種資料探勘方法建構乳癌預後(Prognosis)模式，並利用準確率(Accuracy)、敏感度(Sensitivity)、特異度(Specificity)及ROC(Receiver Operating Characteristic)曲線下面積等績效評估方法評估比較四種探勘模型。本研究結果顯示，以類神經網路與貝氏分類法預測乳癌的績效較佳，準確率分別為95.93%及94.41%，ROC曲線下面積分別為0.894及0.911。此四種模型可用於預測乳癌病患最終結果為存活或無法存活，在臨床上可提供乳癌病患的存活預測，期能提供醫師對於患者診療及預後評估之參考與建議。

關鍵字

決策樹；類神經網路；資料探勘；貝氏分類法；支援向量機

並列摘要

In the cancer epidemiology, we can use the technology of data mining to speculate about the prognosis results of patients and the factor of cancer causing. If we can use this technology to focus on the research of common disease at local place, it will not only improve the cure and prevention, but also reduce the cost of medical. Breast cancer is the most common cancer for women in the world. Recently, the death rate of breast cancer has increased gradually. Therefore, breast cancer has become the number four of the cancer death rate for domestic women. In this research, it used the data of regional hospital in the middle area of Taiwan as a sample. By concluding the research and collection of data, we used 5-fold-cross-validation to build training and test the data. In addition, we constructed the prognosis model of breast cancer by artificial neural network, decision tree, bayes classifier and SVM(support vector machine), and used accuracy, sensitivity, specificity and AUC(Area Under ROC Curve) these methods to assess and compare to the four models. The results show that the efficiency of artificial neural network and bayes classifier are better than other methods. The accuracy is 95.93% and 94.41%. Moreover, AUC are 0.894 and 0.911. These four models can predict if the breast cancer patients are alive or not. For clinical, it can provide the alive prediction of breast cancer patients to give suggestions of cure and prevention for the doctor.

並列關鍵字

SVM ； bayes classifier ； decision tree ； artificial neural network ； data mining

參考文獻

4.何子銘、盧瑜芬、許家瑋、白健佑、白璐、周雨青等，「運用三種資料探勘方法預測子宮頸癌存活情形之比較」，台灣家醫誌，第16卷，第3期，第192-203頁(2006)。

6.吳志雄，「認識乳癌」，聲洋防癌之聲，第124期，第15-17頁(2009)。

8.邱元亨，「乳癌的診斷及治療」，聲洋防癌之聲，第118期，第20-23頁(2007)。

9.林文賜、廖紹安、張文詔、周建民、洪耀明，「應用支援向量機於九份二山崩塌地變遷評估之研究」，水保技術，第3卷，第3期，第108-116頁(2008)。

10.林誠、劉福堂，「資料探勘在寬頻網路客戶目標行銷之應用研究」，電子商務學報，第7卷，第2期，第121-138頁(2005)。

被引用紀錄

賴琴文（2015）。以資料探勘與模糊邏輯技術建置乳癌疾病診斷系統〔碩士論文，義守大學〕。華藝線上圖書館。https://doi.org/10.6343/ISU.2015.00010

黃馨瑤（2011）。運用資料探勘技術於乳癌後憂鬱之預測模式建構〔碩士論文，臺北醫學大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0007-1107201115055900

林裕森（2011）。運用不同階段檢驗項目建構急性腎衰竭病患之預後模型〔碩士論文，朝陽科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0078-1511201110382713

林義祥（2011）。運用健檢資料建構大腸癌預測模型〔碩士論文，朝陽科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0078-1511201110382712

葉泰均（2013）。運用類神經網路探討大腸異常之相關健檢項目與預測模型〔碩士論文，朝陽科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0078-2712201314041915

國際替代計量

運用資料探勘方法建構乳癌預後模式

主題瀏覽