透過您的圖書館登入
IP:3.141.202.187
  • 學位論文

以資料探勘與模糊邏輯技術建置乳癌疾病診斷系統

A Breast Cancer Diagnosis System Using Data Mining and Fuzzy Logic Techniques

指導教授 : 劉振隆

摘要


現今,人們的生活型態以及飲食習慣大幅改變,因而引起一些疾病,導致身體亮紅燈。也因為這些病因,人們開始注重身體的健康保健,來降低各種疾病發生的機率。本研究是以資料探勘分類技術中的決策樹理論,應用於乳癌之預測。經分析後可驗證哪些屬性變數會引起乳癌。本研究採用UCI資料庫之Breast Cancer Wisconsin (Original)總共有699筆樣本資料,刪除16筆遺漏值,共採用683筆資料。因此,本研究採用資料探勘Weka軟體中的方法,包含J48、NB Tree、Naïve Bayes、Bayes Net、Multi-Layer Perceptron等演算法,分別比較所檢測出來的準確度結果,並且利用混淆矩陣做分類分析的比較。另外,運用統計分析工具來檢測屬性變數重要程度之結果,來探討乳癌相關的病因。進一步地,使用由Java語言構成且執行效率良好的JFuzzyLogic模糊邏輯推論系統開發工具,運用規則決策分析相關之理論,針對乳癌來發展一套風險評估系統。藉由相關知識參考並使用C++語言撰寫規則方法,結合開發工具中所提供的推論機制應用,進而提供使用者一個決策及評估參考的依據。本研究結果可以輔助民眾做簡易的乳癌預測,並亦可提供給醫師做診斷時之分析參考。

並列摘要


People nowadays have changed a lot in terms of life style and dietary habits, which tend to cause health problems and make the body sick. Not until this happens are people really willing to start paying attention to health issues more, trying to reduce the chance of falling ill. In this study, we apply decision tree theory of data mining in the prediction of breast cancer. After the analysis, we can verify which sorts of qualitative variables may mean cancer potential. This research has collected 699 data from Breast Cancer Wisconsin (Original) UCI; however, the exact number in use, except for the 16 data missed, is 683. The algorithms of J48, NB tree, Naive Bayes, Bayes Net, and Multi-Layer Perceptron implemented in the Weka software are also adopted to compare respectively and help get the accuracy of the results. Confusion matrix is used in categorization analysis. In addition, statistical analysis is applied in judging the importance of qualitative variables and finding the cause of breast cancer. Moreover, this study uses JFuzzLogic package, which programmed using Java language, and some theories related to regular decision analysis to develop a risk assessment system solely for detecting breast cancer. Through the reference of related knowledge and the use of C++, the study combine the different inference mechanisms used in this study and make them a basis for the user in both decision making and assessing. The results showed in this thesis can help people to make their own breast-cancer checks primarily and also can be a useful reference for a doctor when he/she diagnoses whether a patient with breast cancer or not.

參考文獻


[17] 陳金泉,應用資料探勘找出金融海嘯下台灣股市的避險操作方式,大同大學資訊經營研究所碩士論文,2010。
[19] 張士瑋,應用資料探勘技術於成人健康檢查之慢性病預防,朝陽科技大學資訊工程系碩士論文,2014。
[21] 張昭威,運用資料探勘方法建構乳癌預後模式,朝陽科技大學工業工程與管理系碩士班碩士論文,2010。
[22] 張雅婷,以資料探勘技術建立輔助乳癌診斷模型,國立台北科技大學商業自動化與管理研究所碩士論文,2008。
[26] Cheng S. H., Tsou M. H., Liu M. C., ‘‘Unique Features of Breast Cancer in Taiwan.’’, Breast Cancer Res Treat, Vol. 63, 2000, pp.213-223.

被引用紀錄


黃冠豪(2017)。應用資料探勘方法探索學系對新生選讀該系關聯規則之研究〔碩士論文,義守大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0074-1907201714134100

延伸閱讀