透過您的圖書館登入
IP:3.22.51.241
  • 學位論文

運用健檢資料建構大腸癌預測模型

Apply the Health Examination Data to Construct Colorectal Cancer Prediction Models

指導教授 : 鄭純媛
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


近年來,社會的快速變遷,民眾為求便利易攝取一些高脂肪及低纖維的食物,攝取過多會造成大腸黏膜有刺激作用,這些刺激作用容易使大腸消化系統阻塞。根據行政院衛生署統計結果,顯示國人得到結腸、直腸癌(又稱大腸癌)的人數有逐年增加的趨勢,從1996年的2642人至2009年的4531人,因此,大腸癌對國人的影響已經不容忽視了。 本研究運用資料探勘的技術,以某醫學中心的全身健康檢查資料為樣本,探討全身健康檢查資料與大腸癌疾病的關聯性,並建構大腸癌預測模型。建構預測模型分成兩個階段(一)運用健康檢查資料分別使用區別分析及Logistic迴歸分析,從健檢資料中篩選出大腸癌重要危險因子。(二)將階段(一)所獲得的重要危險因子作為自變項,分別運用類神經網路及支援向量機建構罹患大腸癌的預測模型。 研究結果顯示,以相關Logistic迴歸分析結合支援向量機所建構的模型預測大腸癌罹患較準確,平均準確度為88.60%,敏感度及特異度分別為87.32%及75.76%,然而,以區別分析結合支援向量機所建構的模型較不受樣本資料中正常與異常比率懸殊影響,平均準確度為77.45%,敏感度及特異度分別為76.53%及76.50%。 關鍵字:全身健康檢查、資料探勘、區別分析、Logistic迴歸、類神經網路、支援向量機、大腸癌

並列摘要


In recent years, with the rapid change of the society, for the sake of convenience and easy, people started to take high fat and low fiber food. However, excessive intake can cause colon mucosa and have stimulating effect, which will stimulate the digestive system and tends to block the large intestine. According to DOH statistics results, showed that number of people getting colon cancer (also known as colorectal cancer) tend to increasing over the years. From 2462 of year 1996 to 4531 of year 2009.Therefore, the impact of colorectal cancer is in negligible. This study uses Data mining techniques, taking a medical center’s general health check information for sample. Our goal is to explore the correlations between physical examination data and disease associated with colorectal cancer. Also, we build a colorectal cancer predictive model. The construction of predictive model is divided into two stages, (1) by using difference and Logistic regression analysis methods; we sift out the important risk factors for the colon cancer from the health check data. (2) we set the important risk factors acquired from stage one as independent variables, and apply the neural networks and support vector machine to construct the colorectal cancer prediction model. The results show that, the model built with correlation Logistic Regression combined with Support Vector Machines prediction is more accurate, the average mean accuracy is 88.60%, and sensitivity and specificity were 87.32% and 75.76%. However, the model built with Discriminant Analysis combined with the best Support Vector Machines is less affected by the ratio of the normal and abnormal data in the sample, the mean accuracy of 77.45%, sensitivity and specificity were76.53% and 76.50%. Keywords:Physical examination, Discriminant Analysis, Logistic Regression, Artificial Neural Networks, Support Vector Machines, Colorectal cancer

參考文獻


3. 王秀伯 (2005)。啤酒啤酒與高濃度酒精飲料增加大腸癌的機會,醫學
8. 李語嫣 (2009)。運用資料探勘技術由健康檢查與生活習慣資料建立疾病預測模型-以糖尿病為例,國立成功大學,醫學資訊研究所碩士論文,台南市。
24. 廖建彰、王心怡、林瑞雄、謝長堯、宋鴻樟 (2005)。台灣地區男性大腸癌與攝護腺篩檢狀況,台灣衛誌,24卷(3)。
26. 劉易承、宋鴻樟、謝玲玲、唐瑞平、葉志清 (2008)。大腸直腸癌之風險預測模式與風險指標,台灣衛誌,27卷(1)
30. 糠榮誠,2008年4月,腸保健康談大腸直腸癌系統疾病,人醫心傳。

被引用紀錄


張巧蓉(2013)。大腸腺瘤息肉之分類規則研究〔碩士論文,中山醫學大學〕。華藝線上圖書館。https://doi.org/10.6834/CSMU.2013.00059
葉泰均(2013)。運用類神經網路探討大腸異常之相關健檢項目與預測模型〔碩士論文,朝陽科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0078-2712201314041915
陳志達(2013)。運用多變量分析探討大腸異常之相關健檢項目〔碩士論文,朝陽科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0078-2712201314041878
黃淑婷(2013)。應用類神經網路技術探討科技接受模式下 護理人員數位學習之使用意願〔碩士論文,朝陽科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0078-2611201410165953
葉皇志(2014)。應用支援向量資料描述(SVDD)建構大腸異常預測模型〔碩士論文,朝陽科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0078-0905201416542667

延伸閱讀