透過您的圖書館登入
IP:3.17.167.114
  • 學位論文

利用影響效果探勘協助文獻探勘

Impact Mining for Supporting Literature-based Discovery

指導教授 : 魏志平

摘要


近幾年科學文獻快速的成長,使得想要了解醫學資訊的人很難在大量的文獻中找到他們所想要資訊或者是潛藏的醫學上的關係,而Swanson於1986年提出了文獻探勘的方法來協助研究者找出潛在的醫學上的關係,之後其他的研究者也提出許多方法想要改善Swanson所提出的方法。 透過文獻探勘能夠縮短尋找潛在醫學上的關係的時間,但是無法提供更多的資訊,例如魚油能夠降低血漿濃度這種上升、下降影響的關係。因此,本研究基於文獻探勘的概念,提出影響效果探勘的方法,能夠找出醫學上關係上升、下降影響來幫助使用者或者是專家們在面臨大量資料時,透過這個方法所提供的上升、下降的圖形與排名更容易分析。我們會先透過文獻探勘找出相關的醫學概念,接著利用我們所提出的方法來找出醫學交互關係影響,最後再利用排序的方法來對產生的結果建立排名。 我們建立兩個實驗情境來評估影響效果探勘方法的結果,「疾病─化合物與藥情境」。在疾病─化合物與藥的情境中,通常較會注意哪種藥可以治療哪一種病,我們提出的方法能夠有效地將能治療疾病的藥排序在前面,並且能夠將正確的藥放在較高的排名。而在「藥─化合物與藥情境」之下,儘管實驗結果沒有疾病─化合物與藥的情境明顯,但是仍然可以看出我們所提出的方法能夠有效地提供較佳的結果給研究者。而對於研究者,通常較會注意前面一百名或是三百名的結果,而本研究所提出的方法能提供一個更好的結果讓想要了解醫學資訊不論是有醫學背景或是無醫學背景的人更容易了解。

並列摘要


Scientific literature has growth rapidly in the past century, and a great deal of knowledge can support medical researchers to keep up with up-to-date information. This large volume of data is difficult to discover hidden relationships. To overcome this problem, Swanson proposed a method called literature-based discovery in 1986 to support researchers an effective way to uncovering new, potentially meaningful relationships. After Swanson proposed this method, other researchers also try to improve the result from literature-base discovery or develop new method to improve. Researchers could employ literature-based discovery to support them reduce the time of discovering hidden relationships. But literature-based discovery method could not provide more information such as fish oil and blood viscosity is a suppressing relationship because fish oil can decrease blood viscosity. The kind of relationship we defined as impact relationship. Therefore, this study proposed a LBD (Impact) technique which is based on the concept of literature-based discovery and this technique can extract impact relationship to support researchers easier to analyze large volume of data. First, we apply literature-based discovery to retrieve related medical concepts. Subsequently, we use our proposed technique to extract impact relationship then order medical concepts in an appropriate way. We construct two scenarios to evaluate our proposed LBD (Impact) technique, disease-chemicals and drugs scenario and drug-chemicals and drugs scenario. In disease-chemicals and drugs scenario, researchers usually focus on which drug can cure disease. And our proposed technique can rank drugs that can cure disease at higher rank. In the other scenario, drug-chemicals and drugs scenario, although the experiment result is not better than disease-chemicals and drugs scenario, we still can provide a better result to researchers. For researchers, they usually pay more attention on top 100 or 300. In this study, our proposed technique can provide a better result for researchers.

參考文獻


[2] Baker, N.C., Hemminger, B.M., Mining Connections between Chemicals, Proteins, and Diseases Extracted from MEDLINE Annotations, Journal of Biomedical Informatics 43:510–519, 2010.
[3] Frijters R., et al, Literature Mining for the Discovery of Hidden Connections between Drugs, Genes and Diseases, PLoS Computational Biology, 6(9), 2010.
[4] Gordon, M.D., Dumais, S., Using Latent Semantic Indexing for Literature Based Discovery, Journal of the American Society for Information Science, 49(8) 674–685, 1998.
[5] Gordon, M.D., Lindsay, R.K., Toward Discovery Support Systems: A Replication, Re-examination, and Extension of Swanson's Work on Literature-based Discovery of A Connection between Raynaud's and Fish Oil, Journal of the American Society for Information Science, 47 (2) 116–128, 1996.
[6] Gordon, M.D., Lindsay, R.K., Literature-Based Discovery by Lexical Statistics, Journal of the American Society for Information Science, 50(7) 574–587, 1999.

延伸閱讀