對於生物而言,生物表現功能特性的藍圖是根據染色體上的DNA基因序列組成所決定。而DNA經轉錄、轉譯而形成的蛋白質正是左右生物表現的主要因素,在這些蛋白質序列上,會出現某些特殊的蛋白質功能區塊序列組,稱為domain。 本論文中,我們提供簡易的瀏覽器介面,以蛋白質名稱或蛋白質序列為條件進行資料的搜尋(目前所提供的有HUMAN、MOUSE和RAT三種物種可供查詢),再經由選定單一蛋白質後,進而查詢其所選定蛋白質之相關資訊,與其所對應的基因資訊。另外並針對此蛋白質的序列可同時執行五種的domain功能預測分析,有SMART(附錄A)、ProDom(附錄B)、NCBI(附錄C)、Pfam(附錄D)和Sanger(附錄E),並把這五種分析方法的結果以圖形方式並列顯示,提供生物學家直接快速地綜合比較,以瞭解此蛋白質所扮演的功能。最後根據此蛋白質上的某個功能區塊序列組,找出在不同物種染色體上的分佈情形,例如出現的個數和位置。如此希望能提供生物學家能以較簡易且快速的方法,來瞭解蛋白質在不同物種上演化及家族擴充彼此間的關聯性。
To all life forms, their biological functions are encoded by the DNA sequences on their chromosomes in higher organisms. However, not all DNA, but selectively few, are actively transcribed, and translated into proteins. Some of the protein sequences are highly conserved and represent the biological functions, which named domain. In this paper, we provide a new browser interface and Human’s, Mouse’s, and Rat’s protein sequence database for comparison. If queried with some unknown protein sequences, this system will display protein domain information and corresponding genetic location of these selected proteins. We utilize five major bioinformatics methods to predict protein domain, SMART, ProDom, NCBI, Pfam, and Sanger. Finally, we juxtapose the result of domain prediction and combine information on the chromosomes of different species. This makes the relationships visualized between chromosomes and regarding proteins easily to the end users.