生物資訊指的是運用計算機科學、資訊科技、統計學等來分析生物學的相關資訊,其中以序列比對、蛋白質結構分析在生物資訊是一個基本且最為重要的部份。若想要知道未知基因序列的結構和功能,就必須透過序列比對的方式。而當要做藥物開發等生物研究,首先就是對已知的蛋白質做結構分析進行檢測。MATLAB的M檔案可以將程式碼集中於一個檔案中,使執行效率更佳。在本研究中,我們將運用MATLAB圖形化使用介面(Graphic User Interface, GUI)的M檔案,建置出一個序列比對的生物資訊系統。 在系統功能方面,提供了全域序列比對(Global Alignment)、區域序列比對(Local Alignment)、序列標誌(Sequence Logo)等方式進行序列比對。在系統後端,建置一個基因序列的資料庫,這是應用MATLAB的Database Toolbox與資料庫連結,透過系統介面,便於使用者查詢基因序列、執行序列比對以及蛋白質結構分析。另外在輸出結果,盡量以圖像化的方式輸出,便於使用者觀察序列之間的相似度與差異性。
Bioinformatics refers to the use of computer science, information technology, statistics to analyze the biology-related information. Sequence Alignment and protein structure analysis are a part of basic but very important issues in bioinformatics. Method of Sequence Alignment is used to recognize an unknown Sequence and learn its structure and function. It is essential to perform protein structure analysis for drug development and other biological research. MATLAB has powerful computing functions and analysis tools. In this research, the M-files will be applied to design complicated GUI interface and group GUI codes to efficiently build the bioinformatics system. Functions of Global Alignment, Local Alignment and Sequence Logo for sequence alignment are provided in our system. As to backend in the system, genetic sequences database is built and linked via MATLAB database toolbox. Users can easily query gene sequence, execute sequence alignment and protein structure analysis through system interface. Also graphic outputs are displayed to conveniently observe similarities between sequences.