透過您的圖書館登入
IP:3.238.233.189
  • 學位論文

滯留時間校準、去噪與化合物辨識在代謝體學上之應用

Alignment, Denoising, and Identification in Metabolomics

指導教授 : 曾宇鳳

摘要


本篇論文呈現兩套我們研發的演算法,用來解決偵測小分子訊號時面臨的計算問題,它們是由代謝體學和毒物篩檢的應用發展而來。 在本篇論文的第一個部分,我們發展一套用於紫外光檢測器之層析圖譜校正工具-Chromaligner,它可以校正多種層析法的滯留時間(retention time),且執行速度比傳統correlation optimized warping (COW)演算法快。代謝體學(metabolomics)分析上常用之層析法包含毛細管電泳及高效液相層析儀等。毛細管電泳因其分析所需之樣品量及有機溶媒消耗量較少,是一種綠色環保層析分析法,且分析物分離效率高,但缺點為滯留時間再現性較差。高效液相層析儀有系統穩定性佳、分析結果再現性高之優點,但在梯度沖提時易發生滯留時間偏移。倘若樣本內包含多元化合物,層析圖譜之滯留時間偏移將導致化合物之辨識錯誤率提高,因此需要發展滯留時間校正工具。現有的校正工具大多為質譜儀設計,無法校正紫外光檢測器之圖譜資料。然而質譜儀費用昂貴,許多實驗單位仍以紫外光檢測器作為主要分析方式。Chromaligner為紫外光檢測器之資料提供滯留時間校正之解決方案。Chromaligner藉設定約束條件之層析圖譜校準法來解決波峰偏移問題。使用者收集完層析圖譜並定義一組作為約束條件之波峰後,Chromaligner使用這些定義好之波峰配合COW演算法來校正圖譜。Chromaligner之執行速度比傳統COW演算法快總波峰數之平方倍,可大量加速圖譜校正。Chromaligner同時可藉由已知成分之波峰校正出最佳結果,以利化學劑量學之分析。Chromaligner已建置在網路上,提供使用者免費使用線上校正層析圖譜滯留時間,網址如下: http://cmdd.csie.ntu.edu.tw/~chromaligner 在本篇論文的第二個部分,我們發展一個全新的去噪(denoising)和波峰擷取(peak picking)演算法-TIPick,它能夠自動在複雜的液相層析飛行時間式質譜儀(Liquid Chromatography - Time of Flight)樣本中,精確、靈敏地偵測出標的化合物。在毒物篩檢(toxicological screening)和代謝體學的研究中,液相層析飛行時間式質譜儀是一套重要的分析技術。TIPick的優點在於不假設化合物峰形之情況下能準確擷取到各種波峰,例如拖尾峰(tailing peaks)、低強度的波峰(low-intensity peaks)與分裂峰(split peaks)。分裂峰通常是由偵測器飽和(instrumental saturation)或去背演算法造成。TIPick包含了去背(background subtraction)與波峰擷取(peak picking)這兩個主要步驟。TIPick藉由去背消除已存在空白注射(blank injections)之化合物訊號,並使用圖譜波峰(chromatographic peak)之時間長度與強度(intensity)特性,去除雜訊並擷取波峰。另外,TIPick利用重複分析之實驗資料(duplicates)增強訊噪比(signal-to-noise ratio),有效提高波峰偵測之能力。TIPick會對指定之荷質比(target mass-to-charge ratio)自動產生萃取離子層析圖(extracted ion chromatogram),協助研究人員從複雜的混合物中測定特定化合物訊號。TIPick會自動產生樣本之總峰表(total peak table),作為分析依據或為後續提供可能的客制化分析。TIPick成功建立了國立台灣大學代謝體核心實驗室標準品之資料庫,並應用在毒物篩檢上,為複雜的液相層析飛行時間式質譜儀圖譜分析提供一個高靈敏度之標的化合物偵測演算法。

並列摘要


This dissertation presents two developed algorithms for solving computational problems of detecting small molecules in the field of metabolomics analysis and toxicological screening. In the first part of this dissertation, we present the tool - Chromaligner, which is a tool for chromatogram alignment to align retention time for chromatographic methods coupled to spectrophotometers such as high performance liquid chromatography and capillary electrophoresis for metabolomics works. Chromaligner resolves peak shifts by a constrained chromatogram alignment. For a collection of chromatograms and a set of defined peaks, Chromaligner aligns the chromatograms on defined peaks using correlation optimized warping (COW). Chromaligner is faster than the original COW algorithm by k2 times, where k is the number of defined peaks in a chromatogram. It also provides alignments based on known component peaks to reach the best results for further chemometric analysis. In the second part of this dissertation, we present the tool - TIPick. Liquid Chromatography - Time of Flight mass spectrometry has become an important technique in toxicological screening and metabolomic analysis. We hereby provide an effective algorithm, TIPick, for target analysis to accurately and sensitively detect target compounds in complex samples. TIPick comprises two major steps: background subtraction and peak picking. By subtracting the blank chromatogram, TIPick is able to eliminate chemical signals appearing in blank injections. TIPick utilizes the length and intensity of chromatographic peaks to perform peak enhancement and peak picking; thus, it is able to detect low-intensity or split peaks that may arise from either instrumental saturation or a mathematical background subtraction algorithm. TIPick is able to detect all peaks even the tailing or fronting peaks without pre-assuming peak shape. Furthermore, TIPick uses duplicate injections to enhance the signals of peaks, which improves peak detection power. TIPick can generate the extracted ion chromatograms for target m/z and automatically provide the total peak table of the sample. TIPick has successfully constructed the NTU MetaCore standard library and facilitated toxicological screening.

參考文獻


[1] V. Cianchino, C. Ortega, G. Acosta, L. D. Martinez, and M. R. Gomez, "Fingerprint analysis and synthetic adulterant search in Hedera helix formulations by capillary electrophoresis," Pharmazie, vol. 62, pp. 262-5, Apr 2007.
[2] A. Nordstrom, G. O'Maille, C. Qin, and G. Siuzdak, "Nonlinear data alignment for UPLC-MS and HPLC-MS based metabolomics: quantitative analysis of endogenous and exogenous metabolites in human serum," Anal Chem, vol. 78, pp. 3289-95, May 15 2006.
[3] C. Christin, A. K. Smilde, H. C. J. Hoefsloot, F. Suits, R. Bischoff, and P. L. Horvatovich, "Optimized Time Alignment Algorithm for LC-MS Data: Correlation Optimized Warping Using Component Detection Algorithm-Selected Mass Chromatograms," Analytical Chemistry, vol. 80, pp. 7012-7021, 2008.
[4] N. Hoffmann and J. Stoye, "ChromA: signal-based retention time alignment for chromatography-mass spectrometry data," Bioinformatics, vol. 25, pp. 2080-1, Aug 15 2009.
[5] M. E. Monroe, N. Tolic, N. Jaitly, J. L. Shaw, J. N. Adkins, and R. D. Smith, "VIPER: an advanced software package to support high-throughput LC-MS peptide identification," Bioinformatics, vol. 23, pp. 2021-3, Aug 1 2007.

延伸閱讀