透過您的圖書館登入
IP:3.149.238.159
  • 學位論文

以離散餘弦轉換進行快速內涵式影像檢索

Fast Content-Based Image Retrieval in Discrete Cosine Transform Domain

指導教授 : 黃有評
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


由於數位影像數量的快速增長、儲存裝置價格的急速滑落以及網際網路爆發性的成長,在過去數十年間已有許多學者深入研究內涵式的影像檢索 (Content-Based Image Retrieval,CBIR)。雖然在許多文獻中已經提出了一些以顏色、紋路和形狀為基礎的影像特徵,但如何選擇一組好的特徵集合來進行影像的分類與檢索,始終是一個嚴峻的挑戰。在本論文中,我們先探討一些著名的內涵式影像檢索系統,再深入討論檢索策略中一些重要的主題;其中有效的索引和快速的檢索是我們核心的目標,也成為我們在選擇特徵集合時最重要的評量標準。 我們的研究主要是使用離散餘弦轉換 (Discrete Cosine Transform,DCT)來進行快速的影像索引和檢索。我們先說明如何使用DCT有效地代表影像,然後再提出一個以DCT為基礎的兩階段方法來進一步提升檢索的速度。由於文字可以被視為一個灰階的影像,這個兩階段檢索的觀念也被成功地應用於中文文字的辨識。此外,我們使用了一組權重值來表達檢索影像中不同特徵間的相對重要性,其在進行反覆精鍊檢索的過程中,扮演了重要的角色。我們也針對這種彈性的檢索方式深入研究,提出了一種稱為模糊語意資訊檢索的方法,並實現於鳥類檢索系統中。最後,我們將檢視目前的研究成果,並據以提出未來的研究重點,以做為本論文的結論。

並列摘要


As vastly increasing amount of digital images, rapidly declining cost of storage, and explosive growth of the Internet, content-based image retrieval (CBIR) has been intensively studied in the last decades. Though a number of image features based on color, texture, and shape attributes in various domains have been reported in the literature, it is still a rigorous challenge to select a good feature set for image classification. In this thesis, some famous CBIR systems are reviewed, and related issues in the retrieval strategy are addressed. The effective indexing and efficient retrieval are identified as a problem, which serves as the most important criterion in choosing the feature set. Our work mainly focuses on the use of discrete cosine transform (DCT) as a contribution to fast indexing and retrieval in a CBIR system. We will first show the effective representation of images in DCT domain. Then, to further improve the retrieval speed, a two-stage approach based on DCT is proposed. As the character can be regarded as a gray image, the concept of the two-stage approach is also successfully applied to the recognition of Chinese characters. In addition, a set of weights are used to characterize the relative importance of the features in a query image, which plays an important role in the multiple passes of refining the retrieval. An intensive study of such flexible retrieval, called the fuzzy semantic information retrieval model, is realized in a bird searching system. Finally, the prospects of further work based on the findings of the study are given as a conclusion.

參考文獻


[1] T. Tsai, Y.-P. Huang, and T.-W. Chiang, “Fast Image Retrieval Using Low Frequency DCT Coefficients,” Proc. of the 10th Conf. on Artificial Intelligence and Applications-International Track, Kaoshung, Taiwan, Dec. 2005.
[3] T. Tsai, Y.-P. Huang, and T.-W. Chiang, “Content-Based Image Retrieval Using Gray Relational Analysis,” Proc. of the Conf. on Gray System and Its Applications, Pingtung, Taiwan, pp.227-233, Dec. 2005.
[4] T. Tsai, Y.-P. Huang, and T.-W. Chiang, “Image Retrieval Based on Dominant Texture Features,” Proc. of IEEE Int. Symposium on Industrial Electronics, Montreal, Quebec, Canada, July 2006. (Accepted)
[6] T. Tsai, Y.-P. Huang, and T.-W. Chiang, “A Two-Stage Content-Based Image Retrieval Approach in DCT Domain,” submitted to Int. Journal of Pattern Recognition and Artificial Intelligence, May 2006.
[7] T.-W. Chiang, T. Tsai, and Y.-P. Huang, “Wavelet-Based Multiresolution Matching for Content-Based Image Retrieval,” 第十一屆資訊管理暨實務 研討會, Taipei, Taiwan, pp.1963-1975, Dec. 2005.

延伸閱讀