透過您的圖書館登入
IP:3.14.70.203
  • 學位論文

利用語音進行照片中人物影像的自動化標註及檢索

Automatic Facial Image Annotation and Retrieval by Integrating Voice Label and Visual Appearance

指導教授 : 徐宏民

摘要


無資料

關鍵字

照片標註 語音檢索

並列摘要


Annotation is important for managing and retrieving a large amount of photos, but it is generally labor-intensive and time-consuming. However, speaking while taking photos is straightforward and effortless, and using voice for annotation is faster than typing words. To best reduce the manual cost of annotating photos, we propose a novel framework which utilizes the scarce spoken annotations recorded while capturing as voice labels and automatically label every facial image in the photo collection. To accomplish this goal, we employ a probabilistic graphical model which integrates voice labels and visual appearances for inference. Combined with group prior estimation and gender attribute association, we can achieve an outstanding performance on the proposed synthesized group photo collections.

並列關鍵字

Photo Annotation Speech Retrieval

參考文獻


[19] N. Kumar, A. C. Berg, P. N. Belhumeur, and S. K. Nayar. Attribute and simile classifiers for face verification. In Computer Vision, 2009 IEEE 12th International Conference on, pages 365–372. IEEE, 2009.
[14] T. J. Hazen, B. Sherry, and M. Adler. Speech-based annotation and retrieval of digital photographs. In INTERSPEECH, volume 7, pages 2165–2168, 2007.
[5] M. Brenner and E. Izquierdo. Recognizing people by face and body in photo col- lections. In 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG), pages 1–7. IEEE, 2013.
[6] C.-C. Chang and C.-J. Lin. Libsvm: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology (TIST), 2(3):27, 2011.
[7] D. Chen, X. Cao, F. Wen, and J. Sun. Blessing of dimensionality: High-dimensional feature and its efficient compression for face verification. In Computer Vision and

延伸閱讀