Please use this identifier to cite or link to this item: https://ah.lib.nccu.edu.tw/handle/140.119/64373
題名: 延展式區域三元化圖形特徵描述子之比例式降維法
Commensurate Dimensionality Reduction in Extended Local Ternary Patterns
作者: 余浩瑋
貢獻者: 廖文宏
余浩瑋
關鍵詞: 特徵描述
區域二元化圖形
延展式區域三元化圖形
比例式降維法
模糊理論
紋理影像辨識
feature descriptor
local binary pattern
extend local ternary pattern
commensurate dimensionality reduction
fuzzy logic
texture classification
日期: 2012
上傳時間: 3-Mar-2014
摘要: 區域二/三元化樣式與其各種變型被廣泛應用於物件辨識中的特徵描述,然而現有的區域特徵描述方式,普遍存在適用時機的問題,也就是針對不同類型的圖像資料庫,必須選用符合該圖片性質的描述法,方能達到較佳的辨識效果,舉例而言,處理材質影像時多使用uniform pattern,而進行人臉偵測或表情辨識時則多採用一般型的區域二/三元化樣式。\n\n本論文的目標是建構一個通用型的區域三元化樣式,使其一體適用於各類圖型辨識的任務,我們以延展式區域三元化樣式(Extended local ternary patterns, ELTP)為基礎,探討各種降維演算法的結合機制,並提出可行的樣式定義方法,我們針對ETLP 中的uniform pattern定義重新思考,藉由大規模實驗與統計,探討各類uniform pattern 的從屬關係與出現比例,並依據比例原則,在降維階段分配適當之維度,稱之為比例式降維法。\n\n本論文針對比例式降維後的ELTP之抗噪性、描述力與通用性進行深度的分析與廣泛的實驗,以驗證此類圖像描述方法之效能。此外,由於特徵樣式的編碼在某些情況下,容易受到雜訊影響而產生變化,因此我們亦提出了結合模糊理論的方式加以改善。以上改良方式之效果,都在各式實驗中獲得驗證。
Local binary/ternary pattern and its derivatives have been widely employed to represent low-level features in many pattern recognition tasks. However, existing local descriptors fail to achieve universal applicability in the sense that specific types of local binary patterns are better suited for certain collections of images. For example, uniform local binary patterns are preferred when dealing with textures, while regular local binary/ternary patterns are adopted for face detection and facial expression recognition.\n\nThis thesis proposes a universally applicable local descriptor based on the extended local ternary pattern (ELTP) to address the above concern. We exploit the feasibility of combining dimensionality reduction techniques to derive a novel local descriptor that is suitable for all kinds of object recognition applications. Specifically, we investigate all possible definitions of uniform patterns under ternary encoding scheme and study their properties. This enables us to devise a dimensionality assignment algorithm in which the allocated dimension is proportional to the appearance rate of the corresponding pattern group.\n\nThe newly defined extend local ternary pattern using commensurate dimensionality reduction (ELTP-CDR) technique has been extensively tested to analyze its universality, discriminability, and noise sensitivity. To further enhance the robustness of this class of local descriptor, we also incorporate fuzzy logic to derive fuzzy representations of the original ELTP. The efficacy of the proposed methods has been validated through several experiments targeted at texture recognition tasks.
參考文獻: [1] T. Ojala, M. Pietikainen, and T. Maenpaa, “Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24, no. 7, pp. 971-987, July 2002.\n[2] D. Lowe, “Distinctive Image Features from Scale-Invariant Keypoints,” Int`l J. Computer Vision, vol. 2, no. 60, pp. 91-110, 2004.\n[3] H. Bay, A. Ess, T. Tuytelaars, L. V. Gool, “SURF: Speeded Up Robust Features”, Computer Vision and Image Understanding (CVIU), Vol. 110, No. 3, pp. 346--359, 2008.\n[4] A. Alahi, R. Ortiz, P. Vandergheynst, “FREAK: Fast Retina Keypoint”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012.\n[5] S. Leutenegger, M. Chli, R. Y. Siegwart, “BRISK: Binary Robust Invariant Scalable Keypoints”, IEEE International Conference on Computer Vision (ICCV), 2011.\n[6] W. H. Liao, “Region Description Using Extended Local Ternary Patterns”, Proceedings of the 20th International Conference on Pattern Recognition, pp. 1003-1006, 2010.\n[7] X. Tan and B. Triggs. “Enhanced local texture feature sets for face recognition under difficult lighting conditions”. In Analysis and Modeling of Faces and Gestures, volume 4778 of LNCS, pages 168–182. Springer, 2007.\n[8] A. Shobeirinejad and Y. S. Gao, “Gender Classification Using Interlaced Derivative Patterns“, Proceedings of the 20th International Conference on Pattern Recognition, pp. 1509-1512, 2010.\n[9] T. Ojala, M. Pietikainen and T. Maenpaa, ”Multi-resolution Gray-scale and Rotation Invariant Texture Classification with Local Binary Patterns”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 24(7), pp.971-987. July 2002.\n[10] M. Heikkilä, M. Pietikäinen and C. Schmid, “Description of Interest Regions with Center-Symmetric Local Binary Patterns”, Computer Vision, Graphics and Image Processing, Lecture Notes in Computer Science, 2006.\n[11] T. Ahonen, M. Pietikainen, “Soft Histograms for Local Binary Patterns”, in Proc. Fin. Signal Process. Symp., Oulu, Finland, 2007.\n[12] J. C. Dunn, “A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters”, Journal of Cybernetics 3: 32-57, 1973\n[13] R. Krishnapuram, A. Joshi, L. Yi, “A Fuzzy Relative of the k-Medoids Algorithm with Application to Web Document and Snippet Clustering”, Snippet Clustering, in Proc. IEEE Intl. Conf. Fuzzy Systems - FUZZIEEE99, Korea, 1999.\n[14] T. Ahonen, A. Hadid, and M. Pietikainen, “Face Description with Local Binary Patterns: Application to Face Recognition”, IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 28, no. 12, pp.2037-2041, Dec. 2006.\n[15] N. P. Doshi, G. Schaefer, “A Comprehensive Benchmark of Local Binary Pattern Algorithms for Texture Retrieval”, International Conference on Pattern Recognition (ICPR), 2012.\n[16] Everingham, M., Van~Gool, L., Williams, C. K. I., Winn, J., Zisserman, A., “The PASCAL Visual Object Classes Challenge 2012 (VOC2012) Results”, http://www.pascal-network.org/challenges/VOC/voc2012/workshop/index.html.
描述: 碩士
國立政治大學
資訊科學學系
98753030
101
資料來源: http://thesis.lib.nccu.edu.tw/record/#G0098753030
資料類型: thesis
Appears in Collections:學位論文

Files in This Item:
File SizeFormat
303001.pdf5.93 MBAdobe PDF2View/Open
Show full item record

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.