圖像資料結構化與分類的探討

莊于萱; Zhuang, Yu-Xuan

Please use this identifier to cite or link to this item: https://ah.lib.nccu.edu.tw/handle/140.119/136834

題名:	圖像資料結構化與分類的探討 A Study of Image Structurization and Classification
作者:	莊于萱 Zhuang, Yu-Xuan
貢獻者:	余清祥<br>陳麗霞莊于萱 Zhuang, Yu-Xuan
關鍵詞:	圖像辨識資料結構化圖像風格分割圖像解析度 Image Recognition Data Structurization Image Style Splitting method Resolution
日期:	2021
上傳時間:	2-Sep-2021
摘要:	資料以各種形態存在於我們生活中，寫過的每一篇文章，甚至拍過的每一張照片，透過適當的數位化皆可由量化分析挖掘出其中的重要訊息。過去資料分析大多侷限在數字格式，隨著電腦相關技術的發展，資訊解讀擴展至文字、圖像、音樂等各種類型的資料，我們的生活因為資訊傳遞快速、即時判讀而更加便利，影像辨識、自動駕駛等應用就是眾所周知的應用。資料格式多元、傳遞交換便捷，都是大數據時代的特點，使得資訊安全及品質愈形重要，如何解讀龐雜的大數據，更是政府及個人必備的關鍵能力。不具固定格式資料稱為非結構資料，而解讀這類型資料的首要挑戰為數位化格式，但轉檔方式與研究目標、資料屬性關係密切，很難訂出一個絕對標準。\n以圖像辨識為例，資料應轉換成三原色（紅綠藍，RGB：Red、Green、Blue）或是圖像形狀及大小，至今仍無定論；即便是以顏色紀錄，是否也需考量色彩飽和度、亮度等資訊？有鑑於圖像資料尚無統一的格式化，本文以視覺感受的方式定義變數，比較冷暖色、RGB、灰階、邊緣檢測、分割圖像等方法，協助分類不同風格的圖像。由於圖像辨識結果多半與其屬性有關（Data Dependent），本文分析三種類型的圖像資料：臺灣報紙頭版、美國Vogue雜誌封面、十九世紀油畫（現實派、印象派），其內容包含文字、照片（及圖片）、繪畫，再結合統計分析、機器學習模型，藉由電腦模擬與交叉驗證，探討不同的圖像格式方法、模型及其參數的分類準確性。研究結果顯示，分類準確性隨著解析度上升而提高，100100即有不錯的效果；而圖像格式化中以分割圖像法最佳，用於不同圖像資料及分析模型的分類準確性較高。 Data exists in various forms, including articles we write and photos we take. We can dig out important information through appropriate digitization and quantitative analysis. Data analysis was usually limited to data with digital format and now has expanded to all kinds of data, such as text, images, and music. Rapid data transmission and real-time analysis brings convenience to our lives, and image recognition and autonomous driving are two famous applications. The ability of analyzing messy and complicated data is a key ability in Big Data era. However, more than 90% of data are unstructured data and it needs to convert them into digital format before plugging into data analysis. But the conversion method is closely related to the research objectives and data attributes. Taking image recognition as an example, there is no consensus if the image data should be converted into three primary colors (red, green and blue, RGB) or their shape and size should also be considered.\nThis study aims to explore different structurization methods for image data and evaluate which method, after plugging into classification models, has the highest accuracy in classifying images . We consider three types of image data: Taiwanese newspapers, Vogue magazine, and nineteenth century oil paintings, since the classification results are often data-dependent. We will apply statistical and machine learning models to explore classification accuracy of different image format methods. The analysis results show that the classification accuracy increases when the resolution becomes higher, and 100100 resolution can provide sufficiently satisfactory results. We found that the splitting method has highest accuracy in image classification, for three types of image data and different classification models.
參考文獻:	一、中文文獻\n孔萬增、朱善安（2007）。「基於切割子模塊的單樣本人臉識別」，《光電工程》，34（8），頁110-114。\n任大勇、賈振紅、楊傑（2019）。「結合位圖切割和區域合併的彩色圖像分割」，《計算機工程與應用》，55（2），頁162-167。\n柯裕嘉（2011）。「報紙消費者對頭版新聞形式與內容喜好度研究」，國立台灣師範大學圖文傳播學系碩士論文，頁1-120。\n胡毅（2015）。「米勒《拾穗》賞析」，《時代文學（下半月）》，第7期，頁75-75。\n施振祥（2010）。「基於顏色和邊緣信息分佈的圖像檢索」，《計算機科學》，37（2），頁256-260。\n辜衛東、李兵（2018）。「基於隨機區域合併的自動彩色圖像分割算法」，《計算機科學》，45（9），頁279-282。\n黃衍翠（2010）。「從《日出·印象》談印象派油畫之美」，《時代文學（上半月）》，第3期，頁229-231。\n楊賢藝（2006）。「論印象派繪畫的藝術特色」，《藝術教育》，第4期，頁94-95。\n劉振源（1900）。「印象派繪畫」，藝術圖書出版社，頁1-252。\n龔如森（2016）。「西班牙藝術夜空裡的星光-寫實主義的委拉斯蓋茲與浪漫主義的哥雅」，中國文化大學藝術學院美術學系學系碩士論文，頁1-136。\n\n二、英文文獻\nBarni, M., Pelagotti, A., and Piva, A.（2005）“Image processing for the analysis and conservation of paintings: opportunities and challenges.” IEEE Signal Process Magazine, 22, 141–144.\nCanny, J.（1986）“A computational approach to edge detection.” IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-8, 679–698.\nCheng, Y.C. and Chen, S.Y.（2003）“Image classification using color, texture and regions. ” Image and Vision Computing, 21（9）, 759–776.\nChen, W., Shi, Y.Q., and Xuan, G.（2007）“Identifying computer graphics using HSV color model and statistical moments of characteristic functions.” IEEE International Conference on Multimedia and Expo, 1123–1126.\nChristiana, W. （2014）“Female image in Vogue magazine: A pictorial analysis of facial and body language.” Robots Reading Vogue.\nJunhua, C. and Jing, L. （2012）“Research on color image classification based on HSV color space.” Second International Conference on Instrumentation, Measurement, Computer, Communication and Control, 944–947.\nKabade, A.L. and Sangam, D.V.（2016） “Canny edge detection algorithm.” International Journal of Advanced Research in Electronics and Communication Engineering, 5（5）, 1292–1295.\nKaur, B. and Garg, A. （2011）“Mathematical morphological edge detection for remote sensing images.” International Conference on Electronic Computer Technology, 5, 324–327.\nLee, S.G. and Cha, E.Y. （2016）“Style classification and visualization of art painting’s genre using self-organizing maps.” Human-centric Computing and Information Sciences, 6, No.7.\nRabby, M.K.M., Chowdhury, B., and Kim, J.H.（2018）“A modified canny edge detection algorithm for fruit detection and classification.” 10th International Conference on Electrical and Computer Engineering, 237–240.\nPal, N.R. and Pal, S.K. （1993）“A review in image segmentation techniques.” Pattern Recognition, 26（9）, 1277–1294.\nPeter, L. （2013）“Vogue Cover Averages.” Robots Reading Vogue.\nRong, W., Li, Z., Zhang, W. and Sun, L. （2014） “An improved canny edge detection algorithm.” Proceeding IEEE International Conference on Mechatronics and Automation, 2（2）, 577–582.\nPhoebe, P. （1997） “Impressionism.” Thames and Hudson, 1–187.\nRohit, G. （2019）“LightGBM - Another gradient boosting algorithm.” , Retrieved March 28, 2019, from: https://rohitgr7.github.io/lightgbm-another-gradient-boosting/\nSüsstrunk, S., Buckley, R., and Swen, S. （1999）“Standard RGB color spaces.” Color and Imaging Conference, 127–134.\nSeetharaman, K. （2019） “Melanoma Image Classification Based on Color, Shape, and Texture Features Using Multivariate Statistical Tests.” Journal of Computational and Theoretical Nanoscience, 16（4）, 1717–1724.\nSong, M. and Civco, D. （2004） “Road extraction using SVM and image segmentation.” Photogrammetric Engineering and Remote Sensing, 70（12）, 1365–1371.\nSalman, A., Semwal, A., Bhatt, U., and Thakkar, V. M. （2017） “Leaf classification and identification using Canny Edge Detector and SVM classifier.” Proceedings of the 2017 International Conference on Inventive Systems and Control（ICISC）, Coimbatore, India, 19–20 January 2017, 1–4.\nXin, M. and Wang, Y. （2019）“Research on image classification model based on deep convolution neural network.” EURASIP Journal on Image and Video Processing, 40.\nDenny, B. （2015）“Understanding Convolutional Neural Networks for NLP.” Retrieved November 7, 2015, from: http://www.wildml.com/2015/11/understanding-convolutional-neural-networks-for-nlp/
描述:	碩士國立政治大學統計學系 108354028
資料來源:	http://thesis.lib.nccu.edu.tw/record/#G0108354028
資料類型:	thesis
Appears in Collections:	學位論文

Files in This Item:

File	Description	Size	Format
402801.pdf		2.92 MB	Adobe PDF2	View/Open

Show full item record

Google Scholar^TM

Check

Files in This Item:

Google Scholar^TM

Altmetric

Altmetric

Files in This Item:

Google ScholarTM

Altmetric

Altmetric

Google Scholar^TM