Publications-Theses

Title基於注意力與多模式分析之 數位相片管理系統設計與實作
Design and implementation of a multi-modal attention-based photo manager
Creator孫新民
Contributor廖文宏
孫新民
Key Words電腦視覺
多模式
影像處理
人工智慧
Date2004
Date Issued19-Sep-2009 12:09:23 (UTC+8)
Summary本論文敘述對於智慧型個人數位相片管理瀏覽平台之研究、設計與實作過程。系統設計上基於整合多重證據架構,採用影像內容與使用者瀏覽行為之分析作為自動分類,判斷影像重要性與推薦程度的依據。影像自動分類方面,包括外部給予的標準資訊-EXIF資訊與分析影像內容,以其中人物存在數量與面積比例為依據的影像分類。而在影像的推薦方面,則採用影像品質之分析-包括對焦品質分析、曝光品質分析-與分析使用者瀏覽相片時的行為-包括停留時間與專注程度的整合為分析重要程度依據;最後則採用多模式(Multi-Modal)架構整合不同的評估結果並作為推薦的結論。
In this thesis, we present the design and implementation of an intelligent personal digital photo browsing platform. The proposed system relies on multiple evidences inferred from image content as well as user behavior. Specifically, external EXIF data and face detection results are utilized to coarsely classify the digital images. Measures of image quality, including clarity and contrast, are calculated to further refine the search result. Moreover, we use web cameras to record and analyze the viewing behavior of the user and attempt to correlate the interest of the viewer to the effective viewing time. Finally, a multi-modal system is put in place to integrate the clues acquired from different modules.
參考文獻 【1】 Richard Shim.,「影像左右快閃記憶卡命運」,CNET新聞專區,2004年,http://taiwan.cnet.com/news/ce/0,2000062982,20087086,00.htm
【2】 Kerry Rodden, Kenneth R. Wood. 2003. How Do People Manage Their Digital Photographs? CHI 2003: NEW HORIZONS. Volume No. 5, Issue No. 1
【3】 Hyunmo Kang, Ben Shneiderman.2002. Visualization Methods for Personal Photo Collections:Browsing and Searching in the PhotoFinder. Department of Computer Science, Human-Computer Interaction Laboratory
【4】 Adobe Systems Incorporated, http://www.pacific.adobe.com/products/photoshopalbum/overview.html
【5】 Ullas Gargi, Yining Deng, Daniel R. Tretter. 2002. Managing and Searching Personal Photo Collections. HP Laboratories Palo Alto
【6】 Lynette Hirschman.1999. Intelligent Human-Computer Interfaces. The Edge Volume 3, Number 4
【7】 P. Maes, T. Darrell, B. Blumberg, A. Pentland. 1995. The ALIVE system: full-body interaction with autonomous agents. Computer Animation`95 .
【8】 許聞廉、陳克健,「自然智慧型輸入系統的語意分析─脈絡會意法」,1993年,Proceedings of the 6th International Symposium on Cognitive Aspects of the Chinese Language, (1993), 527-540.
【9】 Japan Electronics and Information Technology Industries Association . Exchangeable image file format for digital still cameras : Exif Version 2.2
【10】 TsuruZohTachibanaya..Description of Exif file format. 2001. http://park2.wakwak.com/~tsuruzoh/Computer/Digicams/exif-e.html#AboutExif
【11】 Stuart Russell ,Peter Norvig. 2002. Artificial Intelligence: A Modern Approach Second Edition. Prentice Hall.
【12】 Sanjay Kr. Singh, D. S. Chauhan, Mayank Vatsa, Richa Singh. 2003. A Robust Skin Color Based Face Detection Algorithm. Tamkang Journal of Science and Engineering, Vol. 6, No. 4, pp. 227-234
【13】 Y. Gong and M. Sakauchi, "Detection of regions matching specified chromatic features", Computer Vision and Image Understanding, vol. 61, no. 2, 1995, pp 263 - 269
【14】 Goldennumer.Net, “The human face is based entirely on Phi”, http://www.goldennumber.net/face.htm
【15】 Zhou Wang, Alan C. Bovik, 2002 “WHY IS IMAGE QUALITY ASSESSMENT SO DIFFICULT?”, IEEE International Conference on Acoustics, Speech, & Signal Processing
【16】 Zhou Wang, Alan C. Bovik. 2002. A Universal Image Quality Index. IEEE Signal Processing Letters, vol. 9, no. 3, pp. 81-84
【17】 Norbert Wiener. 1942. Extrapolation, Interpolation, and Smoothing of Stationary Time Series. MIT Express
【18】 Claude E. Shannon. 1948 . A Mathematical Theory of Communication. Bell System Technical Journal, vol. 27, pp. 379-423 and 623-656
【19】 Jiawei Han, Micheline Kamber. 2001. Data Mining: Concepts and Techniques
【20】 Gordon S. Linoff, Michael J. A. Berry, Michael J. A. Berry . 2001. Mining the Web: Transforming Customer Data.
【21】 Paul Viola, Michael Jones. 2001. Rapid Object Detection using a Boosted Cascade of Simple Features. Proceedings IEEE Conf. on Computer Vision and Pattern Recognition
【22】 E.S. Bigun, J.Bigün, B. Duc, S. Fischer. 1997. Expert conciliation for multi modal person authentication systems by Bayesian statistics, Audio and Video based Person Authentication - AVBPA97
【23】 P. Verlinde, G. Chollet, and M. Acheroy. 2000. Multi-modal identity verification using expert fusion. Information Fusion, 1:17--33
【24】 Conrad Sanderson, 2002, “Information fusion and person verification using speech & face information”, IDIAP–RR 02-33
【25】 Arun Ross, Anil Jain, Jian-Zhong Qian. 2001. Information Fusion in Biometrics. Lecture Notes in Computer Science
【26】 Metropolis,N., A. Rosenbluth, M. Rosenbluth, A. Teller, E. Teller, 1953,"Equation of State Calculations by Fast Computing Machines", J. Chem. Phys.,21, 6, 1087-1092,
Description碩士
國立政治大學
資訊科學學系
90753012
93
資料來源 http://thesis.lib.nccu.edu.tw/record/#G0090753012
Typethesis
dc.contributor.advisor 廖文宏zh_TW
dc.contributor.author (Authors) 孫新民zh_TW
dc.creator (作者) 孫新民zh_TW
dc.date (日期) 2004en_US
dc.date.accessioned 19-Sep-2009 12:09:23 (UTC+8)-
dc.date.available 19-Sep-2009 12:09:23 (UTC+8)-
dc.date.issued (上傳時間) 19-Sep-2009 12:09:23 (UTC+8)-
dc.identifier (Other Identifiers) G0090753012en_US
dc.identifier.uri (URI) https://nccur.lib.nccu.edu.tw/handle/140.119/37101-
dc.description (描述) 碩士zh_TW
dc.description (描述) 國立政治大學zh_TW
dc.description (描述) 資訊科學學系zh_TW
dc.description (描述) 90753012zh_TW
dc.description (描述) 93zh_TW
dc.description.abstract (摘要) 本論文敘述對於智慧型個人數位相片管理瀏覽平台之研究、設計與實作過程。系統設計上基於整合多重證據架構,採用影像內容與使用者瀏覽行為之分析作為自動分類,判斷影像重要性與推薦程度的依據。影像自動分類方面,包括外部給予的標準資訊-EXIF資訊與分析影像內容,以其中人物存在數量與面積比例為依據的影像分類。而在影像的推薦方面,則採用影像品質之分析-包括對焦品質分析、曝光品質分析-與分析使用者瀏覽相片時的行為-包括停留時間與專注程度的整合為分析重要程度依據;最後則採用多模式(Multi-Modal)架構整合不同的評估結果並作為推薦的結論。zh_TW
dc.description.abstract (摘要) In this thesis, we present the design and implementation of an intelligent personal digital photo browsing platform. The proposed system relies on multiple evidences inferred from image content as well as user behavior. Specifically, external EXIF data and face detection results are utilized to coarsely classify the digital images. Measures of image quality, including clarity and contrast, are calculated to further refine the search result. Moreover, we use web cameras to record and analyze the viewing behavior of the user and attempt to correlate the interest of the viewer to the effective viewing time. Finally, a multi-modal system is put in place to integrate the clues acquired from different modules.en_US
dc.description.tableofcontents 第一章 緒論 1
1.1數位影像普及化所造成的管理問題與目前的解決方案 1
1.2智慧型人機介面 3
1.3智慧型數位影像管理系統平台 5
第二章 影像內容資訊分析 9
2.1影像內容資訊分析概觀 9
2.2數位相機與拍攝參數資訊-EXIF簡介 10
2.3以人物為基礎的數位相片分類 13
第三章 數位影像品質參數分析 25
3.1數位影像品質分析概觀 25
3.2偵測成像結果品質之評估演算法 27
3.3偵測影像對比程度對於數位相片品質影響之評估演算法 33
第四章 使用者行為參數分析 37
4.1使用者行為分析概論 37
4.2使用者行為與專注程度分析 39
第五章 多模式與資訊融合 51
5.1多模式(Multi-Modal)與資訊融合(Information Fusion)概論 51
5.2決策核心設計 55
5.3系統整合 60
第六章 系統實作 62
6.1系統實作 62
6.2資料庫設計與實作 64
第七章 結論與未來發展 67
7.1 結論 67
7.2 未來發展 69
參考文獻 71
zh_TW
dc.format.extent 98777 bytes-
dc.format.extent 106434 bytes-
dc.format.extent 104322 bytes-
dc.format.extent 61791 bytes-
dc.format.extent 121485 bytes-
dc.format.extent 121150 bytes-
dc.format.extent 98223 bytes-
dc.format.extent 96164 bytes-
dc.format.extent 1327653 bytes-
dc.format.extent 1346117 bytes-
dc.format.extent 1712700 bytes-
dc.format.extent 416345 bytes-
dc.format.extent 308738 bytes-
dc.format.extent 1389671 bytes-
dc.format.extent 828192 bytes-
dc.format.extent 163279 bytes-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.format.mimetype application/pdf-
dc.language.iso en_US-
dc.source.uri (資料來源) http://thesis.lib.nccu.edu.tw/record/#G0090753012en_US
dc.subject (關鍵詞) 電腦視覺zh_TW
dc.subject (關鍵詞) 多模式zh_TW
dc.subject (關鍵詞) 影像處理zh_TW
dc.subject (關鍵詞) 人工智慧zh_TW
dc.title (題名) 基於注意力與多模式分析之 數位相片管理系統設計與實作zh_TW
dc.title (題名) Design and implementation of a multi-modal attention-based photo manageren_US
dc.type (資料類型) thesisen
dc.relation.reference (參考文獻) 【1】 Richard Shim.,「影像左右快閃記憶卡命運」,CNET新聞專區,2004年,http://taiwan.cnet.com/news/ce/0,2000062982,20087086,00.htmzh_TW
dc.relation.reference (參考文獻) 【2】 Kerry Rodden, Kenneth R. Wood. 2003. How Do People Manage Their Digital Photographs? CHI 2003: NEW HORIZONS. Volume No. 5, Issue No. 1zh_TW
dc.relation.reference (參考文獻) 【3】 Hyunmo Kang, Ben Shneiderman.2002. Visualization Methods for Personal Photo Collections:Browsing and Searching in the PhotoFinder. Department of Computer Science, Human-Computer Interaction Laboratoryzh_TW
dc.relation.reference (參考文獻) 【4】 Adobe Systems Incorporated, http://www.pacific.adobe.com/products/photoshopalbum/overview.htmlzh_TW
dc.relation.reference (參考文獻) 【5】 Ullas Gargi, Yining Deng, Daniel R. Tretter. 2002. Managing and Searching Personal Photo Collections. HP Laboratories Palo Altozh_TW
dc.relation.reference (參考文獻) 【6】 Lynette Hirschman.1999. Intelligent Human-Computer Interfaces. The Edge Volume 3, Number 4zh_TW
dc.relation.reference (參考文獻) 【7】 P. Maes, T. Darrell, B. Blumberg, A. Pentland. 1995. The ALIVE system: full-body interaction with autonomous agents. Computer Animation`95 .zh_TW
dc.relation.reference (參考文獻) 【8】 許聞廉、陳克健,「自然智慧型輸入系統的語意分析─脈絡會意法」,1993年,Proceedings of the 6th International Symposium on Cognitive Aspects of the Chinese Language, (1993), 527-540.zh_TW
dc.relation.reference (參考文獻) 【9】 Japan Electronics and Information Technology Industries Association . Exchangeable image file format for digital still cameras : Exif Version 2.2zh_TW
dc.relation.reference (參考文獻) 【10】 TsuruZohTachibanaya..Description of Exif file format. 2001. http://park2.wakwak.com/~tsuruzoh/Computer/Digicams/exif-e.html#AboutExifzh_TW
dc.relation.reference (參考文獻) 【11】 Stuart Russell ,Peter Norvig. 2002. Artificial Intelligence: A Modern Approach Second Edition. Prentice Hall.zh_TW
dc.relation.reference (參考文獻) 【12】 Sanjay Kr. Singh, D. S. Chauhan, Mayank Vatsa, Richa Singh. 2003. A Robust Skin Color Based Face Detection Algorithm. Tamkang Journal of Science and Engineering, Vol. 6, No. 4, pp. 227-234zh_TW
dc.relation.reference (參考文獻) 【13】 Y. Gong and M. Sakauchi, "Detection of regions matching specified chromatic features", Computer Vision and Image Understanding, vol. 61, no. 2, 1995, pp 263 - 269zh_TW
dc.relation.reference (參考文獻) 【14】 Goldennumer.Net, “The human face is based entirely on Phi”, http://www.goldennumber.net/face.htmzh_TW
dc.relation.reference (參考文獻) 【15】 Zhou Wang, Alan C. Bovik, 2002 “WHY IS IMAGE QUALITY ASSESSMENT SO DIFFICULT?”, IEEE International Conference on Acoustics, Speech, & Signal Processingzh_TW
dc.relation.reference (參考文獻) 【16】 Zhou Wang, Alan C. Bovik. 2002. A Universal Image Quality Index. IEEE Signal Processing Letters, vol. 9, no. 3, pp. 81-84zh_TW
dc.relation.reference (參考文獻) 【17】 Norbert Wiener. 1942. Extrapolation, Interpolation, and Smoothing of Stationary Time Series. MIT Expresszh_TW
dc.relation.reference (參考文獻) 【18】 Claude E. Shannon. 1948 . A Mathematical Theory of Communication. Bell System Technical Journal, vol. 27, pp. 379-423 and 623-656zh_TW
dc.relation.reference (參考文獻) 【19】 Jiawei Han, Micheline Kamber. 2001. Data Mining: Concepts and Techniqueszh_TW
dc.relation.reference (參考文獻) 【20】 Gordon S. Linoff, Michael J. A. Berry, Michael J. A. Berry . 2001. Mining the Web: Transforming Customer Data.zh_TW
dc.relation.reference (參考文獻) 【21】 Paul Viola, Michael Jones. 2001. Rapid Object Detection using a Boosted Cascade of Simple Features. Proceedings IEEE Conf. on Computer Vision and Pattern Recognitionzh_TW
dc.relation.reference (參考文獻) 【22】 E.S. Bigun, J.Bigün, B. Duc, S. Fischer. 1997. Expert conciliation for multi modal person authentication systems by Bayesian statistics, Audio and Video based Person Authentication - AVBPA97zh_TW
dc.relation.reference (參考文獻) 【23】 P. Verlinde, G. Chollet, and M. Acheroy. 2000. Multi-modal identity verification using expert fusion. Information Fusion, 1:17--33zh_TW
dc.relation.reference (參考文獻) 【24】 Conrad Sanderson, 2002, “Information fusion and person verification using speech & face information”, IDIAP–RR 02-33zh_TW
dc.relation.reference (參考文獻) 【25】 Arun Ross, Anil Jain, Jian-Zhong Qian. 2001. Information Fusion in Biometrics. Lecture Notes in Computer Sciencezh_TW
dc.relation.reference (參考文獻) 【26】 Metropolis,N., A. Rosenbluth, M. Rosenbluth, A. Teller, E. Teller, 1953,"Equation of State Calculations by Fast Computing Machines", J. Chem. Phys.,21, 6, 1087-1092,zh_TW