基於注意力與多模式分析之數位相片管理系統設計與實作 | Publication

Publications-Theses

Article View/Open

pdf(727)pdf(789)pdf(748)pdf(655)pdf(839)pdf(843)pdf(756)pdf(759)pdf(958)pdf(2400)pdf(3326)pdf(1475)pdf(1162)pdf(937)pdf(914)pdf(764)

Publication Export

Google Scholar^TM

題名	基於注意力與多模式分析之數位相片管理系統設計與實作 Design and implementation of a multi-modal attention-based photo manager
作者	孫新民
貢獻者	廖文宏孫新民
關鍵詞	電腦視覺多模式影像處理人工智慧
日期	2004
上傳時間	19-Sep-2009 12:09:23 (UTC+8)
摘要	本論文敘述對於智慧型個人數位相片管理瀏覽平台之研究、設計與實作過程。系統設計上基於整合多重證據架構，採用影像內容與使用者瀏覽行為之分析作為自動分類，判斷影像重要性與推薦程度的依據。影像自動分類方面，包括外部給予的標準資訊-EXIF資訊與分析影像內容，以其中人物存在數量與面積比例為依據的影像分類。而在影像的推薦方面，則採用影像品質之分析-包括對焦品質分析、曝光品質分析-與分析使用者瀏覽相片時的行為-包括停留時間與專注程度的整合為分析重要程度依據；最後則採用多模式(Multi-Modal)架構整合不同的評估結果並作為推薦的結論。 In this thesis, we present the design and implementation of an intelligent personal digital photo browsing platform. The proposed system relies on multiple evidences inferred from image content as well as user behavior. Specifically, external EXIF data and face detection results are utilized to coarsely classify the digital images. Measures of image quality, including clarity and contrast, are calculated to further refine the search result. Moreover, we use web cameras to record and analyze the viewing behavior of the user and attempt to correlate the interest of the viewer to the effective viewing time. Finally, a multi-modal system is put in place to integrate the clues acquired from different modules.
參考文獻	【1】 Richard Shim.，「影像左右快閃記憶卡命運」，CNET新聞專區，2004年，http://taiwan.cnet.com/news/ce/0,2000062982,20087086,00.htm 【2】 Kerry Rodden, Kenneth R. Wood. 2003. How Do People Manage Their Digital Photographs? CHI 2003: NEW HORIZONS. Volume No. 5, Issue No. 1 【3】 Hyunmo Kang, Ben Shneiderman.2002. Visualization Methods for Personal Photo Collections:Browsing and Searching in the PhotoFinder. Department of Computer Science, Human-Computer Interaction Laboratory 【4】 Adobe Systems Incorporated, http://www.pacific.adobe.com/products/photoshopalbum/overview.html 【5】 Ullas Gargi, Yining Deng, Daniel R. Tretter. 2002. Managing and Searching Personal Photo Collections. HP Laboratories Palo Alto 【6】 Lynette Hirschman.1999. Intelligent Human-Computer Interfaces. The Edge Volume 3, Number 4 【7】 P. Maes, T. Darrell, B. Blumberg, A. Pentland. 1995. The ALIVE system: full-body interaction with autonomous agents. Computer Animation`95 . 【8】許聞廉、陳克健，「自然智慧型輸入系統的語意分析─脈絡會意法」,1993年,Proceedings of the 6th International Symposium on Cognitive Aspects of the Chinese Language, (1993), 527-540. 【9】 Japan Electronics and Information Technology Industries Association . Exchangeable image file format for digital still cameras : Exif Version 2.2 【10】 TsuruZohTachibanaya..Description of Exif file format. 2001. http://park2.wakwak.com/~tsuruzoh/Computer/Digicams/exif-e.html#AboutExif 【11】 Stuart Russell ,Peter Norvig. 2002. Artificial Intelligence: A Modern Approach Second Edition. Prentice Hall. 【12】 Sanjay Kr. Singh, D. S. Chauhan, Mayank Vatsa, Richa Singh. 2003. A Robust Skin Color Based Face Detection Algorithm. Tamkang Journal of Science and Engineering, Vol. 6, No. 4, pp. 227-234 【13】 Y. Gong and M. Sakauchi, "Detection of regions matching specified chromatic features", Computer Vision and Image Understanding, vol. 61, no. 2, 1995, pp 263 - 269 【14】 Goldennumer.Net, “The human face is based entirely on Phi”, http://www.goldennumber.net/face.htm 【15】 Zhou Wang, Alan C. Bovik, 2002 “WHY IS IMAGE QUALITY ASSESSMENT SO DIFFICULT?”, IEEE International Conference on Acoustics, Speech, & Signal Processing 【16】 Zhou Wang, Alan C. Bovik. 2002. A Universal Image Quality Index. IEEE Signal Processing Letters, vol. 9, no. 3, pp. 81-84 【17】 Norbert Wiener. 1942. Extrapolation, Interpolation, and Smoothing of Stationary Time Series. MIT Express 【18】 Claude E. Shannon. 1948 . A Mathematical Theory of Communication. Bell System Technical Journal, vol. 27, pp. 379-423 and 623-656 【19】 Jiawei Han, Micheline Kamber. 2001. Data Mining: Concepts and Techniques 【20】 Gordon S. Linoff, Michael J. A. Berry, Michael J. A. Berry . 2001. Mining the Web: Transforming Customer Data. 【21】 Paul Viola, Michael Jones. 2001. Rapid Object Detection using a Boosted Cascade of Simple Features. Proceedings IEEE Conf. on Computer Vision and Pattern Recognition 【22】 E.S. Bigun, J.Bigün, B. Duc, S. Fischer. 1997. Expert conciliation for multi modal person authentication systems by Bayesian statistics, Audio and Video based Person Authentication - AVBPA97 【23】 P. Verlinde, G. Chollet, and M. Acheroy. 2000. Multi-modal identity verification using expert fusion. Information Fusion, 1:17--33 【24】 Conrad Sanderson, 2002, “Information fusion and person verification using speech & face information”, IDIAP–RR 02-33 【25】 Arun Ross, Anil Jain, Jian-Zhong Qian. 2001. Information Fusion in Biometrics. Lecture Notes in Computer Science 【26】 Metropolis,N., A. Rosenbluth, M. Rosenbluth, A. Teller, E. Teller, 1953,"Equation of State Calculations by Fast Computing Machines", J. Chem. Phys.,21, 6, 1087-1092,
描述	碩士國立政治大學資訊科學學系 90753012 93
資料來源	http://thesis.lib.nccu.edu.tw/record/#G0090753012
資料類型	thesis

dc.contributor.advisor	廖文宏	zh_TW
dc.contributor.author (Authors)	孫新民	zh_TW
dc.creator (作者)	孫新民	zh_TW
dc.date (日期)	2004	en_US
dc.date.accessioned	19-Sep-2009 12:09:23 (UTC+8)	-
dc.date.available	19-Sep-2009 12:09:23 (UTC+8)	-
dc.date.issued (上傳時間)	19-Sep-2009 12:09:23 (UTC+8)	-
dc.identifier (Other Identifiers)	G0090753012	en_US
dc.identifier.uri (URI)	https://nccur.lib.nccu.edu.tw/handle/140.119/37101	-
dc.description (描述)	碩士	zh_TW
dc.description (描述)	國立政治大學	zh_TW
dc.description (描述)	資訊科學學系	zh_TW
dc.description (描述)	90753012	zh_TW
dc.description (描述)	93	zh_TW
dc.description.abstract (摘要)	本論文敘述對於智慧型個人數位相片管理瀏覽平台之研究、設計與實作過程。系統設計上基於整合多重證據架構，採用影像內容與使用者瀏覽行為之分析作為自動分類，判斷影像重要性與推薦程度的依據。影像自動分類方面，包括外部給予的標準資訊-EXIF資訊與分析影像內容，以其中人物存在數量與面積比例為依據的影像分類。而在影像的推薦方面，則採用影像品質之分析-包括對焦品質分析、曝光品質分析-與分析使用者瀏覽相片時的行為-包括停留時間與專注程度的整合為分析重要程度依據；最後則採用多模式(Multi-Modal)架構整合不同的評估結果並作為推薦的結論。	zh_TW
dc.description.abstract (摘要)	In this thesis, we present the design and implementation of an intelligent personal digital photo browsing platform. The proposed system relies on multiple evidences inferred from image content as well as user behavior. Specifically, external EXIF data and face detection results are utilized to coarsely classify the digital images. Measures of image quality, including clarity and contrast, are calculated to further refine the search result. Moreover, we use web cameras to record and analyze the viewing behavior of the user and attempt to correlate the interest of the viewer to the effective viewing time. Finally, a multi-modal system is put in place to integrate the clues acquired from different modules.	en_US
dc.description.tableofcontents	第一章緒論 1 1.1數位影像普及化所造成的管理問題與目前的解決方案 1 1.2智慧型人機介面 3 1.3智慧型數位影像管理系統平台 5 第二章影像內容資訊分析 9 2.1影像內容資訊分析概觀 9 2.2數位相機與拍攝參數資訊-EXIF簡介 10 2.3以人物為基礎的數位相片分類 13 第三章數位影像品質參數分析 25 3.1數位影像品質分析概觀 25 3.2偵測成像結果品質之評估演算法 27 3.3偵測影像對比程度對於數位相片品質影響之評估演算法 33 第四章使用者行為參數分析 37 4.1使用者行為分析概論 37 4.2使用者行為與專注程度分析 39 第五章多模式與資訊融合 51 5.1多模式(Multi-Modal)與資訊融合(Information Fusion)概論 51 5.2決策核心設計 55 5.3系統整合 60 第六章系統實作 62 6.1系統實作 62 6.2資料庫設計與實作 64 第七章結論與未來發展 67 7.1 結論 67 7.2 未來發展 69 參考文獻 71	zh_TW
dc.format.extent	98777 bytes	-
dc.format.extent	106434 bytes	-
dc.format.extent	104322 bytes	-
dc.format.extent	61791 bytes	-
dc.format.extent	121485 bytes	-
dc.format.extent	121150 bytes	-
dc.format.extent	98223 bytes	-
dc.format.extent	96164 bytes	-
dc.format.extent	1327653 bytes	-
dc.format.extent	1346117 bytes	-
dc.format.extent	1712700 bytes	-
dc.format.extent	416345 bytes	-
dc.format.extent	308738 bytes	-
dc.format.extent	1389671 bytes	-
dc.format.extent	828192 bytes	-
dc.format.extent	163279 bytes	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.language.iso	en_US	-
dc.source.uri (資料來源)	http://thesis.lib.nccu.edu.tw/record/#G0090753012	en_US
dc.subject (關鍵詞)	電腦視覺	zh_TW
dc.subject (關鍵詞)	多模式	zh_TW
dc.subject (關鍵詞)	影像處理	zh_TW
dc.subject (關鍵詞)	人工智慧	zh_TW
dc.title (題名)	基於注意力與多模式分析之數位相片管理系統設計與實作	zh_TW
dc.title (題名)	Design and implementation of a multi-modal attention-based photo manager	en_US
dc.type (資料類型)	thesis	en
dc.relation.reference (參考文獻)	【1】 Richard Shim.，「影像左右快閃記憶卡命運」，CNET新聞專區，2004年，http://taiwan.cnet.com/news/ce/0,2000062982,20087086,00.htm	zh_TW
dc.relation.reference (參考文獻)	【2】 Kerry Rodden, Kenneth R. Wood. 2003. How Do People Manage Their Digital Photographs? CHI 2003: NEW HORIZONS. Volume No. 5, Issue No. 1	zh_TW
dc.relation.reference (參考文獻)	【3】 Hyunmo Kang, Ben Shneiderman.2002. Visualization Methods for Personal Photo Collections:Browsing and Searching in the PhotoFinder. Department of Computer Science, Human-Computer Interaction Laboratory	zh_TW
dc.relation.reference (參考文獻)	【4】 Adobe Systems Incorporated, http://www.pacific.adobe.com/products/photoshopalbum/overview.html	zh_TW
dc.relation.reference (參考文獻)	【5】 Ullas Gargi, Yining Deng, Daniel R. Tretter. 2002. Managing and Searching Personal Photo Collections. HP Laboratories Palo Alto	zh_TW
dc.relation.reference (參考文獻)	【6】 Lynette Hirschman.1999. Intelligent Human-Computer Interfaces. The Edge Volume 3, Number 4	zh_TW
dc.relation.reference (參考文獻)	【7】 P. Maes, T. Darrell, B. Blumberg, A. Pentland. 1995. The ALIVE system: full-body interaction with autonomous agents. Computer Animation`95 .	zh_TW
dc.relation.reference (參考文獻)	【8】許聞廉、陳克健，「自然智慧型輸入系統的語意分析─脈絡會意法」,1993年,Proceedings of the 6th International Symposium on Cognitive Aspects of the Chinese Language, (1993), 527-540.	zh_TW
dc.relation.reference (參考文獻)	【9】 Japan Electronics and Information Technology Industries Association . Exchangeable image file format for digital still cameras : Exif Version 2.2	zh_TW
dc.relation.reference (參考文獻)	【10】 TsuruZohTachibanaya..Description of Exif file format. 2001. http://park2.wakwak.com/~tsuruzoh/Computer/Digicams/exif-e.html#AboutExif	zh_TW
dc.relation.reference (參考文獻)	【11】 Stuart Russell ,Peter Norvig. 2002. Artificial Intelligence: A Modern Approach Second Edition. Prentice Hall.	zh_TW
dc.relation.reference (參考文獻)	【12】 Sanjay Kr. Singh, D. S. Chauhan, Mayank Vatsa, Richa Singh. 2003. A Robust Skin Color Based Face Detection Algorithm. Tamkang Journal of Science and Engineering, Vol. 6, No. 4, pp. 227-234	zh_TW
dc.relation.reference (參考文獻)	【13】 Y. Gong and M. Sakauchi, "Detection of regions matching specified chromatic features", Computer Vision and Image Understanding, vol. 61, no. 2, 1995, pp 263 - 269	zh_TW
dc.relation.reference (參考文獻)	【14】 Goldennumer.Net, “The human face is based entirely on Phi”, http://www.goldennumber.net/face.htm	zh_TW
dc.relation.reference (參考文獻)	【15】 Zhou Wang, Alan C. Bovik, 2002 “WHY IS IMAGE QUALITY ASSESSMENT SO DIFFICULT?”, IEEE International Conference on Acoustics, Speech, & Signal Processing	zh_TW
dc.relation.reference (參考文獻)	【16】 Zhou Wang, Alan C. Bovik. 2002. A Universal Image Quality Index. IEEE Signal Processing Letters, vol. 9, no. 3, pp. 81-84	zh_TW
dc.relation.reference (參考文獻)	【17】 Norbert Wiener. 1942. Extrapolation, Interpolation, and Smoothing of Stationary Time Series. MIT Express	zh_TW
dc.relation.reference (參考文獻)	【18】 Claude E. Shannon. 1948 . A Mathematical Theory of Communication. Bell System Technical Journal, vol. 27, pp. 379-423 and 623-656	zh_TW
dc.relation.reference (參考文獻)	【19】 Jiawei Han, Micheline Kamber. 2001. Data Mining: Concepts and Techniques	zh_TW
dc.relation.reference (參考文獻)	【20】 Gordon S. Linoff, Michael J. A. Berry, Michael J. A. Berry . 2001. Mining the Web: Transforming Customer Data.	zh_TW
dc.relation.reference (參考文獻)	【21】 Paul Viola, Michael Jones. 2001. Rapid Object Detection using a Boosted Cascade of Simple Features. Proceedings IEEE Conf. on Computer Vision and Pattern Recognition	zh_TW
dc.relation.reference (參考文獻)	【22】 E.S. Bigun, J.Bigün, B. Duc, S. Fischer. 1997. Expert conciliation for multi modal person authentication systems by Bayesian statistics, Audio and Video based Person Authentication - AVBPA97	zh_TW
dc.relation.reference (參考文獻)	【23】 P. Verlinde, G. Chollet, and M. Acheroy. 2000. Multi-modal identity verification using expert fusion. Information Fusion, 1:17--33	zh_TW
dc.relation.reference (參考文獻)	【24】 Conrad Sanderson, 2002, “Information fusion and person verification using speech & face information”, IDIAP–RR 02-33	zh_TW
dc.relation.reference (參考文獻)	【25】 Arun Ross, Anil Jain, Jian-Zhong Qian. 2001. Information Fusion in Biometrics. Lecture Notes in Computer Science	zh_TW
dc.relation.reference (參考文獻)	【26】 Metropolis,N., A. Rosenbluth, M. Rosenbluth, A. Teller, E. Teller, 1953,"Equation of State Calculations by Fast Computing Machines", J. Chem. Phys.,21, 6, 1087-1092,	zh_TW

Publications-Theses

Article View/Open

Publication Export

Google ScholarTM

NCCU Library

Citation Infomation

Related Publications in TAIR

Google Scholar^TM