影片動跡剪輯 | Publication

Publications-Theses

Article View/Open

pdf(833)pdf(1148)pdf(855)pdf(701)pdf(1110)pdf(934)pdf(1202)pdf(936)pdf(820)pdf(900)pdf(671)

Publication Export

Google Scholar^TM

Title	影片動跡剪輯
Creator	王智潁 Wang, Chih-Ying
Contributor	何瑁鎧 Hor, Maw-Kae 王智潁 Wang, Chih-Ying
Key Words	視訊處理影片剪輯全域動作計算 video process video editing global motion estimation
Date	2004
Date Issued	17-Sep-2009 14:06:44 (UTC+8)
Summary	「動跡剪輯」是將多個不同內容的影片片段，根據影片中特定物體移動的關係，剪接成新的影片，使得產生的新影片能維持動作連貫及流暢的特性。本論文提出一套方法能夠自動找尋不同影片間相似的剪輯點，作為「動跡剪輯」的參考。此方法之重點在於建立影片的時空資訊，作為找尋剪輯點的依據。建立影片時空資訊的過程中，我們先將影片依偵測出的鏡頭轉換點分割成不同的影片片段，再將影片片段中前景物件的位置、大小與動作等資訊分離而成影片物件平面，並結合影片片段中的背景動作資訊與影片物件平面資訊，成為該影片片段之時空資訊，從而進行剪輯點之找尋與比對，擇其最佳點進行剪輯。運用影片時空資訊於找尋影片間之剪輯點時，是以影片物件平面作為搜尋單位，此方式有助於提升結果的正確性，同時也提供了搜尋時的靈活度。 With the rapid increasing of the multimedia applications in modern commercial and movie business, it becomes more desirable to have efficient video editing tools. However, conventional video editing requires too many manual interventions that reduce productivities as well as opportunities in better performance. In this thesis, we propose a MOtion-based Video Editing (MOVE) mechanism that can automatically select the most similar or suitable transition points from a given set of raw videos. A given video can be divided into a set of video clips using a shot detection algorithm. For each video clip, we provide an algorithm that can separate the global motions as well as the local motions using the principles of video object plane and accumulated difference. We introduce the concept of spatio-temporal information, a condensed information that associated with a video clip. We can use this information in finding a good video editing point. Since the spatio-temporal information is a concise representation of a video clip, searching in this domain will reduce the complexity of the problem and achieve better performance. We implemented our mechanism with successful experiments.
參考文獻	[1] Dufaux, F. and Konrad, J., “Efficient, robust, and fast global motion estimation for video coding,” IEEE Trans. Image Processing, Vol. 9, No. 3, , pp. 497 – 501, Mar. 2000. [2] Wang Y. A. and Edward H. A., “Spatio-temporal segmentation of video data,” in Proc. SPIE, Image and Video Processing II, Vol. 2182, Feb. 1994. [3] Daniel D., David D., “Video retrieval using spatio-temporal descriptors,” in Proc. ACM Conf. on Multimedia, pp. 508 – 517, Nov. 2003. [4] Del B., A., Pala, P., Tanganelli, L., “Video retrieval based on dynamics of color flows,” in Proc. Int. Conf. Pattern Recognition, Vol. 1, pp. 851 – 854, Sept. 2000. [5] Nummiaro K., Koller-Meier E. and Van G. L., “Color features for tracking non-rigid objects,” Special Issue on Visual Surveillance, Chinese Journal of Automation, Vol. 29, No. 3, pp 345-355, May 2003. [6] Forsyth D. A. and Ponce J., “Computer vision: A modern approach,” Pearson Education, part I-part IV, 2003. [7] Dufaux, F. and Moscheni, F., “Motion estimation techniques for digital TV: A review and a new contribution,” in Proc. IEEE, vol.83, pp.858-879, Jun. 1995. [8] Tse Y. T. and Baker R. L., “Global zoom/pan estimation and compensation for video compression,” in IEEE Proc. ICASSP’91, vol. IV, pp.2725-2728, May 1991. [9] Moscheni, F., Dufaux, F. and Kunt, M., “A new two-stage motion estimation based on a background/foreground segmentation,” in IEEE Proc. ICASSP’95, pp. 2261-2264, Detroit, MI, May 1995. [10] Ebrahimi T., “MPEG-4 video verification model: A video encoding/decoding algorithm based on content representation,” Signal Proc. Image Communication 9, No. 4, pp. 367-384, May 1997. [11] Sikora T., “The MPEG-4 video standard verification model,” IEEE Trans. CSVT, Vol.7, No.1, Feb. 1997. [12] Gonzalez R. C. and Woods R. E., “Digital image processing,” Prentice Hall, chapter7, pp.349-403, 2001. [13] Grigoriu, L., “Spatio-temporal compression of the motion field in video coding,” IEEE Workshop on Multimedia Signal Proc., pp. 129 – 134, Oct. 2001. [14] Lai-Man P. and Wing-Chung M., “A novel four-step search algorithm for fast block motion estimation,” IEEE Trans. CSVT, Vol. 6, No. 3, pp. 313-317, Jun. 1996. [15] Richard S., “Image Mosaicing for tele-reality applications,” Cambridge Research Laboratory Technical Report Series, May 1994. [16] Deng Y. and Manjunath B. S., “Content-based search of video using color, texture and motion,” in Proc. IEEE Int. Conf. Image Processing, volume 2, pp. 534–537, 1997. [17] Belongie S., Carson C., Greenspan H. and Malik J., “Color and texture-based image segmentation using em and its application to content based image retrieval,” in Proc. of Int. Conf. on Computer Vision, pp. 675-682, 1998. [18] Greenspan H. and Goldberger J., “Probabilistic space-time video modeling via piecewise GMM,” IEEE Pattern Analysis and Machine Intelligence, vol. 26, no. 3, 2004. [19] Memin E. and Perez P., “Dense estimation and object-based segmentation of the optical flow with robust techniques,” IEEE Trans. Image Processing, Vol. 7, No. 5, pp. 703 – 719, May 1998. [20] Eric B. and St´ephane M., “Nonlinear temporal modeling for motion-based video overviewing,” in Proc. European Conf. on Content-based Multimedia Indexing, Sept., 2003. [21] Ianeva T., Vries A. P. de and Westerveld. T., “A dynamic probabilistic retrieval model,” IEEE Int. Conf. on Multimedia and Expo (ICME), 2004. [22] Cheung, S.-C.S. and Zakhor A., “Video similarity detection with video signature clustering,” in Proc. of International Conf. on Image Processing, Vol. 2, pp. 649 – 652, Oct. 2001. [23] Bregler C., “Learning and recognizing human dynamics in video sequences,” in IEEE CVPR, June 1997.
Description	國立政治大學資訊科學學系 91753016 93
資料來源	http://thesis.lib.nccu.edu.tw/record/#G0917530161
Type	thesis

dc.contributor.advisor	何瑁鎧	zh_TW
dc.contributor.advisor	Hor, Maw-Kae	en_US
dc.contributor.author (Authors)	王智潁	zh_TW
dc.contributor.author (Authors)	Wang, Chih-Ying	en_US
dc.creator (作者)	王智潁	zh_TW
dc.creator (作者)	Wang, Chih-Ying	en_US
dc.date (日期)	2004	en_US
dc.date.accessioned	17-Sep-2009 14:06:44 (UTC+8)	-
dc.date.available	17-Sep-2009 14:06:44 (UTC+8)	-
dc.date.issued (上傳時間)	17-Sep-2009 14:06:44 (UTC+8)	-
dc.identifier (Other Identifiers)	G0917530161	en_US
dc.identifier.uri (URI)	https://nccur.lib.nccu.edu.tw/handle/140.119/32711	-
dc.description (描述)	國立政治大學	zh_TW
dc.description (描述)	資訊科學學系	zh_TW
dc.description (描述)	91753016	zh_TW
dc.description (描述)	93	zh_TW
dc.description.abstract (摘要)	「動跡剪輯」是將多個不同內容的影片片段，根據影片中特定物體移動的關係，剪接成新的影片，使得產生的新影片能維持動作連貫及流暢的特性。本論文提出一套方法能夠自動找尋不同影片間相似的剪輯點，作為「動跡剪輯」的參考。此方法之重點在於建立影片的時空資訊，作為找尋剪輯點的依據。建立影片時空資訊的過程中，我們先將影片依偵測出的鏡頭轉換點分割成不同的影片片段，再將影片片段中前景物件的位置、大小與動作等資訊分離而成影片物件平面，並結合影片片段中的背景動作資訊與影片物件平面資訊，成為該影片片段之時空資訊，從而進行剪輯點之找尋與比對，擇其最佳點進行剪輯。運用影片時空資訊於找尋影片間之剪輯點時，是以影片物件平面作為搜尋單位，此方式有助於提升結果的正確性，同時也提供了搜尋時的靈活度。	zh_TW
dc.description.abstract (摘要)	With the rapid increasing of the multimedia applications in modern commercial and movie business, it becomes more desirable to have efficient video editing tools. However, conventional video editing requires too many manual interventions that reduce productivities as well as opportunities in better performance. In this thesis, we propose a MOtion-based Video Editing (MOVE) mechanism that can automatically select the most similar or suitable transition points from a given set of raw videos. A given video can be divided into a set of video clips using a shot detection algorithm. For each video clip, we provide an algorithm that can separate the global motions as well as the local motions using the principles of video object plane and accumulated difference. We introduce the concept of spatio-temporal information, a condensed information that associated with a video clip. We can use this information in finding a good video editing point. Since the spatio-temporal information is a concise representation of a video clip, searching in this domain will reduce the complexity of the problem and achieve better performance. We implemented our mechanism with successful experiments.	en_US
dc.description.tableofcontents	第一章緒論 1.1 簡介 1.2 問題描述 1.3 研究動機 1.4 本論文的貢獻 1.5 章節架構第二章相關研究第三章影片時空資訊的取得 3.1 影片物件平面 3.2 時空資訊 3.3 動作偵測 3.3.1 動作模型 3.3.2 梯度下降法 3.3.3 全域動作偵測 3.3.4 區域動作偵測 3.4 影片物件平面建立 3.4.1 前景與背景分離 3.4.2 像素分群 3.4.3 物件平面追蹤 3.5 時空資訊與動跡剪輯第四章動跡剪輯系統架構 4.1 鏡頭轉換偵測 4.2 時空資訊建立 4.3 初步影片比較 4.4 影片配對評分 4.5 複雜度分析第五章實驗 5.1 實驗一：找尋影片間的動跡剪輯點 5.2 實驗二：全域動作偵測與鏡頭轉換偵測第六章總結 6.1 結論 6.2 未來展望參考文獻	zh_TW
dc.format.extent	49485 bytes	-
dc.format.extent	77270 bytes	-
dc.format.extent	67590 bytes	-
dc.format.extent	75121 bytes	-
dc.format.extent	212198 bytes	-
dc.format.extent	106151 bytes	-
dc.format.extent	530768 bytes	-
dc.format.extent	276728 bytes	-
dc.format.extent	785658 bytes	-
dc.format.extent	114683 bytes	-
dc.format.extent	52168 bytes	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.language.iso	en_US	-
dc.source.uri (資料來源)	http://thesis.lib.nccu.edu.tw/record/#G0917530161	en_US
dc.subject (關鍵詞)	視訊處理	zh_TW
dc.subject (關鍵詞)	影片剪輯	zh_TW
dc.subject (關鍵詞)	全域動作計算	zh_TW
dc.subject (關鍵詞)	video process	en_US
dc.subject (關鍵詞)	video editing	en_US
dc.subject (關鍵詞)	global motion estimation	en_US
dc.title (題名)	影片動跡剪輯	zh_TW
dc.type (資料類型)	thesis	en
dc.relation.reference (參考文獻)	[1] Dufaux, F. and Konrad, J., “Efficient, robust, and fast global motion estimation for video coding,” IEEE Trans. Image Processing, Vol. 9, No. 3, , pp. 497 – 501, Mar. 2000.	zh_TW
dc.relation.reference (參考文獻)	[2] Wang Y. A. and Edward H. A., “Spatio-temporal segmentation of video data,” in Proc. SPIE, Image and Video Processing II, Vol. 2182, Feb. 1994.	zh_TW
dc.relation.reference (參考文獻)	[3] Daniel D., David D., “Video retrieval using spatio-temporal descriptors,” in Proc. ACM Conf. on Multimedia, pp. 508 – 517, Nov. 2003.	zh_TW
dc.relation.reference (參考文獻)	[4] Del B., A., Pala, P., Tanganelli, L., “Video retrieval based on dynamics of color flows,” in Proc. Int. Conf. Pattern Recognition, Vol. 1, pp. 851 – 854, Sept. 2000.	zh_TW
dc.relation.reference (參考文獻)	[5] Nummiaro K., Koller-Meier E. and Van G. L., “Color features for tracking non-rigid objects,” Special Issue on Visual Surveillance, Chinese Journal of Automation, Vol. 29, No. 3, pp 345-355, May 2003.	zh_TW
dc.relation.reference (參考文獻)	[6] Forsyth D. A. and Ponce J., “Computer vision: A modern approach,” Pearson Education, part I-part IV, 2003.	zh_TW
dc.relation.reference (參考文獻)	[7] Dufaux, F. and Moscheni, F., “Motion estimation techniques for digital TV: A review and a new contribution,” in Proc. IEEE, vol.83, pp.858-879, Jun. 1995.	zh_TW
dc.relation.reference (參考文獻)	[8] Tse Y. T. and Baker R. L., “Global zoom/pan estimation and compensation for video compression,” in IEEE Proc. ICASSP’91, vol. IV, pp.2725-2728, May 1991.	zh_TW
dc.relation.reference (參考文獻)	[9] Moscheni, F., Dufaux, F. and Kunt, M., “A new two-stage motion estimation based on a background/foreground segmentation,” in IEEE Proc. ICASSP’95, pp. 2261-2264, Detroit, MI, May 1995.	zh_TW
dc.relation.reference (參考文獻)	[10] Ebrahimi T., “MPEG-4 video verification model: A video encoding/decoding algorithm based on content representation,” Signal Proc. Image Communication 9, No. 4, pp. 367-384, May 1997.	zh_TW
dc.relation.reference (參考文獻)	[11] Sikora T., “The MPEG-4 video standard verification model,” IEEE Trans. CSVT, Vol.7, No.1, Feb. 1997.	zh_TW
dc.relation.reference (參考文獻)	[12] Gonzalez R. C. and Woods R. E., “Digital image processing,” Prentice Hall, chapter7, pp.349-403, 2001.	zh_TW
dc.relation.reference (參考文獻)	[13] Grigoriu, L., “Spatio-temporal compression of the motion field in video coding,” IEEE Workshop on Multimedia Signal Proc., pp. 129 – 134, Oct. 2001.	zh_TW
dc.relation.reference (參考文獻)	[14] Lai-Man P. and Wing-Chung M., “A novel four-step search algorithm for fast block motion estimation,” IEEE Trans. CSVT, Vol. 6, No. 3, pp. 313-317, Jun. 1996.	zh_TW
dc.relation.reference (參考文獻)	[15] Richard S., “Image Mosaicing for tele-reality applications,” Cambridge Research Laboratory Technical Report Series, May 1994.	zh_TW
dc.relation.reference (參考文獻)	[16] Deng Y. and Manjunath B. S., “Content-based search of video using color, texture and motion,” in Proc. IEEE Int. Conf. Image Processing, volume 2, pp. 534–537, 1997.	zh_TW
dc.relation.reference (參考文獻)	[17] Belongie S., Carson C., Greenspan H. and Malik J., “Color and texture-based image segmentation using em and its application to content based image retrieval,” in Proc. of Int. Conf. on Computer Vision, pp. 675-682, 1998.	zh_TW
dc.relation.reference (參考文獻)	[18] Greenspan H. and Goldberger J., “Probabilistic space-time video modeling via piecewise GMM,” IEEE Pattern Analysis and Machine Intelligence, vol. 26, no. 3, 2004.	zh_TW
dc.relation.reference (參考文獻)	[19] Memin E. and Perez P., “Dense estimation and object-based segmentation of the optical flow with robust techniques,” IEEE Trans. Image Processing, Vol. 7, No. 5, pp. 703 – 719, May 1998.	zh_TW
dc.relation.reference (參考文獻)	[20] Eric B. and St´ephane M., “Nonlinear temporal modeling for motion-based video overviewing,” in Proc. European Conf. on Content-based Multimedia Indexing, Sept., 2003.	zh_TW
dc.relation.reference (參考文獻)	[21] Ianeva T., Vries A. P. de and Westerveld. T., “A dynamic probabilistic retrieval model,” IEEE Int. Conf. on Multimedia and Expo (ICME), 2004.	zh_TW
dc.relation.reference (參考文獻)	[22] Cheung, S.-C.S. and Zakhor A., “Video similarity detection with video signature clustering,” in Proc. of International Conf. on Image Processing, Vol. 2, pp. 649 – 652, Oct. 2001.	zh_TW
dc.relation.reference (參考文獻)	[23] Bregler C., “Learning and recognizing human dynamics in video sequences,” in IEEE CVPR, June 1997.	zh_TW

Publications-Theses

Article View/Open

Publication Export

Google ScholarTM

NCCU Library

Citation Infomation

Related Publications in TAIR

Google Scholar^TM