Please use this identifier to cite or link to this item:
https://ah.lib.nccu.edu.tw/handle/140.119/135531
DC Field | Value | Language |
---|---|---|
dc.contributor | 資科系 | |
dc.creator | 黃瀚萱 | |
dc.creator | Huang, Hen-Hsen | |
dc.creator | Lin, Wei-Rou | |
dc.creator | Chen, Hsin-Hsi | |
dc.date | 2020-06 | |
dc.date.accessioned | 2021-06-04T06:45:27Z | - |
dc.date.available | 2021-06-04T06:45:27Z | - |
dc.date.issued | 2021-06-04T06:45:27Z | - |
dc.identifier.uri | http://nccur.lib.nccu.edu.tw/handle/140.119/135531 | - |
dc.description.abstract | This paper introduces visual story ordering, a challenging task in which images and text are ordered in a visual story jointly. We propose a neural network model based on the reader-processor-writer architecture with a self-attention mechanism. A novel bidirectional decoder is further proposed with bidirectional beam search. Experimental results show the effectiveness of the approach. The information gained from multimodal learning is presented and discussed. We also find that the proposed embedding narrows the distance between images and their corresponding story sentences, even though we do not align the two modalities explicitly. As it addresses a general issue in generative models, the proposed bidirectional inference mechanism applies to a variety of applications. | |
dc.format.extent | 1744655 bytes | - |
dc.format.mimetype | application/pdf | - |
dc.relation | Proceedings of the 2020 International Conference on Multimedia Retrieval (ICMR ’20), Association for Computing Machinery, pp.326-330 | |
dc.subject | Multimodal modeling ; temporal information ordering ; sentence ordering ; visual-semantic representation | |
dc.title | Visual Story Ordering with a Bidirectional Writer | |
dc.type | conference | |
dc.identifier.doi | 10.1145/3372278.3390735 | |
dc.doi.uri | https://doi.org/10.1145/3372278.3390735 | |
item.fulltext | With Fulltext | - |
item.openairetype | conference | - |
item.openairecristype | http://purl.org/coar/resource_type/c_18cf | - |
item.cerifentitytype | Publications | - |
item.grantfulltext | restricted | - |
Appears in Collections: | 會議論文 |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.