學術產出-學位論文

文章檢視/開啟

書目匯出

Google ScholarTM

政大圖書館

引文資訊

TAIR相關學術產出

題名 透過眼動選擇大型語言模型建議達成符合使用者意圖之人工智慧協作擴增實境筆記系統
Co-Piloted AR Note-Taking via Gaze Selection of LLM Suggestions to Match Users’ Intentions
作者 裘世綱
Chiu, Shih Kang
貢獻者 蔡欣叡
Tsai, Hsin-Ruey
裘世綱
Chiu, Shih Kang
關鍵詞 擴增實境
人工智慧
大型語言模型
眼動追蹤
Augmented Reality
Artificial Intelligence
Large Language Model
Eye Tracking
日期 2024
上傳時間 4-九月-2024 15:00:19 (UTC+8)
摘要 記錄筆記在聽演講和與人討論時至關重要,不僅用於幫助我們紀錄摘要和總結重點,還會在提問環節中用於提醒想提出的問題或提醒在討論時的發言。然而,以手機打字的筆記方式可能會分散使用者注意力並增加使用者的心智負擔。儘管大型語言模型(LLM)能夠被用於自動生成演講或討論的摘要和重點,但如果直接由人工智慧(AI)生成,沒有使用者參與或互動,得到的結果未必能符合使用者的意圖。因此,我們提出GazeNoter,一個人工智慧協作擴增實境(AR)系統,讓使用者透過AR頭戴裝置的眼動追蹤,快速選擇LLM生成的建議,以達成實時筆記。GazeNoter利用AR頭戴裝置作為媒介,讓使用者能夠快速給予LLM回饋以產出更符合其意圖之筆記,形成一個使用者參與式的AI系統,且不僅可產出演講與討論的內容以內的筆記,甚至能將筆記延伸至內容之外。我們進行了兩項使用者研究,分別驗證GazeNoter在聽演講的靜態情境和行走會議的移動情境下進行即時筆記的可用性。
Note-taking is critical during speeches and discussions, serving not only for later summarization and organization but also for real-time question and opinion reminding in question-and-answer sessions or timely contributions in discussions. Manually typing on smartphones for note-taking could be distracting and increase cognitive load for users. While large language models (LLMs) are used to automatically generate summaries and highlights, the con- tent generated by artificial intelligence (AI) may not match users’intentions without user input or interaction. Therefore, we propose an AI-copiloted augmented reality (AR) system, GazeNoter, to allow users to swiftly select diverse LLM-generated suggestions via gaze on an AR headset for real-time note-taking. GazeNoter leverages an AR headset as a medium for users to swiftly adjust the LLM output to match their intentions, forming a user-in-the-loop AI system for both within-context and beyond-context notes. We conducted two user studies to verify the usability of GazeNoter in attending speeches in a static sitting condition and walking meetings and discussions in a mobile walking condition, respectively.
參考文獻 [1] Sunggeun Ahn, Stephanie Santosa, Mark Parent, Daniel Wigdor, Tovi Gross- man, and Marcello Giordano. 2021. StickyPie: A Gaze-Based, Scale-Invariant Marking Menu Optimized for AR/VR. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI ’21). Association for Computing Machinery, New York, NY, USA, Article 739, 16 pages. https://doi.org/10.1145/3411764.3445297 [2] Sunggeun Ahn, Jeongmin Son, Sangyoon Lee, and Geehyuk Lee. 2020. Verge-It: Gaze Interaction for a Binocular Head-Worn Display Using Modulated Dis- parity Vergence Eye Movement. In Extended Abstracts of the 2020 CHI Con- ference on Human Factors in Computing Systems (Honolulu, HI, USA) (CHI EA ’20). Association for Computing Machinery, New York, NY, USA, 1–7. https://doi.org/10.1145/3334480.3382908 [3] Satanjeev Banerjee and Alexander I. Rudnicky. 2006. SmartNotes: Implicit Labeling of Meeting Data through User Note-Taking and Browsing. In Proceedings of the 2006 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: Companion Volume: Demonstrations (New York, New York) (NAACL-Demonstrations ’06). Association for Computational Linguistics, USA, 261–264. https://doi.org/10.3115/1225785.1225788 [4] Satanjeev Banerjee and Alexander I. Rudnicky. 2007. Segmenting Meetings into Agenda Items by Extracting Implicit Supervision from Human Note-Taking. In Proceedings of the 12th International Conference on Intelligent User Interfaces (Honolulu, Hawaii, USA) (IUI ’07). Association for Computing Machinery, New York, NY, USA, 151–159. https://doi.org/10.1145/1216295.1216325 [5] Stephen Brade, Bryan Wang, Mauricio Sousa, Sageev Oore, and Tovi Grossman. 2023. Promptify: Text-to-Image Generation through Interactive Prompt Explo- ration with Large Language Models. In Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology (<conf-loc>, <city>San Francisco</city>, <state>CA</state>, <country>USA</country>, </conf-loc>) (UIST ’23). Association for Computing Machinery, New York, NY, USA, Article 96, 14 pages. https://doi.org/10.1145/3586183.3606725 [6] Runze Cai, Nuwan Nanayakkarawasam Peru Kandage Janaka, Shengdong Zhao, and Minghui Sun. 2023. ParaGlassMenu: Towards Social-Friendly Subtle In- teractions in Conversations. In Proceedings of the 2023 CHI Conference on Hu- man Factors in Computing Systems (Hamburg, Germany) (CHI ’23). Associa- tion for Computing Machinery, New York, NY, USA, Article 721, 21 pages. https://doi.org/10.1145/3544548.3581065 [7] Yining Cao, Hariharan Subramonyam, and Eytan Adar. 2022. VideoSticker: A Tool for Active Viewing and Visual Note-Taking from Videos. In 27th International Conference on Intelligent User Interfaces (Helsinki, Finland) (IUI ’22). Association for Computing Machinery, New York, NY, USA, 672–690. https://doi.org/10. 1145/3490099.3511132 [8] Senthil Chandrasegaran, Chris Bryan, Hidekazu Shidara, Tung-Yen Chuang, and Kwan-Liu Ma. 2019. TalkTraces: Real-Time Capture and Visualization of Verbal Content in Meetings. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (Glasgow, Scotland Uk) (CHI ’19). Association for Computing Machinery, New York, NY, USA, 1–14. https://doi.org/10.1145/3290605.3300807 [9] Si Chen, Dennis Wang, and Yun Huang. 2021. Exploring the Complementary Features of Audio and Text Notes for Video-Based Learning in Mobile Settings. In Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI EA ’21). Association for Computing Machinery, New York, NY, USA, Article 310, 7 pages. https://doi.org/10.1145/3411763.3451801 [10] Myungguen Choi, Daisuke Sakamoto, and Tetsuo Ono. 2022. Kuiper Belt: Utilizing the “Out-of-Natural Angle” Region in the Eye-Gaze Interaction for Virtual Reality. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 357, 17 pages. https://doi.org/10.1145/3491102.3517725 [11] John Joon Young Chung, Wooseok Kim, Kang Min Yoo, Hwaran Lee, Eytan Adar, and Minsuk Chang. 2022. TaleBrush: Sketching Stories with Generative Pretrained Language Models. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (<conf-loc>, <city>New Orleans</city>, <state>LA</state>, <country>USA</country>, </conf-loc>) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 209, 19 pages. https://doi.org/10.1145/3491102.3501819 [12] Ida Damen, Anika Kok, Bas Vink, Hans Brombacher, Steven Vos, and Carine Lallemand. 2020. The Hub: Facilitating Walking Meetings through a Network of Interactive Devices. In Companion Publication of the 2020 ACM Designing Interactive Systems Conference (Eindhoven, Netherlands) (DIS’ 20 Companion). Association for Computing Machinery, New York, NY, USA, 19–24. https://doi.org/10.1145/3393914.3395876 [13] Ida Damen, Carine Lallemand, Rens Brankaert, Aarnout Brombacher, Pieter van Wesemael, and Steven Vos. 2020. Understanding Walking Meetings: Drivers and Barriers. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (Honolulu, HI, USA) (CHI ’20). Association for Computing Machinery, New York, NY, USA, https://doi.org/10.1145/3313831.3376141 [14] Hai Dang, Karim Benharrak, Florian Lehmann, and Daniel Buschek. 2022. Be- yond Text Generation: Supporting Writers with Continuous Automatic Text Summaries. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology (Bend, OR, USA) (UIST ’22). Association for Computing Machinery, New York, NY, USA, Article 98, 13 pages. https: //doi.org/10.1145/3526113.3545672 [15] Morten Lund Dybdal, Javier San Agustin, and John Paulin Hansen. 2012. Gaze Input for Mobile Devices by Dwell and Gestures. In Proceedings of the Symposium on Eye Tracking Research and Applications (Santa Barbara, California) (ETRA ’12). Association for Computing Machinery, New York, NY, USA, 225–228. https: //doi.org/10.1145/2168556.2168601 [16] Marc Exposito, Vicky Zeamer, and Pattie Maes. 2017. Unobtrusive Note Taking: Enriching Digital Interpersonal Interactions Using Gestures. In Companion of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing (Portland, Oregon, USA) (CSCW ’17 Companion). Association for Computing Machinery, New York, NY, USA, 167–170. https://doi.org/10.1145/ 3022198.3026319 [17] Jingchao Fang, Yanhao Wang, Chi-Lan Yang, Ching Liu, and Hao-Chuan Wang. 2022. Understanding the Effects of Structured Note-Taking Systems for Video- Based Learners in Individual and Social Learning Contexts. Proc. ACM Hum Comput. Interact. 6, GROUP, Article 21 (jan 2022), 21 pages. [18] Jingchao Fang, Yanhao Wang, Chi-Lan Yang, and Hao-Chuan Wang. 2021. NoteCoStruct: Powering Online Learners with Socially Scaffolded Note Taking and Sharing. In Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI EA ’21). Association for Computing Machinery, New York, NY, USA, Article 223, 5 pages. https://doi.org/10.1145/3411763.3451694 [19] Tan Gemicioglu, R. Michael Winters, Yu-Te Wang, Thomas M. Gable, Ann Paradiso, and Ivan J. Tashev. 2023. Gaze & Tongue: A Subtle, Hands-Free Interaction for Head-Worn Devices. In Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems (Hamburg, Germany) (CHI EA ’23). Association for Computing Machinery, New York, NY, USA, Article 456, 4 pages. https://doi.org/10.1145/3544549.3583930 [20] Luke Haliburton, Natalia Bartłomiejczyk, Albrecht Schmidt, Paweł W. Woźniak, and Jasmin Niess. 2023. The Walking Talking Stick: Understanding Automated Note-Taking in Walking Meetings. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (Hamburg, Germany) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 431, 16 pages. https: //doi.org/10.1145/3544548.3580986 [21] Ari Hautasaari and Naomi Yamashita. 2014. Catching up in Audio Conferences: Highlighting Keywords in ASR Transcripts for Non-Native Speakers. In Proceedings of the 5th ACM International Conference on Collaboration across Boundaries: Culture, Distance & Technology (Kyoto, Japan) (CABS ’14). Association for Computing Machinery, New York, NY, USA, 107–110. https: //doi.org/10.1145/2631488.2634064 [22] Nuwan Janaka, Chloe Haigh, Hyeongcheol Kim, Shan Zhang, and Shengdong Zhao. 2022. Paracentral and Near-Peripheral Visualizations: Towards Attention Maintaining Secondary Information Presentation on OHMDs during in-Person Social Interactions. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 551, 14 pages. https://doi.org/10.1145/3491102.3502127 [23] Peiling Jiang, Jude Rayan, Steven P. Dow, and Haijun Xia. 2023. Graphologue: Exploring Large Language Model Responses with Interactive Diagrams. In Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology (<conf-loc>, <city>San Francisco</city>, <state>CA</state>, <country>USA</country>, </conf-loc>) (UIST ’23). Association for Computing Machinery, New York, NY, USA, Article 3, 20 pages. https://doi.org/10.1145/3586183.3606737 [24] Haojian Jin, Yale Song, and Koji Yatani. 2017. ElasticPlay: Interactive Video Summarization with Dynamic Time Budgets. In Proceedings of the 25th ACM International Conference on Multimedia (Mountain View, California, USA) (MM’17). Association for Computing Machinery, New York, NY, USA, 1164–1172. https://doi.org/10.1145/3123266.3123393 [25] Vaiva Kalnikaitundefined, Patrick Ehlen, and Steve Whittaker. 2012. Markup as You Talk: Establishing Effective Memory Cues While Still Contributing to a Meeting. In Proceedings of the ACM 2012 Conference on Computer Supported Cooperative Work (Seattle, Washington, USA) (CSCW ’12). Association for Computing Machinery, New York, NY, USA, 349–358. https://doi.org/10.1145/2145204.2145260 [26] Matthew Kam, Jingtao Wang, Alastair Iles, Eric Tse, Jane Chiu, Daniel Glaser, Orna Tarshish, and John Canny. 2005. Livenotes: A System for Cooperative and Augmented Note-Taking in Lectures. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Portland, Oregon, USA) (CHI ’05). Association for Computing Machinery, New York, NY, USA, 531–540. https: //doi.org/10.1145/1054972.1055046 [27] Bridgette Kaminski, Rainer Wasinger, Kimberley Norris, Chris Zehntner, Shuxiang Xu, Winyu Chinthammit, and Henry Duh. 2016. Learning through Shared Note-Taking Visualisations in the Classroom. In Proceedings of the 28th Australian Conference on Computer-Human Interaction (Launceston, Tasmania, Australia) (OzCHI ’16). Association for Computing Machinery, New York, NY, USA, 576–580. https://doi.org/10.1145/3010915.3010970 [28] Anam Ahmad Khan. 2019. Gaze Assisted Voice Note Taking System. In Adjunct Proceedings of the 2019 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2019 ACM International Symposium on Wearable Computers (London, United Kingdom) (UbiComp/ISWC ’19 Adjunct). Association for Computing Machinery, New York, NY, USA, 367–371. https: //doi.org/10.1145/3341162.3349308 [29] Anam Ahmad Khan, Sadia Nawaz, Joshua Newn, Ryan M. Kelly, Jason M. Lodge, James Bailey, and Eduardo Velloso. 2022. To Type or to Speak? The Effect of Input Modality on Text Understanding during Note-Taking. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 164, 15 pages. https://doi.org/10.1145/3491102.3501974 [30] Kenneth A. Kiewra. 1989. A review of note-taking: The encoding-storage paradigm and beyond. Educational Psychology Review 1 (1989), 147–172. https: //api.semanticscholar.org/CorpusID:144302749 [31] Taejun Kim, Auejin Ham, Sunggeun Ahn, and Geehyuk Lee. 2022. Lattice Menu: A Low-Error Gaze-Based Marking Menu Utilizing Target-Assisted Gaze Gestures on a Lattice of Visual Anchors. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 277, 12 pages. https://doi.org/10.1145/3491102.3501977 [32] Tae Soo Kim, DaEun Choi, Yoonseo Choi, and Juho Kim. 2022. Stylette: Styling the Web with Natural Language. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (<conf-loc>, <city>New Orleans</city>, <state>LA</state>, <country>USA</country>, </conf-loc>) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 5, 17 pages. https://doi.org/10.1145/3491102.3501931 [33] Dominik Kirst and Andreas Bulling. 2016. On the Verge: Voluntary Convergences for Accurate and Precise Timing of Gaze Input. In Proceedings of the 2016 CHI Conference Extended Abstracts on Human Factors in Computing Systems (San Jose, California, USA) (CHI EA ’16). Association for Computing Machinery, New York, NY, USA, 1519–1525. https://doi.org/10.1145/2851581.2892307 [34] Shinya Kudo, Hiroyuki Okabe, Taku Hachisu, Michi Sato, Shogo Fukushima, and Hiroyuki Kajimoto. 2013. Input Method Using Divergence Eye Movement. In CHI ’13 Extended Abstracts on Human Factors in Computing Systems (Paris, France) (CHI EA ’13). Association for Computing Machinery, New York, NY, USA, 1335–1340. https://doi.org/10.1145/2468356.2468594 [35] Philippe Laban, Chien-Sheng Wu, Lidiya Murakhovs’Ka, Xiang ’Anthony’ Chen, and Caiming Xiong. 2023. Designing and Evaluating Interfaces That Highlight News Coverage Diversity Using Discord Questions. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (Hamburg, Germany) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 104, 21 pages. https://doi.org/10.1145/3544548.3581569 [36] Philippe Laban, Chien-Sheng Wu, Lidiya Murakhovs’Ka, Xiang ’Anthony’ Chen, and Caiming Xiong. 2023. Designing and Evaluating Interfaces That Highlight News Coverage Diversity Using Discord Questions. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (Hamburg, Germany) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 104, 21 pages. https://doi.org/10.1145/3544548.3581569 [37] Hyungmin Lee, Chen-Chun Hsia, Aleksandr Tsoy, Sungmin Choi, Hanchao Hou, and Shiguang Ni. 2023. VisionARy: Exploratory Research on Contextual Language Learning Using AR Glasses with ChatGPT. In Proceedings of the 15th Biannual Conference of the Italian SIGCHI Chapter (<confloc>, <city>Torino</city>, <country>Italy</country>, </conf-loc>) (CHItaly ’23). Association for Computing Machinery, New York, NY, USA, Article 22, 6 pages. https://doi.org/10.1145/3605390.3605400 [38] Daniel Li, Thomas Chen, Albert Tung, and Lydia B Chilton. 2021. Hierarchical Summarization for Longform Spoken Dialog. In The 34th Annual ACM Symposium on User Interface Software and Technology (Virtual Event, USA) (UIST ’21). Association for Computing Machinery, New York, NY, USA, 582597. https://doi.org/10.1145/3472749.3474771 [39] Daniel Li, Thomas Chen, Alec Zadikian, Albert Tung, and Lydia B Chilton. = 2023. Improving Automatic Summarization for Browsing Longform Spoken Dialog. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (Hamburg, Germany) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 106, 20 pages. https://doi.org/10.1145/3544548.3581339 [40] Jian Liao, Adnan Karim, Shivesh Singh Jadon, Rubaiat Habib Kazi, and Ryo Suzuki. 2022. RealityTalk: Real-Time Speech-Driven Augmented Presentation for AR Live Storytelling. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology (Bend, OR, USA) (UIST ’22). Association for Computing Machinery, New York, NY, USA, Article 17, 12 pages. https: //doi.org/10.1145/3526113.3545702 [41] Ching (Jean) Liu, Chi-Lan Yang, Joseph Jay Williams, and Hao-Chuan Wang. 2019. NoteStruct: Scaffolding Note-Taking While Learning from Online Videos. In Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems (Glasgow, Scotland Uk) (CHI EA ’19). Association for Computing Machinery, New York, NY, USA, 1–6. https://doi.org/10.1145/3290607.3312878 [42] Xingyu "Bruce" Liu, Vladimir Kirilyuk, Xiuxiu Yuan, Alex Olwal, Peggy Chi, Xiang "Anthony" Chen, and Ruofei Du. 2023. Visual Captions: Augmenting Verbal Communication with On-the-Fly Visuals. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (Hamburg, Germany) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 108, 20 pages. https://doi.org/10.1145/3544548.3581566 [43] Xinyi Lu, Simin Fan, Jessica Houghton, Lu Wang, and Xu Wang. 2023. ReadingQuizMaker: A Human-NLP Collaborative System that Supports Instructors to Design High-Quality Reading Quiz Questions. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (<conf-loc>, <city>Hamburg</city>, <country>Germany</country>, </conf-loc>) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 454, 18 pages. https://doi.org/10.1145/3544548.3580957 [44] Sara Mandic, Rhys Tracy, and Misha Sra. 2023. ARFit: Pose-Based Exercise Feedback with Mobile AR. In Proceedings of the 2023 ACM Symposium on Spatial User Interaction (<conf-loc>, <city>Sydney</city>, <state>NSW</state>, <country>Australia</country>, </conf-loc>) (SUI ’23). Association for Computing Machinery, New York, NY, USA, Article 45, 3 pages. [45] Piotr Mirowski, Kory W. Mathewson, Jaylen Pittman, and Richard Evans. 2023. Co-Writing Screenplays and Theatre Scripts with Language Models: Evaluation by Industry Professionals. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (<conf-loc>, <city>Hamburg</city>, <country>Germany</country>, </conf-loc>) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 355, 34 pages. https://doi.org/10.1145/3544548.3581225 [46] Mogeeb A. A. Mosleh, Mohd Sapiyan Baba, Sorayya Malek, and Musaed A. Alhussein. 2016. Challenges of Digital Note Taking. In Advanced Computer and Communication Engineering Technology, Hamzah Asyrani Sulaiman, Mohd Azlishah Othman, Mohd Fairuz Iskandar Othman, Yahaya Abd Rahim, and Naim Che Pee (Eds.). Springer International Publishing, Cham, 211–231. [47] Cuong Nguyen and Feng Liu. 2016. Gaze-Based Notetaking for Learning from Lecture Videos. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (San Jose, California, USA) (CHI ’16). Association for Computing Machinery, New York, NY, USA, 2093–2097. https://doi.org/10.1145/2858036.2858137 [48] Srishti Palani, Zijian Ding, Austin Nguyen, Andrew Chuang, Stephen MacNeil, and Steven P. Dow. 2021. CoNotate: Suggesting Queries Based on Notes Promotes Knowledge Discovery. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI ’21). Association for Computing Machinery, New York, NY, USA, Article 726, 14 pages. https://doi.org/10.1145/3411764.3445618 [49] Yi-Hao Peng, Ming-Wei Hsi, Paul Taele, Ting-Yu Lin, Po-En Lai, Leon Hsu, Tzu-chuan Chen, Te-Yen Wu, Yu-An Chen, Hsien-Hui Tang, and Mike Y. Chen. 2018. SpeechBubbles: Enhancing Captioning Experiences for Deaf and Hard-of-Hearing People in Group Conversations. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (Montreal QC, Canada) (CHI ’18). Association for Computing Machinery, New York, NY, USA, 1–10. https://doi.org/10.1145/3173574.3173867 [50] Ken Pfeuffer, Benedikt Mayer, Diako Mardanbegi, and Hans Gellersen. 2017. Gaze + Pinch Interaction in Virtual Reality. In Proceedings of the 5th Symposium on Spatial User Interaction (Brighton, United Kingdom) (SUI ’17). Association for Computing Machinery, New York, NY, USA, 99–108. https://doi.org/10.1145/3131277.3132180 [51] Jimin Pi and Bertram E. Shi. 2017. Probabilistic adjustment of dwell time for eye typing. In 2017 10th International Conference on Human System Interactions (HSI). 251–257. https://doi.org/10.1109/HSI.2017.8005041 [52] Annie Piolat, Thierry Olive, and Ronald Kellogg. 2005. Cognitive effort during note taking. Applied Cognitive Psychology 19 (04 2005), 291–312. https://doi.org/10.1002/acp.1086 [53] Christopher Plaue, Sal LaMarca, and Shelby H. Funk. 2012. Group Note-Taking in a Large Lecture Class. In Proceedings of the 43rd ACM Technical Symposium on Computer Science Education (Raleigh, North Carolina, USA) (SIGCSE ’12). Association for Computing Machinery, New York, NY, USA, 227–232. https://doi.org/10.1145/2157136.2157203 [54] Shwetha Rajaram and Michael Nebeling. 2022. Paper Trail: An Immersive Au- thoring System for Augmented Reality Instructional Experiences. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (<conf-loc>, <city>New Orleans</city>, <state>LA</state>, <country>USA</country>, </conf-loc>) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 382, 16 pages. https://doi.org/10.1145/3491102.3517486 [55] Radiah Rivu, Yasmeen Abdrabou, Ken Pfeuffer, Augusto Esteves, Stefanie Meitner, and Florian Alt. 2020. StARe: Gaze-Assisted Face-to-Face Communication in Augmented Reality. In ACM Symposium on Eye Tracking Research and Applications (Stuttgart, Germany) (ETRA ’20 Adjunct). Association for Computing Machinery, New York, NY, USA, Article 14, 5 pages. https://doi.org/10.1145/3379157.3388930 [56] Nirmal Roy, Manuel Valle Torre, Ujwal Gadiraju, David Maxwell, and Claudia Hauff. 2021. Note the Highlight: Incorporating Active Reading Tools in a Search as Learning Environment. In Proceedings of the 2021 Conference on Human Information Interaction and Retrieval (Canberra ACT, Australia) (CHIIR’21). Association for Computing Machinery, New York, NY, USA, 229–238. https://doi.org/10.1145/3406522.3446025 [57] Perry Samson and Charles Bassam. 2018. LessonWare: Mining Student Notes to Provide Personalized Feedback. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval (Ann Arbor, MI, USA) (SIGIR’18). Association for Computing Machinery, New York, NY, USA, 1363–1364. https://doi.org/10.1145/3209978.3210210 [58] Yang Shi, Chris Bryan, Sridatt Bhamidipati, Ying Zhao, Yaoxue Zhang, and Kwan-Liu Ma. 2018. MeetingVis: Visual Narratives to Assist in Recalling Meeting Context and Content. IEEE Transactions on Visualization and Computer Graphics 24, 6 (June 2018), 1918–1929. https://doi.org/10.1109/TVCG.2018.2816203 [59] Ludwig Sidenmark, Christopher Clarke, Xuesong Zhang, Jenny Phu, and Hans Gellersen. 2020. Outline Pursuits: Gaze-Assisted Selection of Occluded Objects in Virtual Reality. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (Honolulu, HI, USA) (CHI ’20). Association for Computing Machinery, New York, NY, USA, 1–13. https://doi.org/10.1145/3313831.3376438 [60] Ludwig Sidenmark and Hans Gellersen. 2019. Eye&Head: Synergetic Eye and Head Movement for Gaze Pointing and Selection. In Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology (New Orleans, LA, USA) (UIST ’19). Association for Computing Machinery, New York, NY, USA, 1161–1174. https://doi.org/10.1145/3332165.3347921 [61] Ludwig Sidenmark, Dominic Potts, Bill Bapisch, and Hans Gellersen. 2021. RadiEye: Hands-Free Radial Interfaces for 3D Interaction Using Gaze-Activated Head-Crossing. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI ’21). Association for Computing Machinery, New York, NY, USA, Article 740, 11 pages. https://doi.org/10.1145/3411764.3445697 [62] Franck Silvestre, Philippe Vidal, and Julien Broisin. 2014. Tsaap-Notes – An Open Micro-blogging Tool for Collaborative Notetaking during Face-to-Face Lectures. In 2014 IEEE 14th International Conference on Advanced Learning Technologies.39–43. https://doi.org/10.1109/ICALT.2014.22 [63] Seoyun Son, Junyoug Choi, Sunjae Lee, Jean Y Song, and Insik Shin. 2023. It is Okay to Be Distracted: How Real-Time Transcriptions Facilitate Online Meeting with Distraction. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (<conf-loc>, <city>Hamburg</city>, <country>Germany</country>, </conf-loc>) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 64, 19 pages. https://doi.org/10.1145/3544548.3580742 [64] Ranjitha Jaddigadde Srinivasa, Samuel Dodson, Kyoungwon Seo, Dongwook Yoon, and Sidney Fels. 2021. NoteLink: A Point-and-Shoot Linking Interface between Students#x0027; Handwritten Notebooks and Instructional Videos. In 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL). 140–149. https: //doi.org/10.1109/JCDL52503.2021.00026 [65] Sangho Suh, Bryan Min, Srishti Palani, and Haijun Xia. 2023. Sensecape: Enabling Multilevel Exploration and Sensemaking with Large Language Models. In Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology(San Francisco, CA, USA,) (UIST ’23). Association for Computing Machinery, New York, NY, USA, Article 1, 18 pages. https://doi.org/10.1145/3586183.3606756 [66] Shan-Yuan Teng, Pengyu Li, Romain Nith, Joshua Fonseca, and Pedro Lopes. 2021. Touch&Fold: A Foldable Haptic Actuator for Rendering Touch in Mixed Reality (CHI ’21). Association for Computing Machinery, New York, NY, USA, Article 736, 14 pages. https://doi.org/10.1145/3411764.3445099 [67] L. Thaler, A.C. Schütz, M.A. Goodale, and K.R. Gegenfurtner. 2013. What is the best fixation target? The effect of target shape on stability of fixational eye movements. Vision Research 76 (2013), 31–42. https://doi.org/10.1016/j.visres.2012.10.012 [68] Hsin-Ruey Tsai, Chieh Tsai, Yu-So Liao, Yi-Ting Chiang, and Zhong-Yi Zhang. 2022. FingerX: Rendering Haptic Shapes of Virtual Objects Augmented by Real Objects using Extendable and Withdrawable Supports on Fingers. In CHI Conference on Human Factors in Computing Systems. 1–14. [69] Hsin-Ruey Tsai, Cheng-Yuan Wu, Lee-Ting Huang, and Yi-Ping Hung. 2016. ThumbRing: Private Interactions Using One-Handed Thumb Motion Input on Finger Segments. In Proceedings of the 18th International Conference on Human-Computer Interaction with Mobile Devices and Services Adjunct (Florence, Italy) (MobileHCI ’16). Association for Computing Machinery, New York, NY, USA, 791–798. https://doi.org/10.1145/2957265.2961859 [70] Bryan Wang, Zeyu Jin, and Gautham Mysore. 2022. Record Once, Post Every- where: Automatic Shortening of Audio Stories for Social Media. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology (Bend, OR, USA) (UIST ’22). Association for Computing Machinery, New York, NY, USA, Article 14, 11 pages. https://doi.org/10.1145/3526113.3545680 [71] Yushi Wei, Rongkai Shi, Difeng Yu, Yihong Wang, Yue Li, Lingyun Yu, and Hai-Ning Liang. 2023. Predicting Gaze-Based Target Selection in Augmented Reality Headsets Based on Eye and Head Endpoint Distributions. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (Hamburg, Germany) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 283, 14 pages. https://doi.org/10.1145/3544548.3581042 [72] Tongshuang Wu, Michael Terry, and Carrie Jun Cai. 2022. AI Chains: Trans- parent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (<conf-loc>, <city>New Orleans</city>, <state>LA</state>, <country>USA</country>, </conf-loc>) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 385, 22 pages. https://doi.org/10.1145/3491102.3517582 [73] Haijun Xia. 2020. Crosspower: Bridging Graphics and Linguistics. In Proceedings of the 33rd Annual ACM Symposium on User Interface Software and Technology (Virtual Event, USA) (UIST ’20). Association for Computing Machinery, New York, NY, USA, 722–734. https://doi.org/10.1145/3379337.3415845 [74] Chengpei Xu, Wenjing Jia, Ruomei Wang, Xiangjian He, Baoquan Zhao, and Yuanfang Zhang. 2023. Semantic Navigation of PowerPoint-Based Lecture Video for AutoNote Generation. IEEE Transactions on Learning Technologies 16, 1 (Feb 2023), 1–17. https://doi.org/10.1109/TLT.2022.3216535 [75] Saelyne Yang, Jisu Yim, Juho Kim, and Hijung Valentina Shin. 2022. CatchLive: Real-Time Summarization of Live Streams with Stream Content and Interaction Data. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 500, 20 pages. https://doi.org/10.1145/3491102.3517461 [76] Xin Yi, Leping Qiu, Wenjing Tang, Yehan Fan, Hewu Li, and Yuanchun Shi. 2022. DEEP: 3D Gaze Pointing in Virtual Reality Leveraging Eyelid Movement. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology (Bend, OR, USA) (UIST ’22). Association for Computing Machinery, New York, NY, USA, Article 3, 14 pages. https://doi.org/10.1145/3526113.3545673 [77] Xiaoyu Zhang, Jianping Li, Po-Wei Chi, Senthil Chandrasegaran, and Kwan- Liu Ma. 2023. ConceptEVA: Concept-Based Interactive Exploration and Cus- tomization of Document Summaries. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (<conf-loc>, <city>Hamburg</city>, <country>Germany</country>, </conf-loc>) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 204, 16 pages. https://doi.org/10. 1145/3544548.3581260 [78] Zhongyi Zhou and Koji Yatani. 2022. Gesture-Aware Interactive Machine Teaching with In-Situ Object Annotations. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology (Bend, OR, USA) (UIST ’22). Association for Computing Machinery, New York, NY, USA, Article 27, 14 pages. https://doi.org/10.1145/3526113.3545648
描述 碩士
國立政治大學
資訊科學系
111753142
資料來源 http://thesis.lib.nccu.edu.tw/record/#G0111753142
資料類型 thesis
dc.contributor.advisor 蔡欣叡zh_TW
dc.contributor.advisor Tsai, Hsin-Rueyen_US
dc.contributor.author (作者) 裘世綱zh_TW
dc.contributor.author (作者) Chiu, Shih Kangen_US
dc.creator (作者) 裘世綱zh_TW
dc.creator (作者) Chiu, Shih Kangen_US
dc.date (日期) 2024en_US
dc.date.accessioned 4-九月-2024 15:00:19 (UTC+8)-
dc.date.available 4-九月-2024 15:00:19 (UTC+8)-
dc.date.issued (上傳時間) 4-九月-2024 15:00:19 (UTC+8)-
dc.identifier (其他 識別碼) G0111753142en_US
dc.identifier.uri (URI) https://nccur.lib.nccu.edu.tw/handle/140.119/153381-
dc.description (描述) 碩士zh_TW
dc.description (描述) 國立政治大學zh_TW
dc.description (描述) 資訊科學系zh_TW
dc.description (描述) 111753142zh_TW
dc.description.abstract (摘要) 記錄筆記在聽演講和與人討論時至關重要,不僅用於幫助我們紀錄摘要和總結重點,還會在提問環節中用於提醒想提出的問題或提醒在討論時的發言。然而,以手機打字的筆記方式可能會分散使用者注意力並增加使用者的心智負擔。儘管大型語言模型(LLM)能夠被用於自動生成演講或討論的摘要和重點,但如果直接由人工智慧(AI)生成,沒有使用者參與或互動,得到的結果未必能符合使用者的意圖。因此,我們提出GazeNoter,一個人工智慧協作擴增實境(AR)系統,讓使用者透過AR頭戴裝置的眼動追蹤,快速選擇LLM生成的建議,以達成實時筆記。GazeNoter利用AR頭戴裝置作為媒介,讓使用者能夠快速給予LLM回饋以產出更符合其意圖之筆記,形成一個使用者參與式的AI系統,且不僅可產出演講與討論的內容以內的筆記,甚至能將筆記延伸至內容之外。我們進行了兩項使用者研究,分別驗證GazeNoter在聽演講的靜態情境和行走會議的移動情境下進行即時筆記的可用性。zh_TW
dc.description.abstract (摘要) Note-taking is critical during speeches and discussions, serving not only for later summarization and organization but also for real-time question and opinion reminding in question-and-answer sessions or timely contributions in discussions. Manually typing on smartphones for note-taking could be distracting and increase cognitive load for users. While large language models (LLMs) are used to automatically generate summaries and highlights, the con- tent generated by artificial intelligence (AI) may not match users’intentions without user input or interaction. Therefore, we propose an AI-copiloted augmented reality (AR) system, GazeNoter, to allow users to swiftly select diverse LLM-generated suggestions via gaze on an AR headset for real-time note-taking. GazeNoter leverages an AR headset as a medium for users to swiftly adjust the LLM output to match their intentions, forming a user-in-the-loop AI system for both within-context and beyond-context notes. We conducted two user studies to verify the usability of GazeNoter in attending speeches in a static sitting condition and walking meetings and discussions in a mobile walking condition, respectively.en_US
dc.description.tableofcontents CHAPTER 1 INTRODUCTION 1 CHAPTER 2 RELATED WORK 5 2.1 AUGMENTING AR WITH NATURAL LANGUAGE 5 2.2 NOTE-TAKING APPROACHES 6 2.3 USER-IN-THE-LOOP NLP SYSTEMS 7 2.4 GAZE SELECTION AND LAYOUT ON HEADSETS 9 CHAPTER 3 GAZENOTER 11 3.1 DESIGN CONSIDERATIONS 12 3.2 GAZENOTER FEATURES AND FLOW 16 3.3 AR GAZE SELECTION 20 3.4 NOTE-TAKING SYSTEM IMPLEMENTATION 23 CHAPTER 4 USER STUDY 1: FORMAL SPEECH 26 4.1 PARTICIPANTS AND APPARATUS 26 4.2 TASK AND PROCEDURE 27 4.3 RESULTS AND DISCUSSION 30 CHAPTER 5 USER STUDY 2: WALKING MEETING 38 5.1 SETUP, TASK AND PROCEDURE 39 5.2 RESULTS AND DISCUSSION 40 CHAPTER 6 LIMITATIONS AND FUTURE WORK 47 CHAPTER 7 CONCLUSION 48 REFERENCES 49 APPENDIX 67zh_TW
dc.source.uri (資料來源) http://thesis.lib.nccu.edu.tw/record/#G0111753142en_US
dc.subject (關鍵詞) 擴增實境zh_TW
dc.subject (關鍵詞) 人工智慧zh_TW
dc.subject (關鍵詞) 大型語言模型zh_TW
dc.subject (關鍵詞) 眼動追蹤zh_TW
dc.subject (關鍵詞) Augmented Realityen_US
dc.subject (關鍵詞) Artificial Intelligenceen_US
dc.subject (關鍵詞) Large Language Modelen_US
dc.subject (關鍵詞) Eye Trackingen_US
dc.title (題名) 透過眼動選擇大型語言模型建議達成符合使用者意圖之人工智慧協作擴增實境筆記系統zh_TW
dc.title (題名) Co-Piloted AR Note-Taking via Gaze Selection of LLM Suggestions to Match Users’ Intentionsen_US
dc.type (資料類型) thesisen_US
dc.relation.reference (參考文獻) [1] Sunggeun Ahn, Stephanie Santosa, Mark Parent, Daniel Wigdor, Tovi Gross- man, and Marcello Giordano. 2021. StickyPie: A Gaze-Based, Scale-Invariant Marking Menu Optimized for AR/VR. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI ’21). Association for Computing Machinery, New York, NY, USA, Article 739, 16 pages. https://doi.org/10.1145/3411764.3445297 [2] Sunggeun Ahn, Jeongmin Son, Sangyoon Lee, and Geehyuk Lee. 2020. Verge-It: Gaze Interaction for a Binocular Head-Worn Display Using Modulated Dis- parity Vergence Eye Movement. In Extended Abstracts of the 2020 CHI Con- ference on Human Factors in Computing Systems (Honolulu, HI, USA) (CHI EA ’20). Association for Computing Machinery, New York, NY, USA, 1–7. https://doi.org/10.1145/3334480.3382908 [3] Satanjeev Banerjee and Alexander I. Rudnicky. 2006. SmartNotes: Implicit Labeling of Meeting Data through User Note-Taking and Browsing. In Proceedings of the 2006 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: Companion Volume: Demonstrations (New York, New York) (NAACL-Demonstrations ’06). Association for Computational Linguistics, USA, 261–264. https://doi.org/10.3115/1225785.1225788 [4] Satanjeev Banerjee and Alexander I. Rudnicky. 2007. Segmenting Meetings into Agenda Items by Extracting Implicit Supervision from Human Note-Taking. In Proceedings of the 12th International Conference on Intelligent User Interfaces (Honolulu, Hawaii, USA) (IUI ’07). Association for Computing Machinery, New York, NY, USA, 151–159. https://doi.org/10.1145/1216295.1216325 [5] Stephen Brade, Bryan Wang, Mauricio Sousa, Sageev Oore, and Tovi Grossman. 2023. Promptify: Text-to-Image Generation through Interactive Prompt Explo- ration with Large Language Models. In Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology (<conf-loc>, <city>San Francisco</city>, <state>CA</state>, <country>USA</country>, </conf-loc>) (UIST ’23). Association for Computing Machinery, New York, NY, USA, Article 96, 14 pages. https://doi.org/10.1145/3586183.3606725 [6] Runze Cai, Nuwan Nanayakkarawasam Peru Kandage Janaka, Shengdong Zhao, and Minghui Sun. 2023. ParaGlassMenu: Towards Social-Friendly Subtle In- teractions in Conversations. In Proceedings of the 2023 CHI Conference on Hu- man Factors in Computing Systems (Hamburg, Germany) (CHI ’23). Associa- tion for Computing Machinery, New York, NY, USA, Article 721, 21 pages. https://doi.org/10.1145/3544548.3581065 [7] Yining Cao, Hariharan Subramonyam, and Eytan Adar. 2022. VideoSticker: A Tool for Active Viewing and Visual Note-Taking from Videos. In 27th International Conference on Intelligent User Interfaces (Helsinki, Finland) (IUI ’22). Association for Computing Machinery, New York, NY, USA, 672–690. https://doi.org/10. 1145/3490099.3511132 [8] Senthil Chandrasegaran, Chris Bryan, Hidekazu Shidara, Tung-Yen Chuang, and Kwan-Liu Ma. 2019. TalkTraces: Real-Time Capture and Visualization of Verbal Content in Meetings. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (Glasgow, Scotland Uk) (CHI ’19). Association for Computing Machinery, New York, NY, USA, 1–14. https://doi.org/10.1145/3290605.3300807 [9] Si Chen, Dennis Wang, and Yun Huang. 2021. Exploring the Complementary Features of Audio and Text Notes for Video-Based Learning in Mobile Settings. In Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI EA ’21). Association for Computing Machinery, New York, NY, USA, Article 310, 7 pages. https://doi.org/10.1145/3411763.3451801 [10] Myungguen Choi, Daisuke Sakamoto, and Tetsuo Ono. 2022. Kuiper Belt: Utilizing the “Out-of-Natural Angle” Region in the Eye-Gaze Interaction for Virtual Reality. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 357, 17 pages. https://doi.org/10.1145/3491102.3517725 [11] John Joon Young Chung, Wooseok Kim, Kang Min Yoo, Hwaran Lee, Eytan Adar, and Minsuk Chang. 2022. TaleBrush: Sketching Stories with Generative Pretrained Language Models. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (<conf-loc>, <city>New Orleans</city>, <state>LA</state>, <country>USA</country>, </conf-loc>) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 209, 19 pages. https://doi.org/10.1145/3491102.3501819 [12] Ida Damen, Anika Kok, Bas Vink, Hans Brombacher, Steven Vos, and Carine Lallemand. 2020. The Hub: Facilitating Walking Meetings through a Network of Interactive Devices. In Companion Publication of the 2020 ACM Designing Interactive Systems Conference (Eindhoven, Netherlands) (DIS’ 20 Companion). Association for Computing Machinery, New York, NY, USA, 19–24. https://doi.org/10.1145/3393914.3395876 [13] Ida Damen, Carine Lallemand, Rens Brankaert, Aarnout Brombacher, Pieter van Wesemael, and Steven Vos. 2020. Understanding Walking Meetings: Drivers and Barriers. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (Honolulu, HI, USA) (CHI ’20). Association for Computing Machinery, New York, NY, USA, https://doi.org/10.1145/3313831.3376141 [14] Hai Dang, Karim Benharrak, Florian Lehmann, and Daniel Buschek. 2022. Be- yond Text Generation: Supporting Writers with Continuous Automatic Text Summaries. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology (Bend, OR, USA) (UIST ’22). Association for Computing Machinery, New York, NY, USA, Article 98, 13 pages. https: //doi.org/10.1145/3526113.3545672 [15] Morten Lund Dybdal, Javier San Agustin, and John Paulin Hansen. 2012. Gaze Input for Mobile Devices by Dwell and Gestures. In Proceedings of the Symposium on Eye Tracking Research and Applications (Santa Barbara, California) (ETRA ’12). Association for Computing Machinery, New York, NY, USA, 225–228. https: //doi.org/10.1145/2168556.2168601 [16] Marc Exposito, Vicky Zeamer, and Pattie Maes. 2017. Unobtrusive Note Taking: Enriching Digital Interpersonal Interactions Using Gestures. In Companion of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing (Portland, Oregon, USA) (CSCW ’17 Companion). Association for Computing Machinery, New York, NY, USA, 167–170. https://doi.org/10.1145/ 3022198.3026319 [17] Jingchao Fang, Yanhao Wang, Chi-Lan Yang, Ching Liu, and Hao-Chuan Wang. 2022. Understanding the Effects of Structured Note-Taking Systems for Video- Based Learners in Individual and Social Learning Contexts. Proc. ACM Hum Comput. Interact. 6, GROUP, Article 21 (jan 2022), 21 pages. [18] Jingchao Fang, Yanhao Wang, Chi-Lan Yang, and Hao-Chuan Wang. 2021. NoteCoStruct: Powering Online Learners with Socially Scaffolded Note Taking and Sharing. In Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI EA ’21). Association for Computing Machinery, New York, NY, USA, Article 223, 5 pages. https://doi.org/10.1145/3411763.3451694 [19] Tan Gemicioglu, R. Michael Winters, Yu-Te Wang, Thomas M. Gable, Ann Paradiso, and Ivan J. Tashev. 2023. Gaze & Tongue: A Subtle, Hands-Free Interaction for Head-Worn Devices. In Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems (Hamburg, Germany) (CHI EA ’23). Association for Computing Machinery, New York, NY, USA, Article 456, 4 pages. https://doi.org/10.1145/3544549.3583930 [20] Luke Haliburton, Natalia Bartłomiejczyk, Albrecht Schmidt, Paweł W. Woźniak, and Jasmin Niess. 2023. The Walking Talking Stick: Understanding Automated Note-Taking in Walking Meetings. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (Hamburg, Germany) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 431, 16 pages. https: //doi.org/10.1145/3544548.3580986 [21] Ari Hautasaari and Naomi Yamashita. 2014. Catching up in Audio Conferences: Highlighting Keywords in ASR Transcripts for Non-Native Speakers. In Proceedings of the 5th ACM International Conference on Collaboration across Boundaries: Culture, Distance & Technology (Kyoto, Japan) (CABS ’14). Association for Computing Machinery, New York, NY, USA, 107–110. https: //doi.org/10.1145/2631488.2634064 [22] Nuwan Janaka, Chloe Haigh, Hyeongcheol Kim, Shan Zhang, and Shengdong Zhao. 2022. Paracentral and Near-Peripheral Visualizations: Towards Attention Maintaining Secondary Information Presentation on OHMDs during in-Person Social Interactions. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 551, 14 pages. https://doi.org/10.1145/3491102.3502127 [23] Peiling Jiang, Jude Rayan, Steven P. Dow, and Haijun Xia. 2023. Graphologue: Exploring Large Language Model Responses with Interactive Diagrams. In Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology (<conf-loc>, <city>San Francisco</city>, <state>CA</state>, <country>USA</country>, </conf-loc>) (UIST ’23). Association for Computing Machinery, New York, NY, USA, Article 3, 20 pages. https://doi.org/10.1145/3586183.3606737 [24] Haojian Jin, Yale Song, and Koji Yatani. 2017. ElasticPlay: Interactive Video Summarization with Dynamic Time Budgets. In Proceedings of the 25th ACM International Conference on Multimedia (Mountain View, California, USA) (MM’17). Association for Computing Machinery, New York, NY, USA, 1164–1172. https://doi.org/10.1145/3123266.3123393 [25] Vaiva Kalnikaitundefined, Patrick Ehlen, and Steve Whittaker. 2012. Markup as You Talk: Establishing Effective Memory Cues While Still Contributing to a Meeting. In Proceedings of the ACM 2012 Conference on Computer Supported Cooperative Work (Seattle, Washington, USA) (CSCW ’12). Association for Computing Machinery, New York, NY, USA, 349–358. https://doi.org/10.1145/2145204.2145260 [26] Matthew Kam, Jingtao Wang, Alastair Iles, Eric Tse, Jane Chiu, Daniel Glaser, Orna Tarshish, and John Canny. 2005. Livenotes: A System for Cooperative and Augmented Note-Taking in Lectures. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Portland, Oregon, USA) (CHI ’05). Association for Computing Machinery, New York, NY, USA, 531–540. https: //doi.org/10.1145/1054972.1055046 [27] Bridgette Kaminski, Rainer Wasinger, Kimberley Norris, Chris Zehntner, Shuxiang Xu, Winyu Chinthammit, and Henry Duh. 2016. Learning through Shared Note-Taking Visualisations in the Classroom. In Proceedings of the 28th Australian Conference on Computer-Human Interaction (Launceston, Tasmania, Australia) (OzCHI ’16). Association for Computing Machinery, New York, NY, USA, 576–580. https://doi.org/10.1145/3010915.3010970 [28] Anam Ahmad Khan. 2019. Gaze Assisted Voice Note Taking System. In Adjunct Proceedings of the 2019 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2019 ACM International Symposium on Wearable Computers (London, United Kingdom) (UbiComp/ISWC ’19 Adjunct). Association for Computing Machinery, New York, NY, USA, 367–371. https: //doi.org/10.1145/3341162.3349308 [29] Anam Ahmad Khan, Sadia Nawaz, Joshua Newn, Ryan M. Kelly, Jason M. Lodge, James Bailey, and Eduardo Velloso. 2022. To Type or to Speak? The Effect of Input Modality on Text Understanding during Note-Taking. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 164, 15 pages. https://doi.org/10.1145/3491102.3501974 [30] Kenneth A. Kiewra. 1989. A review of note-taking: The encoding-storage paradigm and beyond. Educational Psychology Review 1 (1989), 147–172. https: //api.semanticscholar.org/CorpusID:144302749 [31] Taejun Kim, Auejin Ham, Sunggeun Ahn, and Geehyuk Lee. 2022. Lattice Menu: A Low-Error Gaze-Based Marking Menu Utilizing Target-Assisted Gaze Gestures on a Lattice of Visual Anchors. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 277, 12 pages. https://doi.org/10.1145/3491102.3501977 [32] Tae Soo Kim, DaEun Choi, Yoonseo Choi, and Juho Kim. 2022. Stylette: Styling the Web with Natural Language. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (<conf-loc>, <city>New Orleans</city>, <state>LA</state>, <country>USA</country>, </conf-loc>) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 5, 17 pages. https://doi.org/10.1145/3491102.3501931 [33] Dominik Kirst and Andreas Bulling. 2016. On the Verge: Voluntary Convergences for Accurate and Precise Timing of Gaze Input. In Proceedings of the 2016 CHI Conference Extended Abstracts on Human Factors in Computing Systems (San Jose, California, USA) (CHI EA ’16). Association for Computing Machinery, New York, NY, USA, 1519–1525. https://doi.org/10.1145/2851581.2892307 [34] Shinya Kudo, Hiroyuki Okabe, Taku Hachisu, Michi Sato, Shogo Fukushima, and Hiroyuki Kajimoto. 2013. Input Method Using Divergence Eye Movement. In CHI ’13 Extended Abstracts on Human Factors in Computing Systems (Paris, France) (CHI EA ’13). Association for Computing Machinery, New York, NY, USA, 1335–1340. https://doi.org/10.1145/2468356.2468594 [35] Philippe Laban, Chien-Sheng Wu, Lidiya Murakhovs’Ka, Xiang ’Anthony’ Chen, and Caiming Xiong. 2023. Designing and Evaluating Interfaces That Highlight News Coverage Diversity Using Discord Questions. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (Hamburg, Germany) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 104, 21 pages. https://doi.org/10.1145/3544548.3581569 [36] Philippe Laban, Chien-Sheng Wu, Lidiya Murakhovs’Ka, Xiang ’Anthony’ Chen, and Caiming Xiong. 2023. Designing and Evaluating Interfaces That Highlight News Coverage Diversity Using Discord Questions. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (Hamburg, Germany) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 104, 21 pages. https://doi.org/10.1145/3544548.3581569 [37] Hyungmin Lee, Chen-Chun Hsia, Aleksandr Tsoy, Sungmin Choi, Hanchao Hou, and Shiguang Ni. 2023. VisionARy: Exploratory Research on Contextual Language Learning Using AR Glasses with ChatGPT. In Proceedings of the 15th Biannual Conference of the Italian SIGCHI Chapter (<confloc>, <city>Torino</city>, <country>Italy</country>, </conf-loc>) (CHItaly ’23). Association for Computing Machinery, New York, NY, USA, Article 22, 6 pages. https://doi.org/10.1145/3605390.3605400 [38] Daniel Li, Thomas Chen, Albert Tung, and Lydia B Chilton. 2021. Hierarchical Summarization for Longform Spoken Dialog. In The 34th Annual ACM Symposium on User Interface Software and Technology (Virtual Event, USA) (UIST ’21). Association for Computing Machinery, New York, NY, USA, 582597. https://doi.org/10.1145/3472749.3474771 [39] Daniel Li, Thomas Chen, Alec Zadikian, Albert Tung, and Lydia B Chilton. = 2023. Improving Automatic Summarization for Browsing Longform Spoken Dialog. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (Hamburg, Germany) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 106, 20 pages. https://doi.org/10.1145/3544548.3581339 [40] Jian Liao, Adnan Karim, Shivesh Singh Jadon, Rubaiat Habib Kazi, and Ryo Suzuki. 2022. RealityTalk: Real-Time Speech-Driven Augmented Presentation for AR Live Storytelling. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology (Bend, OR, USA) (UIST ’22). Association for Computing Machinery, New York, NY, USA, Article 17, 12 pages. https: //doi.org/10.1145/3526113.3545702 [41] Ching (Jean) Liu, Chi-Lan Yang, Joseph Jay Williams, and Hao-Chuan Wang. 2019. NoteStruct: Scaffolding Note-Taking While Learning from Online Videos. In Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems (Glasgow, Scotland Uk) (CHI EA ’19). Association for Computing Machinery, New York, NY, USA, 1–6. https://doi.org/10.1145/3290607.3312878 [42] Xingyu "Bruce" Liu, Vladimir Kirilyuk, Xiuxiu Yuan, Alex Olwal, Peggy Chi, Xiang "Anthony" Chen, and Ruofei Du. 2023. Visual Captions: Augmenting Verbal Communication with On-the-Fly Visuals. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (Hamburg, Germany) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 108, 20 pages. https://doi.org/10.1145/3544548.3581566 [43] Xinyi Lu, Simin Fan, Jessica Houghton, Lu Wang, and Xu Wang. 2023. ReadingQuizMaker: A Human-NLP Collaborative System that Supports Instructors to Design High-Quality Reading Quiz Questions. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (<conf-loc>, <city>Hamburg</city>, <country>Germany</country>, </conf-loc>) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 454, 18 pages. https://doi.org/10.1145/3544548.3580957 [44] Sara Mandic, Rhys Tracy, and Misha Sra. 2023. ARFit: Pose-Based Exercise Feedback with Mobile AR. In Proceedings of the 2023 ACM Symposium on Spatial User Interaction (<conf-loc>, <city>Sydney</city>, <state>NSW</state>, <country>Australia</country>, </conf-loc>) (SUI ’23). Association for Computing Machinery, New York, NY, USA, Article 45, 3 pages. [45] Piotr Mirowski, Kory W. Mathewson, Jaylen Pittman, and Richard Evans. 2023. Co-Writing Screenplays and Theatre Scripts with Language Models: Evaluation by Industry Professionals. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (<conf-loc>, <city>Hamburg</city>, <country>Germany</country>, </conf-loc>) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 355, 34 pages. https://doi.org/10.1145/3544548.3581225 [46] Mogeeb A. A. Mosleh, Mohd Sapiyan Baba, Sorayya Malek, and Musaed A. Alhussein. 2016. Challenges of Digital Note Taking. In Advanced Computer and Communication Engineering Technology, Hamzah Asyrani Sulaiman, Mohd Azlishah Othman, Mohd Fairuz Iskandar Othman, Yahaya Abd Rahim, and Naim Che Pee (Eds.). Springer International Publishing, Cham, 211–231. [47] Cuong Nguyen and Feng Liu. 2016. Gaze-Based Notetaking for Learning from Lecture Videos. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (San Jose, California, USA) (CHI ’16). Association for Computing Machinery, New York, NY, USA, 2093–2097. https://doi.org/10.1145/2858036.2858137 [48] Srishti Palani, Zijian Ding, Austin Nguyen, Andrew Chuang, Stephen MacNeil, and Steven P. Dow. 2021. CoNotate: Suggesting Queries Based on Notes Promotes Knowledge Discovery. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI ’21). Association for Computing Machinery, New York, NY, USA, Article 726, 14 pages. https://doi.org/10.1145/3411764.3445618 [49] Yi-Hao Peng, Ming-Wei Hsi, Paul Taele, Ting-Yu Lin, Po-En Lai, Leon Hsu, Tzu-chuan Chen, Te-Yen Wu, Yu-An Chen, Hsien-Hui Tang, and Mike Y. Chen. 2018. SpeechBubbles: Enhancing Captioning Experiences for Deaf and Hard-of-Hearing People in Group Conversations. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (Montreal QC, Canada) (CHI ’18). Association for Computing Machinery, New York, NY, USA, 1–10. https://doi.org/10.1145/3173574.3173867 [50] Ken Pfeuffer, Benedikt Mayer, Diako Mardanbegi, and Hans Gellersen. 2017. Gaze + Pinch Interaction in Virtual Reality. In Proceedings of the 5th Symposium on Spatial User Interaction (Brighton, United Kingdom) (SUI ’17). Association for Computing Machinery, New York, NY, USA, 99–108. https://doi.org/10.1145/3131277.3132180 [51] Jimin Pi and Bertram E. Shi. 2017. Probabilistic adjustment of dwell time for eye typing. In 2017 10th International Conference on Human System Interactions (HSI). 251–257. https://doi.org/10.1109/HSI.2017.8005041 [52] Annie Piolat, Thierry Olive, and Ronald Kellogg. 2005. Cognitive effort during note taking. Applied Cognitive Psychology 19 (04 2005), 291–312. https://doi.org/10.1002/acp.1086 [53] Christopher Plaue, Sal LaMarca, and Shelby H. Funk. 2012. Group Note-Taking in a Large Lecture Class. In Proceedings of the 43rd ACM Technical Symposium on Computer Science Education (Raleigh, North Carolina, USA) (SIGCSE ’12). Association for Computing Machinery, New York, NY, USA, 227–232. https://doi.org/10.1145/2157136.2157203 [54] Shwetha Rajaram and Michael Nebeling. 2022. Paper Trail: An Immersive Au- thoring System for Augmented Reality Instructional Experiences. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (<conf-loc>, <city>New Orleans</city>, <state>LA</state>, <country>USA</country>, </conf-loc>) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 382, 16 pages. https://doi.org/10.1145/3491102.3517486 [55] Radiah Rivu, Yasmeen Abdrabou, Ken Pfeuffer, Augusto Esteves, Stefanie Meitner, and Florian Alt. 2020. StARe: Gaze-Assisted Face-to-Face Communication in Augmented Reality. In ACM Symposium on Eye Tracking Research and Applications (Stuttgart, Germany) (ETRA ’20 Adjunct). Association for Computing Machinery, New York, NY, USA, Article 14, 5 pages. https://doi.org/10.1145/3379157.3388930 [56] Nirmal Roy, Manuel Valle Torre, Ujwal Gadiraju, David Maxwell, and Claudia Hauff. 2021. Note the Highlight: Incorporating Active Reading Tools in a Search as Learning Environment. In Proceedings of the 2021 Conference on Human Information Interaction and Retrieval (Canberra ACT, Australia) (CHIIR’21). Association for Computing Machinery, New York, NY, USA, 229–238. https://doi.org/10.1145/3406522.3446025 [57] Perry Samson and Charles Bassam. 2018. LessonWare: Mining Student Notes to Provide Personalized Feedback. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval (Ann Arbor, MI, USA) (SIGIR’18). Association for Computing Machinery, New York, NY, USA, 1363–1364. https://doi.org/10.1145/3209978.3210210 [58] Yang Shi, Chris Bryan, Sridatt Bhamidipati, Ying Zhao, Yaoxue Zhang, and Kwan-Liu Ma. 2018. MeetingVis: Visual Narratives to Assist in Recalling Meeting Context and Content. IEEE Transactions on Visualization and Computer Graphics 24, 6 (June 2018), 1918–1929. https://doi.org/10.1109/TVCG.2018.2816203 [59] Ludwig Sidenmark, Christopher Clarke, Xuesong Zhang, Jenny Phu, and Hans Gellersen. 2020. Outline Pursuits: Gaze-Assisted Selection of Occluded Objects in Virtual Reality. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (Honolulu, HI, USA) (CHI ’20). Association for Computing Machinery, New York, NY, USA, 1–13. https://doi.org/10.1145/3313831.3376438 [60] Ludwig Sidenmark and Hans Gellersen. 2019. Eye&Head: Synergetic Eye and Head Movement for Gaze Pointing and Selection. In Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology (New Orleans, LA, USA) (UIST ’19). Association for Computing Machinery, New York, NY, USA, 1161–1174. https://doi.org/10.1145/3332165.3347921 [61] Ludwig Sidenmark, Dominic Potts, Bill Bapisch, and Hans Gellersen. 2021. RadiEye: Hands-Free Radial Interfaces for 3D Interaction Using Gaze-Activated Head-Crossing. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI ’21). Association for Computing Machinery, New York, NY, USA, Article 740, 11 pages. https://doi.org/10.1145/3411764.3445697 [62] Franck Silvestre, Philippe Vidal, and Julien Broisin. 2014. Tsaap-Notes – An Open Micro-blogging Tool for Collaborative Notetaking during Face-to-Face Lectures. In 2014 IEEE 14th International Conference on Advanced Learning Technologies.39–43. https://doi.org/10.1109/ICALT.2014.22 [63] Seoyun Son, Junyoug Choi, Sunjae Lee, Jean Y Song, and Insik Shin. 2023. It is Okay to Be Distracted: How Real-Time Transcriptions Facilitate Online Meeting with Distraction. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (<conf-loc>, <city>Hamburg</city>, <country>Germany</country>, </conf-loc>) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 64, 19 pages. https://doi.org/10.1145/3544548.3580742 [64] Ranjitha Jaddigadde Srinivasa, Samuel Dodson, Kyoungwon Seo, Dongwook Yoon, and Sidney Fels. 2021. NoteLink: A Point-and-Shoot Linking Interface between Students#x0027; Handwritten Notebooks and Instructional Videos. In 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL). 140–149. https: //doi.org/10.1109/JCDL52503.2021.00026 [65] Sangho Suh, Bryan Min, Srishti Palani, and Haijun Xia. 2023. Sensecape: Enabling Multilevel Exploration and Sensemaking with Large Language Models. In Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology(San Francisco, CA, USA,) (UIST ’23). Association for Computing Machinery, New York, NY, USA, Article 1, 18 pages. https://doi.org/10.1145/3586183.3606756 [66] Shan-Yuan Teng, Pengyu Li, Romain Nith, Joshua Fonseca, and Pedro Lopes. 2021. Touch&Fold: A Foldable Haptic Actuator for Rendering Touch in Mixed Reality (CHI ’21). Association for Computing Machinery, New York, NY, USA, Article 736, 14 pages. https://doi.org/10.1145/3411764.3445099 [67] L. Thaler, A.C. Schütz, M.A. Goodale, and K.R. Gegenfurtner. 2013. What is the best fixation target? The effect of target shape on stability of fixational eye movements. Vision Research 76 (2013), 31–42. https://doi.org/10.1016/j.visres.2012.10.012 [68] Hsin-Ruey Tsai, Chieh Tsai, Yu-So Liao, Yi-Ting Chiang, and Zhong-Yi Zhang. 2022. FingerX: Rendering Haptic Shapes of Virtual Objects Augmented by Real Objects using Extendable and Withdrawable Supports on Fingers. In CHI Conference on Human Factors in Computing Systems. 1–14. [69] Hsin-Ruey Tsai, Cheng-Yuan Wu, Lee-Ting Huang, and Yi-Ping Hung. 2016. ThumbRing: Private Interactions Using One-Handed Thumb Motion Input on Finger Segments. In Proceedings of the 18th International Conference on Human-Computer Interaction with Mobile Devices and Services Adjunct (Florence, Italy) (MobileHCI ’16). Association for Computing Machinery, New York, NY, USA, 791–798. https://doi.org/10.1145/2957265.2961859 [70] Bryan Wang, Zeyu Jin, and Gautham Mysore. 2022. Record Once, Post Every- where: Automatic Shortening of Audio Stories for Social Media. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology (Bend, OR, USA) (UIST ’22). Association for Computing Machinery, New York, NY, USA, Article 14, 11 pages. https://doi.org/10.1145/3526113.3545680 [71] Yushi Wei, Rongkai Shi, Difeng Yu, Yihong Wang, Yue Li, Lingyun Yu, and Hai-Ning Liang. 2023. Predicting Gaze-Based Target Selection in Augmented Reality Headsets Based on Eye and Head Endpoint Distributions. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (Hamburg, Germany) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 283, 14 pages. https://doi.org/10.1145/3544548.3581042 [72] Tongshuang Wu, Michael Terry, and Carrie Jun Cai. 2022. AI Chains: Trans- parent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (<conf-loc>, <city>New Orleans</city>, <state>LA</state>, <country>USA</country>, </conf-loc>) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 385, 22 pages. https://doi.org/10.1145/3491102.3517582 [73] Haijun Xia. 2020. Crosspower: Bridging Graphics and Linguistics. In Proceedings of the 33rd Annual ACM Symposium on User Interface Software and Technology (Virtual Event, USA) (UIST ’20). Association for Computing Machinery, New York, NY, USA, 722–734. https://doi.org/10.1145/3379337.3415845 [74] Chengpei Xu, Wenjing Jia, Ruomei Wang, Xiangjian He, Baoquan Zhao, and Yuanfang Zhang. 2023. Semantic Navigation of PowerPoint-Based Lecture Video for AutoNote Generation. IEEE Transactions on Learning Technologies 16, 1 (Feb 2023), 1–17. https://doi.org/10.1109/TLT.2022.3216535 [75] Saelyne Yang, Jisu Yim, Juho Kim, and Hijung Valentina Shin. 2022. CatchLive: Real-Time Summarization of Live Streams with Stream Content and Interaction Data. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 500, 20 pages. https://doi.org/10.1145/3491102.3517461 [76] Xin Yi, Leping Qiu, Wenjing Tang, Yehan Fan, Hewu Li, and Yuanchun Shi. 2022. DEEP: 3D Gaze Pointing in Virtual Reality Leveraging Eyelid Movement. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology (Bend, OR, USA) (UIST ’22). Association for Computing Machinery, New York, NY, USA, Article 3, 14 pages. https://doi.org/10.1145/3526113.3545673 [77] Xiaoyu Zhang, Jianping Li, Po-Wei Chi, Senthil Chandrasegaran, and Kwan- Liu Ma. 2023. ConceptEVA: Concept-Based Interactive Exploration and Cus- tomization of Document Summaries. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (<conf-loc>, <city>Hamburg</city>, <country>Germany</country>, </conf-loc>) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 204, 16 pages. https://doi.org/10. 1145/3544548.3581260 [78] Zhongyi Zhou and Koji Yatani. 2022. Gesture-Aware Interactive Machine Teaching with In-Situ Object Annotations. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology (Bend, OR, USA) (UIST ’22). Association for Computing Machinery, New York, NY, USA, Article 27, 14 pages. https://doi.org/10.1145/3526113.3545648zh_TW