Publications-Proceedings

Article View/Open

Publication Export

Google ScholarTM

NCCU Library

Citation Infomation

Related Publications in TAIR

題名 Computational approaches to quantitative analysis of pause duration in Taiwan Mandarin
作者 萬依萍
Wan, I-Ping;Lai, Yu-Ju;Yu, Pu
貢獻者 語言所
日期 2025-11
上傳時間 11-Feb-2026 09:24:44 (UTC+8)
摘要 This study presents a quantitative analysis of pause-duration patterns in a Mandarin spoken corpus to establish a baseline for prosodic and cognitive assessment. Drawing on cross-linguistic research, the distribution of pause patterns is viewed as reflecting multiple underlying factors. Longer pauses aligned with prosodic and syntactic boundaries indicate more deliberative and planned discourse rather than spontaneous speech. Such settings place higher demands on cognitive and articulatory planning, producing extended thinking time as speakers handle complex topics and specialized terminology. The spoken corpus was automatically processed and annotated using an in-house alignment and pause-tagging pipeline. Outlier detection with a 3.0×IQR threshold retained 35,474 tokens and removed extreme values exceeding 1,016 ms. Short and medium pauses remained stable across mean, median, and variability measures, while long pauses showed a moderate reduction (16,436 to 15,420 tokens), with mean duration decreasing from 535 to 426 ms and standard deviation sharply reduced from 786 to 169 ms, while the median stayed around 370–380 ms. These findings demonstrate that automatic cleaning primarily removed aberrant values while preserving linguistically meaningful long pauses. This baseline from non-impaired adult speakers underscores the need for corpus-specific frameworks and offers a reference point for cross-linguistic research on speech planning.
關聯 The 37th Conference on Computational Linguistics and Speech Processing (ROCLING 2025), The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
資料類型 conference
dc.contributor 語言所
dc.creator (作者) 萬依萍
dc.creator (作者) Wan, I-Ping;Lai, Yu-Ju;Yu, Pu
dc.date (日期) 2025-11
dc.date.accessioned 11-Feb-2026 09:24:44 (UTC+8)-
dc.date.available 11-Feb-2026 09:24:44 (UTC+8)-
dc.date.issued (上傳時間) 11-Feb-2026 09:24:44 (UTC+8)-
dc.identifier.uri (URI) https://ah.lib.nccu.edu.tw/item?item_id=181242-
dc.description.abstract (摘要) This study presents a quantitative analysis of pause-duration patterns in a Mandarin spoken corpus to establish a baseline for prosodic and cognitive assessment. Drawing on cross-linguistic research, the distribution of pause patterns is viewed as reflecting multiple underlying factors. Longer pauses aligned with prosodic and syntactic boundaries indicate more deliberative and planned discourse rather than spontaneous speech. Such settings place higher demands on cognitive and articulatory planning, producing extended thinking time as speakers handle complex topics and specialized terminology. The spoken corpus was automatically processed and annotated using an in-house alignment and pause-tagging pipeline. Outlier detection with a 3.0×IQR threshold retained 35,474 tokens and removed extreme values exceeding 1,016 ms. Short and medium pauses remained stable across mean, median, and variability measures, while long pauses showed a moderate reduction (16,436 to 15,420 tokens), with mean duration decreasing from 535 to 426 ms and standard deviation sharply reduced from 786 to 169 ms, while the median stayed around 370–380 ms. These findings demonstrate that automatic cleaning primarily removed aberrant values while preserving linguistically meaningful long pauses. This baseline from non-impaired adult speakers underscores the need for corpus-specific frameworks and offers a reference point for cross-linguistic research on speech planning.
dc.format.extent 110 bytes-
dc.format.mimetype text/html-
dc.relation (關聯) The 37th Conference on Computational Linguistics and Speech Processing (ROCLING 2025), The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
dc.title (題名) Computational approaches to quantitative analysis of pause duration in Taiwan Mandarin
dc.type (資料類型) conference