Please use this identifier to cite or link to this item: https://ah.nccu.edu.tw/handle/140.119/53544


Title: The NCCU Corpus of Spoken Chinese: Mandarin, Hakka, and Southern Min
Authors: Chui, Kawai
Lai, Huei-ling
Contributors: 政大英文系
Keywords: Spoken Chinese;Mandarin;Hakka;Southern Min
Date: 2008-01
Issue Date: 2012-09-10 10:46:51 (UTC+8)
Abstract: In Taiwan, most people speak Mandarin, Southern Min, or Hakka. Not only are the three Chinese dialects undergoing linguistic changes, but the population of Southern Min and Hakka is also diminishing. The NCCU Corpus of Spoken Chinese is thus a project of language documentation whereby open online access to Mandarin, Hakka, and Southern Min data is provided for non-profit-making research.As a language documentation project, the NCCU spoken corpus focuses on collecting and archiving spoken forms of various types. It consists of three sub-corpora, namely the Corpus of Spoken Mandarin, the Corpus of Spoken Hakka, and the Corpus of Spoken Southern Min. The three corpora share a common scheme for the collection of spoken data, mostly in the form of spontaneous face-to-face conversations. The infrastructure of the corpus is designed in a simple yet user-friendly way, so that data can be processed efficiently in the database, and users can browse the spoken data directly from the web. We hope that our work can encourage more people to engage in building up spoken corpora from different perspectives and for different purposes.
Relation: Taiwan Journal of Linguistics, 6, 119-144
Data Type: article
DOI 連結: http://dx.doi.org/10.3115/992424.992450
Appears in Collections:[英國語文學系] 期刊論文

Files in This Item:

File Description SizeFormat
6.2-5 Chui and Lai.pdf742KbAdobe PDF1255View/Open


All items in 學術集成 are protected by copyright, with all rights reserved.


社群 sharing