The NCCU Corpus of Spoken Chinese: Mandarin, Hakka, and Southern Min | NCCU Academic Hub

學術產出-Periodical Articles

Article View/Open

pdf(1304)

Publication Export

Google Scholar^TM

政大圖書館

學術資源探索系統

Citation Infomation

Simple Record
Full Record

題名	The NCCU Corpus of Spoken Chinese: Mandarin, Hakka, and Southern Min
作者	Chui, Kawai Lai, Huei-ling
貢獻者	政大英文系
關鍵詞	Spoken Chinese; Mandarin; Hakka; Southern Min
日期	2008-01
上傳時間	10-Sep-2012 10:46:51 (UTC+8)
摘要	In Taiwan, most people speak Mandarin, Southern Min, or Hakka. Not only are the three Chinese dialects undergoing linguistic changes, but the population of Southern Min and Hakka is also diminishing. The NCCU Corpus of Spoken Chinese is thus a project of language documentation whereby open online access to Mandarin, Hakka, and Southern Min data is provided for non-profit-making research.As a language documentation project, the NCCU spoken corpus focuses on collecting and archiving spoken forms of various types. It consists of three sub-corpora, namely the Corpus of Spoken Mandarin, the Corpus of Spoken Hakka, and the Corpus of Spoken Southern Min. The three corpora share a common scheme for the collection of spoken data, mostly in the form of spontaneous face-to-face conversations. The infrastructure of the corpus is designed in a simple yet user-friendly way, so that data can be processed efficiently in the database, and users can browse the spoken data directly from the web. We hope that our work can encourage more people to engage in building up spoken corpora from different perspectives and for different purposes.
關聯	Taiwan Journal of Linguistics, 6, 119-144
資料類型	article
DOI	http://dx.doi.org/10.3115/992424.992450

dc.contributor	政大英文系	en
dc.creator (作者)	Chui, Kawai	en
dc.creator (作者)	Lai, Huei-ling	en
dc.date (日期)	2008-01	-
dc.date.accessioned	10-Sep-2012 10:46:51 (UTC+8)	-
dc.date.available	10-Sep-2012 10:46:51 (UTC+8)	-
dc.date.issued (上傳時間)	10-Sep-2012 10:46:51 (UTC+8)	-
dc.identifier.uri (URI)	http://nccur.lib.nccu.edu.tw/handle/140.119/53544	-
dc.description.abstract (摘要)	In Taiwan, most people speak Mandarin, Southern Min, or Hakka. Not only are the three Chinese dialects undergoing linguistic changes, but the population of Southern Min and Hakka is also diminishing. The NCCU Corpus of Spoken Chinese is thus a project of language documentation whereby open online access to Mandarin, Hakka, and Southern Min data is provided for non-profit-making research.As a language documentation project, the NCCU spoken corpus focuses on collecting and archiving spoken forms of various types. It consists of three sub-corpora, namely the Corpus of Spoken Mandarin, the Corpus of Spoken Hakka, and the Corpus of Spoken Southern Min. The three corpora share a common scheme for the collection of spoken data, mostly in the form of spontaneous face-to-face conversations. The infrastructure of the corpus is designed in a simple yet user-friendly way, so that data can be processed efficiently in the database, and users can browse the spoken data directly from the web. We hope that our work can encourage more people to engage in building up spoken corpora from different perspectives and for different purposes.	en
dc.format.extent	760249 bytes	-
dc.format.mimetype	application/pdf	-
dc.language	zh_TW	en
dc.language.iso	en_US	-
dc.relation (關聯)	Taiwan Journal of Linguistics, 6, 119-144	-
dc.subject (關鍵詞)	Spoken Chinese; Mandarin; Hakka; Southern Min	en
dc.title (題名)	The NCCU Corpus of Spoken Chinese: Mandarin, Hakka, and Southern Min	en
dc.type (資料類型)	article	en
dc.identifier.doi (DOI)	10.3115/992424.992450	en_US
dc.doi.uri (DOI)	http://dx.doi.org/10.3115/992424.992450	en_US