Publications-Periodical Articles

Article View/Open

Publication Export

Google ScholarTM

NCCU Library

Citation Infomation

Related Publications in TAIR

題名 Fuzzy Canonical Discriminant Analysis
其他題名 模糊典型判別分析
作者 鄭宇庭
Cheng, Yu-Ting
貢獻者 統計系
關鍵詞 判別分析 ; 模糊 ; 典型 ; 分類問題 ; Fuzzy canonical discriminant analysis ; Fisher canonical discrimination ; Classification
日期 2011-02
上傳時間 18-Dec-2014 10:05:34 (UTC+8)
摘要 The main purpose of discriminant analysis is to apply a set of known observations to classify the observation of unknown groups into pre-defined groups. In traditional discriminant analysis, the classification of the data is limited to either belong or not belong to a specific set. As such some of the information contained in the data might have been ignored. In this paper, we propose fuzzy canonical discriminant analysis, a new classification method, to classify groups of known observations and determine the membership function of each set. This membership function is then taken to apply on the unknown observations. The fuzzy canonical discriminant analysis takes in data matrices with unknown observations which are weighted by membership degrees. To find out the correlation between parameters, this paper maximizes the ratio of the weighted sum of square between the ”between groups” and the ”within groups” by Lagrange Multiplier method. The initial value is given and then iterative algorithm is applied to calculate the estimation of the parameters.We compare fuzzy discriminant analysis with canonical discrimination, based on the example from three species of Iris. We found that it improves the accuracy of discriminant analysis when the sample size is small.
判別分析主要在於利用已知群組之樣本點,對未知樣本點做群組歸屬判斷。傳統判別分析對於已知樣本點,只能限制其完全屬於或完全不屬於某一群組,因此常會失去一些原本資料所給的訊息。本文嘗試利用模糊數學的多值邏輯理論,對於群組界限不是很明確的樣本點,給定其屬於各群組的隸屬度,以此做為已知資料,對未知樣本點做群組歸屬判斷,而判定結果也以樣本點屬於各群組的隸屬度表示。模糊判別分析方法主要是將未知資料點納入資料矩陣之中,以樣本點屬於各群的隸屬度為權數大小,並以拉氏乘數(Lagrange multiplier)法,找出使群間加權離均平方和與群內加權離均平方和比值為最大的各參數相對關係。給定一初始值之後,採用遞迴(iterative)運算的方式,求出各欲估參數的值。將本文推導所得之模糊判別分析方法,應用於鳶尾花品種判別分析。以此結果與傳統判別分析方法做比較,發現在建立模式的已知樣本數較少時,採用模糊判別分析方法可以改善部份誤判情形。
關聯 Journal of Data Analysis, 6(1), 115-133
資料類型 article
dc.contributor 統計系en_US
dc.creator (作者) 鄭宇庭zh_TW
dc.creator (作者) Cheng, Yu-Tingen_US
dc.date (日期) 2011-02en_US
dc.date.accessioned 18-Dec-2014 10:05:34 (UTC+8)-
dc.date.available 18-Dec-2014 10:05:34 (UTC+8)-
dc.date.issued (上傳時間) 18-Dec-2014 10:05:34 (UTC+8)-
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/72151-
dc.description.abstract (摘要) The main purpose of discriminant analysis is to apply a set of known observations to classify the observation of unknown groups into pre-defined groups. In traditional discriminant analysis, the classification of the data is limited to either belong or not belong to a specific set. As such some of the information contained in the data might have been ignored. In this paper, we propose fuzzy canonical discriminant analysis, a new classification method, to classify groups of known observations and determine the membership function of each set. This membership function is then taken to apply on the unknown observations. The fuzzy canonical discriminant analysis takes in data matrices with unknown observations which are weighted by membership degrees. To find out the correlation between parameters, this paper maximizes the ratio of the weighted sum of square between the ”between groups” and the ”within groups” by Lagrange Multiplier method. The initial value is given and then iterative algorithm is applied to calculate the estimation of the parameters.We compare fuzzy discriminant analysis with canonical discrimination, based on the example from three species of Iris. We found that it improves the accuracy of discriminant analysis when the sample size is small.en_US
dc.description.abstract (摘要) 判別分析主要在於利用已知群組之樣本點,對未知樣本點做群組歸屬判斷。傳統判別分析對於已知樣本點,只能限制其完全屬於或完全不屬於某一群組,因此常會失去一些原本資料所給的訊息。本文嘗試利用模糊數學的多值邏輯理論,對於群組界限不是很明確的樣本點,給定其屬於各群組的隸屬度,以此做為已知資料,對未知樣本點做群組歸屬判斷,而判定結果也以樣本點屬於各群組的隸屬度表示。模糊判別分析方法主要是將未知資料點納入資料矩陣之中,以樣本點屬於各群的隸屬度為權數大小,並以拉氏乘數(Lagrange multiplier)法,找出使群間加權離均平方和與群內加權離均平方和比值為最大的各參數相對關係。給定一初始值之後,採用遞迴(iterative)運算的方式,求出各欲估參數的值。將本文推導所得之模糊判別分析方法,應用於鳶尾花品種判別分析。以此結果與傳統判別分析方法做比較,發現在建立模式的已知樣本數較少時,採用模糊判別分析方法可以改善部份誤判情形。en_US
dc.format.extent 1091383 bytes-
dc.format.mimetype application/pdf-
dc.language.iso en_US-
dc.relation (關聯) Journal of Data Analysis, 6(1), 115-133en_US
dc.subject (關鍵詞) 判別分析 ; 模糊 ; 典型 ; 分類問題 ; Fuzzy canonical discriminant analysis ; Fisher canonical discrimination ; Classificationen_US
dc.title (題名) Fuzzy Canonical Discriminant Analysisen_US
dc.title.alternative (其他題名) 模糊典型判別分析en_US
dc.type (資料類型) articleen