學術產出-Periodical Articles

Article View/Open

Publication Export

Google ScholarTM

政大圖書館

Citation Infomation

題名 Multiscale major factor selections for complex system data with structural dependency and heterogeneity
作者 周珮婷
Chou, Elizabeth P.;Fushing, Hsieh;Chen, Ting-Li
貢獻者 統計系
關鍵詞 Broken symmetry; Conditional entropy; Contingency table; Major factor selection; Multiclass Classification; Pitching dynamics
日期 2023-11
上傳時間 5-Mar-2024 16:16:32 (UTC+8)
摘要 The unknown multiscale structure hidden in large complex systems is explored bottom-up through discovered heterogeneity under structural dependency embedded within structured data sets. Via two real complex systems, we demonstrate computed hierarchical structures with broken symmetry constituting data’s information content. Through graphic displays, such information content indirectly, but efficiently resolves system-related scientific issues that are difficult to resolve directly. All bottom-up explorations and computations are based on conditional entropy and mutual information evaluated upon contingency table platforms after categorizing all quantitative features. Categorical Exploratory Data Analysis (CEDA) first extracts global major factors that share significant mutual information with the targeted response (Re) variable against many covariate (Co) features under the presence of structural dependency. Then each global major factor is taken as one perspective of heterogeneity to subdivide the entire data set according to its categories into sub-collections. This simple “de-associating” protocol significantly reduces structural dependency among the rest of the features such that another run of major factor selection performed on the sub-collection scale can precisely identify which feature sets could provide extra information beyond the global major factor. Finally, informative patterns collected from multiple perspectives of heterogeneity are displayed to explicitly resolve issues of prediction, classification, and detecting minute dynamic changes.
關聯 Physica A: Statistical Mechanics and its Applications, Vol.630, 129227
資料類型 article
DOI https://doi.org/10.1016/j.physa.2023.129227
dc.contributor 統計系
dc.creator (作者) 周珮婷
dc.creator (作者) Chou, Elizabeth P.;Fushing, Hsieh;Chen, Ting-Li
dc.date (日期) 2023-11
dc.date.accessioned 5-Mar-2024 16:16:32 (UTC+8)-
dc.date.available 5-Mar-2024 16:16:32 (UTC+8)-
dc.date.issued (上傳時間) 5-Mar-2024 16:16:32 (UTC+8)-
dc.identifier.uri (URI) https://nccur.lib.nccu.edu.tw/handle/140.119/150386-
dc.description.abstract (摘要) The unknown multiscale structure hidden in large complex systems is explored bottom-up through discovered heterogeneity under structural dependency embedded within structured data sets. Via two real complex systems, we demonstrate computed hierarchical structures with broken symmetry constituting data’s information content. Through graphic displays, such information content indirectly, but efficiently resolves system-related scientific issues that are difficult to resolve directly. All bottom-up explorations and computations are based on conditional entropy and mutual information evaluated upon contingency table platforms after categorizing all quantitative features. Categorical Exploratory Data Analysis (CEDA) first extracts global major factors that share significant mutual information with the targeted response (Re) variable against many covariate (Co) features under the presence of structural dependency. Then each global major factor is taken as one perspective of heterogeneity to subdivide the entire data set according to its categories into sub-collections. This simple “de-associating” protocol significantly reduces structural dependency among the rest of the features such that another run of major factor selection performed on the sub-collection scale can precisely identify which feature sets could provide extra information beyond the global major factor. Finally, informative patterns collected from multiple perspectives of heterogeneity are displayed to explicitly resolve issues of prediction, classification, and detecting minute dynamic changes.
dc.format.extent 107 bytes-
dc.format.mimetype text/html-
dc.relation (關聯) Physica A: Statistical Mechanics and its Applications, Vol.630, 129227
dc.subject (關鍵詞) Broken symmetry; Conditional entropy; Contingency table; Major factor selection; Multiclass Classification; Pitching dynamics
dc.title (題名) Multiscale major factor selections for complex system data with structural dependency and heterogeneity
dc.type (資料類型) article
dc.identifier.doi (DOI) 10.1016/j.physa.2023.129227
dc.doi.uri (DOI) https://doi.org/10.1016/j.physa.2023.129227