Publications-Periodical Articles

Article View/Open

Publication Export

Google ScholarTM

NCCU Library

Citation Infomation

Related Publications in TAIR

題名 Optimizing entity join queries when data transmission cost dominates
作者 陳良弼
Tsai,Pauray S.M.;Chen,Arbee L.P.
貢獻者 資科系
關鍵詞 Entity join; Extended semijoin; Inconsistent data; Local processing; Multidatabase; Query optimization; Query transformation; Selectivity
日期 1997-05
上傳時間 21-Aug-2014 15:01:48 (UTC+8)
摘要 Heterogeneities exist in a multidatabase environment. For example, a real world entity may be differently represented in relations of different databases. In particular, keys of these relations may be incompatible. In this paper, we consider processing entity join queries when data transmission cost dominates. An entity join operation ‘integrates’ tuples representing the same entities from different relations in which inconsistent data may exist. A natural way to process the entity join is to transmit both relations to a site, resolve the possible conflicts between corresponding attributes and process the join, which is very costly. In this paper, an approach is proposed to correctly transform a global query into local subqueries to preprocess entity join queries in multiple sites with an attempt to lower the cost of data transmission. Besides, an extension of the traditional semijoin, named extended semijoin, is proposed to further reduce the cost of data transmission for entity join query processing.
關聯 Data & Knowledge Engineering (EI), North-Holland,283-308
資料類型 article
dc.contributor 資科系en_US
dc.creator (作者) 陳良弼zh_TW
dc.creator (作者) Tsai,Pauray S.M.;Chen,Arbee L.P.en_US
dc.date (日期) 1997-05en_US
dc.date.accessioned 21-Aug-2014 15:01:48 (UTC+8)-
dc.date.available 21-Aug-2014 15:01:48 (UTC+8)-
dc.date.issued (上傳時間) 21-Aug-2014 15:01:48 (UTC+8)-
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/69146-
dc.description.abstract (摘要) Heterogeneities exist in a multidatabase environment. For example, a real world entity may be differently represented in relations of different databases. In particular, keys of these relations may be incompatible. In this paper, we consider processing entity join queries when data transmission cost dominates. An entity join operation ‘integrates’ tuples representing the same entities from different relations in which inconsistent data may exist. A natural way to process the entity join is to transmit both relations to a site, resolve the possible conflicts between corresponding attributes and process the join, which is very costly. In this paper, an approach is proposed to correctly transform a global query into local subqueries to preprocess entity join queries in multiple sites with an attempt to lower the cost of data transmission. Besides, an extension of the traditional semijoin, named extended semijoin, is proposed to further reduce the cost of data transmission for entity join query processing.en_US
dc.format.extent 1312394 bytes-
dc.format.mimetype application/pdf-
dc.language.iso en_US-
dc.relation (關聯) Data & Knowledge Engineering (EI), North-Holland,283-308en_US
dc.subject (關鍵詞) Entity join; Extended semijoin; Inconsistent data; Local processing; Multidatabase; Query optimization; Query transformation; Selectivityen_US
dc.title (題名) Optimizing entity join queries when data transmission cost dominatesen_US
dc.type (資料類型) articleen