Publications-Theses
Article View/Open
Publication Export
-
Google ScholarTM
NCCU Library
Citation Infomation
Related Publications in TAIR
題名 開放型XML資料庫績效評估工作量之模型
An Open Workload Model for XML Database Benchmark作者 尤靖雅 貢獻者 諶家蘭
尤靖雅關鍵詞 XML資料庫
績效評估
工作量模型
資料模型
查詢模型
控制模型
XML database
performance evaluation
workload model
data model
query model
control model
XML
benchmark日期 2002 上傳時間 10-May-2016 16:15:34 (UTC+8) 摘要 XML (eXtensible Markup Language)是今日新興在網路上所使用的延伸性的標記語言。它具有豐富的語意表達及與展現方式獨立的資料獨立性。由於這些特性,使得XML成為新的資料交換標準並且其應用在資料庫中產生了許多新的研究議題在資料儲存和查詢處理上。在本篇研究中,將研究XML資料庫中的績效評估的議題並且發展一個可適用於不同應用領域及各種平台上的使用者導向且開放型工作量模式以評估XML資料庫績效。此XML開放型的工作量模型包含三個子模型─XML資料模型、查詢模型以及控制模型。XML資料模型將模式化XML文件中階層式的結構概念;查詢模型包含了一連串查詢模組以供測試XML資料庫的處理查詢能力以及一個開放型的查詢輸入介面以供使用者依照需求設定所需的測試查詢;控制模型中定義了一連串變數以供設定績效評估系統中的執行環境。我們發展此系統化且具開放型的工作量方法可以幫助各種不同應用領域的使用者預測及展現XML資料庫系統的績效。
XML (eXtensible Markup Language) is the emerging data format for data processing on the Internet. XML provides a rich data semantics and data independence from the presentation. Thanks to these features, XML becomes a new data exchange standard and leads new storage and query processing issues on database research communities. In this paper, the performance evaluation issues on XML databases have been studied and a generic and requirement-driven XML workload model that is applicable to any application scenario or movable on various platforms is developed. There are three sub-models in this generic workload model, the XML data model, the query model, and the control model. The XML data model formulates the generic hierarchy structure of XML documents and supports a flexible document structure of the test database. The XML query model contains a flexible classical query module selector and an open query input to define the requirement-driven test query model to challenge the XML query processing ability of the XML database. The control model defines variables that are used to set up the implementation of a benchmark. This open, flexible, and systematic workload method permits users in various application domains to predicate or profile the performance of the XML database systems.參考文獻 1. Bohme, T. and E. Rahm (2001), “Xmach-1: Benchmark for XML Data Management,” In Proceedings of the German Database Conference. 2. Carey, M.J., D.J. DeWitt and J.F. Naughton (1993), “The 007 Benchmark,” In Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, pp12-21. 3. Carey, M.J., D.J. DeWitt, J.F. Naughton, M. Asgarian, P. Brown, J. Gehrke, and D. Shah (1997), “The BUCKY Object-Relational Benchmark,” In Proceedings ACM SIGMOD International Conference on Management of Data, pp.135-146. 4. DeWitt, David, Jeffrey Naughton, Jayavel Shanmugasundaram, Kristin Tufte, Gang He and Chun Zhang (1999), “Relational Databases for Querying XML Documents: Limitations and Opportunities,” In Proceedings of the 25th VLDB Conference, pp.25-32. 5. Fankhauser P., M. Marchiori, and J. Robie (2000), “XML Query Requirements,” http://www.w3.org/TR/xmlquery-req 6. Florescu, D. and D. Kossmann (1999), “A Performance Evaluation of Alternative Mapping Schemes for Storing XML Data in a Relational Database,” Technical Report No.3680, INRIA. 7. Florescu, D. and D. Kossmann (1999), “Storing and Querying XML Data Using an RDBMS,” IEEE Data Engineering Bulletin, 22(3), pp.27-34. 8. Gray, Jim (1993), The Benchmark Handbook, Morgan Kaufmann, San Mateo, CA, 9. Li, Y. Guang, Stephane Bressan, Gillian Dobbie, Zoe Lacroix, Mong Li Lee, Ullas Nambiar, and Bimlesh Wadhwa (2001), “X007: Applying 007 Benchmark to XML Query Processing Tools,” In Proceedings of the 10th International Conference on Information and Knowledge Management, pp.27-36. 10. Poess, M. and C. Floyd (2000), “New TPC Benchmarks for Decision Support and Web Commerce,” ACM SIGMOD RECORD, 29(4), pp.64-71. 11. Schmidt, A. R., F. Waas, M.L. Kersten, D. Florescu, I. Manolescu, M.J. Carey and R. Busse (2001), “The XML Benchmark Project,” CWI Technical reports. 12. Schmidt, A., F. Waas, M.L. Kersten, D. Florescu, I. Manolescu, M.J. Carey and R. Busse (2001) “Why and How to Benchmark XML Database,” SIGMOD Record, 30(3) pp.41-50. 13. Schmidt, A., M. Kersten, D. Florescu, M. Carey, I. Manolescu, and F. Waas (2000), “The XML Store Benchmark Project,” http://www.xmlperformance.org 14. Seng, J. L. (1995), “Benchmarking Relational Database Systems — Overview,” The 6th International Conference on Information Management, pp.55-60. 15. Seng, J. L. (1996), “Workload characterization using requirements analysis,” The International Conference on Information Systems and Management of Data, pp.71-77. 16. Tian, F., D.J. DeWitt, J. Chen and C. Zhang (2000), “The design and performance evaluation of alternative XML storage strategies,” unpublished manuscript, University of Wisconsin, Madison, pp.1-9. 17. Zhao, B. Y. (2000), “The Xset XML search engine and Xbench XML query benchmark,” Technical Report No. UCB/CSD-00-1112, Computer Science Division, University of California at Berkeley. 中文參考文獻 1. 林嬿芳(2001),「網際網路超媒體資料庫工作量模型產生之研究」,國立政治大學資訊管理學研究所。 2. 翁嘉正(2000),「設計與製作XML文件資料庫」,大同大學資訊工程研究所。 3. 張子文(2001),「XML-Based HTML Wrapper建置之研究在網路銀行個人帳戶資訊彙整服務上之應用」,國立政治大學資訊管理學研究所。 4. 菲立浦斯Phillips Lee Anne(2001),「XML全方位完全剖析」,第三波電腦圖書資料公司,台北市。 5. 榮欽科技主筆室(2001),「最新XML入門與應用」,松崗電腦圖書資料公司,台北市。 6. 韓先良(1999),「分散式關聯資料庫系統績效評估工作量模式之研究」,國立政治大學資訊管理學研究所。 描述 碩士
國立政治大學
資訊管理學系
89356040資料來源 http://thesis.lib.nccu.edu.tw/record/#A2010000457 資料類型 thesis dc.contributor.advisor 諶家蘭 zh_TW dc.contributor.author (Authors) 尤靖雅 zh_TW dc.creator (作者) 尤靖雅 zh_TW dc.date (日期) 2002 en_US dc.date.accessioned 10-May-2016 16:15:34 (UTC+8) - dc.date.available 10-May-2016 16:15:34 (UTC+8) - dc.date.issued (上傳時間) 10-May-2016 16:15:34 (UTC+8) - dc.identifier (Other Identifiers) A2010000457 en_US dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/95972 - dc.description (描述) 碩士 zh_TW dc.description (描述) 國立政治大學 zh_TW dc.description (描述) 資訊管理學系 zh_TW dc.description (描述) 89356040 zh_TW dc.description.abstract (摘要) XML (eXtensible Markup Language)是今日新興在網路上所使用的延伸性的標記語言。它具有豐富的語意表達及與展現方式獨立的資料獨立性。由於這些特性,使得XML成為新的資料交換標準並且其應用在資料庫中產生了許多新的研究議題在資料儲存和查詢處理上。在本篇研究中,將研究XML資料庫中的績效評估的議題並且發展一個可適用於不同應用領域及各種平台上的使用者導向且開放型工作量模式以評估XML資料庫績效。此XML開放型的工作量模型包含三個子模型─XML資料模型、查詢模型以及控制模型。XML資料模型將模式化XML文件中階層式的結構概念;查詢模型包含了一連串查詢模組以供測試XML資料庫的處理查詢能力以及一個開放型的查詢輸入介面以供使用者依照需求設定所需的測試查詢;控制模型中定義了一連串變數以供設定績效評估系統中的執行環境。我們發展此系統化且具開放型的工作量方法可以幫助各種不同應用領域的使用者預測及展現XML資料庫系統的績效。 zh_TW dc.description.abstract (摘要) XML (eXtensible Markup Language) is the emerging data format for data processing on the Internet. XML provides a rich data semantics and data independence from the presentation. Thanks to these features, XML becomes a new data exchange standard and leads new storage and query processing issues on database research communities. In this paper, the performance evaluation issues on XML databases have been studied and a generic and requirement-driven XML workload model that is applicable to any application scenario or movable on various platforms is developed. There are three sub-models in this generic workload model, the XML data model, the query model, and the control model. The XML data model formulates the generic hierarchy structure of XML documents and supports a flexible document structure of the test database. The XML query model contains a flexible classical query module selector and an open query input to define the requirement-driven test query model to challenge the XML query processing ability of the XML database. The control model defines variables that are used to set up the implementation of a benchmark. This open, flexible, and systematic workload method permits users in various application domains to predicate or profile the performance of the XML database systems. en_US dc.description.tableofcontents 謝辭 Abstract 摘要 Table of Contents-----I List of Tables-----III List of Figures-----IV Chapter 1 Introduction-----1 1.1 Research Motivation-----1 1.2 Research Issues-----2 1.3 Research Structure-----3 Chapter 2 Literature Review-----5 2 Benchmark and Workload Model-----5 2.1 Benchmark-----5 2.2 Workload-----6 2.3 Characteristics of XML Documents-----6 2.4 XML Database Benchmark-----7 2.4.1 XML Data Model-----8 2.4.2 XML Query Model-----17 2.5 Comparison and Contrast of XML Benchmarks-----28 Chapter 3 An Open Workload Model for XML Database Benchmark-----30 3.1 Research Approach-----30 3.2 Research Model-----30 3.3 Generic XML Data Model-----31 3.4 Generic XML Query Model-----34 3.4.1 Exact Match-----35 3.4.2 Ordered Access-----36 3.4.3 Casting-----36 3.4.4 Regular PathExpressions-----37 3.4.5 Chasing References-----37 3.4.6 Construction of Complex Results-----37 3.4.7 Joins on values-----38 3.4.8 Reconstruction-----38 3.4.9 Full Text-----39 3.4.10 Path Traversals-----39 3.4.11 Missing Elements-----39 3.4.12 Function Application-----39 3.4.13 Sorting-----40 3.4.14 Aggregation-----40 3.4.15 Structural Recursion-----41 3.4.16 Updates Operations-----41 3.5 Open Data Model and Open Query Model-----41 3.6 Control Model-----42 Chapter 4 A Prototype System Implementation-----44 4.1 The Prototype System Platform and Architecture-----44 4.2 The Architecture of the Prototype System-----45 4.2.1 The XML Data Loader-----47 4.2.2 The Query Generator-----48 4.2.3 The Control Model Generator-----50 4.2.4 Results Collector-----52 Chapter 5 Discussion-----54 5.1 A More Open and A More Generic Approach for XML Benchmark-----54 5.2 A Model Solution for Both Administrators and Application Programmers-----55 5.3 Research Limitations-----55 Chapter 6 Conclusion-----57 6.1 Summary-----57 6.2 Research Results-----58 6.3 Future Research Directions-----59 References-----60 中文參考文獻-----62 List of Tables Table 2.1-----19 Table 2.2-----20 Table 2.3-----21 Table 2.4-----22 Table 2.5-----24 Table 2.6-----25 Table 2.7-----27 Table 4.1-----42 Table 6.1-----56 List of Figures Figure 1.1-----4 Figure 2.1-----12 Figure 2.2-----12 Figure 2.3-----13 Figure 2.4-----14 Figure 2.5-----15 Figure 2.6-----16 Figure 3.1-----29 Figure 3.2-----31 Figure 3.3-----32 Figure 4.1-----43 Figure 4.2-----44 Figure 4.3-----45 Figure 4.4-----46 Figure 4.5-----47 Figure 4.6-----47 Figure 4.7-----48 Figure 4.8-----49 Figure 4.9-----49 Figure 4.10-----50 Figure 4.11-----51 zh_TW dc.source.uri (資料來源) http://thesis.lib.nccu.edu.tw/record/#A2010000457 en_US dc.subject (關鍵詞) XML資料庫 zh_TW dc.subject (關鍵詞) 績效評估 zh_TW dc.subject (關鍵詞) 工作量模型 zh_TW dc.subject (關鍵詞) 資料模型 zh_TW dc.subject (關鍵詞) 查詢模型 zh_TW dc.subject (關鍵詞) 控制模型 zh_TW dc.subject (關鍵詞) XML database en_US dc.subject (關鍵詞) performance evaluation en_US dc.subject (關鍵詞) workload model en_US dc.subject (關鍵詞) data model en_US dc.subject (關鍵詞) query model en_US dc.subject (關鍵詞) control model en_US dc.subject (關鍵詞) XML en_US dc.subject (關鍵詞) benchmark en_US dc.title (題名) 開放型XML資料庫績效評估工作量之模型 zh_TW dc.title (題名) An Open Workload Model for XML Database Benchmark en_US dc.type (資料類型) thesis en_US dc.relation.reference (參考文獻) 1. Bohme, T. and E. Rahm (2001), “Xmach-1: Benchmark for XML Data Management,” In Proceedings of the German Database Conference. 2. Carey, M.J., D.J. DeWitt and J.F. Naughton (1993), “The 007 Benchmark,” In Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, pp12-21. 3. Carey, M.J., D.J. DeWitt, J.F. Naughton, M. Asgarian, P. Brown, J. Gehrke, and D. Shah (1997), “The BUCKY Object-Relational Benchmark,” In Proceedings ACM SIGMOD International Conference on Management of Data, pp.135-146. 4. DeWitt, David, Jeffrey Naughton, Jayavel Shanmugasundaram, Kristin Tufte, Gang He and Chun Zhang (1999), “Relational Databases for Querying XML Documents: Limitations and Opportunities,” In Proceedings of the 25th VLDB Conference, pp.25-32. 5. Fankhauser P., M. Marchiori, and J. Robie (2000), “XML Query Requirements,” http://www.w3.org/TR/xmlquery-req 6. Florescu, D. and D. Kossmann (1999), “A Performance Evaluation of Alternative Mapping Schemes for Storing XML Data in a Relational Database,” Technical Report No.3680, INRIA. 7. Florescu, D. and D. Kossmann (1999), “Storing and Querying XML Data Using an RDBMS,” IEEE Data Engineering Bulletin, 22(3), pp.27-34. 8. Gray, Jim (1993), The Benchmark Handbook, Morgan Kaufmann, San Mateo, CA, 9. Li, Y. Guang, Stephane Bressan, Gillian Dobbie, Zoe Lacroix, Mong Li Lee, Ullas Nambiar, and Bimlesh Wadhwa (2001), “X007: Applying 007 Benchmark to XML Query Processing Tools,” In Proceedings of the 10th International Conference on Information and Knowledge Management, pp.27-36. 10. Poess, M. and C. Floyd (2000), “New TPC Benchmarks for Decision Support and Web Commerce,” ACM SIGMOD RECORD, 29(4), pp.64-71. 11. Schmidt, A. R., F. Waas, M.L. Kersten, D. Florescu, I. Manolescu, M.J. Carey and R. Busse (2001), “The XML Benchmark Project,” CWI Technical reports. 12. Schmidt, A., F. Waas, M.L. Kersten, D. Florescu, I. Manolescu, M.J. Carey and R. Busse (2001) “Why and How to Benchmark XML Database,” SIGMOD Record, 30(3) pp.41-50. 13. Schmidt, A., M. Kersten, D. Florescu, M. Carey, I. Manolescu, and F. Waas (2000), “The XML Store Benchmark Project,” http://www.xmlperformance.org 14. Seng, J. L. (1995), “Benchmarking Relational Database Systems — Overview,” The 6th International Conference on Information Management, pp.55-60. 15. Seng, J. L. (1996), “Workload characterization using requirements analysis,” The International Conference on Information Systems and Management of Data, pp.71-77. 16. Tian, F., D.J. DeWitt, J. Chen and C. Zhang (2000), “The design and performance evaluation of alternative XML storage strategies,” unpublished manuscript, University of Wisconsin, Madison, pp.1-9. 17. Zhao, B. Y. (2000), “The Xset XML search engine and Xbench XML query benchmark,” Technical Report No. UCB/CSD-00-1112, Computer Science Division, University of California at Berkeley. 中文參考文獻 1. 林嬿芳(2001),「網際網路超媒體資料庫工作量模型產生之研究」,國立政治大學資訊管理學研究所。 2. 翁嘉正(2000),「設計與製作XML文件資料庫」,大同大學資訊工程研究所。 3. 張子文(2001),「XML-Based HTML Wrapper建置之研究在網路銀行個人帳戶資訊彙整服務上之應用」,國立政治大學資訊管理學研究所。 4. 菲立浦斯Phillips Lee Anne(2001),「XML全方位完全剖析」,第三波電腦圖書資料公司,台北市。 5. 榮欽科技主筆室(2001),「最新XML入門與應用」,松崗電腦圖書資料公司,台北市。 6. 韓先良(1999),「分散式關聯資料庫系統績效評估工作量模式之研究」,國立政治大學資訊管理學研究所。 zh_TW