Publications-Theses

Article View/Open

Publication Export

Google ScholarTM

NCCU Library

Citation Infomation

Related Publications in TAIR

題名 開放型XML資料庫績效評估工作量之模型
An Open Workload Model for XML Database Benchmark
作者 尤靖雅
貢獻者 諶家蘭
尤靖雅
關鍵詞 XML資料庫
績效評估
工作量模型
資料模型
查詢模型
控制模型
XML database
performance evaluation
workload model
data model
query model
control model
XML
benchmark
日期 2002
上傳時間 10-May-2016 16:15:34 (UTC+8)
摘要   XML (eXtensible Markup Language)是今日新興在網路上所使用的延伸性的標記語言。它具有豐富的語意表達及與展現方式獨立的資料獨立性。由於這些特性,使得XML成為新的資料交換標準並且其應用在資料庫中產生了許多新的研究議題在資料儲存和查詢處理上。在本篇研究中,將研究XML資料庫中的績效評估的議題並且發展一個可適用於不同應用領域及各種平台上的使用者導向且開放型工作量模式以評估XML資料庫績效。此XML開放型的工作量模型包含三個子模型─XML資料模型、查詢模型以及控制模型。XML資料模型將模式化XML文件中階層式的結構概念;查詢模型包含了一連串查詢模組以供測試XML資料庫的處理查詢能力以及一個開放型的查詢輸入介面以供使用者依照需求設定所需的測試查詢;控制模型中定義了一連串變數以供設定績效評估系統中的執行環境。我們發展此系統化且具開放型的工作量方法可以幫助各種不同應用領域的使用者預測及展現XML資料庫系統的績效。
  XML (eXtensible Markup Language) is the emerging data format for data processing on the Internet. XML provides a rich data semantics and data independence from the presentation. Thanks to these features, XML becomes a new data exchange standard and leads new storage and query processing issues on database research communities. In this paper, the performance evaluation issues on XML databases have been studied and a generic and requirement-driven XML workload model that is applicable to any application scenario or movable on various platforms is developed. There are three sub-models in this generic workload model, the XML data model, the query model, and the control model. The XML data model formulates the generic hierarchy structure of XML documents and supports a flexible document structure of the test database. The XML query model contains a flexible classical query module selector and an open query input to define the requirement-driven test query model to challenge the XML query processing ability of the XML database. The control model defines variables that are used to set up the implementation of a benchmark. This open, flexible, and systematic workload method permits users in various application domains to predicate or profile the performance of the XML database systems.
參考文獻 1. Bohme, T. and E. Rahm (2001), “Xmach-1: Benchmark for XML Data Management,” In Proceedings of the German Database Conference.
     2. Carey, M.J., D.J. DeWitt and J.F. Naughton (1993), “The 007 Benchmark,” In Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, pp12-21.
     3. Carey, M.J., D.J. DeWitt, J.F. Naughton, M. Asgarian, P. Brown, J. Gehrke, and D. Shah (1997), “The BUCKY Object-Relational Benchmark,” In Proceedings ACM SIGMOD International Conference on Management of Data, pp.135-146.
     4. DeWitt, David, Jeffrey Naughton, Jayavel Shanmugasundaram, Kristin Tufte, Gang He and Chun Zhang (1999), “Relational Databases for Querying XML Documents: Limitations and Opportunities,” In Proceedings of the 25th VLDB Conference, pp.25-32.
     5. Fankhauser P., M. Marchiori, and J. Robie (2000), “XML Query Requirements,” http://www.w3.org/TR/xmlquery-req
     6. Florescu, D. and D. Kossmann (1999), “A Performance Evaluation of Alternative Mapping Schemes for Storing XML Data in a Relational Database,” Technical Report No.3680, INRIA.
     7. Florescu, D. and D. Kossmann (1999), “Storing and Querying XML Data Using an RDBMS,” IEEE Data Engineering Bulletin, 22(3), pp.27-34.
     8. Gray, Jim (1993), The Benchmark Handbook, Morgan Kaufmann, San Mateo, CA,
     9. Li, Y. Guang, Stephane Bressan, Gillian Dobbie, Zoe Lacroix, Mong Li Lee, Ullas Nambiar, and Bimlesh Wadhwa (2001), “X007: Applying 007 Benchmark to XML Query Processing Tools,” In Proceedings of the 10th International Conference on Information and Knowledge Management, pp.27-36.
     10. Poess, M. and C. Floyd (2000), “New TPC Benchmarks for Decision Support and Web Commerce,” ACM SIGMOD RECORD, 29(4), pp.64-71.
     11. Schmidt, A. R., F. Waas, M.L. Kersten, D. Florescu, I. Manolescu, M.J. Carey and R. Busse (2001), “The XML Benchmark Project,” CWI Technical reports.
     12. Schmidt, A., F. Waas, M.L. Kersten, D. Florescu, I. Manolescu, M.J. Carey and R. Busse (2001) “Why and How to Benchmark XML Database,” SIGMOD Record, 30(3) pp.41-50.
     13. Schmidt, A., M. Kersten, D. Florescu, M. Carey, I. Manolescu, and F. Waas (2000), “The XML Store Benchmark Project,” http://www.xmlperformance.org
     14. Seng, J. L. (1995), “Benchmarking Relational Database Systems — Overview,” The 6th International Conference on Information Management, pp.55-60.
     15. Seng, J. L. (1996), “Workload characterization using requirements analysis,” The International Conference on Information Systems and Management of Data, pp.71-77.
     16. Tian, F., D.J. DeWitt, J. Chen and C. Zhang (2000), “The design and performance evaluation of alternative XML storage strategies,” unpublished manuscript, University of Wisconsin, Madison, pp.1-9.
     17. Zhao, B. Y. (2000), “The Xset XML search engine and Xbench XML query benchmark,” Technical Report No. UCB/CSD-00-1112, Computer Science Division, University of California at Berkeley.
     中文參考文獻
     1. 林嬿芳(2001),「網際網路超媒體資料庫工作量模型產生之研究」,國立政治大學資訊管理學研究所。
     2. 翁嘉正(2000),「設計與製作XML文件資料庫」,大同大學資訊工程研究所。
     3. 張子文(2001),「XML-Based HTML Wrapper建置之研究在網路銀行個人帳戶資訊彙整服務上之應用」,國立政治大學資訊管理學研究所。
     4. 菲立浦斯Phillips Lee Anne(2001),「XML全方位完全剖析」,第三波電腦圖書資料公司,台北市。
     5. 榮欽科技主筆室(2001),「最新XML入門與應用」,松崗電腦圖書資料公司,台北市。
     6. 韓先良(1999),「分散式關聯資料庫系統績效評估工作量模式之研究」,國立政治大學資訊管理學研究所。
描述 碩士
國立政治大學
資訊管理學系
89356040
資料來源 http://thesis.lib.nccu.edu.tw/record/#A2010000457
資料類型 thesis
dc.contributor.advisor 諶家蘭zh_TW
dc.contributor.author (Authors) 尤靖雅zh_TW
dc.creator (作者) 尤靖雅zh_TW
dc.date (日期) 2002en_US
dc.date.accessioned 10-May-2016 16:15:34 (UTC+8)-
dc.date.available 10-May-2016 16:15:34 (UTC+8)-
dc.date.issued (上傳時間) 10-May-2016 16:15:34 (UTC+8)-
dc.identifier (Other Identifiers) A2010000457en_US
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/95972-
dc.description (描述) 碩士zh_TW
dc.description (描述) 國立政治大學zh_TW
dc.description (描述) 資訊管理學系zh_TW
dc.description (描述) 89356040zh_TW
dc.description.abstract (摘要)   XML (eXtensible Markup Language)是今日新興在網路上所使用的延伸性的標記語言。它具有豐富的語意表達及與展現方式獨立的資料獨立性。由於這些特性,使得XML成為新的資料交換標準並且其應用在資料庫中產生了許多新的研究議題在資料儲存和查詢處理上。在本篇研究中,將研究XML資料庫中的績效評估的議題並且發展一個可適用於不同應用領域及各種平台上的使用者導向且開放型工作量模式以評估XML資料庫績效。此XML開放型的工作量模型包含三個子模型─XML資料模型、查詢模型以及控制模型。XML資料模型將模式化XML文件中階層式的結構概念;查詢模型包含了一連串查詢模組以供測試XML資料庫的處理查詢能力以及一個開放型的查詢輸入介面以供使用者依照需求設定所需的測試查詢;控制模型中定義了一連串變數以供設定績效評估系統中的執行環境。我們發展此系統化且具開放型的工作量方法可以幫助各種不同應用領域的使用者預測及展現XML資料庫系統的績效。zh_TW
dc.description.abstract (摘要)   XML (eXtensible Markup Language) is the emerging data format for data processing on the Internet. XML provides a rich data semantics and data independence from the presentation. Thanks to these features, XML becomes a new data exchange standard and leads new storage and query processing issues on database research communities. In this paper, the performance evaluation issues on XML databases have been studied and a generic and requirement-driven XML workload model that is applicable to any application scenario or movable on various platforms is developed. There are three sub-models in this generic workload model, the XML data model, the query model, and the control model. The XML data model formulates the generic hierarchy structure of XML documents and supports a flexible document structure of the test database. The XML query model contains a flexible classical query module selector and an open query input to define the requirement-driven test query model to challenge the XML query processing ability of the XML database. The control model defines variables that are used to set up the implementation of a benchmark. This open, flexible, and systematic workload method permits users in various application domains to predicate or profile the performance of the XML database systems.en_US
dc.description.tableofcontents 謝辭
     Abstract
     摘要
     Table of Contents-----I
     List of Tables-----III
     List of Figures-----IV
     Chapter 1 Introduction-----1
       1.1 Research Motivation-----1
       1.2 Research Issues-----2
       1.3 Research Structure-----3
     Chapter 2 Literature Review-----5
       2 Benchmark and Workload Model-----5
       2.1 Benchmark-----5
       2.2 Workload-----6
       2.3 Characteristics of XML Documents-----6
       2.4 XML Database Benchmark-----7
         2.4.1 XML Data Model-----8
         2.4.2 XML Query Model-----17
       2.5 Comparison and Contrast of XML Benchmarks-----28
     Chapter 3 An Open Workload Model for XML Database Benchmark-----30
       3.1 Research Approach-----30
       3.2 Research Model-----30
       3.3 Generic XML Data Model-----31
       3.4 Generic XML Query Model-----34
         3.4.1 Exact Match-----35
         3.4.2 Ordered Access-----36
         3.4.3 Casting-----36
         3.4.4 Regular PathExpressions-----37
         3.4.5 Chasing References-----37
         3.4.6 Construction of Complex Results-----37
         3.4.7 Joins on values-----38
         3.4.8 Reconstruction-----38
         3.4.9 Full Text-----39
         3.4.10 Path Traversals-----39
         3.4.11 Missing Elements-----39
         3.4.12 Function Application-----39
         3.4.13 Sorting-----40
         3.4.14 Aggregation-----40
         3.4.15 Structural Recursion-----41
         3.4.16 Updates Operations-----41
       3.5 Open Data Model and Open Query Model-----41
       3.6 Control Model-----42
     Chapter 4 A Prototype System Implementation-----44
       4.1 The Prototype System Platform and Architecture-----44
       4.2 The Architecture of the Prototype System-----45
       4.2.1 The XML Data Loader-----47
       4.2.2 The Query Generator-----48
       4.2.3 The Control Model Generator-----50
       4.2.4 Results Collector-----52
     Chapter 5 Discussion-----54
       5.1 A More Open and A More Generic Approach for XML Benchmark-----54
       5.2 A Model Solution for Both Administrators and Application Programmers-----55
       5.3 Research Limitations-----55
     Chapter 6 Conclusion-----57
       6.1 Summary-----57
       6.2 Research Results-----58
       6.3 Future Research Directions-----59
     References-----60
     中文參考文獻-----62
     
     List of Tables
     Table 2.1-----19
     Table 2.2-----20
     Table 2.3-----21
     Table 2.4-----22
     Table 2.5-----24
     Table 2.6-----25
     Table 2.7-----27
     Table 4.1-----42
     Table 6.1-----56
     
     List of Figures
     Figure 1.1-----4
     Figure 2.1-----12
     Figure 2.2-----12
     Figure 2.3-----13
     Figure 2.4-----14
     Figure 2.5-----15
     Figure 2.6-----16
     Figure 3.1-----29
     Figure 3.2-----31
     Figure 3.3-----32
     Figure 4.1-----43
     Figure 4.2-----44
     Figure 4.3-----45
     Figure 4.4-----46
     Figure 4.5-----47
     Figure 4.6-----47
     Figure 4.7-----48
     Figure 4.8-----49
     Figure 4.9-----49
     Figure 4.10-----50
     Figure 4.11-----51
zh_TW
dc.source.uri (資料來源) http://thesis.lib.nccu.edu.tw/record/#A2010000457en_US
dc.subject (關鍵詞) XML資料庫zh_TW
dc.subject (關鍵詞) 績效評估zh_TW
dc.subject (關鍵詞) 工作量模型zh_TW
dc.subject (關鍵詞) 資料模型zh_TW
dc.subject (關鍵詞) 查詢模型zh_TW
dc.subject (關鍵詞) 控制模型zh_TW
dc.subject (關鍵詞) XML databaseen_US
dc.subject (關鍵詞) performance evaluationen_US
dc.subject (關鍵詞) workload modelen_US
dc.subject (關鍵詞) data modelen_US
dc.subject (關鍵詞) query modelen_US
dc.subject (關鍵詞) control modelen_US
dc.subject (關鍵詞) XMLen_US
dc.subject (關鍵詞) benchmarken_US
dc.title (題名) 開放型XML資料庫績效評估工作量之模型zh_TW
dc.title (題名) An Open Workload Model for XML Database Benchmarken_US
dc.type (資料類型) thesisen_US
dc.relation.reference (參考文獻) 1. Bohme, T. and E. Rahm (2001), “Xmach-1: Benchmark for XML Data Management,” In Proceedings of the German Database Conference.
     2. Carey, M.J., D.J. DeWitt and J.F. Naughton (1993), “The 007 Benchmark,” In Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, pp12-21.
     3. Carey, M.J., D.J. DeWitt, J.F. Naughton, M. Asgarian, P. Brown, J. Gehrke, and D. Shah (1997), “The BUCKY Object-Relational Benchmark,” In Proceedings ACM SIGMOD International Conference on Management of Data, pp.135-146.
     4. DeWitt, David, Jeffrey Naughton, Jayavel Shanmugasundaram, Kristin Tufte, Gang He and Chun Zhang (1999), “Relational Databases for Querying XML Documents: Limitations and Opportunities,” In Proceedings of the 25th VLDB Conference, pp.25-32.
     5. Fankhauser P., M. Marchiori, and J. Robie (2000), “XML Query Requirements,” http://www.w3.org/TR/xmlquery-req
     6. Florescu, D. and D. Kossmann (1999), “A Performance Evaluation of Alternative Mapping Schemes for Storing XML Data in a Relational Database,” Technical Report No.3680, INRIA.
     7. Florescu, D. and D. Kossmann (1999), “Storing and Querying XML Data Using an RDBMS,” IEEE Data Engineering Bulletin, 22(3), pp.27-34.
     8. Gray, Jim (1993), The Benchmark Handbook, Morgan Kaufmann, San Mateo, CA,
     9. Li, Y. Guang, Stephane Bressan, Gillian Dobbie, Zoe Lacroix, Mong Li Lee, Ullas Nambiar, and Bimlesh Wadhwa (2001), “X007: Applying 007 Benchmark to XML Query Processing Tools,” In Proceedings of the 10th International Conference on Information and Knowledge Management, pp.27-36.
     10. Poess, M. and C. Floyd (2000), “New TPC Benchmarks for Decision Support and Web Commerce,” ACM SIGMOD RECORD, 29(4), pp.64-71.
     11. Schmidt, A. R., F. Waas, M.L. Kersten, D. Florescu, I. Manolescu, M.J. Carey and R. Busse (2001), “The XML Benchmark Project,” CWI Technical reports.
     12. Schmidt, A., F. Waas, M.L. Kersten, D. Florescu, I. Manolescu, M.J. Carey and R. Busse (2001) “Why and How to Benchmark XML Database,” SIGMOD Record, 30(3) pp.41-50.
     13. Schmidt, A., M. Kersten, D. Florescu, M. Carey, I. Manolescu, and F. Waas (2000), “The XML Store Benchmark Project,” http://www.xmlperformance.org
     14. Seng, J. L. (1995), “Benchmarking Relational Database Systems — Overview,” The 6th International Conference on Information Management, pp.55-60.
     15. Seng, J. L. (1996), “Workload characterization using requirements analysis,” The International Conference on Information Systems and Management of Data, pp.71-77.
     16. Tian, F., D.J. DeWitt, J. Chen and C. Zhang (2000), “The design and performance evaluation of alternative XML storage strategies,” unpublished manuscript, University of Wisconsin, Madison, pp.1-9.
     17. Zhao, B. Y. (2000), “The Xset XML search engine and Xbench XML query benchmark,” Technical Report No. UCB/CSD-00-1112, Computer Science Division, University of California at Berkeley.
     中文參考文獻
     1. 林嬿芳(2001),「網際網路超媒體資料庫工作量模型產生之研究」,國立政治大學資訊管理學研究所。
     2. 翁嘉正(2000),「設計與製作XML文件資料庫」,大同大學資訊工程研究所。
     3. 張子文(2001),「XML-Based HTML Wrapper建置之研究在網路銀行個人帳戶資訊彙整服務上之應用」,國立政治大學資訊管理學研究所。
     4. 菲立浦斯Phillips Lee Anne(2001),「XML全方位完全剖析」,第三波電腦圖書資料公司,台北市。
     5. 榮欽科技主筆室(2001),「最新XML入門與應用」,松崗電腦圖書資料公司,台北市。
     6. 韓先良(1999),「分散式關聯資料庫系統績效評估工作量模式之研究」,國立政治大學資訊管理學研究所。
zh_TW