바로가기메뉴

본문 바로가기 주메뉴 바로가기

logo

객체-관계형 데이터베이스에 의한 XML문헌의 검색성능 평가

Retrieval Performance of XML Documents Using Object-Relational Databases

정보관리학회지 / Journal of the Korean Society for Information Management, (P)1013-0799; (E)2586-2073
2004, v.21 no.2, pp.189-210
https://doi.org/10.3743/KOSIM.2004.21.2.189
김희섭 (경북대학교)
  • 다운로드 수
  • 조회수

초록

본 연구의 목적은 객체-관계형 데이터베이스 접근에 의한 XML 문헌의 검색 성능을 평가하는 것이다. 본 논문에서는 INEX(Initiative for the Evaluation of XML retrieval)에서의 XML 문헌의 색인 및 검색 방법에 대하여, 그리고 실험 방법론들에 대하여 기술하고 있다. 대부분의 전통적인 정보검색 성능평가 실험에서와 같이 본 연구에서 사용된 테스트 콜렉션(test collection)은 문헌(즉, XML 문헌), 토픽, ad hoc 검색, 적합성 판단, 평가로 이루어졌다. 그리고 ORDBMS 기술들을 기반으로 개발된 전용 XML 데이터베이스의 일종인 EXIMATM Supply을 사용하여 INEX에서 제공한 대규모 XML 문헌들을 저장하고 검색하였다. 본 논문에서는 실험에서 사용한 시스템에 대한 개략적인 기능들과 색인 및 검색 과정 그리고 INEX 2002에서의 성능평가 결과에 대하여, 앞으로 개선되어야 할 기능에 대하여 논하고 있다.

keywords
XML documents, EXIMA supply, object-relational DBMS, IR performance evaluation, INEX, XML 문헌, 객체-관계형 DBMSs, 정보 검색, 성능 평가1. Introduction, XML documents, EXIMA supply, object-relational DBMS, IR performance evaluation, INEX, XML 문헌, 객체-관계형 DBMSs, 정보 검색, 성능 평가1. Introduction

Abstract

The purpose of this study is to evaluate the performance of XML retrieval based on ORDBMSs(Object-Relational Database Management Systems) approach. This paper describes indexing and retrieval methods for XML documents and the methodologies of experiments at INEX(Initiative for the Evaluation of XML retrieval). Like any other traditional information retrieval experiment, the test collection was consists of documents, topics/queries, task, relevance assessments and evaluation. EXIMATM Supply, a kind of native XML DB based on ORDBMS technologies, is used for this experiment. Although this approach has many benefits, for example, no delay in storing and searching XML documents, but it showed relatively disappointed retrieval performance at INEX 2002. This result may caused since the given topics had to be decomposed and modified to be processed by the XPath processor, and during this modification the original meaning of topics can be changed inevitably and some important information may pass over.

keywords
XML documents, EXIMA supply, object-relational DBMS, IR performance evaluation, INEX, XML 문헌, 객체-관계형 DBMSs, 정보 검색, 성능 평가1. Introduction, XML documents, EXIMA supply, object-relational DBMS, IR performance evaluation, INEX, XML 문헌, 객체-관계형 DBMSs, 정보 검색, 성능 평가1. Introduction

참고문헌

1.

(2003). Querying Structured Text in an XML Database. , 4-15.

2.

(2002). Second Edition of the XML and Information Ret- rieval Workshop. 36(2), 53-57.

3.

(1988). The Design and Implementation of O2 of the Second International Workshop on Object-oriented Database. , -.

4.

(2003). Searching XML Documents via XML Fragments. , 151-158.

5.

(2001). XML and Infor- mation Retrieval: a SIGIR 2000 Workshop. 30(1), 62-65.

6.

(2000). XML: Current Developments and Future Chall- enges for the Database Com- munity. , 3-17.

7.

(2000). XML and DB2. , 569-573.

8.

(2001). Expressive Retrieval from XML documents. , 163-171.

9.

(2001). Accessing and Transforming Dynamic Content based on XML: Alternative Techniques and a Practical Implementation. , -.

10.

(1999). XML-QL: A Query Language for XML. , -.

11.

(2001). Query Engines for Web-accessible XML Data. , 251-260.

12.

(1998). Catching the Boat with Strudel: Experiences with a Web-Site Management System. 27(2), 414-425.

13.

(1999). Storing and Querying XML Data using an RDBMS. 22(3), 27-34.

14.

(2002). INEX: Initiative for the Evaluation of XML Ret- rieval. , -.

15.

(2001). XIRQL: A Query Language for Information Retrieval in XML Documents. , 172-180.

16.

(2001). Mapping XML Documents to the Object-Relational Form. 3, 1757-1761.

17.

INEX homepage. , -.

18.

load area. , -.

19.

(1999). An Effective Mechanism for Index Update in Structured Documents. , 383-390.

20.

(2004). A Report on the First Year of the Initiative for the Evaluation of XML Retrieval: INEX'02. 55(6), 551-556.

21.

(2001). A Perfor- mance Evaluation of Storing XML Data in Relational Database Management Systems. , 31-38.

22.

(2002). Structured Infor- mation Retrieval in XML documents. , 663-667.

23.

(1996). Index Structures for Structured Documents. , 91-99.

24.

(2001). Answering XML Queries Over Heterogene- ous Data Sources. , 241-250.

25.

(1997). Lore: A Database Management System for Semi-structured Data. 26(3), 54-66.

26.

(2000). Querying XML documents. 19(1), 24-26.

27.

(1997). Proximal Nodes: A Model to Query Document Databases by Content and Structure. 15(4), 400-435.

28.

(2001). Efficient Relational Storage and Retrieval of XML Documents. 1997, 137-150.

29.

(1999). Relational Databases for Querying XML Documents: Limitations and Opportunities. , 302-314.

30.

(2001). XML Indexing and Retrieval with a Hybrid Storage Model. 3(2), 252-261.

31.

(2002). The Design and Performance Evaluation of Alter- native XML Storage Strategies. 31(1), 5-10.

32.

(2001). Bridging XML-Schema and Relational Databases: A System for Generating and Manipulating Relational Databases using valid XML Documents. , 105-114.

33.

(2003). XVerter: Querying XML Data with OR-DBMS. , 37-44.

34.

W3C Document Object Model. , -.

35.

W3C XPath. , -.

36.

(2001). On Supporting Containment Queries in Relational Database Manage- ment Systems. , 425-436.

37.

(1996). Self-Indexing Inverted Files for Fast Text Retrieval. 14(4), 349-379.

38.

The New XML Type Datatype. , -.

정보관리학회지