바로가기메뉴

본문 바로가기 주메뉴 바로가기

logo

혼합 방식에 기반한 의견 문서 검색 시스템

An Opinionated Document Retrieval System based on Hybrid Method

정보관리학회지 / Journal of the Korean Society for Information Management, (P)1013-0799; (E)2586-2073
2008, v.25 no.4, pp.115-129
https://doi.org/10.3743/KOSIM.2008.25.4.115
이승욱 (고려대학교 정보통신대학원)
송영인 (고려대학교 정보통신대학원)
임해창 (고려대학교)
  • 다운로드 수
  • 조회수

초록

최근 웹 환경이 대중화되고 개방됨에 따라 웹은 단순한 정보 획득의 공간이 아닌, 의견 표출과 교환의 장이 되어 가고 있으며, 이에 따라 웹 상에서 표출된 특정 주제에 대한 사람들의 의견을 자동으로 검색하기 위한 기술 개발의 필요성이 점차 증대되고 있다. 이러한 의견 문서 검색 문제는 사용자 질의와 문서간의 적합성만을 고려하는 일반적인 정보검색 방법으로는 해결하기 어려우며, 문서 내 의견 포함 여부 분석을 수행할 수 있는 더욱 진보된 시스템을 필요로 한다. 본 논문에서는 기존 검색 시스템의 구조 하에서, 의견 문서 검색을 효과적으로 수행할 수 있는 시스템을 제안한다. 의견 검색을 수행하기 위해 문서 내 의견 분석 방법에 대해 기존의 사전 기반 방식과 기계학습 기반 방식을 결합한 새로운 혼합 방식을 제안하고, 실험을 통하여 검색 성능을 개선하는 효과가 있음을 보였다.

keywords
information retrieval, opinion retrieval, hybrid method, 정보검색, 의견 문서 검색, 혼합 방식, information retrieval, opinion retrieval, hybrid method

Abstract

Recently, as its growth and popularization, the Web is changed into the place where people express, share and debate their opinions rather than the space of information seeking. Accordingly, the needs for searching opinions expressed in the Web are also increasing. However, it is difficult to meet these needs by using a classical information retrieval system that only concerns the relevance between the user's query and documents. Instead, a more advanced system that captures subjective information through documents is required. The proposed system effectively retrieves opinionated documents by utilizing an existing information retrieval system. This paper proposes a kind of hybrid method which can utilize both a dictionary-based opinion analysis technique and a machine learning based opinion analysis technique. Experimental results show that the proposed method is effective in improving the performance.

keywords
information retrieval, opinion retrieval, hybrid method, 정보검색, 의견 문서 검색, 혼합 방식, information retrieval, opinion retrieval, hybrid method

참고문헌

1.

Attardi, G. (2006). Blog Mining through Opinionated Words (-). Pro- ceedings of the 15th TREC..

2.

Clark, M. (2006). RGU at the TREC Blog Track (-). Proceedings of the 15th TREC..

3.

Dave, K. (2003). Mining the Peanut Gallery: Opinion Extraction and Semantic Classification for Product Rev:WWW.

4.

Hannah, D. (2007). University of Glasgow at TREC 2007: Experiments in Blog and Enterprise Tracks with Terrier (-). Proceedings of the 16th TREC..

5.

Java, A. (2006). The BlogVox Opinion Retrieval System (-). +Proceedings of the 15th TREC.

6.

Joshi, H. (2006). UALR at TREC: Blog Track (-). Proceedings of the 15th TREC.

7.

Kim, S. M. (2004). Determining the Sentiment of Opinions (-). Proceed- ings of Conference on Computational Linguistics(COLLING-04)..

8.

Liao, X. (2006). Combining Language Model with Sentiment Analysis for Opinion Retrieval of Blog-Post (-). Pro- ceedings of the 15th TREC.

9.

Macdonald, C. (2006). The TREC Blog06 Collection : Creating and Analysing a Blog Test Collection. Department of Computing Science, University of Glasgow..

10.

Miller,G.A. (1992). WordNet: A Lexical Database for English (-). Proceedings of the workshop on Speech and Natural Language, ACL..

11.

Oard, D. (2006). TREC-2006 at Maryland: Blog, Enterprise, Legal and QA Tracks (-). Proceedings of the 15th TREC..

12.

Ounis, I. (2006). Overview of the TREC-2006 Blog Track (17-31). Proceedings of the 15th TREC.

13.

Ponte, J. M. (1998). A Language Modeling Approach to Information Retrieval (-). Proceedings of the 21st Annual international ACM SIGIR Conference on Research and Development in information Retrieval (SIGIR '98)..

14.

Stone, P. J. (1966). A Computer Approach to Content Anal- ysis:MIT Press.

15.

Vechtomova,O. (2007). Using Subjective Adjectives in Opinion Retrieval from Blogs (-). Proceedings of the 16th TREC..

16.

Winson, T. (2003). Identifying Opinionated Sentences:ACL.

17.

Yang, H. (2006). Knowl- edge Transfer and Opinion Detection in the TREC2006 Blog Track (-). Pro- ceedings of the 15th TREC.

18.

Yang, K. (2006). WIDIT in TREC-2006 Blog track (-). Proceedings of the 15th TREC..

19.

Zhang, E. (2006). UCSC on TREC 2006 Blog Opinion Mining (-). Proceedings of the 15th TREC.

20.

Zhang, M. (2008). A Generation Model to Unify Topic Relevance and Lexicon-based Sentiment for Opinion Retrieval (-). Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Develop- ment in Information Retrieval, SIGIR..

21.

Zhang, W. (2006). UIC at TREC 2006 Blog Track (-). Proceedings of the 15th TREC..

22.

Zhou, G. X. (2007). Topic Categorization for Relevancy and Opinion Detection (-). Proceedings of the 16th TREC..

정보관리학회지