정보관리학회지, 한국정보관리학회

1

정은경(이화여자대학교) 2009, Vol.26, No.3, pp.261-278 https://doi.org/10.3743/KOSIM.2009.26.3.261

초록보기

초록

기계학습 기반 문서범주화 기법에 있어서 최적의 자질을 구성하는 것이 성능향상에 있어서 중요하다. 본 연구는 학술지 수록 논문의 필수적 구성요소인 저자 제공 키워드와 논문제목을 대상으로 자질확장에 관한 실험을 수행하였다. 자질확장은 기본적으로 선정된 자질에 기반하여 WordNet과 같은 의미기반 사전 도구를 활용하는 것이 일반적이다. 본 연구는 키워드와 논문제목을 대상으로 WordNet 동의어 관계 용어를 활용하여 자질확장을 수행하였으며, 실험 결과 문서범주화 성능이 자질확장을 적용하지 않은 결과와 비교하여 월등히 향상됨을 보여주었다. 이러한 성능향상에 긍정적인 영향을 미치는 요소로 파악된 것은 정제된 자질 기반 및 분류어 기준의 동의어 자질확장이다. 이때 용어의 중의성 해소 적용과 비적용 모두 성능향상에 영향을 미친 것으로 파악되었다. 본 연구의 결과로 키워드와 논문제목을 활용한 분류어 기준 동의어 자질 확장은 문서 범주화 성능향상에 긍정적인 요소라는 것을 제시하였다.

Abstract

Identifying optimal feature sets in Text Categorization(TC) is crucial in terms of improving the effectiveness. In this study, experiments on feature expansion were conducted using author provided keyword sets and article titles from typical scientific journal articles. The tool used for expanding feature sets is WordNet, a lexical database for English words. Given a data set and a lexical tool, this study presented that feature expansion with synonymous relationship was significantly effective on improving the results of TC. The experiment results pointed out that when expanding feature sets with synonyms using on classifier names, the effectiveness of TC was considerably improved regardless of word sense disambiguation.

2

사전 정보를 이용한 단어 중의성 해소 모형에 관한 실험적 연구

이용구(계명대학교) ; 정영미(연세대학교) 2007, Vol.24, No.1, pp.321-342 https://doi.org/10.3743/KOSIM.2007.24.1.321

초록보기

초록

이 연구에서는 수작업 태깅없이 기계가독형 사전을 이용하여 자동으로 의미를 태깅한 후 학습데이터로 구축한 분류기에 대해 의미를 분류하는 단어 중의성 해소 모형을 제시하였다. 자동 태깅을 위해 사전 추출 정보 기반 방법과 연어 공기 기반 방법을 적용하였다. 실험 결과, 자동 태깅에서는 복수 자질 축소를 적용한 사전 추출 정보 기반 방법이 70.06%의 태깅 정확도를 보여 연어 공기 기반 방법의 56.33% 보다 24.37% 향상된 성능을 가져왔다. 사전 추출 정보 기반 방법을 이용한 분류기의 분류 정학도는 68.11%로서 연어 공기 기반 방법의 62.09% 보다 9.7% 향상된 성능을 보였다. 또한 두 자동 태깅 방법을 결합한 결과 태깅 정확도는 76.09%, 분류 정확도는 76.16%로 나타났다.

Abstract

This study presents an effective word sense disambiguation model that does not require manual sense tagging process by automatically tagging the right sense using a machine-readable dictionary, and attempts to classify the senses of those words using a classifier built from the training data. The automatic tagging technique was implemnted by the dictionary information-based and the collocation co-occurrence-based methods. The dictionary information-based method that applied multiple feature selection showed the tagging accuracy of 70.06%, and the collocation co-occurrence-based method 56.33%. The sense classifier using the dictionary information-based tagging method showed the classification accuracy of 68.11%, and that using the collocation co-occurrence-based tagging method 62.09%. The combined tagging method applying data fusion technique achieved a greater performance of 76.09% resulting in the classification accuracy of 76.16%.

3

주제별 분산 지식베이스에 의한 개념기반 정보검색시스템의 성능향상에 관한 연구

노영희(이화여자대학교) 2002, Vol.19, No.1, pp.47-69 https://doi.org/10.3743/KOSIM.2002.19.1.047

초록보기

초록

개념기반 정보검색기법은 불리언 검색기법의 문제점을 해소했다고 평가받고 있는 단순 매칭함수 기법이나 P-norm 검색기법보다 높은 성능을 보여주고 있다. 그러나 개념화장에 필수적인 의미망 지식베이스를 구축하는데 시간이 너무 오래 걸리는 단점이 있다. 본 연구에서는 이러한 문제를 해결하기 위해 주제범주별로 지식베이스를 분산 구축함으로써 지식베이스 구축에 소요되는 시간을 단축하면서도 검색성능이 떨어지지 않도록 하는 방안을 모색하고자 하였다.

Abstract

The concept based retrieval model has shown a higher performance than those of the simple matching function method or the P-norm retrieval method introduced to compensate the demerits of the Boolean retrieval model. However. it takes too long to create a semantic-net knowledge base, which is essential in concept exploration. In order to solve such demerits. a method was sought out by creating a distributed knowledge base by subjects to reduce construction time without hindering the performance of retrieval.

4

연구문헌의 지식구조를 반영하는 의미기반의 지식조직체계에 관한 연구

고영만(성균관대학교) ; 송인석(한국과학기술정보연구원) 2011, Vol.28, No.1, pp.145-170 https://doi.org/10.3743/KOSIM.2011.28.1.145

초록보기

초록

본 연구는 연구문헌의 지식구조를 반영하는 의미기반 지식조직체계의 실험적 모형을 제시하는 것을 목적으로 한다. 이를 위해 한국연구재단의 기초학문자료센터에 대한 사례분석을 하였다. 기초학문자료센터 연구성과물 DB와 학술용어 DR의 개념클래스 및 인스턴스를 대상으로 연구문헌의 지식구조를 파악하였으며, 기초학문자료센터 시스템의 학술적 이해형성 기능을 분석하였다. 또한 연구문헌의 지식구조와 색인어의 관계를 분석하였다. 이러한 분석을 통해 지식구조와 색인어의 관계구조, 26개의 연구문헌 지식구조 공리 및 11개의 의미관계 추론규칙으로 구성되는 온톨로지 모형, 즉 연구문헌의 지식구조와 그 의미관계에 의한 실험적 지식조직체계 모형을 제시하였다.

Abstract

The purpose of this paper is to suggest a pilot model of knowledge organizing system which reflects the knowledge structure of research papers, using a case analysis on the “Korean Research Memory” of the National Research Foundation of Korea. In this paper, knowledge structure of the research papers in humanities and social science is described and the function of the “Korean Research Memory” for scholarly sense-making is analysed. In order to suggest the pilot model of the knowledge organizing system, the study also analysed the relation between indexed keyword and knowledge structure of research papers in the Korean Research Memory. As a result, this paper suggests 24 axioms and 11 inference rules for an ontology based on semantic relation of the knowledge structure.

5

연구 논문의 의미 구조 기반 메타데이터 항목의 자동 식별 처리를 위한 문장 구조 분석

송민선(대림대학교) 2018, Vol.35, No.3, pp.101-121 https://doi.org/10.3743/KOSIM.2018.35.3.101

초록보기

초록

This study proposes the analysis method in sentence semantics that can be automatically identified and processed as appropriate items in the system according to the composition of the sentences contained in the data corresponding to the logical semantic structure metadata of the research papers. In order to achieve the purpose, the structure of sentences corresponding to ‘Research Objectives’ and ‘Research Outcomes’ among the semantic structure metadata was analyzed based on the number of words, the link word types, the role of many-appeared words in sentences, and the end types of a word. As a result of this study, the number of words in the sentences was 38 in ‘Research Objectives’ and 212 in ‘Research Outcomes’. The link word types in ‘Research Objectives’ were occurred in the order such as Causality, Sequence, Equivalence, In-other-word/Summary relation, and the link word types in ‘Research Outcomes’ were appeared in the order such as Causality, Equivalence, Sequence, In-other-word/Summary relation. Analysis target words like ‘역할(Role)’, ‘요인(Factor)’ and ‘관계(Relation)’ played a similar role in both purpose and result part, but the role of ‘연구(Study)’ was little different. Finally, the verb endings in sentences were appeared many times such as ‘∼고자’, ‘∼였다’ in ‘Research Objectives’, and ‘∼었다’, ‘∼있다’, ‘∼였다’ in ‘Research Outcomes’. This study is significant as a fundamental research that can be utilized to automatically identify and input the metadata element reflecting the common logical semantics of research papers in order to support researchers’ scholarly sensemaking.

Abstract

6

차세대 검색서비스의 속성에 관한 연구

이수상(부산대학교) ; 이순영(부산대학교) 2009, Vol.26, No.4, pp.93-112 https://doi.org/10.3743/KOSIM.2009.26.4.093

초록보기

초록

최근 정보검색 환경은 검색 2.0으로 대표되는 차세대 검색서비스에 대한 논의들이 활발해지고 있다. 따라서 이 연구에서는 정보검색의 발전과 진화에 대한 다양한 논의들을 토대로 정보검색의 발전 과정을 구분하였다. 그리고 현재 거론되고 있는 차세대 검색서비스의 등장 배경, 주요 개념, 그리고 관련 사례와 속성을 파악하였으며, 이러한 속성과 사례에 대한 데이터를 통해 차세대 검색서비스를 설명하는 핵심적인 키워드를 확인하기 위한 군집 분석을 수행하였다. 군집 분석의 결과 차세대 검색서비스를 대표하는 주요 키워드는 소셜 검색, 지능형 의미 검색, 그리고 관계기반 검색 등으로 나타났다.

Abstract

Recently in the area of the information environment, there are lively discussions about search 2.0 which is representative of the next generation search services. In this study, we divide information search model into matching and linking models according the developmental stages. Therefore, on the one hand, we analyze the background, main concepts, related attributes and cases of the next generation search services and the other, we identify the representative keywords by the group analysis of various attributes and cases of it. The result shows that the main keywords such as social search, artificial intelligence and semantic search, and relation/network based search are representative of the search 2.0.

7

검색엔진의 정확률 향상을 위한 질의어 의미와 사용자 반응 정보의 이용

윤성희(상명대학교) 2009, Vol.26, No.4, pp.81-92 https://doi.org/10.3743/KOSIM.2009.26.4.081

초록보기

초록

본 논문은 정보검색 시스템의 사용자 질의어와 색인에 기반한 검색 과정에서 나타나는 중의성 해소를 위해 질의어 의미정보와 사용자 피드백을 사용하여 검색 성능을 향상시키는 방법을 소개한다. 의미 정보를 이용하여 질의어의 중의성을 해소하는 검색 과정은 검색 결과로서 의미적으로 무관한 많은 문서들을 배제할 수 있다. 이를 위해 검색의 색인이 되는 명사 중심의 의미범주를 기반으로 의미정보 지식베이스를 구축하고, 검색 문서들을 색인어와 해당 의미범주로 분류한다. 검색 과정에서는 사용자의 질의 의미 선택과 정답 문서에 대한 참조 행위를 웹 페이지의 순위 결정에 반영하여 검색 성능을 향상시킬 수 있다.

Abstract

This paper proposes a technique for improving performance using word senses and user feedback in web information retrieval, compared with the retrieval based on ambiguous user query and index. Disambiguation using query word senses can eliminating the irrelevant pages from the search result. According to semantic categories of nouns which are used as index for retrieval, we build the word sense knowledge-base and categorize the web pages. It can improve the precision of retrieval system with user feedback deciding the query sense and information seeking behavior to pages.

8

온톨로지 기반 한의학 처방 지식관리시스템 설계에 관한 연구

이현실(원광대학교) ; 이두영(중앙대학교) 2003, Vol.20, No.1, pp.341-371 https://doi.org/10.3743/KOSIM.2003.20.1.341

초록보기

초록

본연구는 한의학 처방 지식관리시스템 설계에 요구되는 사항들이 온톨로지의 추상적 개념구조를 기반으로 용어의 개념, 속성, 관계의 명확한 정의를 통해 더욱 합리적이고 효과적으로 실현된다는 것을 전제로 하였다. 이에 따라 실세계 개념 모델링 방식으로 한의학 처방지식 온톨로지를 개발하여 peotege-2000을 기반으로 온톨로지 시스템을 구축하였고, 시스템을 응용할 수 있는 마크업 언어의 설계와 편집기를 만들어 지식의 추론이 가능한 한의학 처방 지식관리시스템을 구현하였다. 본 연구에서 구현한 시스템은 XML 기반의 RDF와 온톨로지 기술에 기반을 두고 있으므로 차세대 인터넷 기술인 의미웹광의 연동이 가능하다.

Abstract

9

RDF/OWL의 객체속성을 이용한 관계온톨로지 시스템 구축과 활용에 관한 연구

강현민(행정안전부 국가기록원) 2010, Vol.27, No.4, pp.219-237 https://doi.org/10.3743/KOSIM.2010.27.4.219

초록보기

초록

FRBR, FRAD 개념모형과 RDA 목록규칙에는 서지개체와 접근제어개체 간 다양한 수준에서 발생하는 복합적이고 다원적인 관계유형들이 규정되어 있다. 본 연구에서는 이러한 관계유형을 술어논리에 기반하여 온톨로지 환경에서 개체 클래스의 인스턴스와 인스턴스 간 관계를 RDF/OWL의 객체속성(Object Property)을 서지세계의 개체 간 관계기술과 접근을 위한 새로운 제어기제이자 통합적 연결장치로서 그 적용과 활용 가능성을 시도하였다. 이를 위해 관계온톨로지 시스템을 구축하고 SPARQL 질의결과를 온톨로지 시각화도구를 통해 제시하였다. 이로써 온톨로지 기반의 ‘관계기술목록’이라는 새로운 목록업무 영역의 확장을 통해, 목록기능의 ‘다 대 다 집중’이라는 의미 확장, ‘개체단위 기반의 의미적 집중’, RDF/OWL 객체속성의 계층관계 상속을 이용한 ‘관계 추론’ 등을 연구결과로 제시하였다.

Abstract

This study proposes a ‘Bibliographic Universe Relationship Vocabulary’(burv) using the RDF/OWL Object Property under the SPO predicate logic according to the relationship type among all entities of bibliographic universe and implemented a ‘relationship ontology system’ to establish a new cataloging business domain called ‘Relationship Description Cataloging’ based on the ontology.

10

검색 성능 향상을 위한 약품 온톨로지 기반 연관 피드백

임수연(경북대학교) 2005, Vol.22, No.2, pp.41-56 https://doi.org/10.3743/KOSIM.2005.22.2.041

초록보기

초록

기계가 정보의 의미를 이해하고 처리할 수 있도록 기존의 웹을 확장하는 것을 목적으로 하는 시멘틱 웹은 온톨로지를 이용하여 지식을 공유하게 된다. 본 논문에서는 정교한 질의의 처리를 위하여 온톨로지 내에 존재하는 의미 관계들을 질의의 확장을 위한 연관피드백 정보로 이용하는 방안을 제안한다. 실험은 도메인 온톨로지인 Medicine 온톨로지를 대상으로 하였으며, 출현 용어들의 빈도정보만을 이용한 키워드기반 문서검색과 제안한 온톨로지기반 문서검색의 성능을 비교하였다. 이 때, 두 시스템의 정확률과 재현율을 성능 평가의 기준으로 삼았다. 그 결과, 검색 엔진은 온톨로지에 정의된 개념들과 규칙들을 활용하면서 검색의 정확률을 향상시키는데 도움이 되었고 검색 성능을 향상시키기 위한 추론의 기반으로도 사용될 수 있었다.

Abstract

For the purpose of extending the Web that is able to understand and process information by machine, Semantic Web shared knowledge in the ontology form. For exquisite query processing, this paper proposes a method to use semantic relations in the ontology as relevance feedback information to query expansion. We made experiment on pharmacy domain. And in order to verify the effectiveness of the semantic relation in the ontology, we compared a keyword based document retrieval system that gives weights by using the frequency information compared with an ontology based document retrieval system that uses relevant information existed in the ontology to a relevant feedback. From the evaluation of the retrieval performance, we knew that search engine used the concepts and relations in ontology for improving precision effectively. Also it used them for the basis of the inference for improvement the retrieval performance.

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

정보관리학회지