정보관리학회지, 한국정보관리학회

1

심경(Systems R&D Center, Iris.Net) ; 정영미(연세대학교) 2006, Vol.23, No.2, pp.265-285 https://doi.org/10.3743/KOSIM.2006.23.2.265

초록보기

초록

문헌범주화에서는 학습문헌집합에 부여된 주제범주의 정확성이 일정 수준을 가진다고 가정한다. 그러나, 이는 실제 문헌집단에 대한 지식이 없이 이루어진 가정이다. 본 연구는 실제 문헌집단에서 기 부여된 주제범주의 정확성의 수준을 알아보고, 학습문헌집합에 기 부여된 주제범주의 정확도와 문헌범주화 성능과의 관계를 확인하려고 시도하였다. 특히, 학습문헌집합에 부여된 주제범주의 질을 수작업 재색인을 통하여 향상시킴으로써 어느 정도까지 범주화 성능을 향상시킬 수 있는가를 파악하고자 하였다. 이를 위하여 과학기술분야의 1,150 초록 레코드 1,150건을 전문가 집단을 활용하여 재색인한 후, 15개의 중복문헌을 제거하고 907개의 학습문헌집합과 227개의 실험문헌집합으로 나누었다. 이들을 초기문헌집단, Recat-1, Recat-2의 재 색인 이전과 이후 문헌집단의 범주화 성능을 kNN 분류기를 이용하여 비교하였다. 초기문헌집단의 범주부여 평균 정확성은 16%였으며, 이 문헌집단의 범주화 성능은 F1값으로 17%였다. 반면, 주제범주의 정확성을 향상시킨 Recat-1 집단은 F1값 61%로 초기문헌집단의 성능을 3.6배나 향상시켰다.

Abstract

In text categorization a certain level of correctness of labels assigned to training documents is assumed without solid knowledge on that of real-world collections. Our research attempts to explore the quality of pre-assigned subject categories in a real-world collection, and to identify the relationship between the quality of category assignment in training set and text categorization performance. Particularly, we are interested in to what extent the performance can be improved by enhancing the quality (i.e., correctness) of category assignment in training documents. A collection of 1,150 abstracts in computer science is re-classified by an expert group, and divided into 907 training documents and 227 test documents (15 duplicates are removed). The performances of before and after re-classification groups, called Initial set and Recat-1/Recat-2 sets respectively, are compared using a kNN classifier. The average correctness of subject categories in the Initial set is 16%, and the categorization performance with the Initial set shows 17% in F1 value. On the other hand, the Recat-1 set scores F1 value of 61%, which is 3.6 times higher than that of the Initial set.

2

의학분야 학술잡지 선택에 영향을 미치는 요인 연구

김기영(Rutgers University) 2006, Vol.23, No.2, pp.245-263 https://doi.org/10.3743/KOSIM.2006.23.2.245

초록보기

초록

학술잡지 구입 예산의 구입비용의 상승에 따른 압력으로 지난 수십년간 학술잡지의 선택에 영향을 미치는 요인들에 대한 연구가 활발히 진행되어 왔지만, 학술잡지의 선택에 대한 만족할만한 이론적 틀이 제시되지 못하였다. 이에 따라 본 연구에서는 의학도서관에서 의학분야의 학술잡지의 선택에 영향을 미치는 요인들을 확인하여 이러한 이론적 틀을 제시할 수 있는 근거를 마련코자 한다. 본 연구는 상관관계 분석과 로지스틱회귀분석을 통해 학술잡지선택의 분산을 설명하고, 나아가 예측하는 통계적 모델들을 여러 변수조합을 이용해 제시한다. 또한 이러한 모델의 실제적 적용과 향후 연구방향을 논의한다.

Abstract

Since the beginning of discussions on serial collection management, as budgets have waxed and waned over the ensuing decades, a number of key variables affecting selection/deselection have emerged but without the framework of a coherent and accepted theoretical model. This study is an effort to identify variables which affect the serial collection decision with special attention to selection/deselection in the context of an academic health science library. Based on results from correlation analyses and logistic regression analyses, the serial collection decision can be explained and predicted using various combinations of a reduced set of objective variables. Applications of the results to libraries are discussed, and further research is proposed.

3

사회적 네비게이션 기반 사회적 검색

안재욱(University of Pittsburgh) ; Peter Brusilovsky(University of Pittsburgh) ; Rosta Farzan(University of Pittsburgh) 2006, Vol.23, No.2, pp.147-165 https://doi.org/10.3743/KOSIM.2006.23.2.147

초록보기

초록

웹기반 교육 자료들이 폭발적으로 증가함에 따라 적합한 자료들에 보다 효과적으로 접근할 수 있는 방법이 요구되고 있다. 이러한 새로운 방법들 중의 하나로 사회적 네비게이션(social navigation) 기반의 사회적 검색(social searching)이 정보 검색 분야에서 제시되었는데, 이는 동료 이용자들로부터 제공된 정보를 바탕으로 검색 결과의 향상을 추구하는 기법이다. 본 연구에서는 개인화와 사회적 네비게이션에 근거한 웹 기반 사회적 검색 시스템을 구축하였으며 이용자 연구를 통해 이용자에게 적합하고 필수적인 정보를 제공할 수 있는 방법이라는 것을 검증하려 하였다.

Abstract

The explosive growth of Web-based educational resources requires a new approach for accessing relevant information effectively. Social searching in the context of social navigation is one of several answers to this problem, in the domain of information retrieval. It provides users with not merely a traditional ranked list, but also with visual hints which can guide users to information provided by their colleagues. A personalized and context-dependent social searching system has been implemented on a platform called KnowledgeSea II, an open-corpus Web-based educational support system with multiple access methods. Validity tests were run on a variety of aspects and results have shown that this is an effective way to help users access relevant, essential information.

4

질의응답문서 검색에서 문서구조를 이용한 질의재생성에 관한 연구

최상희(대구가톨릭대학교) ; 서은경(한성대학교) 2006, Vol.23, No.2, pp.229-243 https://doi.org/10.3743/KOSIM.2006.23.2.229

초록보기

초록

질의응답문서는 이용자가 입력한 질의, 질의설명, 답을 아는 다른 이용자가 제시한 응답으로 구성된 구조화된 문서로서, 최근 웹 문서처럼 검색이 일반적으로 일어나고 있는 정보원이다. 이 연구에서는 질의응답문서의 구조적 특성을 기반으로 질의를 재생성하여 질의응답문서의 검색효율을 향상시키고자 하였다. 질의재생성 실험에서 성능이 비교된 문서구조는 질의와 응답내용이다. 질의를 기반으로 질의를 재생성하는 방식에서는 질의응답검색 시스템에 입력되어 있는 유사질의를 활용하여 클러스터링하는 기법이 적용되었다. 응답정보를 기반으로 질의를 재생성하는 방식에서는 가장 유사한 기존 질의에 대해 응답된 내용에서 단락검색으로 적합한 문장들을 선정하여 활용하는 기법이 적용되었다. 실험 결과 응답정보를 활용하여 질의를 재생성하는 방식이 정확률은 유지하면서 더 다양한 검색결과를 제공하는 것으로 나타났다.

Abstract

This study aims to suggest an effective way to enhance question-answer(QA) document retrieval performance by reconstructing queries based on the structural features in the QA documents. QA documents are a structured document which consists of three components: question from a questioner, short description on the question, answers chosen by the questioner. The study proposes the methods to reconstruct a new query using by two major structural parts, question and answer, and examines which component of a QA document could contribute to improve query performance. The major finding in this study is that to use answer document set is the most effective for reconstructing a new query. That is, queries reconstructed based on terms appeared on the answer document set provide the most relevant search results with reducing redundancy of retrieved documents.

5

엘리먼트 기반 XML 문서검색의 성능에 관한 실험적 연구

윤소영(국사편찬위원회) ; 문성빈(연세대학교) 2006, Vol.23, No.1, pp.201-219 https://doi.org/10.3743/KOSIM.2006.23.1.201

초록보기

초록

이 연구에서는 가장 적합한 엘리먼트 기반 XML 문서검색 기법을 제시하기 위해 언어모델 검색 접근법으로 다이버전스 기법, 보정 기법 그리고 계층적 언어모델의 검색성능을 평가하는 실험을 수행하였다. 실험 결과, 가장 효율적인 검색 접근법으로 문서의 구조정보를 적용한 계층적 언어모델 검색을 제안하였다. 특히, 계층적 언어모델은 실제 검색에서 중요성을 가지는 검색순위 상위에서 뛰어난 성능을 보였다.

Abstract

This experimental study suggests an element-based XML document retrieval method that reveals highly relevant elements. The models investigated here for comparison are divergence and smoothing method, and hierarchical language model. In conclusion, the hierarchical language model proved to be most effective in element-based XML document retrieval with regard to the improved exhaustivity and harmed specificity.

6

웹 포털 이용자 로그 데이터에 기반한 개인화 검색 서비스 모형의 설계 및 평가

이소영(다음커뮤니케이션) ; 정영미(연세대학교) 2006, Vol.23, No.4, pp.179-196 https://doi.org/10.3743/KOSIM.2006.23.4.179

초록보기

초록

이 연구에서는 한국형 포털에 적합한 커뮤니티 기반 개인화 검색 서비스 모형을 제안하였다. 개인화 검색 서비스 모형은 이용자의 관심 주제를 파악하는 과정과 이를 반영한 검색 결과 재순위화 및 관련 주제 카테고리와 질의어 추천 과정으로 구성된다. 개인화 검색 모형의 유용성을 검증하기 위한 실험에서는 포털 사이트 다음에서 12일간 수집한 이용자 로그 데이터를 사용하였다. 실험 결과 개별 이용자의 주제 카테고리 선정에 사용한 카페 활동성 분석과 신지식 활동성 분석 데이터는 매우 유용한 것으로 나타났으며, 개인화 검색 결과와 추천 서비스에 대한 만족도도 비교적 높게 나타났다.

Abstract

This study proposes an expanded model of personalized search service based on community activities on a Korean Web portal. The model is composed of defining subject categories of users, providing personalized search results, and recommending additional subject categories and queries. Several experiments were performed to verify the feasibility and effectiveness of the proposed model. It was found that users’ activities on community services provide valuable data for identifying their interests, and the personalized search service increases users’ satisfaction.

7

전략적 계획을 기반으로 한 BSC 모형 개발 -Rod Library 사례를 중심으로-

조윤희(University of Northern Iowa) 2006, Vol.23, No.1, pp.159-179 https://doi.org/10.3743/KOSIM.2006.23.1.159

초록보기

초록

북아이오와주립 대학도서관은 우수한 교양 커리큘럼을 지원하는 개별화 학습 환경과 다양한 지적, 문화적 커뮤니티를 지원하는 것을 사명으로 하고 있다. 1987년부터 전략적 계획을 수립하기 시작하여 현재 5년 단위로 전략적 계획을 수립하고 있으며, 최근 전략이 어느 정도 달성되고 있는가에 대한 성과측정의 필요성이 제기되었다. 이에 본 연구는 북아이오와주립 대학도서관의 최근 전략적 계획 2004-2009의 내용을 분석하여 전략을 균형성과표의 네 관점으로 전환하는 BSC 모형과 전략지도를 개발하였다. 이와 함께 각 관점별 전략적 목적을 측정하는 핵심성과측정지표와 이를 이끄는 동인들의 인과관계 모형을 개발하여 제시하였다.

Abstract

The University of Northern Iowa Rod Library has mission statement that a personalized learning environment founded on the strong liberal arts curriculum and to supporting an intellectually and culturally diverse community. The Rod Library has been developing the strategic plans by 5 years since 1987. Recently, the strategy has been faced need to measure of performance how much does make it up. This study developed the BSC model and the strategy map that analyze the strategic plan 2004-2009 of the University of Northern Iowa Rod library and transfer the strategy into the four perspectives of BSC. In addition, this study presented the success performance indicators measuring the strategic goals of each perspectives and the cause-effect model driving the lead indicators of performance.

8

기계학습을 통한 디스크립터 자동부여에 관한 연구

김판준(신라대학교) 2006, Vol.23, No.1, pp.279-299 https://doi.org/10.3743/KOSIM.2006.23.1.279

초록보기

초록

학술지 논문에 디스크립터를 자동부여하기 위하여 기계학습 기반의 접근법을 적용하였다. 정보학 분야의 핵심 학술지를 선정하여 지난 11년간 수록된 논문들을 대상으로 문헌집단을 구성하였고, 자질 선정과 학습집합의 크기에 따른 성능을 살펴보았다. 자질 선정에서는 카이제곱 통계량(CHI)과 고빈도 선호 자질 선정 기준들(COS, GSS, JAC)을 사용하여 자질을 축소한 다음, 지지벡터기계(SVM)로 학습한 결과가 가장 좋은 성능을 보였다. 학습집합의 크기에서는 지지벡터기계(SVM)와 투표형 퍼셉트론(VPT)의 경우에는 상당한 영향을 받지만 나이브 베이즈(NB)의 경우에는 거의 영향을 받지 않는 것으로 나타났다.

Abstract

This study utilizes various approaches of machine learning in the process of automatically assigning descriptors to journal articles. After selecting core journals in the field of information science and organizing test collection from the articles of the past 11 years, the effectiveness of feature selection and the size of training set was examined. In the regard of feature selection, after reducing the feature set by χ2 statistics(CHI) and criteria which prefer high-frequency features(COS, GSS, JAC), the trained Support Vector Machines(SVM) performs the best. With respective to the size of the training set, it significantly influences the performance of Support Vector Machines(SVM) and Voted Perceptron(VTP). but it scarcely affects that of Naive Bayes(NB).

9

지식정보 공유를 위한 전자원문서비스의 주요 이슈와 사례 분석

유수연(Korea Institute of Science and Technology Information) ; 최희윤(한국과학기술정보연구원) 2006, Vol.23, No.2, pp.81-96 https://doi.org/10.3743/KOSIM.2006.23.2.081

초록보기

초록

웹기반 학술정보 커뮤니케이션이 보편화되고 정보공급자 및 이용자와의 직접적인 커뮤니케이션이 확산되는 등 원문서비스 환경의 변화는 원문서비스 기관에 적지 않은 영향을 미치고 있다. 특히 웹을 통하여 이용자에게 원문을 제공하는 전자원문서비스의 등장은 전자형태 정보의 신속하고 용이한 복제 및 배포로 인하여 그 운영에 있어서 저작권과의 마찰을 피할 수 없다. 이 연구에서는 원문서비스 환경의 주요 변화와 동향을 검토하고, 해외 전자원문서비스 사례를 파악함으로써 국내 웹기반 원문서비스인 e-DDS가 국내 저작권법에서 이슈가 되는 부분 및 향후 해결해 나가야 할 부분들을 검토하고자 한다.

Abstract

Changes in document delivery service environment such as spread of web-based research information communication and direct communication between users and information providers have considerable effects on document delivery service institutes. Swift advances in information technology have allowed users to receive information on their desktops via web. Web-based document delivery makes the massive scale of reproduction and distribution possible so it need to protect the copyright holders' rights. This study identifies the current trends and issues of document delivery service environment and reviews electronic document delivery services of foreign countries. Also this study introduces the domestic electronic document delivery service, e-DDS, and evaluates the copyright issues for the service.

10

기록 관리 메타데이터의 개념 모델링

이현실(원광대학교) ; 한성국(원광대학교) 2006, Vol.23, No.3, pp.23-48 https://doi.org/10.3743/KOSIM.2006.23.3.023

초록보기

초록

기록 관리 메타데이터 스키마는 기록물 자체에 내재한 정보 요소뿐만 아니라, 기록 업무에 따른 기록물의 생명 주기 관리 등에 필요한 관리 요소를 표현할 수 있는 강고한 구조를 가져야 한다. 이를 위해서 메타데이터 스키마에서는 기록 도메인의 정보 모델과, 기록 관리 업무 및 응용에서 요구되는 의미 상세화와 데이터 요소 특수화 등을 지원하는 메타데이터 프레임워크가 요구된다. 본 연구에서는 메타데이터 스키마의 주요 원리와 특성을 분석하여, 기록 관리 메타데이터 스키마를 체계적이고 효과적으로 개발하기 위한 접근 방식을 제시한다. 이를 위해 ISO 15489와 23081에 제시된 기록 관리 지침과 메타데이터 운용에 근거한 기록 관리 정보 모델을 개발하고 핵심 데이터 요소를 제시하였으며, 기록 관리 프레임워크를 구현하는 방법을 보였다.

Abstract

Record management metadata schema should have robust structure to represent not only elements innate in records itself but also management elements for the life cycle of records according to business activities. To realize these requirement, Information model for record domain is needed and also Metadata framework supporting semantic refinement and data element specialization required in record management business or applications are required. This study analyse main principles and characteristics of metadata scheme, and then suggested a novel method to develope schema systematically and effectively. This study propose information model and set of core data elements of records management based on ISO 15489 and 230381, and show how to implement the record management framework.

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

정보관리학회지