정보관리학회지, 한국정보관리학회

11

최상희(대구가톨릭대학교) ; 서은경(한성대학교) 2006, Vol.23, No.2, pp.229-243 https://doi.org/10.3743/KOSIM.2006.23.2.229

초록보기

초록

질의응답문서는 이용자가 입력한 질의, 질의설명, 답을 아는 다른 이용자가 제시한 응답으로 구성된 구조화된 문서로서, 최근 웹 문서처럼 검색이 일반적으로 일어나고 있는 정보원이다. 이 연구에서는 질의응답문서의 구조적 특성을 기반으로 질의를 재생성하여 질의응답문서의 검색효율을 향상시키고자 하였다. 질의재생성 실험에서 성능이 비교된 문서구조는 질의와 응답내용이다. 질의를 기반으로 질의를 재생성하는 방식에서는 질의응답검색 시스템에 입력되어 있는 유사질의를 활용하여 클러스터링하는 기법이 적용되었다. 응답정보를 기반으로 질의를 재생성하는 방식에서는 가장 유사한 기존 질의에 대해 응답된 내용에서 단락검색으로 적합한 문장들을 선정하여 활용하는 기법이 적용되었다. 실험 결과 응답정보를 활용하여 질의를 재생성하는 방식이 정확률은 유지하면서 더 다양한 검색결과를 제공하는 것으로 나타났다.

Abstract

This study aims to suggest an effective way to enhance question-answer(QA) document retrieval performance by reconstructing queries based on the structural features in the QA documents. QA documents are a structured document which consists of three components: question from a questioner, short description on the question, answers chosen by the questioner. The study proposes the methods to reconstruct a new query using by two major structural parts, question and answer, and examines which component of a QA document could contribute to improve query performance. The major finding in this study is that to use answer document set is the most effective for reconstructing a new query. That is, queries reconstructed based on terms appeared on the answer document set provide the most relevant search results with reducing redundancy of retrieved documents.

12

온라인 이용자 피드백을 사용한 정보필터링 시스템의 수정질의 최적화에 관한 연구

최광(연세대학교) ; 정영미(연세대학교) 2003, Vol.20, No.4, pp.23-48 https://doi.org/10.3743/KOSIM.2003.20.4.023

초록보기

초록

이 연구의 목적은 대량의 최신정보를 제공하는 정보필터링 시스템에서 이용자 피드백에 의해 수정질의를 자동생성하여 재검색을 수행함으로써 검색 성능을 최적화할 수 있는 방안을 찾는 데 있다. 이용자가 입력한 초기질의를 사용하여 정보필터링 시스템이 검색한 문헌에 대해 이용자가 적합성 여부를 온라인으로 입력하도록 하고, 이 피드백 결과를 토대로 '중복제거법'과 '저빈도제거법' 두 가지 방법에 의해각각 17개의 수정질의를 생성하여 재검색한 결과를 초기 검색결과와 비교 분석하였다. 수정질의는 각각의 방법마다 17개 패턴의 불논리 질의형태를 미리 만든 다음 초기질의에 디스크립터와 분류기호를 결합하여 생성하였으며, 재검색 결과에 대한 적합성 평가를 통해 최적의 수정질의식을 도출하였다.

Abstract

In this study an information filtering system was implemented and a series of relevance feedback experiments were conducted using the system. For the relevance feedback, the original queries were searched against the database and the results were reviewed by the researchers. Based on users' online relevance judgements a pair of 17 refined queries were generated using two methods called "co-occurrence exclusion method" and "lower frequencies exclusion method." In order to generate them, the original queries. the descriptors and category codes appeared in either relevant or irrelevant document sets were applied as elements. Users' relevance judgments on the search results of the refined queries were compared and analyzed against those of the original queries. [ 더 많은 내용 보기 ]

13

정보검색 성능 향상을 위한 단어 중의성 해소 모형에 관한 연구

정영미(연세대학교) ; 이용구(계명대학교) 2005, Vol.22, No.2, pp.125-145 https://doi.org/10.3743/KOSIM.2005.22.2.125

초록보기

초록

이 연구에서는 문헌 및 질의의 내용을 대표하는 주제어의 중의성 해소를 위해 대표적인 지도학습 모형인 나이브 베이즈 분류기와 비지도학습 모형인 EM 알고리즘을 각각 적용하여 검색 실험을 수행한 다음, 주제어의 중의성 해소를 통해 검색 성능의 향상을 가져올 수 있는지를 평가하였다. 실험문헌 집단은 약 12만 건에 달하는 한국어 신문기사로 구성하였으며, 중의성 해소 대상 단어로는 한국어 동형이의어 9개를 선정하였다. 검색 실험에는 각 중의성 단어를 포함하는 18개의 질의를 사용하였다. 중의성 해소 실험 결과 나이브 베이즈 분류기는 최적의 조건에서 평균 92%의 정확률을 보였으며, EM 알고리즘은 최적의 조건에서 평균 67% 수준의 클러스터링 성능을 보였다. 중의성 해소 알고리즘을 통합한 의미기반 검색에서는 나이브 베이즈 분류기 통합 검색이 약 39.6%의 정확률을 보였고, EM 알고리즘 통합 검색이 약 36%의 정확률을 보였다. 중의성 해소 모형을 적용하지 않은 베이스라인 검색의 정확률 37%와 비교하면 나이브 베이즈 통합 검색은 약 7.4%의 성능 향상률을 보인 반면 EM 알고리즘 통합 검색은 약 3%의 성능 저하율을 보였다.

Abstract

This paper presents a semantic vector space retrieval model incorporating a word sense disambiguation algorithm in an attempt to improve retrieval effectiveness. Nine Korean homonyms are selected for the sense disambiguation and retrieval experiments. The total of approximately 120,000 news articles comprise the raw test collection and 18 queries including homonyms as query words are used for the retrieval experiments. A Naive Bayes classifier and EM algorithm representing supervised and unsupervised learning algorithms respectively are used for the disambiguation process. The Naive Bayes classifier achieved 92% disambiguation accuracy, while the clustering performance of the EM algorithm is 67% on the average. The retrieval effectiveness of the semantic vector space model incorporating the Naive Bayes classifier showed 39.6% precision achieving about 7.4% improvement. However, the retrieval effectiveness of the EM algorithm-based semantic retrieval is 3% lower than the baseline retrieval without disambiguation. It is worth noting that the performances of disambiguation and retrieval depend on the distribution patterns of homonyms to be disambiguated as well as the characteristics of queries.

14

XML 기반의 동영상콘텐츠 검색 시스템 설계 및 구현

김용(전북대학교) ; 소민호(KAIST 학술정보처) 2009, Vol.26, No.4, pp.113-128 https://doi.org/10.3743/KOSIM.2009.26.4.113

초록보기

초록

일반적으로 동영상콘텐츠에 대한 탐색과정에서 이용자는 해당 기관 또는 사이트에서 제공하는 간단하게 요약한 요약 동영상과 텍스트 정보를 이용하여 원하는 동영상을 선택한다. 이러한 이용자의 동영상콘텐츠의 탐색과정에서의 정확하고 신속한 동영상콘텐츠의 검색을 위하여 본 연구에서는 동영상을 구성하는 자막과 이미지 정보를 이용하여 동영상을 내용에 따라 여러 개의 클립으로 구분하고, 이용자의 필요에 따라 원하는 클립을 빠르게 검색하여 제공하는 시스템을 제안한다. 동영상콘텐츠 및 관련 메타데이터의 관리 및 제어를 위해서 본 연구에서는 XML 계층구조를 생성하여 저장한다. 한편, 이용자의 검색 요청시 XML을 기반으로 동영상콘텐츠의 계층구조를 반영하는 Xpath를 생성하여 질의처리를 수행함으로써 이용자가 원하는 동영상콘텐츠에 대한 검색결과를 제공한다. 제안된 방법을 기반으로 본 연구에서는 XML 계층구조를 이용한 동영상 검색 시스템을 설계하고 구현하였다.

Abstract

Generally, a user uses briefly summarized video data and text information to search video contents. To provide fast and accurate search tool for video contents in the process of searching video contents, this study proposes a method to search video clips which was partitioned from video contents. To manage and control video contents and metadata, the proposed method creates structural information based on XML on a video and metadata, and saves the information into XML database. With the saved information, when a user try to search video contents, the results of user's query to retrieve video contents would be provided through creating Xpath which has class structure information. Based on the proposed method, an information retrieval system for video clips was designed and implemented.

15

디지털도서관 구축과정에서 TREC 텍스트 문서의 시각적 표현에 관한 연구

정기태(Assistant Professor University of Oklahoma School of Library and Information Studies) ; 박일종(계명대학교) 2004, Vol.21, No.3, pp.1-14 https://doi.org/10.3743/KOSIM.2004.21.3.001

초록보기

초록

이용자들은 유사문서를 검색할 때, 각 가지 문서의 시각적표현을 통하여 도움을 얻게 되며, 모든 정보검색에 관한 연구는 이용자들의 다양한 요구를 충족시키기 위한 여러 가지의 해결책을 제시하고 있다. 제안되어진 해결책은 알파벳 순서로 만들어 진 파피루스 문서로부터 카드목록, 마이크로 필름을 이용한 저장, 컴퓨터 디스크를 이용한 파일 보관 등에 이르기까지 다양한 방법들을 들 수 있을 것이다. 또한 대부분의 정보검색 시스템들은 Document Surrogate(문헌을 대체할 수 있는 것들), 즉 요약문, 목차, 초록, 리뷰한 내용, 기계가독형목록(MARC) 기록물 등과 같은 서지자료들을 전체논문을 대체하여 이용하게 된다.본 논문에서는 또 다른 형태의 Document Surrogate로서 용어 리스트의 집단화 방법을 이용해서 찾아보았다. 이 Document Surrogate들은 Multidimensional Scaling (MDS)을 이용해서 2차원 그래프 위에 좌표로써 표현되어지고 있다. 사용된 2차원의그래프 위에서 좌표간의 거리는 문헌들의 유사성을 나타낸다고 해석할 수 있으며 거리가 가까우면 가까울수록 두 문서는 더욱 유사한내용을 포함하고 있다고 해석할 수 있는 것으로 밝혀졌다.

Abstract

Visualization of documents will help users when they do search similar documents, and all research in information retrieval addresses itself to the problem of a user with an information need facing a data source containing an acceptable solution to that need. In various contexts, adequate solutions to this problem have included alphabetized cubbyholes housing papyrus rolls, microfilm registers, card catalogs and inverted files coded onto discs. Many information retrieval systems rely on the use of a document surrogate. Though they might be surprise to discover it, nearly every information seeker uses an array of document surrogates. Summaries, tables of contents, abstracts, reviews, and MARC recordsthese are all document surrogates. That is, they stand infor a document allowing a user to make some decision regarding it, whether to retrieve a book from the stacks, whether to read an entire article, etc.In this paper another type of document surrogate is investigated using a grouping method of term list. Using Multidimensional Scaling Method (MDS) those surrogates are visualized on two-dimensional graph. The distances between dots on the two-dimensional graph can be represented as the similarity of the documents. More close the distance, more similar the documents.

16

과학기술분야 학위논문 내용목차에 따른 주제어 출현빈도에 관한 연구

이혜영(한국과학기술원) ; 곽승진(충남대학교) 2008, Vol.25, No.1, pp.191-210 https://doi.org/10.3743/KOSIM.2008.25.1.191

초록보기

초록

일반적으로 문헌을 검색하고 접근하기 위하여 주제색인과 같은 주제어를 활용하곤 한다. 그렇다면 문헌의 내용과 문헌의 주제어는 분명히 어떤 밀접한 상관관계가 있을 것으로 예측해볼 수 있다. 본 연구는 이러한 의문점에서 출발하여, 디지털콘텐트의 본문내용이 비교적 짜임새 있게 정형화되어 있는 석사 학위논문을 연구문헌으로 한정하여 학위논문 전문에서 나타나는 학위논문의 주제어 분포도를 연구하였다. 학위논문의 주제어는 논문 저자가 부여한 주제어를 사용하되, 학위논문 전문은 ‘목차’, ‘서론’, ‘이론배경’, ‘본론’, ‘결론’, ‘참고문헌’의 내용위치로 분할하여 내용위치에 따른 주제어의 출현율을 확인하였다. 연구대상 학위논문 전문은 1226.3개의 용어, 5152.3번의 용어 출현을 보였다. 학위논문 저자가 부여한 주제어는 12~13개 용어로 구성되어 있었다. 연구결과, 전문 내용위치에 따른 주제어의 출현율은 ‘목차’ 11.4%와 ‘서론’ 11.2%에서 가장 높았으며(11%), 다음 순위는 내용위치 ‘결론’ 9.8%이었다.

Abstract

We would generally use subject terms such as subject indexing for searching and accessing documents. So then, there must be any relationship between document's full-text and its subject terms. This study is started in this question. Master's theses in field of science and technology are worked with because full-text is relatively formatted. This study is to study locations of subject term on Thesis, distribution patterns of subject terms on content of full-text; ‘Contents’, ‘Introduction’, ‘Theory’, ‘Main subject’, ‘Conclusion’ and ‘References’. Thesis were averagely composed of 1226.3 terms. And Subject terms were averagely compose of 12~13 terms. As a result, ‘Contents’ and ‘Introduction’ have had the most frequency of subject.

17

미국의 기록물 기술 내용표준에 대한 비교분석 - APPM2와 DACS를 중심으로 -

박진희(전북대학교) 2005, Vol.22, No.4, pp.129-151 https://doi.org/10.3743/KOSIM.2005.22.4.129

초록보기

초록

웹, XML, EAD 등을 포함한 새로운 기술과 기술(description)도구의 출현으로 기존의 도서관자료와 기록물을 통합 관리할 수 있고, 정보교환을 위한 다양한 검색보조도구를 모두 수용할 수 있는 내용표준에 대한 필요성이 부각되고 있다. 이에 본 연구에서는 기록물업계의 국제적 동향을 반영하여 ISAD(G) 및 ISAAR(CPF)와 조화를 이루고, MARC과 EAD 등 다양한 검색보조도구의 데이터구조 표현을 수용할 수 있도록 제정된 DACS와 기존의 기술규칙인 APPM2를 비교분석하였다. 분석결과를 토대로 우리나라 기록물 기술규칙 제정시 고려해야 할 사항들을 제시하였다.

Abstract

The advent of new technologies and descriptive tools, including the Web, XML, and EAD, have highlighted the necessity of content standard which can integrate and manage to library materials and archives and accommodate various finding aids for information exchange. This research analyzes both APPM2 and DACS. The former is an old established description rule. On the other hand, the latter keeps step with ISAD(G) and ISAAR(CPF) as adopting the international trend of archives and is able to describe the data structure of a variety of finding aids including MARC, EAD, etc. As a result, it presents the points to take into consideration as making descriptive rules about our archives.

18

정보의 모호성이 정보탐색 노력에 미치는 영향요인에 관한 연구

윤중현(전주공업대) 2003, Vol.20, No.3, pp.17-30 https://doi.org/10.3743/KOSIM.2003.20.3.017

초록보기

초록

정보기술의 발전과 더불어 국가경쟁력 및 대국민 서비스 강화를 위한 공공정보시스템의 구축이 확산되고 있다. 또한 막대한 공공재원이 투자되는 공공정보시스템은 효율성에 대한 정기적인 평가를 통한 존재가치의 입증 및 문제점 진단을 통한 성능개선의 요구도 증대되고 있다. 따라서 본 연구는 국내의 대표적인 웹 기반 공공고용정보시스템 워크넷(Work-Net)의 데이터 및 서비스 품질을 102명의 이용자를 대상으로 평가하였다. 측정도구는 학계에서 많이 사용해온 정보내용, 정확성, 적시성, 출력형태, 사용의 용이성, 지원성으로 구성하였으며 t-test와 일원변량분석(one-way ANOVA)을 실시하여 이용자의 특성 및 검색정보에 따른 차이를 살펴보았다. 또한 최근 구직자들의 고용정보시스템에 대한 요구사항들을 분석하였다. 본 연구결과는 공공 및 민간운영 고용정보시스템 운영체제 효율화에 기여할 수 있는 유용한 기초자료를 제공할 것으로 본다.

Abstract

19

국가기록물 접근성 및 활용성 향상을 위한 국가기록포털 연계 개선방안

강윤아(전북대학교) ; 조영준(㈜ 아이와즈, 솔루션개발팀) ; 김민정(국가기록원 기록서비스부 서비스정책과) ; 오효정(전북대학교) 2022, Vol.39, No.4, pp.99-125 https://doi.org/10.3743/KOSIM.2022.39.4.099

초록보기

초록

하나의 기록을 이해하기 위해서는 기록의 내용뿐 아니라 그 기록의 생산배경과 업무 맥락을 함께 파악해야 한다. 또한 다양한 부서와 기관에 흩어져 있는 관련 기록들을 쉽게 찾을 수 있는 기능도 필요하다. 따라서 기록 검색서비스에서 정보 간의 ‘연계’는 갈수록 중요해지고 있다. 국가기록원 역시 영구기록관리시스템의 검색서비스 기능으로 ‘연계’를 강조하고 있지만, 정작 국가기록포털을 살펴보면 전거정보의 연계 부족, 독립적으로 존재하는 내부 서비스와 콘텐츠, 유관 기관과의 연계 부재 등의 문제가 확인된다. 본 연구는 이와 같은 국가기록포털의 한계를 해결하고자, 이상적인 연계서비스를 구축한 기록관리 선진기관을 선정하여 이들 기관의 연계구조를 전반적으로 확인하고, 연계구조 중 타 기록관리 기관에서 볼 수 없었던 특이점과 국가기록포털에서 차용 가능한 요소를 도출하였다. 다음으로, 국가기록포털 연계구조의 현황을 분석하여 연계가 이루어지지 않은 부분을 비롯해 연계방식을 개선해야 할 항목 등을 규명, 이를 해결하기 위한 구체적인 개선방안을 제시하였다. 본 연구의 목적은 국가기록포털의 개선된 연계 서비스를 통해 이용자의 검색 만족도를 높이고, 소장기록 및 기 구축된 내부 서비스와 콘텐츠의 접근성 및 활용성을 향상시키는 데에 있다.

Abstract

In order to understand a record, not only the contents of the record but also the production background and work context of the record must be grasped. It also requires a function that makes it easy to find related records scattered across various departments and agencies. Accordingly, the ‘linkage’ of information in archival information services is becoming more important. NAK also emphasizes ‘linkage’ as a search service function of the archives management system, but some problems were identified at the National Archives Portal Service (NAPS) such as a lack of linkage with authority data, disruption of internal service, and absence of linkage with other related organizations. To solve the limitations of the NAPS, we selected and analyzed advanced record management institutions that have built an ideal linkage service; checked the overall linkage structure of these institutions; and identified characteristics that could not be seen by other institutions. Also, elements that can be adopted from the NAPS were derived. Next, the current status of the NAPS linkage structure was analyzed to identify the parts that were not linked and the items that need to be improved in the linkage method, and specific advancement plans were suggested to solve these problems. The purpose of this study is to increase users’ satisfaction with search and to advance the accessibility and utilization of records and internal services through improved linkage services of NAPS.

20

국내 대학기록관 웹사이트에 관한 메타분석

서은경(한성대학교) ; 손주영(한성대학교) 2012, Vol.29, No.1, pp.351-374 https://doi.org/10.3743/KOSIM.2012.29.1.351

초록보기

초록

최근 디지털 기록관리 패러다임 속에서 중추적 역할을 수행하는 대학기록관 웹사이트는 대학기록을 찾고 활용하는 이용자들에게 더욱 더 중요하게 간주되고 있다. 본 연구는 대학기록관 웹사이트가 질적 향상을 이루기 위해서는 다각적 평가와 분석이 필요하다고 보고 국내 9개의 대학기록관을 선정한 후, 이들에 대하여 웹사이트 구성 내용분석, 웹사이트가 지닌 기능적 유용성, 사용성 및 정보품질에 대한 이용자 만족도, 그리고 웹표준 준수여부 등 다섯 가지의 각도에서 메타 분석을 수행하였다. 분석 결과를 토대로 제시된 대학기록관 웹사이트의 개선방안은 다음과 같다. 첫째, 기록정보의 이용을 보다 활성화하기 위해서는 대학기록관의 소개 및 대학기록 활용성에 대한 홍보와 웹사이트에서의 이용자간의 커뮤니케이션을 높일 수 있는 방안을 모색해야 한다. 둘째, 웹사이트의 콘텐츠 빈약함은 이용자의 이용도를 떨어뜨리므로 체계적인 대학기록의 수집과 함께 디지털화하는 작업에 집중적인 노력이 필요하다. 셋째, 가장 불만족스럽게 나타난 ‘검색 기능’을 보완하기 위해서는 이용자 중심의 검색인터페이스를 개발이 필요하다. 넷째, 이용자의 만족도에 가장 큰 영향을 미치는 요인이 웹사이트에 대한 ‘호감도’이므로 이 점을 고려하여 대학기록관 웹사이트를 디자인해야 할 것이다. 마지막으로 웹사이트는 웹표준을 준수하여 보편적 웹접근성을 높여야 할 것이다.

Abstract

Recently, the website of university archives is becoming greatly significant to users who retrieve and utilize university archives and records. This study aims to meta-analyze university archives in Korea in terms of usefulness, usability, information quality, and web accessibility in order to improve the functionality and usefulness of university archives websites. The result of the study suggests that university archives should increase the number of the digital records and to develop communication chanels, such as blog, chatting room, digital reference services for activating use of university records. Secondly, university archives make efforts to develop systematic collecting and managing procedures and to build relevant digital collections. Thirdly, the websites of university archives develops more user-friendly interfaces and web design to improve user satisfaction. Finally, the websties of university archives should follow the standards on web accessibility.

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

정보관리학회지