정보관리학회지, 한국정보관리학회

31

고영만(성균관대학교) ; 송인석(한국과학기술정보연구원) 2011, Vol.28, No.1, pp.145-170 https://doi.org/10.3743/KOSIM.2011.28.1.145

초록보기

초록

본 연구는 연구문헌의 지식구조를 반영하는 의미기반 지식조직체계의 실험적 모형을 제시하는 것을 목적으로 한다. 이를 위해 한국연구재단의 기초학문자료센터에 대한 사례분석을 하였다. 기초학문자료센터 연구성과물 DB와 학술용어 DR의 개념클래스 및 인스턴스를 대상으로 연구문헌의 지식구조를 파악하였으며, 기초학문자료센터 시스템의 학술적 이해형성 기능을 분석하였다. 또한 연구문헌의 지식구조와 색인어의 관계를 분석하였다. 이러한 분석을 통해 지식구조와 색인어의 관계구조, 26개의 연구문헌 지식구조 공리 및 11개의 의미관계 추론규칙으로 구성되는 온톨로지 모형, 즉 연구문헌의 지식구조와 그 의미관계에 의한 실험적 지식조직체계 모형을 제시하였다.

Abstract

The purpose of this paper is to suggest a pilot model of knowledge organizing system which reflects the knowledge structure of research papers, using a case analysis on the “Korean Research Memory” of the National Research Foundation of Korea. In this paper, knowledge structure of the research papers in humanities and social science is described and the function of the “Korean Research Memory” for scholarly sense-making is analysed. In order to suggest the pilot model of the knowledge organizing system, the study also analysed the relation between indexed keyword and knowledge structure of research papers in the Korean Research Memory. As a result, this paper suggests 24 axioms and 11 inference rules for an ontology based on semantic relation of the knowledge structure.

32

토픽 모델링 기반 과학적 지식의 불확실성의 흐름에 관한 연구

허고은(연세대학교) 2019, Vol.36, No.1, pp.191-213 https://doi.org/10.3743/KOSIM.2019.36.1.191

초록보기

초록

과학적 지식을 얻는 과정은 연구자의 연구를 통해 이루어진다. 연구자들은 과학의 불확실성을 다루고 과학적 지식의 확실성을 구축해나간다. 즉, 과학적 지식을 얻기 위해서 불확실성은 반드시 거쳐가야 하는 필수적인 단계로 인식되고 있다. 현존하는 불확실성의 특성을 파악하는 연구는 언어학적 접근의 hedging 연구를 통해 소개되었으며 컴퓨터 언어학에서 수작업 기반으로 불확실성 단어 코퍼스를 구축해왔다. 기존의 연구들은 불확실성 단어의 단순 출현 빈도를 기반으로 특정 학문 영역의 불확실성의 특성을 파악해오는데 그쳤다. 따라서 본 연구에서는 문장 내 생의학적 주장이 중요한 역할을 하는 생의학 문헌을 대상으로 불확실성 단어 기반 과학적 지식의 패턴을 시간의 흐름에 따라 살펴보고자 한다. 이를 위해 생의학 온톨로지인 UMLS에서 제공하는 의미적 술어를 기반으로 생의학 명제를 분석하였으며, 학문 분야의 패턴을 파악하는데 용이한 DMR 토픽 모델링을 적용하여 생의학 개체의 불확실성 기반 토픽의 동향을 종합적으로 파악하였다. 시간이 흐름에 따라 과학적 지식의 표현은 불확실성이 감소하는 패턴으로 연구의 발전이 이루어지고 있음을 확인하였다.

Abstract

The process of obtaining scientific knowledge is conducted through research. Researchers deal with the uncertainty of science and establish certainty of scientific knowledge. In other words, in order to obtain scientific knowledge, uncertainty is an essential step that must be performed. The existing studies were predominantly performed through a hedging study of linguistic approaches and constructed corpus with uncertainty word manually in computational linguistics. They have only been able to identify characteristics of uncertainty in a particular research field based on the simple frequency. Therefore, in this study, we examine pattern of scientific knowledge based on uncertainty word according to the passage of time in biomedical literature where biomedical claims in sentences play an important role. For this purpose, biomedical propositions are analyzed based on semantic predications provided by UMLS and DMR topic modeling which is useful method to identify patterns in disciplines is applied to understand the trend of entity based topic with uncertainty. As time goes by, the development of research has been confirmed that uncertainty in scientific knowledge is moving toward a decreasing pattern.

33

문화예술기관 기본정보의 품질개선과 연계를 위한 지식그래프 구축

선은택(중앙대학교 일반대학원 문헌정보학과 정보학전공 석사과정) ; 김학래(중앙대학교 문헌정보학과) 2023, Vol.40, No.4, pp.329-349 https://doi.org/10.3743/KOSIM.2023.40.4.329

초록보기

초록

정보통신 기술이 빠르게 발전하면서 데이터의 생산 속도가 급증하였고, 이는 빅데이터라는 개념으로 대표되고 있다. 단시간에 데이터 규모가 급격하게 증가한 빅데이터에 대해 품질과 신뢰성에 대한 논의도 진행되고 있다. 반면 스몰데이터는 품질이 우수한 최소한의 데이터로, 특정 문제 상황에 필요한 데이터를 의미한다. 문화예술 분야는 다양한 유형과 주제의 데이터가 존재하며 빅데이터 기술을 활용한 연구가 진행되고 있다. 하지만 문화예술기관의 기본정보가 정확하게 제공되고 활용되는지를 탐색한 연구는 부족하다. 기관의 기본정보는 대부분의 빅데이터 분석에서 사용하는 필수적인 근거일 수 있고, 기관을 식별하기 위한 출발점이 된다. 본 연구는 문화예술 기관의 기본정보를 다루는 데이터를 수집하여 공통 메타데이터를 정의하고, 공통 메타데이터를 중심으로 기관을 연계하는 지식그래프 형태로 스몰데이터를 구축하였다. 이는 통합적으로 문화예술기관의 유형과 특징을 탐색할 수 있는 방안이 될 수 있다.

Abstract

With the rapid development of information and communication technology, the speed of data production has increased rapidly, and this is represented by the concept of big data. Discussions on quality and reliability are also underway for big data whose data scale has rapidly increased in a short period of time. On the other hand, small data is minimal data of excellent quality and means data necessary for a specific problem situation. In the field of culture and arts, data of various types and topics exist, and research using big data technology is being conducted. However, research on whether basic information about culture and arts institutions is accurately provided and utilized is insufficient. The basic information of an institution can be an essential basis used in most big data analysis and becomes a starting point for identifying an institution. This study collected data dealing with the basic information of culture and arts institutions to define common metadata and constructed small data in the form of a knowledge graph linking institutions around common metadata. This can be a way to explore the types and characteristics of culture and arts institutions in an integrated way.

34

온톨로지의 개념간 관계 설정을 위한 AGROVOC 시소러스의 분석에 관한 연구

유영준(나사렛대학교) 2005, Vol.22, No.1, pp.125-144 https://doi.org/10.3743/KOSIM.2005.22.1.125

초록보기

초록

이 연구에서는 AGROVOC 시소러스의 개념간 관계를 분석하여 시소러스의 의미 관계의 모호성과 비일관성을 밝히고, 이러한 단점들을 개선한 온톨로지의 개념간 관계를 제시하였다. 개념간 관계 분석의 결과로 온톨로지의 개념간 관계의 핵심 요소인 개념 모형과 의미론적으로 발전된 개념간 관계 유형을 제시하였다. 이 관계들은 부분적으로 추론 기능을 수행할 수 있으며 보다 명확한 의미 관계를 기반으로 하는 지식조직시스템에 적용할 수 있을 것이다. 그리고 시소러스의 개념간 관계 유형을 확장하는데 이용할 수 있는 새로운 관계 유형들을 밝혀내었고, 이 관계 유형들이 법률분야 관련어집과 같은 기존 시소러스에도 활용할 수 있음을 확인하였다.

Abstract

This study uncovered ambiguity and inconsistency of the semantic relationships of the existing thesaurus by analyzing the concept relationships of AGROVOC and proposed the concept relationships of ontology in partially overcoming these limitations. By the results of analyzing the concept relationships, the study proposed conceptual model as most important part of conecept relationships of ontology and semantically developed concept relationship types. These relationships partially can perform inferences and must be useful for information knowledge system based on more exact semantic relationships. Also the study found out new relationship types and they will be useful for extension of the concept relationships of existing thesaurus. And these relationship types showed that they were useful for the existing thesaurus as Legal Thesaurus.

35

차세대 검색서비스의 속성에 관한 연구

이수상(부산대학교) ; 이순영(부산대학교) 2009, Vol.26, No.4, pp.93-112 https://doi.org/10.3743/KOSIM.2009.26.4.093

초록보기

초록

최근 정보검색 환경은 검색 2.0으로 대표되는 차세대 검색서비스에 대한 논의들이 활발해지고 있다. 따라서 이 연구에서는 정보검색의 발전과 진화에 대한 다양한 논의들을 토대로 정보검색의 발전 과정을 구분하였다. 그리고 현재 거론되고 있는 차세대 검색서비스의 등장 배경, 주요 개념, 그리고 관련 사례와 속성을 파악하였으며, 이러한 속성과 사례에 대한 데이터를 통해 차세대 검색서비스를 설명하는 핵심적인 키워드를 확인하기 위한 군집 분석을 수행하였다. 군집 분석의 결과 차세대 검색서비스를 대표하는 주요 키워드는 소셜 검색, 지능형 의미 검색, 그리고 관계기반 검색 등으로 나타났다.

Abstract

Recently in the area of the information environment, there are lively discussions about search 2.0 which is representative of the next generation search services. In this study, we divide information search model into matching and linking models according the developmental stages. Therefore, on the one hand, we analyze the background, main concepts, related attributes and cases of the next generation search services and the other, we identify the representative keywords by the group analysis of various attributes and cases of it. The result shows that the main keywords such as social search, artificial intelligence and semantic search, and relation/network based search are representative of the search 2.0.

36

한국학 연구 논문의 텍스트 구조 기반 메타데이터 검색 시스템 개발 연구

송민선(성균관대학교 정보관리연구소) ; 고영만(성균관대학교) ; 이승준(성균관대학교 정보관리연구소) 2016, Vol.33, No.3, pp.155-176 https://doi.org/10.3743/KOSIM.2016.33.3.155

초록보기

초록

본 연구는 한국학 연구 논문 텍스트의 의미 구조를 기반으로 하는 메타데이터를 적용한 학술정보시스템을 구축하여 기존 유사 시스템과의 비교를 통해, 텍스트 구조 기반 메타데이터의 활용 가능성을 확인해 보고자 하는 것을 목적으로 한다. 이를 위해 한국학술지인용색인(Korea Citation Index, KCI)에서 일정 기준을 충족하는 한국학 분야 연구 논문 데이터를 대상으로 의미 구조 메타데이터 항목을 적용한 시범적 검색 시스템(Korean Studies Metadata Database, KMD)을 구축하였으며, 동일한 검색 키워드를 적용하여 기존의 KCI 시스템과 비교했을 때 어떤 특징과 차이점을 갖는지 비교해 보았다. 연구 결과, KMD 시스템이 KCI에 비해 이용자의 검색 의도에 맞는 결과를 보다 효율적으로 보여주는 것으로 확인되었다. 즉 검색하고자 하는 키워드의 조합이나 조건식이 기존 시스템과 동일하더라도 검색 결과를 통해 최종적으로 연구 진행과 관련해 찾고자 하는 연구 목적, 연구의 대상 데이터나 시공간적 배경 등에 따른 검색 결과를 다양하게 보여줄 수 있는 것으로 나타났다.

Abstract

This study aims to develope a scholarly metadata information system based on conceptual elements of text structure of Korean studies research articles and to identify the applicability of text structure based metadata as compared with the existing similar system. For the study, we constructed a database(Korean Studies Metadata Database, KMD) with text structure based on metadata of Korean Studies journal articles selected from the Korea Citation Index(KCI). Then we verified differences between KCI system and KMD system through search results using same keywords. As a result, KMD system shows the search results which meet the users’ intention of searching more efficiently in comparison with the KCI system. In other words, even if keyword combinations and conditional expressions of searching execution are same, KMD system can directly present the content of research purposes, research data, and spatial-temporal contexts of research et cetera as search results through the search procedure.

37

LOD기반의 문화콘텐츠 정보서비스 확장에 관한 연구: K-Food 분야를 중심으로

유현경(전북대학교 기록관리학과) ; 육혜인(전북대학교) ; 한희정(전북대학교) ; 김용(전북대학교) 2015, Vol.32, No.1, pp.109-134 https://doi.org/10.3743/KOSIM.2015.32.1.109

초록보기

초록

한류3.0 시대를 맞이하여 기존 미디어 중심의 한류문화에서 벗어나 다양한 한류문화콘텐츠 개발을 통해 한국문화의 세계화와 지속가능성을 높일 수 있는 전략적 방안을 마련할 필요가 있다. 따라서 본 연구는 LOD기반의 음식문화콘텐츠 서비스 제공을 통해 음식문화 관련 정보뿐만 아니라 다른 문화콘텐츠들과도 망라적으로 연결시켜 다양한 한류문화콘텐츠들이 발전할 수 있는 기반을 마련하는데 목적이 있다. 이를 위해 문헌연구 및 사례분석을 통해 음식문화의 개념을 정립하고 분류하였으며, LOD기반의 음식문화콘텐츠 서비스의 적용가능성을 분석하였다. 나아가 음식문화 LOD 구축 프로세스 및 서비스 모형을 제안함으로써 LOD기반의 한류문화서비스 확장에 관한 기초 연구를 제공하고자 하였다.

Abstract

In the Korean wave 3.0 age, it is needed to prepare how to globalize and hold Korean culture through development of various Korean wave culture contents from existing contents focused on media. The goal of this study is to establish the foundation for developing the various Korean wave culture contents as linking information about other culture contents as well as food culture by extending Korean culture contents service based on LOD. For this purpose, this study established and assorted the concept of food culture through the literature review and case study and analyzed the applicability of the services of food culture contents based on LOD. Futhermore, this study provides the basis on extension of Korean wave culture service and suggests the process of implementation of food culture LOD and service model.

38

온톨로지 기반 상황인지 모델링 연구: u-Convention을 중심으로

김성혁(숙명여자대학교) 2011, Vol.28, No.3, pp.123-139 https://doi.org/10.3743/KOSIM.2011.28.3.123

초록보기

초록

유비쿼터스 컴퓨팅의 주요 기술인 상황인지는 환경을 구성하는 다양한 종류의 정보 기기로부터 전달되는 상황 정보를 이해하고 처리하며, 다양한 도메인에 유연하게 적용할 수 있는 상황인지 모델을 필요로 한다. 시맨틱 웹 기술 기반의 온톨로지는 구조화된 공통의 포맷을 이용하고 의미적인 정보의 표현이 가능하므로, 시스템이 상황 정보를 공유하고 이해, 추론함으로써 효과적인 상황인지가 가능하다. 따라서 온톨로지를 이용한 상황인지 모델이 여러 연구에서 제시되어 왔는데, 본 논문에서는 이러한 기존 연구들에 대한 분석을 바탕으로 상황인지 모델의 범용성과 확장성을 위해 온톨로지의 구조를 계층화하고 이를 기반으로 상황인지 시스템을 구현하여 실제 u-Convention 도메인에 적용하였다. 또한 OWL-DL의 기술논리와 SWRL 규칙 추론을 결합함으로써 복합적인 상황을 효과적으로 추론하는 방법을 제시하였다.

Abstract

Context-awareness as a key technology of ubiquitous computing needs a context model that understands and processes situational information coming from diverse sensors and devices, and can be applied diversely in various domains. Semantic web based ontologies use structured standard format and express meaning of information, so it is possible to recognize effectively context-awareness situations, allowing the system to share information and understand situation by inference. In this paper, we propose a layered ontology model to support generality and scaleability of the context-awareness system, and applied the model to u-Convention domain. In addition, we propose a effective reasoning method to handle compound situation by combining OWL-DL and SWRL rules.

39

LOD 기반 한국사 콘텐츠 서비스 구축에 관한 연구

윤소영(국사편찬위원회) 2013, Vol.30, No.3, pp.297-315 https://doi.org/10.3743/KOSIM.2013.30.3.297

초록보기

초록

역사에 관심 있는 대한민국 국민 누구나 우리 역사에 쉽게 접근하여 재미있게 배울 수 있으며 정확하고 신뢰도 높은 역사정보를 제공하기 위한 콘텐츠 서비스 구축에 관심이 높아지고 있다. 또한 시맨틱 웹 구축을 통해 정보의 공유 및 재활용에 대한 수요가 증가하고 있으며 이는 링크드 데이터를 통해 구체화되고 있다. 기존의 전문연구자 중심의 원문 DB구축에서 탈피하여 일반인도 쉽게 이해하고 이용할 수 있는 대중적 콘텐츠 구축은 여러 기관, 포털, 그리고 일부 개인을 중심으로 구축되고 있으나 정보 공유 및 활용성 측면에 대한 고려 없이 개별적으로 중복 구축되고 있다. 본 연구에서는 원문사료에 대한 접근성을 높이고 정보공유 및 연결을 통한 정보유통 체계를 확보하여 웹상의 다양한 데이터와의 연결로 풍부한 정보제공 환경을 구축하기 위한 방안으로 LOD 기반 한국사 콘텐츠 서비스 시스템 구축을 제안하였다.

Abstract

Anyone curious to easily access and learn Korean history has become interested in Korean history data bases, which will provide accurate and reliable historical information. Furthermore, user demands for information sharing and reusability, available through setting up a semantic web, have been increased, which have taken the shape of linked data. Efforts have been made to construct public data bases containing readily usable contents a user can understand and utilize with ease. They have been produced by several organizations, portal sites, and individuals, trying to deviate from existing mainstreams - expert-based text data bases. A problem with those data bases is that they have not considered such vital factors as the sharing and utilizing of information as a whole. This study suggests a LOD-based Korean history contents implementation system, providing rich information environment by way of multi-dimensional web-data connections. In doing so, this system has tried a historic information circulation service system which is based on information sharing and connecting.

40

정보검색 성능 향상을 위한 단어 중의성 해소 모형에 관한 연구

정영미(연세대학교) ; 이용구(계명대학교) 2005, Vol.22, No.2, pp.125-145 https://doi.org/10.3743/KOSIM.2005.22.2.125

초록보기

초록

이 연구에서는 문헌 및 질의의 내용을 대표하는 주제어의 중의성 해소를 위해 대표적인 지도학습 모형인 나이브 베이즈 분류기와 비지도학습 모형인 EM 알고리즘을 각각 적용하여 검색 실험을 수행한 다음, 주제어의 중의성 해소를 통해 검색 성능의 향상을 가져올 수 있는지를 평가하였다. 실험문헌 집단은 약 12만 건에 달하는 한국어 신문기사로 구성하였으며, 중의성 해소 대상 단어로는 한국어 동형이의어 9개를 선정하였다. 검색 실험에는 각 중의성 단어를 포함하는 18개의 질의를 사용하였다. 중의성 해소 실험 결과 나이브 베이즈 분류기는 최적의 조건에서 평균 92%의 정확률을 보였으며, EM 알고리즘은 최적의 조건에서 평균 67% 수준의 클러스터링 성능을 보였다. 중의성 해소 알고리즘을 통합한 의미기반 검색에서는 나이브 베이즈 분류기 통합 검색이 약 39.6%의 정확률을 보였고, EM 알고리즘 통합 검색이 약 36%의 정확률을 보였다. 중의성 해소 모형을 적용하지 않은 베이스라인 검색의 정확률 37%와 비교하면 나이브 베이즈 통합 검색은 약 7.4%의 성능 향상률을 보인 반면 EM 알고리즘 통합 검색은 약 3%의 성능 저하율을 보였다.

Abstract

This paper presents a semantic vector space retrieval model incorporating a word sense disambiguation algorithm in an attempt to improve retrieval effectiveness. Nine Korean homonyms are selected for the sense disambiguation and retrieval experiments. The total of approximately 120,000 news articles comprise the raw test collection and 18 queries including homonyms as query words are used for the retrieval experiments. A Naive Bayes classifier and EM algorithm representing supervised and unsupervised learning algorithms respectively are used for the disambiguation process. The Naive Bayes classifier achieved 92% disambiguation accuracy, while the clustering performance of the EM algorithm is 67% on the average. The retrieval effectiveness of the semantic vector space model incorporating the Naive Bayes classifier showed 39.6% precision achieving about 7.4% improvement. However, the retrieval effectiveness of the EM algorithm-based semantic retrieval is 3% lower than the baseline retrieval without disambiguation. It is worth noting that the performances of disambiguation and retrieval depend on the distribution patterns of homonyms to be disambiguated as well as the characteristics of queries.

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

정보관리학회지