정보관리학회지, 한국정보관리학회

권한신청
P-ISSN1013-0799
E-ISSN2586-2073
KCI

검색어: 의미관계, 검색결과: 4

변우영(명지대학교 기록정보관리학과) ; 임진희(명지대학교 기록정보과학전문대학원) 2022, Vol.39, No.1, pp.195-217 https://doi.org/10.3743/KOSIM.2022.39.1.195

초록보기

초록

SIARD_KR은 스위스 연방 기록보존소에서 개발한 관계형 데이터베이스 컨텐츠의 장기보존에 이용하는 기술인 SIARD를 우리나라의 실정에 맞게 일부 수정한 행정정보 데이터세트 보존 도구이다. 기존의 선행연구는 SIARD가 얼마나 관계형 데이터베이스안에 들어있는 모든 데이터를 손실 없이 잘 추출할 수 있는지에 초점이 맞춰져 있다. 하지만 데이터베이스에 들어있는 데이터 전부가 의미 있는 정보, 즉 행정정보 데이터세트는 아니다. 따라서 이 논문은 SIARD_KR이 행정정보 데이터세트의 특성을 반영하고 있는가에 대한 문제의식에서 시작한다. SIARD_KR이 단순히 DB에 저장된 데이터를 추출하는 도구가 아니고 의미 있는 정보만을 식별하여 추출할 수 있을지, 본래의 시스템에서 유리되어도 의미 있는 정보를 유지할 수 있을지 확인하려 한다. 본 논문은 SIARD_KR의 구조를 분석하고, 예상되는 문제점을 도출하여 그에 대한 개선방안을 제시하는 것을 목적으로 한다.

Abstract

SIARD_KR is an administrative information dataset preservation tool. It is a partially modified version of SIARD, technology used for long-term preservation of relational databases developed by the Swiss Federal Archives, to suit Korea’s situation better. Previous studies have focused on how SIARD is able to effectively extract all data contained in the relational database without loss. However, not all data contained in the database is meaningful information, that is, an administrative information dataset. This paper began, therefore, with the awareness of the problem of whether SIARD_KR reflects the characteristics of the administrative information dataset. SIARD_KR is not only a tool for extracting data stored in the DB. We want to see if it is capable of identifying and extracting only meaningful information, and maintaining meaningful information, even if it is separated from the original system. The purpose of this paper is to analyze the structure of SIARD_KR, identify expected problems, and suggest improvement measures for them.

LDA와 BERTopic을 이용한 토픽모델링의 증강과 확장 기법 연구

김선욱(경북대학교 사회과학대학 문헌정보학과) ; 양기덕(영남고문헌아카이브센터) 2022, Vol.39, No.3, pp.99-132 https://doi.org/10.3743/KOSIM.2022.39.3.099

초록보기

초록

본 연구의 목적은 LDA 토픽모델링 결과와 BERTopic 토픽모델링 결과를 합성하는 방법론인 Augmented and Extended Topics(AET)를 제안하고, 이를 사용해 문헌정보학 분야의 연구주제를 분석하는 데 있다. AET의 실제 적용결과를 확인하기 위해 2001년 1월부터 2021년 10월까지의 Web of Science 내 문헌정보학 학술지 85종에 게재된 학술논문 서지 데이터 55,442건을 분석하였다. AET는 서로 다른 토픽모델링 결과의 관계를 WORD2VEC 기반 코사인 유사도 매트릭스로 구축하고, 매트릭스 내 의미적 관계가 유효한 범위 내에서 매트릭스 재정렬 및 분할 과정을 반복해 증강토픽(Augmented Topics, 이하 AT)을 추출한 뒤, 나머지 영역에서 코사인 유사도 평균값 순위와 BERTopic 토픽 규모 순위에 대한 조화평균을 통해 확장토픽(Extended Topics, 이하 ET)을 결정한다. 최적 표준으로 도출된 LDA 토픽모델링 결과와 AET 결과를 비교한 결과, AT는 LDA 토픽모델링 토픽을 한층 더 구체화하고 세분화하였으며 ET는 유효한 토픽을 발견하였다. AT(Augmented Topics)의 성능은 LDA 이상이었으며 ET(Extended Topics)는 일부 경우를 제외하고 대부분 LDA와 유사한 수준의 성능을 나타내었다.

Abstract

The purpose of this study is to propose AET (Augmented and Extended Topics), a novel method of synthesizing both LDA and BERTopic results, and to analyze the recently published LIS articles as an experimental approach. To achieve the purpose of this study, 55,442 abstracts from 85 LIS journals within the WoS database, which spans from January 2001 to October 2021, were analyzed. AET first constructs a WORD2VEC-based cosine similarity matrix between LDA and BERTopic results, extracts AT (Augmented Topics) by repeating the matrix reordering and segmentation procedures as long as their semantic relations are still valid, and finally determines ET (Extended Topics) by removing any LDA related residual subtopics from the matrix and ordering the rest of them by (BERTopic topic size rank, Inverse cosine similarity rank). AET, by comparing with the baseline LDA result, shows that AT has effectively concretized the original LDA topic model and ET has discovered new meaningful topics that LDA didn’t. When it comes to the qualitative performance evaluation, AT performs better than LDA while ET shows similar performances except in a few cases.

문화재 중심 기록물 서비스 개선을 위한 온톨로지 설계: 황룡사 관련 기록물 중심으로

김시정(대구대학교 기록물관리 전문요원) ; 최상희(대구가톨릭대학교) 2022, Vol.39, No.4, pp.241-268 https://doi.org/10.3743/KOSIM.2022.39.4.241

초록보기

초록

문화재 관련 기록물은 문화재에 대한 구체적인 증거이며 보존에 있어 중요한 근거자료 역할을 하므로 문화재만큼이나 중요한 의미가 있다. 특히 국가적이나 사회적으로 중요한 가치를 가진 특정 문화재인 경우 해당 문화재가 하나의 주제로 다양한 연구가 진행되고 문화재를 주제로 한 프로그램이 기획되는 경우가 많다. 그러나 유명한 문화재를 중심으로 생산되는 기록물은 긴 시간 동안 발생하면서 분산되어 관리되어 왔고 다양한 형태로 나타나고 있어 해당 기록물의 범위와 소재, 내용을 파악하기 어렵다. 이와 같은 문제들의 해결 방안으로, 이 연구는 황룡사와 같이 사회적, 역사적 가치를 가지는 주요 문화재를 중심으로 발생하는 관련 기록물을 11개 공공기관 및 웹서비스에서 수집하여 기록물의 유형, 기록물과 관련된 활동, 메타데이터 분석을 통해 전체 기록물의 범위와 관계를 파악할 수 있는 온톨로지 설계를 하여 특정 문화재 중심으로 기록물을 이해할 수 있도록 하고자 하였다.

Abstract

Records related to a certain cultural heritage are concrete evidence that prove the value of the cultural heritage and become a criterion for long-term preservation of its records. The value of the records is as important as cultural heritage value. In the case of specific cultural heritage with national or socially important values, various studies are conducted on cultural heritage as one theme, and various programs about cultural heritage are developed. However, it is difficult to grasp the scope, record types, and contents of the records because they have been distributed and managed in many institutes. They also appear in various forms. As a solution to these problems, this study collected records of a major cultural heritage with social and historical values such as Hwangnyongsa from 11 public institutions and web services and analyzed the types of records, activities related to the records, and metadata. Through data analysis, an ontology that can understand the range and relationship of the entire record was suggested so that the record can be understood with a focus on specific cultural heritage.

북한이탈주민의 정보빈곤에 관한 연구: Chatman의 정보빈곤이론을 기반으로

민수진(성균관대학교 문헌정보학과) ; 이용정(성균관대학교) 2022, Vol.39, No.3, pp.241-261 https://doi.org/10.3743/KOSIM.2022.39.3.241

초록보기

초록

본 연구는 Chatman(1996)의 정보빈곤이론(Theory of Information Poverty)을 바탕으로 정보 빈곤이 북한이탈주민의 한국사회적응에 미치는 영향을 알아보고자 하였다. 연구를 위해 정보빈곤이론을 기반으로 정보빈곤의 개념을 은폐(Secrecy), 기만(Deception), 위험감수(Risk-taking), 상황적 관련성(Situational relevance)에 따른 정보 수용이라는 네 가지 변인으로 구성하였고, 선행연구 분석 결과를 바탕으로 한국사회적응을 사회적 적응과 심리적 적응으로 구분하였다. 또한 생명윤리위원회(IRB)의 승인을 거쳐 2021년 8월 4일부터 8월 30일까지 북한이탈주민 지원 단체 <우리온>을 통해 국내 입국 후 최소 1년이 경과한 민법상 성년인 만 19세 이상의 북한이탈주민을 대상으로 설문조사를 실시하였다. 수집된 100개의 유효한 데이터를 빈도 분석, 신뢰도 분석, 상관관계 분석, 다중회귀분석을 통해 분석한 결과, 정보빈곤은 북한이탈주민의 사회적 적응과 심리적 적응에 유의한 영향을 미치는 것으로 나타났다. 특히, “기만” 변수는 북한이탈주민의 사회적 적응과 심리적 적응에 유의한 부(-)의 영향을 미치는 것으로 나타났다. 본 연구는 북한이탈주민을 정보빈곤층으로 정의하고, 그들의 한국사회적응을 Chatman의 정보빈곤이론을 기반으로 설명하였다는 점에서 학문적 의의가 있다. 무엇보다도, 질적 연구를 수행한 선행연구들과 달리 변수의 조작화를 통해 양적 연구를 시도하였다는 점에서 의미가 있다.

Abstract

The present study aims to investigate the effects of information poverty on North Korean refugees’ social adaptation to South Korea based on Chatman’s Theory of Information Poverty (1996). Based on the Theory of Information Poverty, information poverty consists of four variables: Secrecy, Deception, Risk-taking, and information acceptance in response to situational relevance. And based on the previous studies, adaptation to South Korean life is divided into social adaptation and psychological adaptation. From August 4 to August 30, 2021, after approval by the IRB through the North Korean refugee support organization <Urion>, surveys were conducted with North Korean refugees who had lived in South Korea for at least one year and were aged 19 or older. The 100 collected valid data were analyzed using frequency analysis, reliability analysis, correlation analysis, and multiple linear regression analysis. Findings of the study indicated that information poverty had significant effects on North Korean refugees’ social and psychological adaptation. In particular, the “deception” variable had negative effects on social and psychological adaptation. The study has theoretical implications that it explains North Korean refugees’ adaptation to South Korea based on Theory of Information Poverty by defining them as information poor. Above all, it attempts a quantitative approach through operationalization of key concepts unlike previous studies that were conducted with qualitative approaches.

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

정보관리학회지