정보관리학회지, 한국정보관리학회

1

강인수(경성대학교) 2008, Vol.25, No.3, pp.27-39 https://doi.org/10.3743/KOSIM.2008.25.3.027

초록보기

초록

동일한 인명을 갖는 서로 다른 실세계 사람들이 존재하는 현실은 인터넷 세계에서 인명으로 표현된 개체의 신원을 식별해야 하는 문제를 발생시킨다. 상기의 문제가 학술정보 내의 저자명 개체로 제한된 경우를 저자식별이라 부른다. 저자식별은 식별 대상이 되는 저자명 개체 사이의 유사도 즉 저자유사도를 계산하는 단계와 이후 저자명 개체들을 군집화하는 단계로 이루어진다. 저자유사도는 공저자, 논문제목, 게재지정보 등의 저자식별자질들의 자질유사도로부터 계산되는데, 이를 위해 기존에 교사방법과 비교사방법들이 사용되었다. 저자식별된 학습샘플을 사용하는 교사방법은 비교사방법에 비해 다양한 저자식별자질들을 결합하는 최적의 저자유사도함수를 자동학습할 수 있다는 장점이 있다. 그러나, 기존 교사방법 연구에서는 SVM, MEM 등의 일부 기계학습기법만이 시도되었다. 이 논문은 다양한 기계학습기법들이 저자식별에 미치는 성능, 오류, 효율성을 비교하고, 공저자와 논문제목 자질에 대해 자질값 추출 및 자질 유사도 계산을 위한 여러 기법들의 비교분석을 제공한다.

Abstract

In bibliographic data, the use of personal names to indicate authors makes it difficult to specify a particular author since there are numerous authors whose personal names are the same. Resolving same-name author instances into different individuals is called author resolution, which consists of two steps: calculating author similarities and then clustering same-name author instances into different person groups. Author similarities are computed from similarities of author-related bibliographic features such as coauthors, titles of papers, publication information, using supervised or unsupervised methods. Supervised approaches employ machine learning techniques to automatically learn the author similarity function from author-resolved training samples. So far, however, a few machine learning methods have been investigated for author resolution. This paper provides a comparative evaluation of a variety of recent high-performing machine learning techniques on author disambiguation, and compares several methods of processing author disambiguation features such as coauthors and titles of papers.

2

대학생들의 정보매체활용에 따른 학습효율성에 관한 연구

박재용(신라대학교) 2007, Vol.24, No.4, pp.119-132 https://doi.org/10.3743/KOSIM.2007.24.4.119

초록보기

초록

본 연구는 대학생을 대상으로 정보매체활용에 따른 학습효율성의 차이를 분석하였다. 연구를 위한 설문조사 표본은 모두 106개 이었고, 단순회귀분석결과 컴퓨터활용능력과 정보매체활용에 대하여는 t=2.990(p=0.003), sig=0.05, 정보매체활용과 학습효율성에서는 t=41.758(p=0.000), sig=0.05으로 유의적으로 나타났다. 반면, 컴퓨터활용능력과 학습효율성에 관해서는 t=-1.756(p=0.082), sig=0.05.로 비유의적으로 나타났다. 이에 본 연구는 정보매체를 활용한 수업방식에 있어서 보다 효과적인 교수법에 대한 기초자료를 제시하였다. 아울러 대학에서 다양하게 적용되고 있는 정보매체를 활용한 수업에 고려해야 할 사항들을 제시함으로써 효과적인 정보화교육 및 교수방법에 새로운 방향을 모색하였다.

Abstract

This study analyzed the difference of learning efficiency by using information media applications for undergraduate students. The survey samples for research were 106 and the results showed significant by simple regulation analysis on computer applications and information media applications with t=2.990(p=0.003), sig=0.05 and on information media applications and learning efficiency with t=41.758(p=0.000), sig=0.05. Otherwise, the result showed no significant on computer applications and learning efficiency with t=-1.756(p=0.082), sig=0.05. As a result, this study provided basic materials on more effective teaching methods than a class using information applications. As providing facts to be consider a class using information media this study found to be new directions on effective information education and teaching methods.

3

기술과학 분야 학술문헌에 대한 학습집합 반자동 구축 및 자동 분류 통합 연구

김선우(경기대학교 문헌정보학과) ; 고건우(경기대학교 문헌정보학과) ; 최원준(한국과학기술정보연구원 콘텐츠 큐레이션센터) ; 정희석(한국과학기술정보연구원 콘텐츠 큐레이션센터) ; 윤화묵(한국과학기술정보연구원 콘텐츠큐레이션센터) ; 최성필(경기대학교) 2018, Vol.35, No.4, pp.141-164 https://doi.org/10.3743/KOSIM.2018.35.4.141

초록보기

초록

최근 학술문헌의 양이 급증하고, 융복합적인 연구가 활발히 이뤄지면서 연구자들은 선행 연구에 대한 동향 분석에 어려움을 겪고 있다. 이를 해결하기 위해 우선적으로 학술논문 단위의 분류 정보가 필요하지만 국내에는 이러한 정보가 제공되는 학술 데이터베이스가 존재하지 않는다. 이에 본 연구에서는 국내 학술문헌에 대해 다중 분류가 가능한 자동 분류 시스템을 제안한다. 먼저 한국어로 기술된 기술과학 분야의 학술문헌을 수집하고 K-Means 클러스터링 기법을 활용하여 DDC 600번 대의 중분류에 맞게 매핑하여 다중 분류가 가능한 학습집합을 구축하였다. 학습집합 구축 결과, 메타데이터가 존재하지 않는 값을 제외한 총 63,915건의 한국어 기술과학 분야의 자동 분류 학습집합이 구축되었다. 이를 활용하여 심층학습 기반의 학술문헌 자동 분류 엔진을 구현하고 학습하였다. 객관적인 검증을 위해 수작업 구축한 실험집합을 통한 실험 결과, 다중 분류에 대해 78.32%의 정확도와 72.45%의 F1 성능을 얻었다.

Abstract

Recently, as the amount of academic literature has increased rapidly and complex researches have been actively conducted, researchers have difficulty in analyzing trends in previous research. In order to solve this problem, it is necessary to classify information in units of academic papers. However, in Korea, there is no academic database in which such information is provided. In this paper, we propose an automatic classification system that can classify domestic academic literature into multiple classes. To this end, first, academic documents in the technical science field described in Korean were collected and mapped according to class 600 of the DDC by using K-Means clustering technique to construct a learning set capable of multiple classification. As a result of the construction of the training set, 63,915 documents in the Korean technical science field were established except for the values in which metadata does not exist. Using this training set, we implemented and learned the automatic classification engine of academic documents based on deep learning. Experimental results obtained by hand-built experimental set-up showed 78.32% accuracy and 72.45% F1 performance for multiple classification.

4

딥러닝 언어 모델을 이용한 연구보고서의 참고문헌 자동추출 연구

한유경(정보통신정책연구원) ; 최원석(정보통신정책연구원) ; 이민철(카카오엔터프라이즈) 2023, Vol.40, No.2, pp.115-135 https://doi.org/10.3743/KOSIM.2023.40.2.115

초록보기

초록

본 연구는 단행본, 학술지, 보고서 등 다양한 종류의 발간물로 구성된 연구보고서의 참고문헌 데이터베이스를 효율적으로 구축하기 위한 것으로 딥러닝 언어 모델을 이용하여 참고문헌의 자동추출 성능을 비교 분석하고자 한다. 연구보고서는 학술지와는 다르게 기관마다 양식이 상이하여 참고문헌 자동추출에 어려움이 있다. 본 연구에서는 참고문헌 자동추출에 널리 사용되는 연구인 메타데이터 추출과 더불어 참고문헌과 참고문헌이 아닌 문구가 섞여 있는 환경에서 참고문헌만을 분리해내는 원문 분리 연구를 통해 이 문제를 해결하였다. 자동 추출 모델을 구축하기 위해 특정 연구기관의 연구보고서 내 참고문헌셋, 학술지 유형의 참고문헌셋, 학술지 참고문헌과 비참고문헌 문구를 병합한 데이터셋을 구성했고, 딥러닝 언어 모델인 RoBERTa+CRF와 ChatGPT를 학습시켜 메타데이터 추출과 자료유형 구분 및 원문 분리 성능을 측정하였다. 그 결과 F1-score 기준 메타데이터 추출 최대 95.41%, 자료유형 구분 및 원문 분리 최대 98.91% 성능을 달성하는 등 유의미한 결과를 얻었다. 이를 통해 비참고문헌 문구가 포함된 연구보고서의 참고문헌 추출에 대한 딥러닝 언어 모델과 데이터셋 유형별 참고문헌 구축 방향을 제안하였다.

Abstract

The purpose of this study is to assess the effectiveness of using deep learning language models to extract references automatically and create a reference database for research reports in an efficient manner. Unlike academic journals, research reports present difficulties in automatically extracting references due to variations in formatting across institutions. In this study, we addressed this issue by introducing the task of separating references from non-reference phrases, in addition to the commonly used metadata extraction task for reference extraction. The study employed datasets that included various types of references, such as those from research reports of a particular institution, academic journals, and a combination of academic journal references and non-reference texts. Two deep learning language models, namely RoBERTa+CRF and ChatGPT, were compared to evaluate their performance in automatic extraction. They were used to extract metadata, categorize data types, and separate original text. The research findings showed that the deep learning language models were highly effective, achieving maximum F1-scores of 95.41% for metadata extraction and 98.91% for categorization of data types and separation of the original text. These results provide valuable insights into the use of deep learning language models and different types of datasets for constructing reference databases for research reports including both reference and non-reference texts.

5

위키 환경을 활용한 학습자의 협력학습 기반 그룹 프로젝트 활동 분석: 구글 사이트 활용 사례를 중심으로

정영숙(한국방송통신대학교) ; 박옥남(성균관대학교) 2009, Vol.26, No.3, pp.239-259 https://doi.org/10.3743/KOSIM.2009.26.3.239

초록보기

초록

본 연구에서는 교육 도구로써 위키의 가능성을 탐색하기 위해서, 위키 환경인 구글 사이트에서 협력학습 기반 그룹 프로젝트를 수행하는 학습자들의 행태와 인식을 조사하였다. 이를 위해 파일 업로드, 웹 페이지 사용, 댓글, 네비게이션 바와 관련된 개별 학습자 및 그룹별 형태 분석과 구글 사이트 사용에 대한 설문조사를 실시하였다. 연구 결과를 토대로 구글 사이트를 활용한 학습자들의 협력학습 기반 그룹 프로젝트 활동의 특징을 논의하였다. 또한 협력학습의 교육적 효과성과 학습활동 진행 과정 평가의 용이성을 증대시키기 위하여 위키 환경 시스템이 어떻게 개선되어야 하는지를 제시하였다.

Abstract

The study aims at investigating students' behaviors and perceptions regarding the collaborative learning based group project using the wiki environment. The study utilized Google Sites as a case, and analyzed file unloads, the use of web pages, navigation bars, and comments as well as surveys. The study discusses main characteristics of students' activities in the collaborative learning group project, which are drawn from the analysis of students' behaviors and perceptions. The study also provides implications for improvement of wiki environment to support collaborative learning in education.

6

대학 수업에서 소셜 북마킹의 활용: 학생 인식 및 행태를 중심으로

박옥남() ; 정영숙(한국방송통신대학교) 2009, Vol.26, No.2, pp.65-82 https://doi.org/10.3743/KOSIM.2009.26.2.065

초록보기

초록

Abstract

This exploratory study describes the social bookmarking perceptions and behaviors of students in university courses. Although an emerging discussion regarding the value of social bookmarking tools exists, how users adopt tools in practice is not well known. Students were asked to utilize the bookmarking tool del.icio.us to store information relating to course projects. They were also asked to comment how they employed del.icio.us for course projects. The study analyzed student perceptions and behaviors when using social bookmarking tools for university coursework. The study noted that the use of tags, notes, and networking within these social bookmarking tools remained less active and social bookmarking services in Web 2.0 as shared collaboration, shared communities, and vertical search were less present. Utilizing social bookmarking tools to facilitate personal information management includes the activities of information use, information re-use, and mobility.

7

정보추출을 이용한 학습기반의 웹 인터페이스 에이전트

이말례(여수대학교) ; 배금표(중앙대학교) 2002, Vol.19, No.1, pp.5-22 https://doi.org/10.3743/KOSIM.2002.19.1.005

초록보기

초록

사용자는 원하는 자료를 검색하기 위해서 각 위치에 대한 정보를 저장하고 있는 검색엔진을 이용하는 경우가 대부분이다. 하지만 자료의 양이 방대해짐에 따라 사용자에게 실제로 필요한 정보가 아닐 경우가 많이 발생한다. 본 논문에서는 이러한 문제를 해결할 수 있는 개인형 웹 인터페이스 에이전트 시스템인 웹 가이드를 제안하였다. 웹 가이드는 사용자의 행동과 에이전트의 방문을 키워드를 중심으로 각각의 사례로 저장하는 사례기반 학습 방법을 이용, 특정 개인 사용자가 웹 상에서 검색하고자 하는 자료를 입력받은 후부터 사용자의 방문 행동을 학습하여 보다 빠른 시간 내에 원하고자 하는 자료를 검색할 수 있도록 도와주는 에이전트 시스템이다.

Abstract

Users usually search for the required information via search engines which contain locations of the information. However, as the amount of data gets large, the result of the search is often not the information that users actually want. In this paper a web guide is proposed in order to resolve this problem. The web guide uses case-based learning method which stores and utilizes cases based on the keywords of user’s action and agent’s visit. The proposed agent system learns the user’s visiting actions following the input the data to be searched, and then helps rapid searches of data wanted.

8

기계학습을 이용한 기록 텍스트 자동분류 사례 연구

김해찬솔(아카이브랩) ; 안대진(명지대학교 기록정보과학전문대학원, (주)아카이브랩 대표) ; 임진희(서울특별시청) ; 이해영(명지대학교) 2017, Vol.34, No.4, pp.321-344 https://doi.org/10.3743/KOSIM.2017.34.4.321

초록보기

초록

기록이나 문헌의 자동분류에 관한 연구는 오래 전부터 시작되었다. 최근에는 인공지능 기술이 발전하면서 기계학습이나 딥러닝을 접목한 연구로 발전되고 있다. 이 연구에서는 우선 문헌의 자동분류와 인공지능의 학습방식이 발전해 온 과정을 살펴보았다. 또 기계학습 중 특히 지도학습 방식의 특징과 다양한 사례를 통해 기록관리 분야에 인공지능 기술을 적용해야 할 필요성에 대해 알아보았다. 그리고 실제로 지도학습 방식으로 서울시의 결재문서를 ETRI의 엑소브레인을 통해 정부기능분류체계로 자동분류해 보았다. 이를 통해 기록을 다양한 방식의 분류체계로 자동분류하기 위한 각 과정의 고려사항을 도출하였다.

Abstract

Research on automatic classification of records and documents has been conducted for a long time. Recently, artificial intelligence technology has been developed to combine machine learning and deep learning. In this study, we first looked at the process of automatic classification of documents and learning method of artificial intelligence. We also discussed the necessity of applying artificial intelligence technology to records management using various cases of machine learning, especially supervised methods. And we conducted a test to automatically classify the public records of the Seoul metropolitan government into BRM using ETRI’s Exobrain, based on supervised machine learning method. Through this, we have drawn up issues to be considered in each step in records management agencies to automatically classify the records into various classification schemes.

9

풍부한 정보 환경에서 정보와 함께 하는 학습: 인지기술 활용을 중심으로

정진수() 2003, Vol.20, No.4, pp.135-158 https://doi.org/10.3743/KOSIM.2003.20.4.135

초록보기

초록

본 연구는 정보이용이 학습에 영향을 미치는 과정을 탐구하였다. 특히 정보이용 괴정 중에 활용된 학습자들의 인지기술 분석을 통한 그들의 의미 있는 학습 과정에 초점을 맞추어, 성공적인 정보이용이란 단순한 사실과 개념의 검색이라는 일반적 가정을 넘어서 학습자에게 개인적으로 의미 있는 학습이라 정의한다. 구성주의적 패라다임에 근본을 둔 질적 연구 방법론을 이용한 본 연구는 앤더슨과 크래스울(Anderson and Krathwohl, 2001)이 제시한 개정된 블룸의 택사노미라는 개념적 틀에 기초하여 실제정보이용 환경에서는 어떠한 인지기술이 활용되고 결과물에 반영되는지를 탐구하였다. "설득력 있는 화법(Persuasive Speech)"이라는 영어과 교과 중 하나의 과목의 우등반 학생 21명의 참여로 두 가지 방법론의 혼합적 활용이 - 개념도와 개별 인터뷰 - 제안되어 학습자들의 자연스런 정보 환경 속에서 시도되었다. 연구 결과는 학생들이 정보이용 과정 중에 네 가지 패턴의 변환을 거치면서 개정된 블룸의 택사노미에 제시된 모든 단계의 학습에 필요한 인지기술의 활용이 발견이 되었고 특히 풍부한 정보 환경에서는 고급 학습단계의 인지기술의 활용이 중요하다는 것이 밝혀졌다.

Abstract

The purpose of this study is to investigate how information use contributes to learning. Conducted as part of a larger study. this study focuses on learning by analyzing students' use of cognitive skills during the process of using information. Within the broad methodological framework of qualitative research in constructivist paradigm (Guba and Lincoln. 1998), the study applied the revised Bloom's taxonomy (Anderson and Krathwohl, 2000) as a particular framework to understand the phenomenon. Participants included 21 high school juniors in an honors' class of persuasive speech. The study's combinational use of two techniques concept mapping and individual interview - in a naturalistic setting proved to be the unique methods for researching the reflection of information use in learning products. The results revealed that changes in students' understanding occured in four types - simple, analytic, organizational, and holistic changes. The analysis using the revised Bloom's taxonomy showed that a variety of cognitive skills were used during the whole process of information use and that the use of higher levels of cognitive skills is particularly crucial. [ 더 많은 내용 보기 ]

10

기계학습을 통한 디스크립터 자동부여에 관한 연구

김판준(신라대학교) 2006, Vol.23, No.1, pp.279-299 https://doi.org/10.3743/KOSIM.2006.23.1.279

초록보기

초록

학술지 논문에 디스크립터를 자동부여하기 위하여 기계학습 기반의 접근법을 적용하였다. 정보학 분야의 핵심 학술지를 선정하여 지난 11년간 수록된 논문들을 대상으로 문헌집단을 구성하였고, 자질 선정과 학습집합의 크기에 따른 성능을 살펴보았다. 자질 선정에서는 카이제곱 통계량(CHI)과 고빈도 선호 자질 선정 기준들(COS, GSS, JAC)을 사용하여 자질을 축소한 다음, 지지벡터기계(SVM)로 학습한 결과가 가장 좋은 성능을 보였다. 학습집합의 크기에서는 지지벡터기계(SVM)와 투표형 퍼셉트론(VPT)의 경우에는 상당한 영향을 받지만 나이브 베이즈(NB)의 경우에는 거의 영향을 받지 않는 것으로 나타났다.

Abstract

This study utilizes various approaches of machine learning in the process of automatically assigning descriptors to journal articles. After selecting core journals in the field of information science and organizing test collection from the articles of the past 11 years, the effectiveness of feature selection and the size of training set was examined. In the regard of feature selection, after reducing the feature set by χ2 statistics(CHI) and criteria which prefer high-frequency features(COS, GSS, JAC), the trained Support Vector Machines(SVM) performs the best. With respective to the size of the training set, it significantly influences the performance of Support Vector Machines(SVM) and Voted Perceptron(VTP). but it scarcely affects that of Naive Bayes(NB).

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

정보관리학회지