정보관리학회지, 한국정보관리학회

51

육지희(연세대학교 일반대학원 문헌정보학과) ; 송민(연세대학교) 2018, Vol.35, No.2, pp.63-88 https://doi.org/10.3743/KOSIM.2018.35.2.063

초록보기

초록

본 연구는 LDA 토픽 모델과 딥 러닝을 적용한 단어 임베딩 기반의 Doc2Vec 기법을 활용하여 자질을 선정하고 자질집합의 크기와 종류 및 분류 알고리즘에 따른 분류 성능의 차이를 평가하였다. 또한 자질집합의 적절한 크기를 확인하고 문헌의 위치에 따라 종류를 다르게 구성하여 분류에 이용할 때 높은 성능을 나타내는 자질집합이 무엇인지 확인하였다. 마지막으로 딥 러닝을 활용한 실험에서는 학습 횟수와 문맥 추론 정보의 유무에 따른 분류 성능을 비교하였다. 실험문헌집단은 PMC에서 제공하는 생의학 학술문헌을 수집하고 질병 범주 체계에 따라 구분하여 Disease-35083을 구축하였다. 연구를 통하여 가장 높은 성능을 나타낸 자질집합의 종류와 크기를 확인하고 학습 시간에 효율성을 나타냄으로써 자질로의 확장 가능성을 가지는 자질집합을 제시하였다. 또한 딥 러닝과 기존 방법 간의 차이점을 비교하고 분류 환경에 따라 적합한 방법을 제안하였다.

Abstract

This research evaluated differences of classification performance for feature selection methods using LDA topic model and Doc2Vec which is based on word embedding using deep learning, feature corpus sizes and classification algorithms. In addition to find the feature corpus with high performance of classification, an experiment was conducted using feature corpus was composed differently according to the location of the document and by adjusting the size of the feature corpus. Conclusionally, in the experiments using deep learning evaluate training frequency and specifically considered information for context inference. This study constructed biomedical document dataset, Disease-35083 which consisted biomedical scholarly documents provided by PMC and categorized by the disease category. Throughout the study this research verifies which type and size of feature corpus produces the highest performance and, also suggests some feature corpus which carry an extensibility to specific feature by displaying efficiency during the training time. Additionally, this research compares the differences between deep learning and existing method and suggests an appropriate method by classification environment.

52

정보검색 성능 향상을 위한 단어 중의성 해소 모형에 관한 연구

정영미(연세대학교) ; 이용구(계명대학교) 2005, Vol.22, No.2, pp.125-145 https://doi.org/10.3743/KOSIM.2005.22.2.125

초록보기

초록

이 연구에서는 문헌 및 질의의 내용을 대표하는 주제어의 중의성 해소를 위해 대표적인 지도학습 모형인 나이브 베이즈 분류기와 비지도학습 모형인 EM 알고리즘을 각각 적용하여 검색 실험을 수행한 다음, 주제어의 중의성 해소를 통해 검색 성능의 향상을 가져올 수 있는지를 평가하였다. 실험문헌 집단은 약 12만 건에 달하는 한국어 신문기사로 구성하였으며, 중의성 해소 대상 단어로는 한국어 동형이의어 9개를 선정하였다. 검색 실험에는 각 중의성 단어를 포함하는 18개의 질의를 사용하였다. 중의성 해소 실험 결과 나이브 베이즈 분류기는 최적의 조건에서 평균 92%의 정확률을 보였으며, EM 알고리즘은 최적의 조건에서 평균 67% 수준의 클러스터링 성능을 보였다. 중의성 해소 알고리즘을 통합한 의미기반 검색에서는 나이브 베이즈 분류기 통합 검색이 약 39.6%의 정확률을 보였고, EM 알고리즘 통합 검색이 약 36%의 정확률을 보였다. 중의성 해소 모형을 적용하지 않은 베이스라인 검색의 정확률 37%와 비교하면 나이브 베이즈 통합 검색은 약 7.4%의 성능 향상률을 보인 반면 EM 알고리즘 통합 검색은 약 3%의 성능 저하율을 보였다.

Abstract

This paper presents a semantic vector space retrieval model incorporating a word sense disambiguation algorithm in an attempt to improve retrieval effectiveness. Nine Korean homonyms are selected for the sense disambiguation and retrieval experiments. The total of approximately 120,000 news articles comprise the raw test collection and 18 queries including homonyms as query words are used for the retrieval experiments. A Naive Bayes classifier and EM algorithm representing supervised and unsupervised learning algorithms respectively are used for the disambiguation process. The Naive Bayes classifier achieved 92% disambiguation accuracy, while the clustering performance of the EM algorithm is 67% on the average. The retrieval effectiveness of the semantic vector space model incorporating the Naive Bayes classifier showed 39.6% precision achieving about 7.4% improvement. However, the retrieval effectiveness of the EM algorithm-based semantic retrieval is 3% lower than the baseline retrieval without disambiguation. It is worth noting that the performances of disambiguation and retrieval depend on the distribution patterns of homonyms to be disambiguated as well as the characteristics of queries.

53

자아 중심 주제 인용분석을 활용한 딥러닝 연구동향 분석

이재윤(명지대학교) 2017, Vol.34, No.4, pp.7-32 https://doi.org/10.3743/KOSIM.2017.34.4.007

초록보기

초록

최근 들어 다양한 분야에서 딥러닝이 혁신적인 기계학습 기법으로 급속하게 확산되고 있다. 이 연구에서는 딥러닝 연구동향을 분석하기 위해서 자아 중심 주제 인용분석 기법을 변형하여 응용해보았다. 이를 위해 Web of Science에서 ‘deep learning’으로 탐색하여 검색된 문헌 중 소수의 씨앗 문헌으로부터 인용 관계를 통해 분석 대상 문헌을 확보하는 방법을 시도하였다. 씨앗 문헌을 인용하는 최근 논문들을 딥러닝 분야의 현행 연구를 반영하는 자아 문헌집합으로 설정하였다. 자아 문헌으로부터 빈번히 인용된 선행 연구들은 딥러닝 분야의 연구 주제를 나타내는 인용 정체성 문헌집합으로 설정하였다. 자아 문헌집합에 대해서는 공저 네트워크 분석을 비롯한 정량적 분석을 실시하여 주요 국가와 연구 기관을 파악하였다. 인용 정체성 문헌들에 대해서는 동시인용 분석을 실시하고, 도출된 문헌 군집을 인용하는 주요 키워드인 인용 이미지 키워드를 파악하여 주요 문헌과 주요 연구 주제를 밝혀내었다. 마지막으로 특정 주제에 대한 인용 영향력이 성장하는 추세를 반영하는 인용 성장지수 CGI를 제안하고 측정하여 딥러닝 분야의 선도 연구 주제가 변화하는 동향을 밝혔다.

Abstract

Recently, deep learning has been rapidly spreading as an innovative machine learning technique in various domains. This study explored the research trends of deep learning via modified ego centered topic citation analysis. To do that, a few seed documents were selected from among the retrieved documents with the keyword ‘deep learning’ from Web of Science, and the related documents were obtained through citation relations. Those papers citing seed documents were set as ego documents reflecting current research in the field of deep learning. Preliminary studies cited frequently in the ego documents were set as the citation identity documents that represents the specific themes in the field of deep learning. For ego documents which are the result of current research activities, some quantitative analysis methods including co-authorship network analysis were performed to identify major countries and research institutes. For the citation identity documents, co-citation analysis was conducted, and key literatures and key research themes were identified by investigating the citation image keywords, which are major keywords those citing the citation identity document clusters. Finally, we proposed and measured the citation growth index which reflects the growth trend of the citation influence on a specific topic, and showed the changes in the leading research themes in the field of deep learning.

54

연구데이터 관점에서 본 거대언어모델 품질 평가 기준 제언

한나은(한국과학기술정보연구원) ; 서수정(한국과학기술정보연구원) ; 엄정호(한국과학기술정보연구원) 2023, Vol.40, No.3, pp.77-98 https://doi.org/10.3743/KOSIM.2023.40.3.077

초록보기

초록

본 연구는 지금까지 제안된 거대언어모델 가운데 LLaMA 및 LLaMA 기반 모델과 같이 연구데이터를 주요 사전학습데이터로 활용한 모델의 데이터 품질에 중점을 두어 현재의 평가 기준을 분석하고 연구데이터의 관점에서 품질 평가 기준을 제안하였다. 이를 위해 데이터 품질 평가 요인 중 유효성, 기능성, 신뢰성을 중심으로 품질 평가를 논의하였으며, 거대언어모델의 특성 및 한계점을 이해하기 위해 LLaMA, Alpaca, Vicuna, ChatGPT 모델을 비교하였다. 현재 광범위하게 활용되는 거대언어모델의 평가 기준을 분석하기 위해 Holistic Evaluation for Language Models를 중심으로 평가 기준을 살펴본 후 한계점을 논의하였다. 이를 바탕으로 본 연구는 연구데이터를 주요 사전학습데이터로 활용한 거대언어모델을 대상으로 한 품질 평가 기준을 제시하고 추후 개발 방향을 논의하였으며, 이는 거대언어모델의 발전 방향을 위한 지식 기반을 제공하는데 의의를 갖는다.

Abstract

Large Language Models (LLMs) are becoming the major trend in the natural language processing field. These models were built based on research data, but information such as types, limitations, and risks of using research data are unknown. This research would present how to analyze and evaluate the LLMs that were built with research data: LLaMA or LLaMA base models such as Alpaca of Stanford, Vicuna of the large model systems organization, and ChatGPT from OpenAI from the perspective of research data. This quality evaluation focuses on the validity, functionality, and reliability of Data Quality Management (DQM). Furthermore, we adopted the Holistic Evaluation of Language Models (HELM) to understand its evaluation criteria and then discussed its limitations. This study presents quality evaluation criteria for LLMs using research data and future development directions.

55

인천글로벌캠퍼스도서관 종합운영계획 수립을 위한 인식조사 연구

곽승진(충남대학교 문헌정보학과) ; 노영희(건국대학교 문헌정보학과) ; 고재민(수원과학대학교 융합디자인학부 실내건축디자인학과) ; 강봉숙(청주대학교 문헌정보학과) ; 김정택(배재대학교 학술정보팀) 2022, Vol.39, No.2, pp.255-273 https://doi.org/10.3743/KOSIM.2022.39.2.255

초록보기

초록

본 연구는 4차 산업혁명과 포스트 코로나 시대를 대비한 인천글로벌캠퍼스 도서관의 종합운영계획 수립을 위한 기초연구로, 인천글로벌캠퍼스 학생을 대상으로 현재 도서관의 현황 및 만족도, 수요도 조사를 실시하고 이를 기반으로 향후 종합운영계획 수립 방향을 제안하고자 하였다. 연구 결과, 첫째 장서의 경우 지속적인 장서 확충을 위한 중장기적 계획이 필요하며, 특히 전공 관련 장서 확충이 필요할 것으로 보인다. 둘째 서비스의 경우, 이용자들의 연구단계별 맞춤형 정보서비스, 연구자 맞춤정보서비스, 학과전담 연구지원서비스 등을 제공하여 이용자들의 연구를 지원해야 하며, 정보활용교육프로그램 제공이 필요하다. 셋째, 공간 부분의 경우, IGC 이용자들은 학습 및 연구 공간에 대한 요구도가 매우 높게 나타나, 향후 공간 재구성을 통해 이용자들이 원하는 교육 및 관련 공간 개선이 필요하며, 학습 공간 확대와 더불어 토론 및․협업 활동, 세미나, 휴식 등 학생들의 삶과 밀접한 장소로서의 창의 협업 공간 확대도 필요하다. 마지막으로 홈페이지 및 정보시스템과 관련하여 홈페이지 활성화를 위해서는 가장 우선적으로 홈페이지 내에 이용자들이 원하는 다양한 콘텐츠 및 최신성을 갖춘 자료 확충이 필요하며, 이와 함께 국내 전자저널 및 DB 제공 계획이 이루어져야 할 것으로 보인다.

Abstract

This study is a basic study for establishing a comprehensive operation plan for the Incheon Global Campus Library in preparation for the 4th Industrial Revolution and the post-corona era. Based on this, it was intended to propose a direction for establishing a comprehensive operation plan in the future. As a result of the study, in the case of the first collection, a mid- to long-term plan for continuous expansion of the collection is required, and in particular, it seems that the expansion of major-related collections is necessary. In the case of the second service, it is necessary to support users’ research by providing information services customized for each stage of research by users, information services customized for researchers, and research support services for departments, and it is necessary to provide information utilization education programs. Third, in the case of space, IGC users have very high demands for learning and research space, so it is necessary to improve education and related spaces that users want through space reorganization in the future. It is also necessary to expand the creative collaboration space as a place closely related to the lives of students, such as rest, etc. Lastly, in order to activate the homepage in relation to the homepage and information system, it is necessary to first expand the various contents and up-to-date data that users want on the homepage. In addition, it seems that the domestic electronic journal and DB provision plan should be implemented.

56

전략적 계획을 기반으로 한 BSC 모형 개발 -Rod Library 사례를 중심으로-

조윤희(University of Northern Iowa) 2006, Vol.23, No.1, pp.159-179 https://doi.org/10.3743/KOSIM.2006.23.1.159

초록보기

초록

북아이오와주립 대학도서관은 우수한 교양 커리큘럼을 지원하는 개별화 학습 환경과 다양한 지적, 문화적 커뮤니티를 지원하는 것을 사명으로 하고 있다. 1987년부터 전략적 계획을 수립하기 시작하여 현재 5년 단위로 전략적 계획을 수립하고 있으며, 최근 전략이 어느 정도 달성되고 있는가에 대한 성과측정의 필요성이 제기되었다. 이에 본 연구는 북아이오와주립 대학도서관의 최근 전략적 계획 2004-2009의 내용을 분석하여 전략을 균형성과표의 네 관점으로 전환하는 BSC 모형과 전략지도를 개발하였다. 이와 함께 각 관점별 전략적 목적을 측정하는 핵심성과측정지표와 이를 이끄는 동인들의 인과관계 모형을 개발하여 제시하였다.

Abstract

The University of Northern Iowa Rod Library has mission statement that a personalized learning environment founded on the strong liberal arts curriculum and to supporting an intellectually and culturally diverse community. The Rod Library has been developing the strategic plans by 5 years since 1987. Recently, the strategy has been faced need to measure of performance how much does make it up. This study developed the BSC model and the strategy map that analyze the strategic plan 2004-2009 of the University of Northern Iowa Rod library and transfer the strategy into the four perspectives of BSC. In addition, this study presented the success performance indicators measuring the strategic goals of each perspectives and the cause-effect model driving the lead indicators of performance.

57

정보활용능력에 대한 자기효능감과 학업성취도간 상관관계 연구

김성원(충남대학교) 2011, Vol.28, No.3, pp.31-46 https://doi.org/10.3743/KOSIM.2011.28.3.031

초록보기

초록

개인의 정보활용능력은 단기적으로는 현안 문제를 해결하고 장기적으로는 평생학습을 가능하게 해줌으로써 개인의 경쟁력에 많은 영향을 미칠 수 있다. 본 연구에서는 중요성이 부각되고 있는 정보활용능력이 개인의 성과와 상관관계를 가지는지를 대학생 집단을 대상으로 검증해 보았다. 개인의 성과 지표로는 실험집단이 대학생인 점을 고려하여 학업성취도인 평점평균을 채택하였다. 검증결과 정보활용능력(information literacy)에 대한 자기효능감(self-efficacy)과 학업성취도간에는 상관관계가 있음을 확인하였다. 또한 이러한 상관관계는 지속되는 것을 확인할 수 있었다. 이 연구를 통해 정보활용능력이 개인의 성과에 영향을 미친다는 것을 확인하였고 이는 정보활용능력 관련 교과의 개설에 대한 당위성을 제공할 수 있을 것이다.

Abstract

Information literacy of the individuals affects their competitive capability significantly by providing problem solving skills in the short run, and by enabling life-long learning in the long run. This study examines if information literacy capacity has any relationship with individuals' achievement level through the experiment with college student subject group. As evidences for individual achievement level, we adopted GPA's(grade point average) of students. As a result, it was confirmed that information literacy and academic achievements has positive relationship. Additionally, it has been found that this relationship has a tendency of sustaining for a significant period. These experiment results would serve as a rationale for providing information literacy courses in the academic curriculum.

58

데이터사이언스 관련 교과목의 강의 계획서 분석: ALA의 인가를 받은 문헌정보학 프로그램을 중심으로

박형주(충남대학교 문헌정보학과) 2022, Vol.39, No.1, pp.119-143 https://doi.org/10.3743/KOSIM.2022.39.1.119

초록보기

초록

본 연구는 미국도서관협회(American Library Association, ALA)의 인가를 받은 문헌정보학 프로그램에서 제공하는 데이터사이언스와 관련된 수업의 내용을 조사했다. 연구의 목적은 강의 계획서의 내용 분석을 통해 해당 수업에서 다뤄지는 교과목 명, 교과 설명, 학습 목표, 주차 별 주제를 살펴보는 것이다. 문헌정보학 프로그램에서의 데이터사이언스와 관련된 필수 과목 및 선택 과목은, 데이터사이언스 개론, 데이터 마이닝, 데이터베이스, 데이터 분석, 데이터 시각화, 데이터 큐레이션 및 관리, 머신 러닝, 메타데이터, 컴퓨터 프로그래밍 등 데이터사이언스 전 분야에 걸쳐 다양하게 교과목이 개설되어 있었다. 본 연구의 결과는 문헌정보학 프로그램에서 데이터사이언스 교과 과정을 개설 및 개정할 때 논의의 시작점이 될 수 있는 기초 자료가 되어 운영 역량을 강화하는데 활용되기를 기대한다.

Abstract

This preliminary study examined the status of data science-related course syllabi in the American Library Association (ALA) accredited Library and Information Science (LIS) programs. The purpose of this study was to explore LIS course syllabi related to data science, such as course title, course description, learning outcomes, and weekly topics. LIS programs offer various topics in data science such as the introduction to data science, data mining, database, data analysis, data visualization, data curation and management, machine learning, metadata, and computer programming. This study contributes to helping instructors develop or revise course materials to improve course competencies related to data science in the ALA-accredited LIS programs.

59

공기업의 지식관리 영향요인과 성과 간의 구조적 관계: 한국철도공사를 중심으로

이홍재(안양대학교) 2009, Vol.26, No.3, pp.189-211 https://doi.org/10.3743/KOSIM.2009.26.3.189

초록보기

초록

본 연구에서는 공기업의 지식관리 영향요인, 지식관리 활동과 그 성과 간의 구조적 관계를 파악하고자 하였다. 이를 위해 본 연구는 정부기관 중 지식관리를 최초로 도입한 기관이자 공공부문 지식관리의 선도기관으로 평가받고 있는 한국철도공사 직원들을 대상으로 한 설문조사를 통해 실증분석을 시도하였다. 가설검증 결과, 지식관리 영향요인으로는 학습지향문화와 지식품질, KMS 품질 등이 지식창출 및 공유 활동 모두에 유의미한 영향요인으로 검증되었으며, 개인태도는 지식창출 활동에만 유의미한 영향을 미치는 것으로 나타났다. 또한 지식창출은 지식공유 활동에 유의미한 영향을 미치는 것으로 나타났으며, 지식창출과 지식공유 모두 성과에 유의미한 영향을 미치는 것으로 나타났다. 이러한 분석결과를 토대로 본 연구에서는 지식창출 및 공유 활성화 방안을 제시하였다.

Abstract

The purpose of this study is to examine the causal relationships among knowledge management(KM) influence factors(leadership, learning culture, member' attitude, reward, knowledge quality, KMS quality), KM activities(knowledge creation, knowledge sharing), KM performance in public enterprise. The results of data analysis by structured equation model(SEM) indicate that learning culture, knowledge quality and KMS quality significantly influence the knowledge creation and sharing. Member' attitude has significant influences on the knowledge creation. And knowledge creation and sharing have a significant affect on KM performance. Based on the results, the potential implications of the strategy for effective knowledge management are discussed.

60

토픽모델링을 활용한 정보활용교육 연구주제 분석 및 교육내용 제안

윤지혜(한남대학교 교육학과) ; 정유경(한남대학교) 2022, Vol.39, No.4, pp.1-21 https://doi.org/10.3743/KOSIM.2022.39.4.001

초록보기

초록

본 연구는 정보활용교육 분야의 연구주제들을 파악하여 정보활용교육에 제안할 수 있는 요소들이 있는지를 살펴보는데 연구의 목적이 있다. 우선, 정보활용교육과 관련된 문헌정보학 분야의 논문 97편을 대상으로 토픽모델링을 적용하여 ‘미디어 리터러시’, ‘정보활용교육의 교수학습방법’, ‘정보원 이용 및 판단’ 등의 주요 주제를 파악하였다. 분석 결과를 바탕으로, 교육내용 측면에서는 Big 6 정보모형, 교육방법과 관련해서는 미국 사서교사협회의 정보활용 능력기준과 미국 대학 및 연구도서관협회의 정보활용교육 개발지침을 고려하여 정보활용교육 교육내용을 제안하였다. 본 연구는 정보활용교육 전반의 연구주제와 연관된 교육 요소들을 파악하였으며, 이를 바탕으로 ‘정보원 이용 및 판단’, ‘정보윤리’와 관련된 교육내용을 보다 강조하였다는 점에서 의의가 있다.

Abstract

The aim of this study is to identify the research topics and suggest an information literacy curriculum by analyzing research articles on information literacy. For this purpose, we applied the topic modeling technique to 97 scientific articles and identified the core contents of information literacy education, such as media literacy, information literacy instruction, and the use of information resources. Based on the analysis results, we suggested an information literacy curriculum by considering the Big 6 model, information literacy standards of American Association of School Library, and Association of College and Research Libraries’s information literacy competencies. This study is significant in that it considered ‘use of information resources’ and ‘information ethics’ to suggest information literacy education.

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

정보관리학회지