정보관리학회지, 한국정보관리학회

1

박서정(연세대학교 문헌정보학과) ; 이수빈(연세대학교 문헌정보학과) ; 김우정(연세대학교 의과대학 용인세브란스병원 정신건강의학교실) ; 송민(연세대학교 문헌정보학과) 2022, Vol.39, No.1, pp.91-117 https://doi.org/10.3743/KOSIM.2022.39.1.091

초록보기

초록

국내를 비롯하여 전 세계적으로 우울증 환자 수가 매년 증가하는 추세이다. 그러나 대다수의 정신질환 환자들은 자신이 질병을 앓고 있다는 사실을 인식하지 못해서 적절한 치료가 이루어지지 않고 있다. 우울 증상이 방치되면 자살과 불안, 기타 심리적인 문제로 발전될 수 있기에 우울증의 조기 발견과 치료는 정신건강 증진에 있어 매우 중요하다. 이러한 문제점을 개선하기 위해 본 연구에서는 한국어 소셜 미디어 텍스트를 활용한 딥러닝 기반의 우울 경향 모델을 제시하였다. 네이버 지식인, 네이버 블로그, 하이닥, 트위터에서 데이터 수집을 한 뒤 DSM-5 주요 우울 장애 진단 기준을 활용하여 우울 증상 개수에 따라 클래스를 구분하여 주석을 달았다. 이후 구축한 말뭉치의 클래스 별 특성을 살펴보고자 TF-IDF 분석과 동시 출현 단어 분석을 실시하였다. 또한, 다양한 텍스트 특징을 활용하여 우울 경향 분류 모델을 생성하기 위해 단어 임베딩과 사전 기반 감성 분석, LDA 토픽 모델링을 수행하였다. 이를 통해 문헌 별로 임베딩된 텍스트와 감성 점수, 토픽 번호를 산출하여 텍스트 특징으로 사용하였다. 그 결과 임베딩된 텍스트에 문서의 감성 점수와 토픽을 모두 결합하여 KorBERT 알고리즘을 기반으로 우울 경향을 분류하였을 때 가장 높은 정확률인 83.28%를 달성하는 것을 확인하였다. 본 연구는 다양한 텍스트 특징을 활용하여 보다 성능이 개선된 한국어 우울 경향 분류 모델을 구축함에 따라, 한국 온라인 커뮤니티 이용자 중 잠재적인 우울증 환자를 조기에 발견해 빠른 치료 및 예방이 가능하도록 하여 한국 사회의 정신건강 증진에 도움을 줄 수 있는 기반을 마련했다는 점에서 의의를 지닌다.

Abstract

The number of depressed patients in Korea and around the world is rapidly increasing every year. However, most of the mentally ill patients are not aware that they are suffering from the disease, so adequate treatment is not being performed. If depressive symptoms are neglected, it can lead to suicide, anxiety, and other psychological problems. Therefore, early detection and treatment of depression are very important in improving mental health. To improve this problem, this study presented a deep learning-based depression tendency model using Korean social media text. After collecting data from Naver KonwledgeiN, Naver Blog, Hidoc, and Twitter, DSM-5 major depressive disorder diagnosis criteria were used to classify and annotate classes according to the number of depressive symptoms. Afterwards, TF-IDF analysis and simultaneous word analysis were performed to examine the characteristics of each class of the corpus constructed. In addition, word embedding, dictionary-based sentiment analysis, and LDA topic modeling were performed to generate a depression tendency classification model using various text features. Through this, the embedded text, sentiment score, and topic number for each document were calculated and used as text features. As a result, it was confirmed that the highest accuracy rate of 83.28% was achieved when the depression tendency was classified based on the KorBERT algorithm by combining both the emotional score and the topic of the document with the embedded text. This study establishes a classification model for Korean depression trends with improved performance using various text features, and detects potential depressive patients early among Korean online community users, enabling rapid treatment and prevention, thereby enabling the mental health of Korean society. It is significant in that it can help in promotion.

2

딥러닝 기반의 BERT 모델을 활용한 학술 문헌 자동분류

김인후(중앙대학교 문헌정보학과 대학원) ; 김성희(중앙대학교 문헌정보학과) 2022, Vol.39, No.3, pp.293-310 https://doi.org/10.3743/KOSIM.2022.39.3.293

초록보기

초록

본 연구에서는 한국어 데이터로 학습된 BERT 모델을 기반으로 문헌정보학 분야의 문서를 자동으로 분류하여 성능을 분석하였다. 이를 위해 문헌정보학 분야의 7개 학술지의 5,357개 논문의 초록 데이터를 학습된 데이터의 크기에 따라서 자동분류의 성능에 어떠한 차이가 있는지를 분석, 평가하였다. 성능 평가척도는 정확률(Precision), 재현율(Recall), F 척도를 사용하였다. 평가결과 데이터의 양이 많고 품질이 높은 주제 분야들은 F 척도가 90% 이상으로 높은 수준의 성능을 보였다. 반면에 데이터 품질이 낮고 내용적으로 다른 주제 분야들과 유사도가 높고 주제적으로 확실히 구별되는 자질이 적을 경우 유의미한 높은 수준의 성능 평가가 도출되지 못하였다. 이러한 연구는 미래 학술 문헌에서 지속적으로 활용할 수 있는 사전학습모델의 활용 가능성을 제시하기 위한 기초자료로 활용될 수 있을 것으로 기대한다.

Abstract

In this study, we analyzed the performance of the BERT-based document classification model by automatically classifying documents in the field of library and information science based on the KoBERT. For this purpose, abstract data of 5,357 papers in 7 journals in the field of library and information science were analyzed and evaluated for any difference in the performance of automatic classification according to the size of the learned data. As performance evaluation scales, precision, recall, and F scale were used. As a result of the evaluation, subject areas with large amounts of data and high quality showed a high level of performance with an F scale of 90% or more. On the other hand, if the data quality was low, the similarity with other subject areas was high, and there were few features that were clearly distinguished thematically, a meaningful high-level performance evaluation could not be derived. This study is expected to be used as basic data to suggest the possibility of using a pre-trained learning model to automatically classify the academic documents.

3

BERTopic을 활용한 불면증 소셜 데이터 토픽 모델링 및 불면증 경향 문헌 딥러닝 자동분류 모델 구축

고영수(연세대학교 문헌정보학과 석사과정) ; 이수빈(연세대학교 문헌정보학과 박사과정) ; 차민정(연세대학교 소셜오믹스 연구센터) ; 김성덕(연세대학교 문헌정보학과 석사과정) ; 이주희(연세대학교 문헌정보학과 석사과정) ; 한지영(연세대학교 문헌정보학과 석사과정) ; 송민(연세대학교 문헌정보학과) 2022, Vol.39, No.2, pp.111-129 https://doi.org/10.3743/KOSIM.2022.39.2.111

초록보기

초록

불면증은 최근 5년 새 환자가 20% 이상 증가하고 있는 현대 사회의 만성적인 질병이다. 수면이 부족할 경우 나타나는 개인 및 사회적 문제가 심각하고 불면증의 유발 요인이 복합적으로 작용하고 있어서 진단 및 치료가 중요한 질환이다. 본 연구는 자유롭게 의견을 표출하는 소셜 미디어 ‘Reddit’의 불면증 커뮤니티인 ‘insomnia’를 대상으로 5,699개의 데이터를 수집하였고 이를 국제수면장애분류 ICSD-3 기준과 정신의학과 전문의의 자문을 받은 가이드라인을 바탕으로 불면증 경향 문헌과 비경향 문헌으로 태깅하여 불면증 말뭉치를 구축하였다. 구축된 불면증 말뭉치를 학습데이터로 하여 5개의 딥러닝 언어모델(BERT, RoBERTa, ALBERT, ELECTRA, XLNet)을 훈련시켰고 성능 평가 결과 RoBERTa가 정확도, 정밀도, 재현율, F1점수에서 가장 높은 성능을 보였다. 불면증 소셜 데이터를 심층적으로 분석하기 위해 기존에 많이 사용되었던 LDA의 약점을 보완하며 새롭게 등장한 BERTopic 방법을 사용하여 토픽 모델링을 진행하였다. 계층적 클러스터링 분석 결과 8개의 주제군(‘부정적 감정’, ‘조언 및 도움과 감사’, ‘불면증 관련 질병’, ‘수면제’, ‘운동 및 식습관’, ‘신체적 특징’, ‘활동적 특징’, ‘환경적 특징’)을 확인할 수 있었다. 이용자들은 불면증 커뮤니티에서 부정 감정을 표현하고 도움과 조언을 구하는 모습을 보였다. 또한, 불면증과 관련된 질병들을 언급하고 수면제 사용에 대한 담론을 나누며 운동 및 식습관에 관한 관심을 표현하고 있었다. 발견된 불면증 관련 특징으로는 호흡, 임신, 심장 등의 신체적 특징과 좀비, 수면 경련, 그로기상태 등의 활동적 특징, 햇빛, 담요, 온도, 낮잠 등의 환경적 특징이 확인되었다.

Abstract

Insomnia is a chronic disease in modern society, with the number of new patients increasing by more than 20% in the last 5 years. Insomnia is a serious disease that requires diagnosis and treatment because the individual and social problems that occur when there is a lack of sleep are serious and the triggers of insomnia are complex. This study collected 5,699 data from ‘insomnia’, a community on ‘Reddit’, a social media that freely expresses opinions. Based on the International Classification of Sleep Disorders ICSD-3 standard and the guidelines with the help of experts, the insomnia corpus was constructed by tagging them as insomnia tendency documents and non-insomnia tendency documents. Five deep learning language models (BERT, RoBERTa, ALBERT, ELECTRA, XLNet) were trained using the constructed insomnia corpus as training data. As a result of performance evaluation, RoBERTa showed the highest performance with an accuracy of 81.33%. In order to in-depth analysis of insomnia social data, topic modeling was performed using the newly emerged BERTopic method by supplementing the weaknesses of LDA, which is widely used in the past. As a result of the analysis, 8 subject groups (‘Negative emotions’, ‘Advice and help and gratitude’, ‘Insomnia-related diseases’, ‘Sleeping pills’, ‘Exercise and eating habits’, ‘Physical characteristics’, ‘Activity characteristics’, ‘Environmental characteristics’) could be confirmed. Users expressed negative emotions and sought help and advice from the Reddit insomnia community. In addition, they mentioned diseases related to insomnia, shared discourse on the use of sleeping pills, and expressed interest in exercise and eating habits. As insomnia-related characteristics, we found physical characteristics such as breathing, pregnancy, and heart, active characteristics such as zombies, hypnic jerk, and groggy, and environmental characteristics such as sunlight, blankets, temperature, and naps.

4

대학생의 학습공간 선택에 영향을 미치는 요인에 관한 연구: 대학도서관의 효과적인 공간 구성을 위한 제언

이나리(연세대학교 교육대학원 사서교육전공) ; 박지홍(연세대학교) 2022, Vol.39, No.2, pp.61-86 https://doi.org/10.3743/KOSIM.2022.39.2.061

초록보기

초록

본 연구의 목적은 물리적 환경의 품질을 측정하는 도구로서의 서비스스케이프 개념을 이용하여 학습공간의 서비스스케이프 요인이 이용자 만족과 지속의도에 미치는 영향과 학습활동유형의 조절효과를 확인하는데 있다. 선행연구 및 심층면담을 통해 청결성, 쾌적성, 편의성, 심미성, 접근성, 유연성의 6개의 학습공간 서비스스케이프 요인을 선정하였고, 수도권 지역의 대학생을 대상으로 설문조사를 진행하였다. 연구 결과, 청결성, 쾌적성, 편의성과 접근성 요인이 이용자 만족에 유의한 영향을, 이용자 만족은 지속의도에 유의한 영향을 미치는 것으로 나타났다. 또한 학습활동유형은 청결성, 쾌적성 요인과 이용자 만족 관계에 부(-)적 조절효과가 있는 것으로 나타났다. 본 연구는 학습공간으로의 대학도서관에 대한 이용자 만족을 높이는 물리적인 환경 구성의 기초자료를 제공하였다는데 의의를 둔다.

Abstract

The purpose of this study is to investigate the effect of learning space Servicescape on the user satisfaction level and continuance intention and to identify moderating effect of the learning activity. The six Servicescape factors are selected after literature review and in-depth interviews; cleanliness, comfort, convenience, aesthetics, accessibility, and flexibility. The online survey is given to the university students at four-year private universities in Seoul metropolitan area. The result shows that among the learning space Servicescape factors, cleanliness, comfort, convenience, and accessibility have a significant impact on the user’s satisfaction and the user’s satisfaction response determines the continuance intention to the learning space. It is also found that the factors of cleanliness and comfort have a negative moderating effect on user satisfaction. This study implies that the result provides methods to develop the space arrangement for university libraries that provide the better-support to students’ learning experience.

5

데이터사이언스 관련 교과목의 강의 계획서 분석: ALA의 인가를 받은 문헌정보학 프로그램을 중심으로

박형주(충남대학교 문헌정보학과) 2022, Vol.39, No.1, pp.119-143 https://doi.org/10.3743/KOSIM.2022.39.1.119

초록보기

초록

본 연구는 미국도서관협회(American Library Association, ALA)의 인가를 받은 문헌정보학 프로그램에서 제공하는 데이터사이언스와 관련된 수업의 내용을 조사했다. 연구의 목적은 강의 계획서의 내용 분석을 통해 해당 수업에서 다뤄지는 교과목 명, 교과 설명, 학습 목표, 주차 별 주제를 살펴보는 것이다. 문헌정보학 프로그램에서의 데이터사이언스와 관련된 필수 과목 및 선택 과목은, 데이터사이언스 개론, 데이터 마이닝, 데이터베이스, 데이터 분석, 데이터 시각화, 데이터 큐레이션 및 관리, 머신 러닝, 메타데이터, 컴퓨터 프로그래밍 등 데이터사이언스 전 분야에 걸쳐 다양하게 교과목이 개설되어 있었다. 본 연구의 결과는 문헌정보학 프로그램에서 데이터사이언스 교과 과정을 개설 및 개정할 때 논의의 시작점이 될 수 있는 기초 자료가 되어 운영 역량을 강화하는데 활용되기를 기대한다.

Abstract

This preliminary study examined the status of data science-related course syllabi in the American Library Association (ALA) accredited Library and Information Science (LIS) programs. The purpose of this study was to explore LIS course syllabi related to data science, such as course title, course description, learning outcomes, and weekly topics. LIS programs offer various topics in data science such as the introduction to data science, data mining, database, data analysis, data visualization, data curation and management, machine learning, metadata, and computer programming. This study contributes to helping instructors develop or revise course materials to improve course competencies related to data science in the ALA-accredited LIS programs.

6

인천글로벌캠퍼스도서관 종합운영계획 수립을 위한 인식조사 연구

곽승진(충남대학교 문헌정보학과) ; 노영희(건국대학교 문헌정보학과) ; 고재민(수원과학대학교 융합디자인학부 실내건축디자인학과) ; 강봉숙(청주대학교 문헌정보학과) ; 김정택(배재대학교 학술정보팀) 2022, Vol.39, No.2, pp.255-273 https://doi.org/10.3743/KOSIM.2022.39.2.255

초록보기

초록

본 연구는 4차 산업혁명과 포스트 코로나 시대를 대비한 인천글로벌캠퍼스 도서관의 종합운영계획 수립을 위한 기초연구로, 인천글로벌캠퍼스 학생을 대상으로 현재 도서관의 현황 및 만족도, 수요도 조사를 실시하고 이를 기반으로 향후 종합운영계획 수립 방향을 제안하고자 하였다. 연구 결과, 첫째 장서의 경우 지속적인 장서 확충을 위한 중장기적 계획이 필요하며, 특히 전공 관련 장서 확충이 필요할 것으로 보인다. 둘째 서비스의 경우, 이용자들의 연구단계별 맞춤형 정보서비스, 연구자 맞춤정보서비스, 학과전담 연구지원서비스 등을 제공하여 이용자들의 연구를 지원해야 하며, 정보활용교육프로그램 제공이 필요하다. 셋째, 공간 부분의 경우, IGC 이용자들은 학습 및 연구 공간에 대한 요구도가 매우 높게 나타나, 향후 공간 재구성을 통해 이용자들이 원하는 교육 및 관련 공간 개선이 필요하며, 학습 공간 확대와 더불어 토론 및․협업 활동, 세미나, 휴식 등 학생들의 삶과 밀접한 장소로서의 창의 협업 공간 확대도 필요하다. 마지막으로 홈페이지 및 정보시스템과 관련하여 홈페이지 활성화를 위해서는 가장 우선적으로 홈페이지 내에 이용자들이 원하는 다양한 콘텐츠 및 최신성을 갖춘 자료 확충이 필요하며, 이와 함께 국내 전자저널 및 DB 제공 계획이 이루어져야 할 것으로 보인다.

Abstract

This study is a basic study for establishing a comprehensive operation plan for the Incheon Global Campus Library in preparation for the 4th Industrial Revolution and the post-corona era. Based on this, it was intended to propose a direction for establishing a comprehensive operation plan in the future. As a result of the study, in the case of the first collection, a mid- to long-term plan for continuous expansion of the collection is required, and in particular, it seems that the expansion of major-related collections is necessary. In the case of the second service, it is necessary to support users’ research by providing information services customized for each stage of research by users, information services customized for researchers, and research support services for departments, and it is necessary to provide information utilization education programs. Third, in the case of space, IGC users have very high demands for learning and research space, so it is necessary to improve education and related spaces that users want through space reorganization in the future. It is also necessary to expand the creative collaboration space as a place closely related to the lives of students, such as rest, etc. Lastly, in order to activate the homepage in relation to the homepage and information system, it is necessary to first expand the various contents and up-to-date data that users want on the homepage. In addition, it seems that the domestic electronic journal and DB provision plan should be implemented.

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

정보관리학회지