바로가기메뉴

본문 바로가기 주메뉴 바로가기

logo

An Experimental Study on the Automatic Classification of Korean Journal Articles through Feature Selection

Journal of the Korean Society for Information Management / Journal of the Korean Society for Information Management, (P)1013-0799; (E)2586-2073
2022, v.39 no.1, pp.69-90
https://doi.org/10.3743/KOSIM.2022.39.1.069
Pan Jun Kim
  • Downloaded
  • Viewed

Abstract

As basic data that can systematically support and evaluate R&D activities as well as set current and future research directions by grasping specific trends in domestic academic research, I sought efficient ways to assign standardized subject categories (control keywords) to individual journal papers. To this end, I conducted various experiments on major factors affecting the performance of automatic classification, focusing on feature selection techniques, for the purpose of automatically allocating the classification categories on the National Research Foundation of Korea’s Academic Research Classification Scheme to domestic journal papers. As a result, the automatic classification of domestic journal papers, which are imbalanced datasets of the real environment, showed that a fairly good level of performance can be expected using more simple classifiers, feature selection techniques, and relatively small training sets.

keywords
자동분류, 텍스트 범주화, 자질선정, 필터, 학술지 논문, automatic classification, text categorization, feature selection, filter, journal articles
Submission Date
2022-02-14
Revised Date
2022-02-24
Accepted Date
2022-03-04

Journal of the Korean Society for Information Management