정보관리학회지, 한국정보관리학회

1

배영활(계명대학교) ; 오동근(계명대학교) 2002, Vol.19, No.4, pp.53-76 https://doi.org/10.3743/KOSIM.2002.19.4.053

초록보기

초록

이 연구는 특정 주제 분야를 사회학적 관련 이론 및 기법을 적용하여 고찰하고 있는 문헌의 분류를 효율적으로 수행하기 위해 국내에서 가장 많이 사용되고 있는 한국십진분류법(KDC)과 듀이십진분류법(DDC)의 특수사회학 분류항목들을 비교 분석하였다. 특히 특수사회학 관련 분류항목들을 종교, 예체능, 과학, 언어, 사회, 지역 등 6개 분야로 구분하여 분속한 결과 분류항목 설정이 상이함으로 인한 문제점과 분류항목이 설정되지 않음으로 인한 문제점이 나타났다. 이를 토대로 이 연구에서는 특정 주제의 이론 및 기법 적용에 일관성을 기하도록 하고 분류번호 부여를 위한 추가의 항목 전개 등 분류 실무자의 판단을 도와주는 한편 나아가 KDC의 제5판 개정에 일조하는데 그 의의를 두었다.

Abstract

This study compares and analyzes the classes in the major special areas in the sociology, called "branch sociology" included in the Korean Decimal Classification 4th edition and Dewey Decimal Classification 21st edition. Especially it analyzes the related classes of specified areas(branch sociology) of sociology including those of arts and sports, sciences, languages, society, region, etc. class by class. In this analysis two systems show many differences in the classes included and in the locations of same classes. This analysis can be useful for the future revision of KDC.

2

메타데이터를 활용한 기록물 자동분류 성능 요소 비교

김영범(전남대학교 대학원 기록관리학 석사) ; 장우권(전남대학교 문헌정보학과 교수) 2023, Vol.40, No.3, pp.99-118 https://doi.org/10.3743/KOSIM.2023.40.3.099

초록보기

초록

이 연구의 목적은 기록물의 맥락정보를 담고 있는 메타데이터를 활용하여 기록물 자동분류 과정에서의 성능요소를 파악하는데 있다. 연구를 위해 2022년 중앙행정기관 원문정보 약 97,064건을 수집하였다.수집한 데이터를 대상으로 다양한 분류 알고리즘과 데이터선정방법, 문헌표현기법을 적용하고 그 결과를 비교하여 기록물 자동 분류를 위한 최적의 성능요소를 파악하고자 하였다. 연구 결과 분류 알고리즘으로는 Random Forest가, 문헌표현기법으로는 TF 기법이 가장 높은 성능을 보였으며, 단위과제의 최소데이터 수량은 성능에 미치는 영향이 미미하였고 자질은 성능변화에 명확한 영향을 미친다는 것이 확인되었다.

Abstract

The objective of this study is to identify performance factors in the automatic classification of records by utilizing metadata that contains the contextual information of records. For this study, we collected 97,064 records of original textual information from Korean central administrative agencies in 2022. Various classification algorithms, data selection methods, and feature extraction techniques are applied and compared with the intent to discern the optimal performance-inducing technique. The study results demonstrated that among classification algorithms, Random Forest displayed higher performance, and among feature extraction techniques, the TF method proved to be the most effective. The minimum data quantity of unit tasks had a minimal influence on performance, and the addition of features positively affected performance, while their removal had a discernible negative impact.

3

음식문화 분야 인터넷자원 분류체계 분석을 통한 한국십진분류법의 항목명 확장에 관한 연구

정연경(이화여자대학교) ; 이미화(한성대학교) 2010, Vol.27, No.4, pp.49-69 https://doi.org/10.3743/KOSIM.2010.27.4.049

초록보기

초록

듀이십진분류법(DDC)은 문헌분류체계로 도서관에서 뿐만 아니라 인터넷자원을 분류하는 기반으로 사용되고 있는데, 이는 DDC가 주기적이며 지속적인 용어 확장을 통해 최신성과 실용성을 유지하기 때문이다. 반면, 한국십진분류법(KDC)은 비정기적인 개정 주기로, 용어의 최신성과 실용성이 떨어진다. KDC가 도서관뿐만 아니라 인터넷자원 분류에도 활용 가능하기 위해서는 실용적인 분류 항목명이 반영되어야 한다. 본 연구에서는 인터넷 자원의 디렉토리 분류체계와 KDC에서 사용하고 있는 분류항목명을 비교 분석하고 KDC에 추가할만한 분류항목명을 확장 제안하였다. 네이버, 야후, 교보문고, 아마존의 디렉토리 분류체계에서 음식문화 분야의 용어를 분석하였으며, 다른 분류체계를 참조하여 KDC로의 적용 방안을 제안하였다. KDC에 추가적인 분류항목명이 필요한 분야는 식품위생, 음료기술, 식품공학, 식품과 음료, 식사 및 식탁차림, 주방, 식당 공간이었으며 부족한 항목명은 음식 관련 용어 및 한식 관련 요리명이 주를 이루었다. 본 연구를 통해 KDC의 부족한 항목명과 적용방안을 제시함으로써 KDC가 도서관과 인터넷자원 분류에 활용될 수 있는 기반을 마련하였다.

Abstract

Library classification system is based upon academic disciplines, However, it is difficult to classify for Internet resources due to its lack of up-to-datedness and practicality. Especially, headings of Korean Decimal Classification need to reflect practical aspects and it should be also developed for classification of web based resources. The purposes of this study are to analyze the structures of directory classifications in Internet resources and to suggest additional headings of KDC as a practical library classification as well as a classification system for internet resources. Directory classification systems of Naver, Yahoo, Kyobo Internet book store, Amazon were selected and their food and culture subjects were analyzed for this study. The headings of KDC were compared to them and new possible headings were suggested with reference of NDC and DDC in the area of food and culture. This study provided a way of developing KDC for a classification system for Internet resources as well as library materials.

4

용어관계의 분류 모형 개발에 관한 연구

백지원(이화여자대학교) ; 정연경(이화여자대학교) 2006, Vol.23, No.1, pp.63-81 https://doi.org/10.3743/KOSIM.2006.23.1.063

초록보기

초록

본 연구는 모든 지식조직체계의 근간인 용어관계가 동일성, 계층성, 연관성이라는 세 가지 포괄적인 기준에 의해 정의되어 사용됨으로써 정보의 정확성이 중시되는 오늘날의 정보 환경에서 제 기능을 다하지 못하고 있으므로, 그 해결 방안의 하나로 용어관계의 분류 모형을 제시하고자 하는데 목적이 있다. 이를 위해 기존의 여러 지식조직체계에 나타나는 각종 용어관계의 사례와 용어관계에 대한 이론적 연구들을 광범위하게 수집하여 다양한 용어관계의 유형을 파악하였다. 그리고 이를 바탕으로 용어관계를 명확하게 정의하고 범주화할 수 있는 용어관계 분류의 근거를 세우고 용어관계의 분류 모형을 개발하였다. 더 나아가 이 분류 모형을 정보검색을 비롯한 다양한 방면에 활용할 수 있는 방안을 모색하고 향후 용어관계 분류 연구에 대한 제언을 했다.용어관계의 분류 모형 개발에 관한 연구*

Abstract

The purpose of this study is to present the limitation of terminological relationships in the current information environment and to propose a solution to result the richer and refined terminological resources. For this, various kinds of terminological relationships in knowledge organization systems and theoretical researches were collected and analyzed. Based upon the analysis, a methodology for classification of terminological relationships was suggested and classification models were presented. Additionally, four suggestions were made for the practical uses of the classification models.

5

KDC 광고분야의 분류체계 개선에 관한 연구

김정현(전남대학교) ; 배주연(전남대학교) 2005, Vol.22, No.4, pp.5-22 https://doi.org/10.3743/KOSIM.2005.22.4.005

초록보기

초록

광고산업과 미디어의 발전에 따라 광고물의 증가뿐만 아니라 광고와 관련한 많은 연구가 이루어지고 있으며, 그 결과 광고 관련 자료가 꾸준히 증가하고 있는 추세이다. 그런데 KDC 광고분야의 분류체계를 살펴보면 광고관련 분류항목이 너무나 미비하며, 더욱이 주류 배치에 있어 관련 학문과의 연관성을 고려하지 않아 사서나 이용자 모두에게 혼란을 초래하고 있다. 이러한 점을 감안하여 본 연구에서는 광고의 유형 및 학문적 특성에 대해 고찰하고, KDC를 비롯한 NDC, DDC, LCC와 같은 문헌분류법 및 광고 전문도서관, 그리고 인터넷 포털사이트와 인터넷서점 등의 분류체계를 분석함으로써 KDC 광고분야의 분류체계에 대한 문제점을 알아보고 이에 대한 개선방안을 제시하였다.

Abstract

As the development of advertising industry and media the research about an advertisement get accomplished. As the result information resources called on the advertising materials are on an increasing trend. However, it looks into the classification system in advertising field of KDC, the problems are as the follows: ① the classification items are too incomplete, ② the main class is badly arranged. The reason have no regard for the correlation with a science. So, it gives rise to confusion to the librarian and user. The purpose of the study is to present the improvement plan on the classification system in advertising field of KDC. In order to build the improvement plan, the four steps are utilized. The first step is to investigate the characteristic of sciences on advertising and a type. The second one is to survey the current status of the library classification as KDC, NDC, DDC, and LCC. The third one is to analyse the classification system of library and web site on the advertising. The forth one is to grasp the problems on the classification system in advertising field of KDC.

6

문헌간 유사도를 이용한 자동분류에서 미분류 문헌의 활용에 관한 연구

김판준(신라대학교) ; 이재윤(경기대학교) 2007, Vol.24, No.1, pp.251-271 https://doi.org/10.3743/KOSIM.2007.24.1.251

초록보기

초록

문헌간 유사도를 자질로 사용하는 분류기에서 미분류 문헌을 학습에 활용하여 분류 성능을 높이는 방안을 모색해보았다. 자동분류를 위해서 다량의 학습문헌을 수작업으로 확보하는 것은 많은 비용이 들기 때문에 미분류 문헌의 활용은 실용적인 면에서 중요하다. 미분류 문헌을 활용하는 준지도학습 알고리즘은 대부분 수작업으로 분류된 문헌을 학습데이터로 삼아서 미분류 문헌을 분류하는 첫 번째 단계와, 수작업으로 분류된 문헌과 자동으로 분류된 문헌을 모두 학습 데이터로 삼아서 분류기를 학습시키는 두 번째 단계로 구성된다. 이 논문에서는 문헌간 유사도 자질을 적용하는 상황을 고려하여 두 가지 준지도학습 알고리즘을 검토하였다. 이중에서 1단계 준지도학습 방식은 미분류 문헌을 문헌유사도 자질 생성에만 활용하므로 간단하며, 2단계 준지도학습 방식은 미분류 문헌을 문헌유사도 자질 생성과 함께 학습 예제로도 활용하는 알고리즘이다. 지지벡터기계와 나이브베이즈 분류기를 이용한 실험 결과, 두 가지 준지도학습 방식 모두 미분류 문헌을 활용하지 않는 지도학습 방식보다 높은 성능을 보이는 것으로 나타났다. 특히 실행효율을 고려한다면 제안된 1단계 준지도학습 방식이 미분류 문헌을 활용하여 분류 성능을 높일 수 있는 좋은 방안이라는 결론을 얻었다

Abstract

This paper studies the problem of classifying documents with labeled and unlabeled learning data, especially with regards to using document similarity features. The problem of using unlabeled data is practically important because in many information systems obtaining training labels is expensive, while large quantities of unlabeled documents are readily available. There are two steps in general semi-supervised learning algorithm. First, it trains a classifier using the available labeled documents, and classifies the unlabeled documents. Then, it trains a new classifier using all the training documents which were labeled either manually or automatically. We suggested two types of semi-supervised learning algorithm with regards to using document similarity features. The one is one step semi-supervised learning which is using unlabeled documents only to generate document similarity features. And the other is two step semi-supervised learning which is using unlabeled documents as learning examples as well as similarity features. Experimental results, obtained using support vector machines and naive Bayes classifier, show that we can get improved performance with small labeled and large unlabeled documents then the performance of supervised learning which uses labeled-only data. When considering the efficiency of a classifier system, the one step semi-supervised learning algorithm which is suggested in this study could be a good solution for improving classification performance with unlabeled documents.

7

브라운의 주제분류법 연구

곽철완(강남대학교) 2018, Vol.35, No.4, pp.37-50 https://doi.org/10.3743/KOSIM.2018.35.4.037

초록보기

초록

이 연구의 목적은 브라운의 주제분류법 초판을 분석하여 오늘의 분류법 연구에 대한 시사점을 파악하는 것이다. 이를 위해 1906년에 발표한 주제분류법 초판을 분석 대상으로 삼았다. 분석 결과는 다음과 같다. 첫째, 분류체계의 구성에서 주제분류법의 주류는 크게 11가지로 구분되며, 각 주류는 000에서 999로 세분되어 열거식으로 나열되었다. 둘째, 분류기호 합성 방법은 크게 3가지가 있다. 셋째, 새로운 주제 처리 방법으로 본표에 없는 새로운 주제가 나타나면 적절한 위치에 새로운 분류기호를 삽입할 수 있는 유연성이 있었다. 분류법 연구에 대한 시사점은 크게 네 가지로 구분할 수 있다. 첫째, 이전의 분류법에는 없었던 혁신적인 방법인 복합 주제에 대한 분류기호 합성 방법을 제시하였다. 둘째, 패싯을 지원하는 보조표 운영을 통하여 주제를 다양한 측면에서 설명하였다. 셋째, 자관별로 유연한 분류체계를 가질 수 있도록 한 분류법으로 분류체계에 새로운 주제를 쉽게 삽입할 수 있거나 도서관 장서 규모에 따라 간략한 분류기호를 사용할 수 있도록 하였다. 넷째, 디지털 자료에 대한 접근점으로 고려할 수 있는 디렉토리를 제공하였다.

Abstract

The purpose of this study is to analyze the first edition of Brown’s Subject Classification and to understand the implications of today’s library classification. For this purpose, the first edition of the Subject Classification published in 1906 was analyzed. The analysis results are divided into three main areas. First, SC is divided into eleven main classes and each class is subdivided into enumerated subdivisions from 000 to 999. Second, As a method of synthesizing the classification numbers, there were three methods of synthesis. There was a flexibility to insert a new classification number at the appropriate location when a new topic that does not appear in the main table appeared. Implications for classification studies can be divided into four main categories. First, SC proposed a method of classification number synthesis for composite topics, which is an innovative method that was not available in previous library classification. Second, the subject matter was explained in various aspects through the operation of auxiliary tables supporting the facets. Third, it is possible to easily insert a new topic into the classification system by using the SC that can have a flexible classification system for each library, or to use a short classification number according to the size of the library collection. Fourth, it provided a directory that can be considered as access points for digital materials.

8

건설기록물 분류체계 모형에 관한 연구

박용부(PAX자료정보연구소) ; 김태수(연세대학교) 2011, Vol.28, No.3, pp.83-101 https://doi.org/10.3743/KOSIM.2011.28.3.083

초록보기

초록

국제 표준인 ISO 15489와 관련코드에 따르면 공공조직이든 민간조직이든지를 막론하고 안정적인 기록물 분류체계를 구현하게 하는 기능 분류 사용을 권고하고 있다. 기업에서도 이를 따라 업무 수행 기록물을 분류․축적하여 검색․활용할 수 있는 체계를 구축하는 것은 기업 성장을 위해서 중요하다. 따라서 기업의 기록물 분류체계 개발에 적용할 수 있는 분류기준이나 개발 방법론 연구 및 모형의 연구가 반드시 필요하다. 본 연구에서는 우리나라 대기업 3개 회사와 중소기업 4개 회사 등 총 7개의 종합건설기업의 기록물 분류체계 사례연구를 통하여 분류체계의 내용을 비교 분석하였다. 사례연구를 통하여 도출한 분류원칙을 정리 종합하여 핵심적인 건설기록물 분류기준을 제시하고, 건설기록물 분류체계 모형을 도출하기 위하여 대기업 사례기업의 본사조직 및 프로젝트조직의 대분류 및 중분류 항목 구성을 상호 비교하면서 표준적인 본사조직 및 프로젝트조직 기록물 분류 항목을 도출하는 과정을 기술하고 그 결과로 개발된 건설기록물 분류체계 모형을 제시하였다.

Abstract

The international standards, ISO 15489 and Family Code, recommend using functional classification method both in public and private organizations. In this study made a comparative analysis of the details of classification systems through case studies on records classification systems of a total of seven comprehensive construction companies in Korea including three large corporations and four small and medium-size businesses. Findings of this study suggester the direction of developing construction records classification system and its methodology. By summarizing classification standards derived from these case studies, key construction records classification standards were presented.

9

한국십진분류법 웹 버전 개발을 위한 기능요건 연구

양정윤(부산대학교 대학원 문헌정보학과 석사졸업, 진주교육대학교 도서관 사서) 2023, Vol.40, No.4, pp.147-165 https://doi.org/10.3743/KOSIM.2023.40.4.147

초록보기

초록

4차 산업혁명을 대표하는 신기술들이 이미 도서관 서비스에 구현이 되고 있다. 그러나 전통적인 사서 업무이자 향후 지속해야 하는 ‘분류’ 업무에 새로운 기술을 도입하여 업무 효율을 증대하고자 하는 방안 연구는 활발하지 않다. 해외 웹 버전 분류법인 WebDewey, Classification Web, UDC Online은 2000년대 초반에 개발되어 현재는 인쇄본보다 웹 버전이 더 활발히 사용되고 있고, 2018년 이후 듀이십진분류법(DDC)은 더 이상 인쇄본을 발간하지 않고 있다. 본 연구는 WebDewey, Classification Web, UDC Online 사례를 분석하고, 한국십진분류법(KDC) 웹 버전 개발을 위해 필요한 기능을 도출하여, AHP 분석을 통해 KDC 웹 버전 개발에 타당한 최종적인 기능을 제안했다.

Abstract

New technologies representing the Fourth Industrial Revolution are already being realized in library services. There is not, however, active research on measures to increase work efficiency by introducing a new technology in the work of “classification” that is part of the traditional librarian jobs they should continue in the future. The Dewey Decimal Classification (DDC) has not issued a print version since 2018. This study analyzes cases of WebDewey, Classification Web, and UDC Online. The functions required for the development of the Korean Decimal Classification (KDC) web version were derived, and the final functions suitable for the development of the KDC web version were proposed through AHP analysis.

10

자동 분류 기법과 지적 구조 분석 기법을 융합한 처방적 분석 시스템 구현 방안 연구

정도헌(덕성여자대학교) 2017, Vol.34, No.4, pp.33-57 https://doi.org/10.3743/KOSIM.2017.34.4.033

초록보기

초록

본 연구는 새로운 분석법으로 떠오르는 처방적 분석 기법을 소개하고, 이를 분류 기반의 시스템에 효율적으로 적용하는 방안을 제시하는 것을 목적으로 한다. 처방적 분석 기법은 분석의 결과를 제시함과 동시에 최적화된 결과가 나오기까지의 과정 및 다른 선택지까지 제공한다. 새로운 개념의 분석 기법을 도입함으로써 문헌 분류를 기반으로 하는 응용 시스템을 더욱 쉽게 최적화하고 효율적으로 운영하는 방안을 제시하였다. 최적화의 과정을 시뮬레이션하기 위해, 대용량의 학술문헌을 수집하고 기준 분류 체계에 따라 자동 분류를 실시하였다. 처방적 분석 개념을 적용하는 과정에서 대용량의 문헌 분류를 위한 동적 자동 분류 기법과 학문 분야의 지적 구조 분석 기법을 동시에 활용하였다. 실험의 결과로 효과적으로 서비스 분류 체계를 수정하고 재적용할 수 있는 몇 가지 최적화 시나리오를 효율적으로 도출할 수 있음을 보여 주었다.

Abstract

This study aims to introduce an emerging prescriptive analytics method and suggest its efficient application to a category-based service system. Prescriptive analytics method provides the whole process of analysis and available alternatives as well as the results of analysis. To simulate the process of optimization, large scale journal articles have been collected and categorized by classification scheme. In the process of applying the concept of prescriptive analytics to a real system, we have fused a dynamic automatic-categorization method for large scale documents and intellectual structure analysis method for scholarly subject fields. The test result shows that some optimized scenarios can be generated efficiently and utilized effectively for reorganizing the classification-based service system.

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

정보관리학회지