정보관리학회지, 한국정보관리학회

51

양동민(전북대학교 기록관리학과) ; 최광훈(알엠소프트) ; 김지혜(전북대학교 기록관리학과 박사과정) ; 유남희(전북대학교 기록관리학과) 2023, Vol.40, No.4, pp.167-200 https://doi.org/10.3743/KOSIM.2023.40.4.167

초록보기

초록

국내 행정정보 데이터세트 기록관리에서는 행정정보 데이터세트를 이관할 때 이관규격으로 SIARD를 활용할 것을 권고하고 있다. 그러나 행정정보 데이터세트의 기록관리 단위, SIARD를 지원하는 도구의 기술적 한계, 공공기관의 현실적인 상황 등으로 인해 SIARD 적용이 적합하지 않은 경우가 다수 발생하고 있다. 본 연구에서는 SIARD 이외에 행정정보 데이터세트의 이관규격을 다양화하는 방안을 제안하고자 한다. 행정정보 데이터세트의 기록관리에서는 데이터세트와 연계된 사용자 인터페이스의 재현에 대한 필요성에 대한 논의는 지속되고 있지만 구체적으로 제시되고 있지 않다. 본 연구에서는 필수보존속성(Significant Properties) 관점에서 사용자 인터페이스도 함께 보존되어야 할 속성임을 확인하고, 사용자 인터페이스를 효과적으로 재현하는 방안을 제시하고, 실제 검증한 사례를 제공하고자 한다.

Abstract

For the record management of administrative information datasets in Korea, it is recommended to utilize SIARD as a transfer specification when transferring administrative information datasets. However, there are many cases where the application of SIARD is not suitable due to the record management unit of administrative information datasets, technical limitations of tools that support SIARD, and the realistic situation of public institutions. In this study, we propose a plan to diversify the transfer specifications of administrative information datasets other than SIARD. In the record management of administrative information datasets, the need to reproduce the user interface associated with the dataset has been discussed but not specifically presented. This study confirms that the user interface is a property to be preserved from the perspective of Significant Properties, proposes a method to effectively reproduce the user interface, and provides an example of actual verification.

52

동시출현단어분석을 통한 데이터과학 분야의 지적구조에 관한 연구

김현정(서울여자대학교) 2017, Vol.34, No.4, pp.101-126 https://doi.org/10.3743/KOSIM.2017.34.4.101

초록보기

초록

최근 문헌정보학의 관련 분야로 주목받고 있는 데이터과학은 오랫동안 문헌정보학에서 해오던 정보의 수집, 저장, 조직, 분석, 활용 등의 활동을 데이터에 적용하여 그 가치를 이해하려는 학문이며, 통계학과 컴퓨터공학 등 다른 학문분야와의 연계가 필요한 분야이다. 이러한 데이터과학 분야의 연구 영역을 파악하기 위하여 동시출현단어 분석을 사용하여 Web of Science 핵심컬렉션에 수록된 문헌들 중 데이터과학 관련 자료들을 수집하고, 그 주제범주를 활용하여 네트워크분석을 실시하였다. 총 667건의 자료에 대한 159개의 주제범주를 기술분석하여 데이터과학 관련 연구가 많이 이루어지고 있는 학문분야를 조사하였고, 네트워크분석을 통해 데이터과학 분야 연구영역의 지적구조를 시각적으로 파악하였다. 분석결과, 데이터과학 분야의 연구들은 2개 영역 9개 군집으로 구분되었으며, 주제범주의 용어들 중 중심성이 높은 용어들을 통해 각 군집의 대표적인 주제들을 선정하였다. 연구의 결과는 데이터과학 분야의 연구들에 대한 지적구조를 파악하는데 도움이 될 수 있고, 문헌정보학과의 연계융합전공으로서의 데이터과학 교과과정 개발에 방향성을 제시할 수도 있을 것이다.

Abstract

Data Science is emerging as a closely related field of study to Library and Information Science (LIS), and as an interdisciplinary subject combining LIS, statistics and computer science in an attempt to understand the value of data by applying what LIS has been doing for collecting, storing, organizing, analyzing, and utilizing information. To investigate which subject fields other than LIS, statistics, and computer science are related to Data Science, this study retrieved 667 materials from Web of Science Core Collection, extracted terms representing Web of Science Categories, examined subject fields that are studying Data Science using descriptive analysis, analyzed the intellectual structure of the field by co-word analysis and network analysis, and visualized the results as a Pathfinder network with clustering created with the PNNC clustering algorithm. The result of this study might help to understand the intellectual structure of the Data Science field, and may be helpful to give an idea for developing relatively new curriculum.

53

해외 비영리기관 소장 학술 데이터베이스 현황 조사 및 분석 연구

홍현진(전남대학교) ; 노영희(건국대학교) ; 정혜경(KDI국제정책대학원대학교) ; 이미영(성북정보문화센터) 2005, Vol.22, No.1, pp.87-104 https://doi.org/10.3743/KOSIM.2005.22.1.087

초록보기

초록

본 연구에서는 해외 비영리기관의 학술 데이터베이스를 도입하기 위해 학술 데이터베이스 현황을 조사하고 그 품질을 평가한 뒤, 실제적인 도입가능성과 방법을 제시한다. 특히 지금까지 국내에서 공동 활용이 불가능한 비영어권 국가의 해외 비영리기관 소장 학술 데이터베이스를 제공함으로써 기존의 학술 데이터베이스와는 다른 차원의 다양한 유형의 자료 발굴 및 자료 범위 확대를 목적으로 한다. 이러한 목적하에 진행된 본 연구는 지금까지 영리기관의 상용 데이터베이스에 거의 의존해왔던 해외 정보 자료수집과정을 저비용-고효율 구조로 개선시켜, 학술 연구의 생산성을 제고시킬 수 있을 것이다.

Abstract

The purpose of this study was to delve into the academic databases of overseas nonprofit organizations, to assess their quality and to discuss whether or not it's possible to introduce them in the nation and in which way that could be done. And it's also attempted to provide information on the academic databases of nonprofit organizations in nonEnglish-speaking countries in a bid to prepare a wide variety of academic materials about broader fields that would be distinguished from those offered by existing academic databases, since it's not currently possible to take advantage of academic materials possessed by such nations. The efforts by this study was expected to gather international information at a lower cost and in a more efficient way and eventually to contribute to improving the productivity of academic research.

54

상용 학술데이터베이스의 텍스트 기반 검색과 비주얼검색의 사용성에 관한 연구

김종애(경기대학교) 2009, Vol.26, No.3, pp.111-129 https://doi.org/10.3743/KOSIM.2009.26.3.111

초록보기

초록

본 연구는 시각화 정보검색시스템이 실제 정보검색환경에서 이용자에게 원활하게 수용될 수 있는지에 대한 경험적인 분석을 제공하고자, 상용 학술데이터베이스의 텍스트 기반 검색과 비주얼검색의 사용성을 비교․평가하고, 실험순서에 따라 사용성 평가에 있어 차이가 있는지 분석하였다. 검색소요시간과 처리동작횟수에 있어서 텍스트 기반 검색이 비주얼검색보다 더 효율적인 것으로 나타났으며, 통계적으로 유의한 차이가 있는 것으로 나타났다. 또한 사용성에 대한 인식에 있어서도 텍스트 기반 검색이 비주얼 검색보다 전체적으로 더 높게 나타났으며 통계적으로 유의한 차이가 있는 것으로 나타났다.

Abstract

This study examined the usability of text-based search and visual search of a large multidisciplinary library database to provide an empirical analysis of the acceptability of visual systems in the information retrieval environment. It also examined if there are differences in the usability assessment based on experimental order. The results indicated that the text-based search supported users' search behaviors more efficiently than the visual search. Also the text-based search was rated higher than the visual search in terms of user perceptions of four usability factors.

55

PREMIS 데이터모델 적용을 위한 사무문서 컨텐츠모형 설계 연구

문주영(숭의여자대학) ; 김태수(연세대학교) 2011, Vol.28, No.1, pp.43-68 https://doi.org/10.3743/KOSIM.2011.28.1.043

초록보기

초록

본 연구에서는 OAIS 참조 모형을 구체적으로 발전시킨, 사실상의 보존 메타데이터 표준인 PREMIS 데이터모델과 데이터사전을 사무문서에 적용하기 위한 사무문서 컨텐츠모형을 개발하였다. 대상 문서는 ‘A사 B국 해외 석유사업 및 유전개발 문서’로 국가 차원 이상의 영구 보존 가치를 지니는 문서 컬렉션이다. PREMIS 데이터모델을 사무문서에 구체적으로 적용하기 위하여 PREMIS 모델 내의 지적개체에 대한 문서 차원의 개념 정립과 이해를 시도하였다. 즉, 문서 컨텐츠의 계층을 구분하는 원칙과 구조를 설계하였고 그에 맞추어 사무문서 컨텐츠를 대상으로 한 계층 모형을 만들어 사무문서 컨텐츠모형을 도출하였다. 이 과정에서 기록물 기술 규칙을 준수하였다.

Abstract

This study presents a contents model designed for business records that require long-term preservation. The contents model is based on the PREMIS(Preservation Metadata: Implementation Strategies) data model and the ISAD(G)(General International Standard Archival Description). The study selected the record collection of “the records of the overseas petroleum business and oil field development of A company located in B country.” This collection requires permanent preservations by the nation and even beyond. It was attempted to establish the concepts of intellectual objects in the PREMIS data model to apply the PREMIS data model to the business records specifically. In other words, the study established the principles for differentiation of the classes in the record contents and the hierarchy structure, and the hierarchy model was developed for business records contents to derive the business records model based on those principles.

56

대용량 음악콘텐츠 환경에서의 데이터마이닝 기법을 활용한 추천시스템에 관한 연구

김용(KT 인프라) ; 문성빈(연세대학교) 2007, Vol.24, No.2, pp.89-104 https://doi.org/10.3743/KOSIM.2007.24.2.089

초록보기

초록

본 연구는 대용량 음악콘텐츠환경에서 개인화 추천 서비스를 위한 기반구조의 제공을 위하여 시도되었다. 추천서비스를 위한 기존의 많은 연구와 상용프로그램에도 불구하고 대규모의 쇼핑몰들은 개인화 추천서비스와 실시간으로 대용량의 데이터를 처리할 수 있는 추천시스템을 필요로 하고 있다. 이를 위하여 본 연구에서는 데이터마이닝 기술과 새로은 패턴매칭 알고리즘을 제안하고 있다. 콘텐츠 주제분야에 대한 이용자의 선호도를 이용한 이용자 분할을 위하여 군집화 기법이 사용되었다. 다음으로는 군집화를 통하여 생성된 분할된 이용자 그룹에서 개별 이용자의 콘텐츠에 대한 접근 패턴의 추출을 위하여 순차패턴 마이닝기법을 적용하였다. 최종적으로 각각의 이용자 군집의 콘텐츠 접근 패턴과 콘텐츠 선호도에 기반한 제안된 추천 알고리즘에 의해 추천이 이루어진다. 이러한 추천을 위하여 기반구조와 함께, 전처리과정과 원본 데이터의 형식변환이 데이터베이스에서 수행되어진다. 본 연구에서 제안하고 있는 기반구조의 적절성을 보여주기 위하여 제안된 시스템을 구현하였다. 실제 이용자에 의해 이용된 데이터를 실험에 적용하였으며, 해당 실험에서 추천은 실시간으로 이루어졌으며 추천결과에 있어서는 적절한 정확성을 보여주고 있다.

Abstract

This study attempts to give a personalized recommendation framework in large-sized music contents environment. Despite of many existing studies and commercial solutions for a recommendation service, large online shopping malls are still looking for a recommendation system that can serve personalized recommendation and handle large data in real-time.This research utilizes data mining technologies and new pattern matching algorithm. A clustering technique is used to get dynamic user segmentations using user preference to contents categories. Then a sequential pattern mining technique is used to extract contents access patterns in the user segmentations. Finally, the recommendation is given by our recommendation algorithm using user contents preference history and contents access patterns of the segment. In the framework, preprocessing and data transformation and transition are implemented on DBMS. The proposed system is implemented to show that the framework is feasible. In the experiment using real-world large data, personalized recommendation is given in almost real-time and shows acceptable correctness.

57

최대 개념강도 인지기법을 이용한 데이터베이스 자동선택 방법에 관한 연구

정도헌(한국과학기술정보연구원) 2010, Vol.27, No.3, pp.265-281 https://doi.org/10.3743/KOSIM.2010.27.3.265

초록보기

초록

본 연구에서 제안하는 기법은 최대 개념강도 인지기법(Maximal Concept-Strength Recognition Method: MCR)이다. 신규 데이터베이스가 입수되어 자동분류가 필요한 경우에, 기 구축된 여러 데이터베이스 중에서 최적의 데이터베이스가 어떤 것인지 알 수 없는 상태에서 MCR 기법은 가장 유사한 데이터베이스를 선택할 수 있는 방법을 제공한다. 실험을 위해 서로 다른 4개의 학술 데이터베이스 환경을 구성하고 MCR 기법을 이용하여 최고의 성능값을 측정하였다. 실험 결과, MCR을 이용하여 최적의 데이터베이스를 정확히 선택할 수 있었으며 MCR을 이용한 자동분류 정확률도 최고치에 근접하는 결과를 보여주었다.

Abstract

The proposed method in this study is the Maximal Concept-Strength Recognition Method(MCR). In case that we don't know which database is the most suitable for automatic-classification when new database is imported, MCR method can support to select the most similar database among many databases in the legacy system. For experiments, we constructed four heterogeneous scholarly databases and measured the best performance with MCR method. In result, we retrieved the exact database expected and the precision value of MCR based automatic-classification was close to the best performance.

58

빅데이터 분석을 통해 본 한국 위키피디아의 지식형성 과정에 관한 연구

이정연(이화여자대학교 이화사회과학원) ; 전수현(우아한형제들 데이터애널리스트) 2020, Vol.37, No.2, pp.171-195 https://doi.org/10.3743/KOSIM.2020.37.2.171

초록보기

초록

본 연구는 대표적인 온라인 협업커뮤니티인 한국 위키피디아의 초기 2002년부터 2019년까지의 편집로그 빅데이터를 해체하여 공동협업과정을 시계열적으로 분석하였다. 공개된 오픈데이터의 표준화된 XML 문서편집 기록을 활용해 Phython과 R을 이용하여 분석 요소를 추출하여 이를 활용하였다. 연구 분석 결과 한국 위키피디아 편집자의 참여 방법, 데이터 내용의 특징, 문서 생성의 추이 등을 설명할 수 있었다. 소수 편집자들의 적극적 활동과 대다수 편집자들의 느슨한 참여도 밝혀졌으며, 온라인에서도 나타나는 사회 문화적 특징이 한국 위키피디아에서도 나타났다. 집단지성을 지속화시키기 위해서는 새롭고 다양한 외부자원이 필수인데 신규 진입자들이 공동편집 커뮤니티에 안착하기 위한 다각적인 고려가 필요하며, 관리자 그룹의 고착화를 탈피하여 순환구조를 통한 개방성이 필요함을 제언하였다.

Abstract

This study analyzed the collaborative process in time series by dismantling the edit log big data of Wikipedia Korea, a representative online collaboration community, from early 2002 to 2019. Analysis elements were extracted from the document edit records, formatted in standardized XML, and analyzed using Python and R. The ways of editors’ contribution, the characteristics of data contents, and the trend of document creation were explained by the analysis. An active contribution of a small set of editors and a loose participation of the majority were revealed. In addition, sociocultural characteristics that appear in online communities were also found in Wikipedia Korea. A new, diverse set of external resources is necessary to sustain the collective intelligence. An effort to settle new editors into the wikipedia community and an openness through circulation structure to avoid the exclusiveness of the management group are suggested.

59

형사사법정보의 빅데이터 활용방안 연구: 구조화 범주화 관점으로

김미령(서울지방경찰청 사서) ; 노윤주(경찰청 사서) ; 김성훈(성균관대학교 문헌정보학과 초빙교수) 2019, Vol.36, No.4, pp.253-277 https://doi.org/10.3743/KOSIM.2019.36.4.253

초록보기

초록

4차 산업혁명시대를 맞아 데이터의 중요성은 심화되고 있으나, 개인정보보호 등의 문제로 데이터의 활용이 쉽지 않은 경우가 많이 있다. 형사사법정보는 범죄 예측 및 예방, 범죄수사 과학화, 양형합리화 등 다양한 활용가치가 예상됨에도 현재 개인정보보호와 형사사법정보 관련 법률적 해석 문제로 활용이 상당히 제한되고 있다. 본 연구는 형사사법정보의 구조화․범주화를 통해 ‘범죄데이터’로 전환하여 빅데이터로서 활용하도록 제안하였으며, ‘범죄데이터’ 활용시 법률적 문제, 활용가치, 데이터 생성 및 활용시 고려사항을 전문가를 통해 검증하고 향후 전략적 발전방안을 도출하였다. 연구결과, ‘범죄데이터’는 개인정보보호문제는 해결된 것으로 보여지나, 형사사법정보 관련법에 명시할 필요는 있으며, 빅데이터 활용을 위해 분석 가능하도록 표준화된 형태로 정리되는 것이 시급함이 밝혀졌다. 향후 진행방향으로는 데이터 요소 도출, 용어사전 시소러스 구축, 데이터 등급화를 위한 개인민감정보 정의 및 등급지정, 비정형데이터의 정형화를 위한 알고리즘 개발 등을 제시하였다.

Abstract

In the era of the 4th Industrial Revolution, the importance of data is intensifying, but there are many cases where it is not easy to use data due to personal information protection. Although criminal justice information is expected to have various useful values such as crime prediction and prevention, scientific investigation of criminal investigations, and rationalization of sentencing, the use of criminal justice information is currently limited as a matter of legal interpretation related to privacy protection and criminal justice information. This study proposed to convert criminal justice information into ‘crime data’ and use it as big data through the structuralization and categorization of criminal justice information. And when using “crime data,” legal issues, value in use, considerations for data generation and use were verified by experts, and future strategic development plans were identified. Finally we found that ‘crime data’ seems to have solved the privacy problem, but it is necessary to specify in the criminal justice information related law and it is urgent to be organized in a standardized form for analysis to use big data. Future directions are to derive data elements, construct a dictionary thesaurus, define and classify personal sensitive information for data grading, and develop algorithms for shaping unstructured data.

60

아카이빙 데이터의 활용성 증진을 위한 전략연구: 국내외 학술논문을 중심으로

정영임(한국과학기술정보연구원) ; 최호남(한국과학기술정보연구원) ; 최선희(한국과학기술정보연구원) 2010, Vol.27, No.1, pp.185-206 https://doi.org/10.3743/KOSIM.2010.27.1.185

초록보기

초록

핵심 학술저널이 디지털화하면서 도서관에서 구독한 자료에 대한 항구 접근 및 장기 보존에 대한 요구가 증대하였다. 이러한 요구에 부응하여 국내외에서는 다양한 기관 및 기구를 수행 주체로 하여 디지털 학술자료의 보존 활동을 해오고 있다. 본 논문은 국가출연기관인 KISTI가 국내외 학술논문 아카이빙 데이터의 활용성을 증진시킬 수 있는 시스템 및 서비스 전략을 중심으로 논의하고자 한다. 또한 KISTI의 전략을 국내외 연구와 비교하였고, NDA 체제 구축의 비용편익 분석을 통해 본 연구에서 제안한 연구의 타당성을 밝히고자 하였다. 마지막으로 아카이빙 데이터 활용성 증진을 위해 정책적, 법제적 기반 마련 방안과 아카이빙 데이터의 고부가가치 서비스 제공 방안을 제안함으로써 향후 연구 방향을 제시하였다.

Abstract

Since core scholarly journals have been digitalized, demands on the perpetual access and long-term preservation of subscribed digital information resources by libraries are increasing. Various institutions and organizations have performed the preservation activities of digital scholarly resources for the purpose of responding those demands. This paper illustrates the National Digital Archive(NDA) system proposed and developed by KISTI and discusses on the NDA strategies which aim to improve the usability of archived journal articles. In addition, NDA strategies of KISTI have been compared with those of international researches and the economic validity of NDA has been verified by analyzing NPV, BCR and IRR. Legal system for improving the application of the archived data should be studied next and high value-added data services have been suggested as our future studies in the final section.

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

정보관리학회지