정보관리학회지, 한국정보관리학회

21

텍스트 마이닝 기법을 이용한 컴퓨터공학 및 정보학 분야 연구동향 조사: DBLP의 학술회의 데이터를 중심으로

김수연(연세대학교) ; 송성전(연세대학교 문헌정보학과) ; 송민(연세대학교) 2015, Vol.32, No.1, pp.135-152 https://doi.org/10.3743/KOSIM.2015.32.1.135

초록보기

초록

Abstract

The goal of this paper is to explore the field of Computer and Information Science with the aid of text mining techniques by mining Computer and Information Science related conference data available in DBLP (Digital Bibliography & Library Project). Although studies based on bibliometric analysis are most prevalent in investigating dynamics of a research field, we attempt to understand dynamics of the field by utilizing Latent Dirichlet Allocation (LDA)-based multinomial topic modeling. For this study, we collect 236,170 documents from 353 conferences related to Computer and Information Science in DBLP. We aim to include conferences in the field of Computer and Information Science as broad as possible. We analyze topic modeling results along with datasets collected over the period of 2000 to 2011 including top authors per topic and top conferences per topic. We identify the following four different patterns in topic trends in the field of computer and information science during this period: growing (network related topics), shrinking (AI and data mining related topics), continuing (web, text mining information retrieval and database related topics), and fluctuating pattern (HCI, information system and multimedia system related topics).

22

뉴스 빅데이터를 이용한 우리나라 언론의 기록관리 분야 보도 특성 분석: 1999~2018 뉴스를 중심으로

한승희(서울여자대학교) 2018, Vol.35, No.3, pp.41-75 https://doi.org/10.3743/KOSIM.2018.35.3.041

초록보기

초록

이 연구에서는 1999년 1월부터 2018년 6월 현재까지 약 20년 간의 기록관리를 주제로 한 뉴스 빅데이터 4,680 건을 ‘빅카인즈’에서 추출하여, 이를 대상으로 우리나라 언론의 기록관리 주제에 대해 시계열 기반으로 보도 특성을 분석하고자 하였다. 먼저, 기록관리에 대한 언론 보도량의 차이를 살펴보기 위해 시기별, 주제별, 언론사 유형별 보도량을 분석하였다. 또한 기록관리 주제에 대한 언론 보도 내용의 차이에 대한 특성을 분석하기 위해 단어빈도 기반 내용 분석과 언어 네트워크 분석을 수행하여 언론 보도 내용의 시기별, 주제별, 언론사 유형별 차이를 분석하였다. 분석 결과, 기록관리 분야 뉴스 보도는 보도량과 보도 내용에 있어 시기별, 주제별, 언론사별로 차이가 있는 것으로 나타났다. 뉴스 보도량은 2007년 대통령기록물관리법이 제정된 이후부터 증가하기 시작하여 2013년에 가장 많은 뉴스가 보도된 것으로 나타났으며, 정치와 사회 주제를 중심으로 중앙지와 경제지가 가장 많은 양의 뉴스를 보도한 것으로 나타났다. 또한 뉴스 보도 내용의 분석 결과, 기록관리가 도입된 처음 10년 동안은 기록관리의 현장 적용과 확산 과정에서 발생하는 이슈들을 중심으로 뉴스 주제가 형성되다가, 대통령기록물관리법 제정 이후로 기록관리가 정치적, 사회적 이슈의 주요 요인이 되면서 정치, 사회 분야의 뉴스가 많이 보도된 것으로 나타났다.

Abstract

The purpose of this study is to analyze the characteristics of Korean media on the topic of archives & records management based on time-series analysis. In this study, from January, 1999 to June, 2018, 4,680 news articles on archives & records management topics were extracted from BigKinds. In order to examine the characteristics of the media coverage on the archives & records management topic, this study was analyzed to the difference of the press coverage by period, subject, and type of the media. In addition, this study was conducted word-frequency based content analysis and semantic network analysis to investigate the content characteristics of media on the subject. Based on these results, this study was analyzed to the differences of media coverage by period, subject, and type of media. As a result, the news in the field of records management showed that there was a difference in the amount of news coverage and news contents by period, subject, and type of media. The amount of news coverage began to increase after the Presidential Records Management Act was enacted in 2007, and the largest amount of news was reported in 2013. Daily newspapers and financial newspapers reported the largest amount of news. As a result of analyzing news reports, during the first 10 years after 1999, news topics were formed around the issues arising from the application and diffusion process of the concept of archives & records management. However, since the enactment of the Presidential Records Management Act, archives & records management has become a major factor in political and social issues, and a large amount of political and social news has been reported.

23

국내 대학도서관의 연구데이터관리서비스 개발 방안에 관한 연구: 서울대학교 소속 연구자들의 요구 분석을 중심으로

심윤희(이화여자대학교 일반대학원 문헌정보학) ; 김지현(이화여자대학교) 2019, Vol.36, No.3, pp.61-80 https://doi.org/10.3743/KOSIM.2019.36.3.061

초록보기

초록

본 연구는 대학도서관의 연구데이터관리서비스 개발을 위하여 수행되었다. 본 연구에서는 연구데이터관리서비스의 요소와 제공 수준을 알아보고, 국내에서 연구비 규모가 가장 큰 대학인 서울대학교 소속 연구자들을 대상으로 인터뷰를 진행하여 연구자들의 연구데이터관리 및 공유와 이용, 서비스에 대한 요구를 분석하였다. 인터뷰 참여자들은 해외 연구비지원기관 또는 학술 저널에서 제시하는 데이터 공유 의무조항에 대한 인식과 이행 경험이 부족하고 데이터를 체계적으로 관리하는데 어려움을 겪고 있었다. 그러나 상당수의 연구자들이 데이터 관리 및 연구데이터관리서비스 관련 교육에 대한 필요성에 대해 동감하고 있었다. 이를 바탕으로, 연구데이터관리서비스를 교육서비스, 전문 컨설팅 서비스, 큐레이션 기술 서비스 요소로 나누어 각 요소별 이용자의 요구를 반영한 서비스를 제안하였다. 본 연구결과는 향후 국내 대학도서관 및 연구데이터관리서비스를 계획하고 있는 기관에서 서비스 개발의 기초자료로 활용할 수 있을 것이다.

Abstract

This study aimed to develop Research Data Management (RDM) Services in a domestic university library of Korea. In this study, elements and levels of RDM services are examined and in-depth interview was conducted with university researchers affiliated in Seoul National University, which has the largest amount of research fund among universities in Korea. Interview was conducted to analyze their data management practices and needs of RDM services. Interview results show researchers’ lack of awareness toward Data Management Policy and data sharing obligations of funding agencies and academic journal publishers. Also, they had trouble managing research data systematically. However, many of the researchers understand the necessary of research data management and education of data management. Based on the interview result, service elements and contents are suggested for RDM services which is consisted of education services, professional consulting services, curation technical services. This study result will help to guide for the planning the future RDM service in university library of Korea.

24

스마트폰 무선신호를 이용한 공공도서관 이용자의 공간이용행태 분석

박성재(한성대학교) 2019, Vol.36, No.1, pp.295-313 https://doi.org/10.3743/KOSIM.2019.36.1.295

초록보기

초록

본 연구의 목적은 이용자의 스마트폰 무선신호를 이용하여 이용자가 공공도서관 공간을 어떻게 이용하는지에 대한 이용행태를 분석하는 것이다. 공간이용 데이터를 수집하는 방법으로 이용자의 스마트폰 무선신호를 감지하여 이용자의 동선을 추적하였고 수집된 데이터를 로데이터로 하여 추가적인 분석을 진행하였다. 서울 시내 한 구립도서관에서 4개월 동안 수집된 이용자 공간이용 데이터를 분석한 결과, 전월 대비 평균 37.9%의 이용자들이 익월에도 이용을 하는 것으로 나타났고 이용자 중의 50%는 7분 미만으로 도서관에 머무는 것으로 나타났다. 또한 도서관을 이용하는 시간을 분석한 결과, 오후 2-3시 사이에 이용자들이 가장 많았으며 주말 오후 5시 이후에는 이용자가 매우 적게 나타났다. 층간 공간이동을 분석한 결과, 서가가 위치한 3층과 4층 사이의 공간이동이 유사하게 높게 나타났다. 이러한 결과는 스마트폰 무선신호를 이용한 도서관 공간이용행태를 분석하는 방법론이 기존에 주로 사용되었던 관찰을 통한 분석보다 효과적임을 제시하고 있다. 따라서 향후 도서관 공간이용 분석에 적극 활용된다면 도서관 공간운영의 활용성을 높일 것으로 기대된다.

Abstract

The purpose of this study is to analyze library space use patterns through users’ smartphone WiFi. This study is applied a method to detect WiFi signal of users’ smartphone to analyze the in-library wayfinding of users. The library usage data were collected for four months in a library in Seoul, Korea. The results show that the average 37.9% of library users revisits the library the next month. Half of users stay under 7 minutes in the library. Users mainly visit the library between 2 and 3 o’clock, and few users visit the library after 5 pm on weekends. The floor moving pattern result shows that the co-visit rate between the third and fourth floor is higher than others, in that these two floors are mainly composed of book shelves. These results indicate that the method to detect the WiFi signal for spatial pattern analysis could be more effective than observation which was used in previous research. It, therefore, is expected that this method would be applied in other libraries to analyze and enhance the library space usage.

25

데이터 활용률 제고를 위한 기술 용어의 상호 네트워크 생성과 통제

정도헌(덕성여자대학교) 2018, Vol.35, No.1, pp.157-182 https://doi.org/10.3743/KOSIM.2018.35.1.157

초록보기

초록

빅 데이터 시대에 접어들면서 저장 기술과 처리 기술이 급속도로 발전함에 따라, 과거에는 간과되었던 롱테일(long tail) 데이터가 많은 기업과 연구자들에게 관심의 대상이 되고 있다. 본 연구는 롱테일 법칙의 영역에 존재하는 데이터의 활용률을 높이기 위해 텍스트 마이닝 기반의 기술 용어 네트워크 생성 및 통제 기법을 제안한다. 특히 텍스트 마이닝의 편집 거리(edit distance) 기법을 이용해 학문 분야에서 사용되는 기술 용어의 상호 네트워크를 자동으로 생성하는 효과적인 방안을 제시하였다. 데이터의 활용률 향상 실험을 위한 데이터 수집을 위해 LOD(linked open data) 환경을 이용하였으며, 이 과정에서 효과적으로 LOD 시스템의 데이터를 활용하는 기법과 용어의 패턴 처리 알고리즘을 제안하였다. 마지막으로, 생성된 기술 용어 네트워크의 성능 측정을 통해 제안한 기법이 롱테일 데이터의 활용률 제고에 효과적이었음을 확인하였다.

Abstract

As data management and processing techniques have been developed rapidly in the era of big data, nowadays a lot of business companies and researchers have been interested in long tail data which were ignored in the past. This study proposes methods for generating and controlling a network of technical terms based on text mining technique to enhance data utilization in the distribution of long tail theory. Especially, an edit distance technique of text mining has given us efficient methods to automatically create an interlinking network of technical terms in the scholarly field. We have also used linked open data system to gather experimental data to improve data utilization and proposed effective methods to use data of LOD systems and algorithm to recognize patterns of terms. Finally, the performance evaluation test of the network of technical terms has shown that the proposed methods were useful to enhance the rate of data utilization.

26

OWL을 이용한 온톨로지 기반의 목록시스템 설계 연구

이현실(원광대학교) ; 한성국(원광대학교) 2004, Vol.21, No.2, pp.249-267 https://doi.org/10.3743/KOSIM.2004.21.2.249

초록보기

초록

MARC는 목록 데이터를 상세하게 정의할 수 있는 장점이 있지만, 개념요소가 구조화 되어 있지 않고 표현체계가 복잡하기 때문에 단순 계층구조의 의미 어휘 체계를 지원하는 XML DTD나 RDF/S로는 그 구조를 모델화하기가 어렵다. 본 연구에서는 MARC의 데이터 요소를 추상화하여 목록 데이터의 개념 구조를 표현하는 서지 온톨로지를 구축하였으며, 개념간의 논리 관계와 프로퍼티의 카디널리티 및 프로퍼티 값에 대한 논리적 제한을 부가할 수 있는 OWL을 이용하여 MRAC 필드의 복합 구조를 모델링하여 구축한 목록 온톨로지를 구현하였다. 온톨로지 언어를 이용한 MARC 데이터를 기술 방법은 목록 데이터에 대한 메타데이터 구성과 목록의 호환성 문제를 해결할 수 있는 기초적 방안이 되며, 시맨틱 웹 서비스를 기반으로 하는 차세대 문헌 정보서비스 시스템 구현의 토대가 될 것이다.

Abstract

Although MARC can define the detail cataloguing data, it has complex structures and frameworks to represent bibliographic information. On account of these idiosyncratic features of MARC, XML DTD or RDF/S that supports simple hierarchy of conceptual vocabularies cannot capture MARC formalism effectively. This study implements bibliographic ontology by means of abstracting conceptual relationships between bibliographic vocabularies of MARC. The bibliographic ontology is formalized with OWL that can represent the logical relations between conceptual elements and specify cardinality and property value restrictions. The bibliographic ontology in this study will provide metadata for cataloguing data and resolve compatibility problems between cataloguing systems. And it can also contribute the development of next generation bibliographic information system using semantic Web services.

27

우리나라 공공도서관의 디지털참고봉사에 대한 종단적 분석

장혜란(상명대학교) 2007, Vol.24, No.2, pp.105-122 https://doi.org/10.3743/KOSIM.2007.24.2.105

초록보기

초록

공공도서관에서 제공하고 있는 디지털참고봉사의 현황과 발전을 이해하기 위하여, 전국의 공공도서관 홈페이지를 직접 접속하여 관찰하고 이용 데이터를 수집하여 분석하였으며. 2003년에 수집하였던 데이터와 비교하였다. 모두 404개의 디지털참고봉사 사이트에 대하여, 접근수준, 서비스방식, 링크명칭, 서비스정책, 웹폼, FAQ 등 서비스 제공 관련 특성을 분석한 후, 15일간 수행된 질문응답 데이터를 수집하여 이용도서관, 이용수준, 응답비율, 질문유형 등 서비스 성능을 분석하였다. 서비스 현황에 대한 이해와 문제점, 그리고 4년 동안에 걸친 변화가 식별되었으며, 향후 발전을 위한 제언이 이루어졌다.

Abstract

To understand the present status and the development of the digital reference service in Korean public libraries, a nationwide site observation was attempted in 2007. The collected data was analyzed, then compared with the previous analysis based on a 2003 data. For the 404 sites offering digital reference, operational characteristics, such as access level, service mode, link description, policy, web form, and FAQ, are analyzed. Performance analysis focused on the presence of question posting, volume of usage, response rate, and types of the questions, for the data collected for 15 days through question and answer transcript recording. Results reveal findings on the present situation as well as changes over 4 years.Related problems are identified. The conclusion includes suggestions for improving digital reference service.

28

Web of Science 데이터학술지 게재 데이터논문의 지적구조 규명

정은경(이화여자대학교 사회과학대학 문헌정보학과 교수) 2020, Vol.37, No.1, pp.153-177 https://doi.org/10.3743/KOSIM.2020.37.1.153

초록보기

초록

오픈과학의 흐름에서 데이터 공유와 재이용은 중요한 연구자의 활동이 되어가고 있다. 데이터 공유와 재이용에 관한 여러 논의 중에서 데이터학술지와 데이터논문의 발간이 가시적인 결과를 보여주고 있다. 데이터학술지는 여러 학문 분야에서 발간되고 있으며, 논문의 수도 점차 증가하고 있다. 데이터논문은 데이터 자체와는 다르게 인용을 주고 받는 활동이 포함되어, 따라서 이들이 형성하는 고유한 지적구조가 생겨나게 된다. 본 연구는 데이터학술지와 데이터논문이 학술커뮤니티에서 구성하는 지적구조를 규명하고자 Web of Science에 색인된 14종의 데이터학술지와 6,086건의 데이터논문과 인용된 참고문헌 84,908건을 분석하였다. 저자사항과 함께 동시인용분석과 서지결합분석을 네트워크로 시각화하여 데이터논문이 형성한 세부 주제 분야를 규명하였다. 분석결과, 저자, 저자소속기관, 국가를 추출하여 출현빈도를 살펴보면, 전통적인 학술지 논문과 다른 양상을 보인다. 이러한 결과는 데이터의 생산이 용이한 기관과 국가에 주로 데이터논문을 출간하기 때문이라고 해석될 수 있다. 동시인용분석와 서지결합분석 모두 분석도구, 데이터베이스, 게놈구성 등이 주된 세부 주제 영역으로 나타났다. 동시인용분석결과는 9개의 군집으로 형성되었는데, 특정 주제 분야로 나타난 영역은 수질과 기후 등의 분야이다. 서지결합분석은 총 27개의 컴포넌트로 구성되었는데, 수질, 기후 이 외에도 해양, 대기 등의 세부 주제 영역이 파악되었다. 특기할만한 사항으로는 사회과학 분야의 주제 영역도 나타났다는 점이다.

Abstract

In the context of open science, data sharing and reuse are becoming important researchers’ activities. Among the discussions about data sharing and reuse, data journals and data papers shows visible results. Data journals are published in many academic fields, and the number of papers is increasing. Unlike the data itself, data papers contain activities that cite and receive citations, thus creating their own intellectual structures. This study analyzed 14 data journals indexed by Web of Science, 6,086 data papers and 84,908 cited references to examine the intellectual structure of data journals and data papers in academic community. Along with the author’s details, the co-citation analysis and bibliographic coupling analysis were visualized in network to identify the detailed subject areas. The results of the analysis show that the frequent authors, affiliated institutions, and countries are different from that of traditional journal papers. These results can be interpreted as mainly because the authors who can easily produce data publish data papers. In both co-citation and bibliographic analysis, analytical tools, databases, and genome composition were the main subtopic areas. The co-citation analysis resulted in nine clusters, with specific subject areas being water quality and climate. The bibliographic analysis consisted of a total of 27 components, and detailed subject areas such as ocean and atmosphere were identified in addition to water quality and climate. Notably, the subject areas of the social sciences have also emerged.

29

데이터 인용의 현황과 제언

김지현(이화여자대학교) ; 정은경(이화여자대학교) ; 윤정원(University of South Florida) ; 이재윤(명지대학교) 2017, Vol.34, No.1, pp.7-29 https://doi.org/10.3743/KOSIM.2017.34.1.007

초록보기

초록

학술 커뮤니티 내에서 논문의 인용은 보편적인 규범으로 자리 잡은 데 비해 데이터의 인용은 아직 초보적인 단계에 머물러 있다. 이를 개선하기 위해 제기되고 있는 데이터 인용의 필요성 및 원칙과 가이드라인에 대해서 살펴보았다. 또한 데이터 인용체계 구축 사례에서는 데이터 인용 요소들을 정의하고 서비스를 제공하는 DataCite, Dataverse Network, Data Citation Index 사례를 중심으로 살펴보았다. 마지막으로 한국종합사회조사 데이터 인용 분석을 통해 국내 데이터세트 인용/이용 정보 제공 실태를 조사하였다.

Abstract

Data citation remains in its infancy, although providing the citation to a journal article is a typical norm in an academic community. This study examines the need for data citation, its principles and guidelines for improving the issue. In addition, the study investigates cases that established data citation mechanism, including DataCite, Dataverse Network and Data Citation Index that define elements of data citation and provide relevant services. At the end, it explores the current state of data citation in Korea through the analysis of citations to dataset from Korean General Social Survey.

30

전문대학 도서관 이용자들의 웹 기반 OPAC 이용실태에 관한 연구

김태승(경기대학교) ; 이동규(대림대학) 2005, Vol.22, No.4, pp.79-95 https://doi.org/10.3743/KOSIM.2005.22.4.079

초록보기

초록

본 연구는 2년제 전문대학 학생들을 대상으로 웹기반 온라인목록의 이용특성을 조사 연구한 것이다. 연구방법으로 이용자들의 특성을 분석하기 위하여 질문지법과 면접조사법을 통해 데이터를 수집하였으며, 수집된 데이터의 처리는 통계처리 프로그램인 SPSSWIN 10.1을 사용하여 분석하였다. 연구결과 이용행태, 검색결과 만족도, 웹 온라인목록의 선호도, 검색어 선정, 문헌정보학 전공자와 비전공자 간의 탐색성과 차이, 웹 온라인목록의 이용자교육의 필요성 등에 관한 결과를 얻었다. 이러한 분석결과를 근거로 하여 웹 온라인목록 이용 중에 발생하는 문제점과 어려움을 느끼는 기능들에 대해 개선방안을 제시하여 이용자들로 하여금 웹 온라인목록 이용의 효율성을 돕고자 하였다.

Abstract

The aims of this study is to analyse the user's behavior, satisfaction, difficulties and selection of retrieval keywords for the use of Web-based OPAC in the College students. The methods of the questionnaire and the interview was applied to get the data and processed by using SPSSWIN 10.1. Several research results was proved the hypothesis such as differences between major subject of students in their fields. Furthermore, based on the result of this analysis, another purpose is to come up with the improvements of functions prompting difficulties and answers to problems found in the Web OPAC, helping them to use the Web OPAC efficiently.

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

정보관리학회지