정보관리학회지, 한국정보관리학회

1

황성욱(전북대학교 기록관리학과) ; 정예용(전북대학교 기록관리학과) ; 김수정(전북대학교 문헌정보학과) ; 오효정(전북대학교 문헌정보학과) 2020, Vol.37, No.2, pp.23-45 https://doi.org/10.3743/KOSIM.2020.37.2.023

초록보기

초록

최근 ‘코로나19’라는 초유의 재난 사태를 맞이하여 대한민국 정부의 투명한 정보 공개를 통한 적극적 대응에 전 세계가 주목하고 있다. 이렇듯 공공데이터 개방은 특정 정보에 대한 국민의 인지도와 접근성을 높임으로써 다양한 사회적, 경제적 가치를 상승시키는 데 필수적이다. 본 연구는 정부차원에서 주도적으로 수집하고 공개하고 있는 공공데이터포털의 이용 활성화를 위해 각국에서 운영하고 있는 SNS 현황과 그에 따른 개선방안을 제안하고자 한다. 이를 위해 국내․외 공공데이터포털 SNS 운용현황을 살펴보고, 그 중 선진 사례 3국(인도, 미국, 한국)의 서비스를 선정하여 계량 분석, 피드백 분석, 시계열 분석, 정보유형 분석을 실시하였다. 분석결과를 통해 정보유형 및 이용요구를 파악하고 시사점을 도출하여 공공데이터 이용 활성화를 위한 구체적인 개선방안을 제언하였다.

Abstract

The world is paying attention to the South Korean government’s aggressive COVID-19 response, key of which is transparency and openness in sharing information. Opening up government information is essential to enhancing its social and economic value through increased awareness and accessibility. The purpose of this study is to investigate the current status of SNS operated by national open data portals in which government-collected and -disclosed data is available and to suggest improvements for the use of open data portals. To do this, the study compared 3 national open data portals, each from India, U.S.A, and Korea, by performing quantitative analysis, user feedback analysis, time-series analysis, and information type analysis. Based on the identified information types and user needs, the study suggests concrete ways to facilitate the use of open data portals.

2

사회과학 분야 연구자의 데이터요구와 데이터 재이용 행위에 관한 연구

김나연(이화여자대학교 일반대학원 문헌정보학과 석사) ; 정은경(이화여자대학교 문헌정보학과 교수) 2020, Vol.37, No.4, pp.1-26 https://doi.org/10.3743/KOSIM.2020.37.4.001

초록보기

초록

오늘날 점차 데이터 집약적으로 변모하는 학문 환경 속에서 데이터는 연구부산물이 아닌 연구성과물로써 학술 커뮤니케이션의 기반으로 자리 잡아가고 있다. 그러나 데이터 공급의 확대나 접근가능성의 확보만으로는 실제적인 데이터 재이용을 담보하는 데 한계가 있다. 이를 극복하기 위해서는 학술연구자의 데이터 재이용 행위와 데이터요구를 심층적으로 파악할 필요성이 있다. 따라서 본 연구는 연구자의 주요 데이터 재이용 행위와 데이터요구를 규명하고자 하였다. 이를 위해 한국사회과학자료원(KOSSDA)의 최근 3개년 데이터 재이용문헌 중 KCI 등재 논문의 저자를 연구대상으로 선정하고, 인터뷰를 수락한 연구자 12명과의 심층면담을 수행하였다. 심층면담 분석결과, 데이터를 재이용하는 요인은 개인적, 경제적, 기술적, 사회적 측면 모두에서 나타났으며, 데이터 재이용 목적에 따라 데이터 그 자체를 이용하거나 데이터가 지닌 맥락정보를 활용하였다. 웹 기반의 정보원으로부터 데이터를 주로 습득하였으나 비공식적인 커뮤니케이션을 통해 파악하는 경우도 있었다. 한편 데이터 재이용 시에 발생하는 학술연구자의 데이터요구를 살펴보면 생산 단위는 기관을, 언어는 영어를, 국가로는 미국을 선호하였다. 또한 조사원 기입식 대인면접 조사 방식으로 수집된 양적 데이터를 우선시하였다. 메타데이터와 식별정보를 충분히 포함한 원자료 수준의 데이터를 긍정적으로 인식하였으나, 접근 및 이용이 통제된 데이터는 데이터가 지닌 가치에 대한 확신을 갖기 어려워 부정적으로 받아들였다. 그러나 데이터의 규모나 최신성과 관련된 선호는 뚜렷하게 나타나지 않았는데 이는 선택 가능한 유사 데이터가 부재하였기 때문이었다.

Abstract

In today’s increasingly data-intensive academic environment, data is becoming the foundation of academic communication as a research outcome rather than a research by-product. However, there is a limit to guaranteeing actual data reuse only by expanding the data supply or securing accessibility. In order to overcome this, it is necessary to understand the data reuse behavior and data needs in-depth. Therefore, this study attempted to identify the major data reuse behavior and data needs among researchers. To this end, the authors of KCI papers among the data reuse documents of the Korea Social Science Data Archive (KOSSDA) for the past 3 years were targeted. An in-depth interview was conducted with 12 researchers who accepted the interview. As a result, factors considered when reusing data were personal, economic, technical, and social aspects, and it was found that the data itself was used or contextual information of the data was used depending on the purpose of data reuse. The path to acquiring data is a web-based source of information, and a path through informal communication can also be found. In terms of the data needs, it was found that they prefer English, the United States, and institutional producers. Also they have a clear preference for quantitative data from an interviewer-filled interpersonal interview survey method, rich metadata along with raw data, and data that contains identification information. However, due to the lack of confidence in the value, it is negative for the use of data with controlled access and use, and it is difficult to confirm a clear preference because there is no similar data available for selection in terms of size and freshness.

3

Web of Science 데이터학술지 게재 데이터논문의 지적구조 규명

정은경(이화여자대학교 사회과학대학 문헌정보학과 교수) 2020, Vol.37, No.1, pp.153-177 https://doi.org/10.3743/KOSIM.2020.37.1.153

초록보기

초록

오픈과학의 흐름에서 데이터 공유와 재이용은 중요한 연구자의 활동이 되어가고 있다. 데이터 공유와 재이용에 관한 여러 논의 중에서 데이터학술지와 데이터논문의 발간이 가시적인 결과를 보여주고 있다. 데이터학술지는 여러 학문 분야에서 발간되고 있으며, 논문의 수도 점차 증가하고 있다. 데이터논문은 데이터 자체와는 다르게 인용을 주고 받는 활동이 포함되어, 따라서 이들이 형성하는 고유한 지적구조가 생겨나게 된다. 본 연구는 데이터학술지와 데이터논문이 학술커뮤니티에서 구성하는 지적구조를 규명하고자 Web of Science에 색인된 14종의 데이터학술지와 6,086건의 데이터논문과 인용된 참고문헌 84,908건을 분석하였다. 저자사항과 함께 동시인용분석과 서지결합분석을 네트워크로 시각화하여 데이터논문이 형성한 세부 주제 분야를 규명하였다. 분석결과, 저자, 저자소속기관, 국가를 추출하여 출현빈도를 살펴보면, 전통적인 학술지 논문과 다른 양상을 보인다. 이러한 결과는 데이터의 생산이 용이한 기관과 국가에 주로 데이터논문을 출간하기 때문이라고 해석될 수 있다. 동시인용분석와 서지결합분석 모두 분석도구, 데이터베이스, 게놈구성 등이 주된 세부 주제 영역으로 나타났다. 동시인용분석결과는 9개의 군집으로 형성되었는데, 특정 주제 분야로 나타난 영역은 수질과 기후 등의 분야이다. 서지결합분석은 총 27개의 컴포넌트로 구성되었는데, 수질, 기후 이 외에도 해양, 대기 등의 세부 주제 영역이 파악되었다. 특기할만한 사항으로는 사회과학 분야의 주제 영역도 나타났다는 점이다.

Abstract

In the context of open science, data sharing and reuse are becoming important researchers’ activities. Among the discussions about data sharing and reuse, data journals and data papers shows visible results. Data journals are published in many academic fields, and the number of papers is increasing. Unlike the data itself, data papers contain activities that cite and receive citations, thus creating their own intellectual structures. This study analyzed 14 data journals indexed by Web of Science, 6,086 data papers and 84,908 cited references to examine the intellectual structure of data journals and data papers in academic community. Along with the author’s details, the co-citation analysis and bibliographic coupling analysis were visualized in network to identify the detailed subject areas. The results of the analysis show that the frequent authors, affiliated institutions, and countries are different from that of traditional journal papers. These results can be interpreted as mainly because the authors who can easily produce data publish data papers. In both co-citation and bibliographic analysis, analytical tools, databases, and genome composition were the main subtopic areas. The co-citation analysis resulted in nine clusters, with specific subject areas being water quality and climate. The bibliographic analysis consisted of a total of 27 components, and detailed subject areas such as ocean and atmosphere were identified in addition to water quality and climate. Notably, the subject areas of the social sciences have also emerged.

4

빅데이터 분석을 통해 본 한국 위키피디아의 지식형성 과정에 관한 연구

이정연(이화여자대학교 이화사회과학원) ; 전수현(우아한형제들 데이터애널리스트) 2020, Vol.37, No.2, pp.171-195 https://doi.org/10.3743/KOSIM.2020.37.2.171

초록보기

초록

본 연구는 대표적인 온라인 협업커뮤니티인 한국 위키피디아의 초기 2002년부터 2019년까지의 편집로그 빅데이터를 해체하여 공동협업과정을 시계열적으로 분석하였다. 공개된 오픈데이터의 표준화된 XML 문서편집 기록을 활용해 Phython과 R을 이용하여 분석 요소를 추출하여 이를 활용하였다. 연구 분석 결과 한국 위키피디아 편집자의 참여 방법, 데이터 내용의 특징, 문서 생성의 추이 등을 설명할 수 있었다. 소수 편집자들의 적극적 활동과 대다수 편집자들의 느슨한 참여도 밝혀졌으며, 온라인에서도 나타나는 사회 문화적 특징이 한국 위키피디아에서도 나타났다. 집단지성을 지속화시키기 위해서는 새롭고 다양한 외부자원이 필수인데 신규 진입자들이 공동편집 커뮤니티에 안착하기 위한 다각적인 고려가 필요하며, 관리자 그룹의 고착화를 탈피하여 순환구조를 통한 개방성이 필요함을 제언하였다.

Abstract

This study analyzed the collaborative process in time series by dismantling the edit log big data of Wikipedia Korea, a representative online collaboration community, from early 2002 to 2019. Analysis elements were extracted from the document edit records, formatted in standardized XML, and analyzed using Python and R. The ways of editors’ contribution, the characteristics of data contents, and the trend of document creation were explained by the analysis. An active contribution of a small set of editors and a loose participation of the majority were revealed. In addition, sociocultural characteristics that appear in online communities were also found in Wikipedia Korea. A new, diverse set of external resources is necessary to sustain the collective intelligence. An effort to settle new editors into the wikipedia community and an openness through circulation structure to avoid the exclusiveness of the management group are suggested.

5

PLoS ONE 학술지 게재 국내 기관 소속 연구자 논문의 계량적 분석

심원식(성균관대학교 문헌정보학과) ; 안병군(성균관대학교 일반대학원 문헌정보학과) ; 박성은(성균관대학교 일반대학원 문헌정보학과) ; 김현수(성균관대학교 일반대학원 문헌정보학과) 2020, Vol.37, No.2, pp.47-69 https://doi.org/10.3743/KOSIM.2020.37.2.047

초록보기

초록

본 연구는 대표적인 오픈액세스 학술지 중에서 범학문적인 성격을 가진 PLoS ONE에 게재된 국내 기관 소속 연구자들의 출판 활동에 대한 계량적인 분석을 제시하고 있다. 대표적인 메가학술지인 PLoS ONE에 국내 연구자들은 2006년부터 2019년의 기간 동안 약 6,500여 개의 연구논문을 게재하였고 이는 국가 기준으로는 전세계 11위에 해당하는 수준이다. 국내 기관 소속 저자들의 PLoS ONE 논문은 대부분 의생명 공학에 집중되어 있다. 최근에는 PLoS ONE에 대한 논문 게재가 감소하고 Scientific Reports, BMJ Open 등과 같은 경쟁 메가학술지로의 이동이 감지된다. 이러한 변화는 논문심사 기간의 지연, 영향력 지수 감소에 영향을 받은 것으로 보인다. PLoS ONE에 10건 이상의 논문을 게재한 국내 교신저자의 전반적인 연구 업적을 보면 오픈액세스 출판 비중이 약 30% 수준으로 나타나 오픈액세스에 대한 수용이 상당한 것으로 분석된다. 하지만 연구자별로 최대 50% 이상의 편차가 있는 것으로 조사된다. PLoS ONE에서 제공하는 이용지표 중에서 저장수는 열람수, 인용수와의 상관계수가 높은 것으로 나오는 반면 공유수는 열람수, 인용수 그리고 저장수와 상관계수가 상대적으로 높지 않은 것으로 조사되었다. 이상의 분석결과는 국내 연구자들의 오픈액세스 출판에 대한 구체적인 데이터에 기반하고 있다는 점에서 의의가 있으며, 논문을 게재한 연구자를 대상으로 한 설문조사 형식의 후속연구를 통해 오픈액세스 출판 배경, 심사과정 등에 대한 구체적인 데이터를 수집, 분석할 예정이다.

Abstract

This research provides a quantitative analysis on research articles published in PLoS ONE, a multidisciplinary open access journal, by authors affiliated with Korean institutions. Korean authors published more than 6,500 research ariticles in the mega journal between 2006 and 2019. Korea is ranked the top 11th place in terms of article publishing in the journal. Most articles by Korean authors are concentrated in the biomedical fields. In recent years, the overall production of PLoS ONE has decreased as authors migrated to competing mega journals such as Scientific Reports and BMJ Open. The change might have been affected in part by the delay in the review period and the dropping impact factor score. The open access share of the Korean PLoS ONE authors of more than 10 articles hovers around 30%. However, there is a significant variation among researchers reaching up to 50% discrepancies. Among altmetrics provided by PLoS ONE, the saves are highly correlated with the views and the citations. On the contrary, the shares show low correlation with other use metrics. A follow up, survey questionnarie based research involving researchers who have published in PLoS ONE is planned in order to investigate author motivation and experience in the review process.

6

목차 정보와 kNN 분류기를 이용한 사회과학 분야 도서 자동 분류에 관한 연구

이용구(계명대학교 문헌정보학과 부교수) 2020, Vol.37, No.1, pp.1-21 https://doi.org/10.3743/KOSIM.2020.37.1.001

초록보기

초록

이 연구에서는 한 대학도서관의 신착 도서 리스트 중 사회 과학 분야 6,253권에 대해 목차 정보를 이용하여 자동 분류를 적용하였다. 분류기는 kNN 알고리즘을 사용하였으며 자동 분류의 범주로 도서관에서 도서에 부여한 DDC 300대 강목을 사용하였다. 분류 자질은 도서의 서명과 목차를 사용하였으며, 목차는 인터넷 서점으로부터 Open API를 통해 획득하였다. 자동 분류 실험 결과, 목차 자질은 분류 재현율과 분류 정확률 모두를 향상시키는 좋은 자질임을 알 수 있었다. 또한 목차는 풍부한 자질로 불균형인 데이터의 과적합 문제를 완화시키는 것으로 나타났다. 법학과 교육학은 사회 과학 분야에서 특정성이 높아 서명 자질만으로도 좋은 분류 성능을 가져오는 점도 파악할 수 있었다.

Abstract

This study applied automatic classification using table of contents (TOC) text for 6,253 social science books from a newly arrived list collected by a university library. The k-nearest neighbors (kNN) algorithm was used as a classifier, and the ten divisions on the second level of the DDC’s main class 300 given to books by the library were used as classes (labels). The features used in this study were keywords extracted from titles and TOCs of the books. The TOCs were obtained through the OpenAPI from an Internet bookstore. As a result, it was found that the TOC features were good for improving both classification recall and precision. The TOC was shown to reduce the overfitting problem of imbalanced data with its rich features. Law and education have high topic specificity in the field of social sciences, so the only title features can bring good classification performance in these fields.

7

비대면 참고정보서비스를 위한 도서관 챗봇 설계 및 구현 연구

유지윤(서울대학교 중앙도서관 주무관) 2020, Vol.37, No.4, pp.151-179 https://doi.org/10.3743/KOSIM.2020.37.4.151

초록보기

초록

본 연구는 대학도서관 이용자를 대상으로 도서관 챗봇을 설계하고 구현하여 언택트 시대의 새로운 비대면 디지털 참고정보서비스를 모색하고자 했다. 데이터 분석을 통해 이용자 요구 및 도서관 서비스를 분석하고, 적합한 챗봇 개발 방법을 선정하여 시나리오를 설계했다. 이용자 친화적인 상호작용을 위해 챗봇의 퍼스널리티를 설계하고 이용자 인터페이스를 디자인하여 사용성을 평가했다. 또한 챗봇의 응답정확도 평가 및 성능평가를 통해 정확도를 검증하고, 이용자 만족도 조사를 통해 챗봇의 효용성을 평가했다. 챗봇 운영관리 및 서비스 품질 유지를 위해 이용자-챗봇 간의 대화를 주기적으로 모니터링하고, 이용자 피드백을 반영하여 서비스를 개선했다. 챗봇 개발 과정 및 결과를 다각도로 분석하여 도서관 챗봇 설계 및 구현을 위한 구체적인 방안을 제시하고자 했다.

Abstract

This study explores the potential of using a library chatbot to improve the non-face-to-face digital reference services for academic library users by designing and implementing a library chatbot. Through data analysis, user needs and library services were analyzed, and a scenario was designed by selecting an appropriate development method. For user-friendly interaction, the personality of the chatbot and user interface was designed to evaluate its usability. In addition, the accuracy was verified through the response accuracy evaluation and performance evaluation of the chatbot, and the effectiveness of the chatbot was evaluated through a user satisfaction survey. In order to manage the operation and maintain service quality, the chatbot is improved by monitoring user-chatbot conversations and reflecting user feedback. Based on these findings, recommendations for designing and implementing a library chatbot were made to help improve library reference services.

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

정보관리학회지