정보관리학회지, 한국정보관리학회

11

Scientific Data 학술지 분석을 통한 데이터 논문 현황에 관한 연구

정은경(이화여자대학교) 2019, Vol.36, No.1, pp.117-135 https://doi.org/10.3743/KOSIM.2019.36.1.117

초록보기

초록

데이터 학술지와 데이터 논문이 오픈과학 패러다임에서 데이터 공유와 재이용이라는 학술활동이 등장하여 지속적으로 성장하고 있다. 본 논문은 영향력있는 다학제적 분야의 데이터 학술지인 Scientific Data에 게제된 총 713건의 논문을 대상으로 저자, 인용, 주제분야 측면을 분석하였다. 그 결과 저자의 주된 주제 영역은 생명공학, 물리학 등으로 나타났으며, 공저자 수는 평균 12명이다. 공저 형태를 네트워크로 살펴보면, 특정 연구자 그룹이 패쇄적으로 공저활동을 수행하는 것으로 나타났다. 인용의 주제영역을 살펴보면, 데이터 논문 저자의 주제영역과 크게 다르지 않게 나타났으나, 방법론을 주로 다루는 학술지의 인용 비중이 높은 것은 데이터 논문의 특징으로 볼 수 있다. 데이터 논문 저자의 키워드를 사용하여 동시출현단어분석 네트워크로 살펴본 데이터 논문의 주제영역은 생물학이 중심이며, 구체적으로 해양생태, 암, 게놈, 데이터베이스, 기온 등의 세부 주제 영역을 확인할 수 있다. 이러한 결과는 다학제학문 분야를 다루는 데이터 학술지이지만, 데이터 학술지 출간에 관한 논의를 일찍부터 시작해온 생명공학 분야에 집중된 현상을 보여준다.

Abstract

Data journals and data papers have grown and considered an important scholarly practice in the paradigm of open science in the context of data sharing and data reuse. This study investigates a total of 713 data papers published in Scientific Data in terms of author, citation, and subject areas. The findings of the study show that the subject areas of core authors are found as the areas of Biotechnology and Physics. An average number of co-authors is 12 and the patterns of co-authorship are recognized as several closed sub-networks. In terms of citation status, the subject areas of cited publications are highly similar to the areas of data paper authors. However, the citation analysis indicates that there are considerable citations on the journals specialized on methodology. The network with authors’ keywords identifies more detailed areas such as marine ecology, cancer, genome, database, and temperature. This result indicates that biology oriented-subjects are primary areas in the journal although Scientific Data is categorized in multidisciplinary science in Web of Science database.

12

사회과학 분야 연구자의 데이터요구와 데이터 재이용 행위에 관한 연구

김나연(이화여자대학교 일반대학원 문헌정보학과 석사) ; 정은경(이화여자대학교 문헌정보학과 교수) 2020, Vol.37, No.4, pp.1-26 https://doi.org/10.3743/KOSIM.2020.37.4.001

초록보기

초록

오늘날 점차 데이터 집약적으로 변모하는 학문 환경 속에서 데이터는 연구부산물이 아닌 연구성과물로써 학술 커뮤니케이션의 기반으로 자리 잡아가고 있다. 그러나 데이터 공급의 확대나 접근가능성의 확보만으로는 실제적인 데이터 재이용을 담보하는 데 한계가 있다. 이를 극복하기 위해서는 학술연구자의 데이터 재이용 행위와 데이터요구를 심층적으로 파악할 필요성이 있다. 따라서 본 연구는 연구자의 주요 데이터 재이용 행위와 데이터요구를 규명하고자 하였다. 이를 위해 한국사회과학자료원(KOSSDA)의 최근 3개년 데이터 재이용문헌 중 KCI 등재 논문의 저자를 연구대상으로 선정하고, 인터뷰를 수락한 연구자 12명과의 심층면담을 수행하였다. 심층면담 분석결과, 데이터를 재이용하는 요인은 개인적, 경제적, 기술적, 사회적 측면 모두에서 나타났으며, 데이터 재이용 목적에 따라 데이터 그 자체를 이용하거나 데이터가 지닌 맥락정보를 활용하였다. 웹 기반의 정보원으로부터 데이터를 주로 습득하였으나 비공식적인 커뮤니케이션을 통해 파악하는 경우도 있었다. 한편 데이터 재이용 시에 발생하는 학술연구자의 데이터요구를 살펴보면 생산 단위는 기관을, 언어는 영어를, 국가로는 미국을 선호하였다. 또한 조사원 기입식 대인면접 조사 방식으로 수집된 양적 데이터를 우선시하였다. 메타데이터와 식별정보를 충분히 포함한 원자료 수준의 데이터를 긍정적으로 인식하였으나, 접근 및 이용이 통제된 데이터는 데이터가 지닌 가치에 대한 확신을 갖기 어려워 부정적으로 받아들였다. 그러나 데이터의 규모나 최신성과 관련된 선호는 뚜렷하게 나타나지 않았는데 이는 선택 가능한 유사 데이터가 부재하였기 때문이었다.

Abstract

In today’s increasingly data-intensive academic environment, data is becoming the foundation of academic communication as a research outcome rather than a research by-product. However, there is a limit to guaranteeing actual data reuse only by expanding the data supply or securing accessibility. In order to overcome this, it is necessary to understand the data reuse behavior and data needs in-depth. Therefore, this study attempted to identify the major data reuse behavior and data needs among researchers. To this end, the authors of KCI papers among the data reuse documents of the Korea Social Science Data Archive (KOSSDA) for the past 3 years were targeted. An in-depth interview was conducted with 12 researchers who accepted the interview. As a result, factors considered when reusing data were personal, economic, technical, and social aspects, and it was found that the data itself was used or contextual information of the data was used depending on the purpose of data reuse. The path to acquiring data is a web-based source of information, and a path through informal communication can also be found. In terms of the data needs, it was found that they prefer English, the United States, and institutional producers. Also they have a clear preference for quantitative data from an interviewer-filled interpersonal interview survey method, rich metadata along with raw data, and data that contains identification information. However, due to the lack of confidence in the value, it is negative for the use of data with controlled access and use, and it is difficult to confirm a clear preference because there is no similar data available for selection in terms of size and freshness.

13

내용기반 음악검색 시스템의 비교 분석

노정순(한남대학교) 2013, Vol.30, No.3, pp.23-48 https://doi.org/10.3743/KOSIM.2013.30.3.023

초록보기

초록

본 연구는 웹에서 접근 가능한 내용기반 음악검색(CBMR) 시스템들을 조사하여, 탐색질의의 종류, 접근점, 입출력, 탐색기능, 데이터베이스 성격과 크기 등의 관점에서 특성을 비교 분석하고자 하였다. 비교 분석에 사용된 특성을 추출하기 위해 내용기반 음악정보의 특성과 시스템 구축에 필요한 파일의 변환, 멜로디 추출 및 분할, 색인자질 추출과 색인, 매칭에 사용되는 기술들을 선행연구로 리뷰하였다. 15개의 시스템을 분석한 결과 다음과 같은 특성과 문제점이 분석되었다. 첫째, 도치색인, N-gram 색인, 불리언 탐색, 용어절단검색, 키워드 및 어구 탐색, 음길이 정규화, 필터링, 브라우징, 편집거리, 정렬과 같은 텍스트 정보 검색 기법이 CBMR에서도 검색성능을 향상시키는 도구로 사용되고 있었다. 둘째, 시스템들은 웹에서 크롤링하거나 탐색질의를 DB에 추가하는 등으로 DB의 성장과 실용성을 위한 노력을 하고 있었다. 셋째, 개선되어야 할 문제점으로 선율이나 주선율을 추출하는데 부정확성, 색인자질을 추출할 때 사용되는 불용음(stop notes)을 탐색질의에서도 자동 제거할 필요성, 옥타브를 무시한 solfege 검색의 문제점 등이 분석되었다.

Abstract

This study compared and analyzed 15 CBMR (Content-based Music Retrieval) systems accessible on the web in terms of DB size and type, query type, access point, input and output type, and search functions, with reviewing features of music information and techniques used for transforming or transcribing of music sources, extracting and segmenting melodies, extracting and indexing features of music, and matching algorithms for CBMR systems. Application of text information retrieval techniques such as inverted indexing, N-gram indexing, Boolean search, truncation, keyword and phrase search, normalization, filtering, browsing, exact matching, similarity measure using edit distance, sorting, etc. to enhancing the CBMR; effort for increasing DB size and usability; and problems in extracting melodies, deleting stop notes in queries, and using solfege as pitch information were found as the results of analysis.

14

음향학적 자질을 활용한 비디오 스피치 요약의 자동 추출과 표현에 관한 연구

김현희(명지대학교) 2012, Vol.29, No.4, pp.191-208 https://doi.org/10.3743/KOSIM.2012.29.4.191

초록보기

초록

스피치 요약을 생성하는데 있어서 두 가지 중요한 측면은 스피치에서 핵심 내용을 추출하는 것과 추출한 내용을 효과적으로 표현하는 것이다. 본 연구는 강의 자료의 스피치 요약의 자동 생성을 위해서 스피치 자막이 없는 경우에도 적용할 수 있는 스피치의 음향학적 자질 즉, 스피치의 속도, 피치(소리의 높낮이) 및 강도(소리의 세기)의 세 가지 요인을 이용하여 스피치 요약을 생성할 수 있는지 분석하고, 이 중 가장 효율적으로 이용할 수 있는 요인이 무엇인지 조사하였다. 조사 결과, 강도(최대값 dB과 최소값 dB간의 차이)가 가장 효율적인 요인으로 확인되었다. 이러한 강도를 이용한 방식의 효율성과 특성을 조사하기 위해서 이 방식과 본문 키워드 방식간의 차이를 요약문의 품질 측면에서 분석하고, 이 두 방식에 의해서 각 세그먼트(문장)에 할당된 가중치간의 관계를 분석해 보았다. 그런 다음 추출된 스피치의 핵심 세그먼트를 오디오 또는 텍스트 형태로 표현했을 때 어떤 특성이 있는지 이용자 관점에서 분석해 봄으로써 음향학적 특성을 이용한 스피치 요약을 효율적으로 추출하여 표현하는 방안을 제안하였다.

Abstract

Two fundamental aspects of speech summary generation are the extraction of key speech content and the style of presentation of the extracted speech synopses. We first investigated whether acoustic features (speaking rate, pitch pattern, and intensity) are equally important and, if not, which one can be effectively modeled to compute the significance of segments for lecture summarization. As a result, we found that the intensity (that is, difference between max DB and min DB) is the most efficient factor for speech summarization. We evaluated the intensity-based method of using the difference between max-DB and min-DB by comparing it to the keyword-based method in terms of which method produces better speech summaries and of how similar weight values assigned to segments by two methods are. Then, we investigated the way to present speech summaries to the viewers. As such, for speech summarization, we suggested how to extract key segments from a speech video efficiently using acoustic features and then present the extracted segments to the viewers.

15

이용자 중심의 주제어 기반 분류를 위한 주제명 개발에 관한 연구: 지식조직체계 분석을 바탕으로

백지원(이화여자대학교) 2011, Vol.28, No.1, pp.171-193 https://doi.org/10.3743/KOSIM.2011.28.1.171

초록보기

초록

본 연구는 도서관 장서의 분류를 위하여 기존의 문헌 분류체계 대신 주제어 기반의 분류를 적용하고자 할 때 필수적인 주제명 개발의 필요성을 논하고, 개발 방법론의 하나로 기존의 다양한 지식조직체계의 주제어를 활용하는 방법의 가능성을 모색하는데 목적이 있다. 이를 위하여 분석 대상 저작을 선정하고 이에 대하여 부여된 문헌분류, 주제명표목, 국내외 대형 서점의 분류, 서가명 및 주제어, 이용자 태그 등 다양한 지식조직체계의 주제어를 수집하여 그 특성을 비교 분석하였다. 이러한 분석의 결과, 전통적인 도서관 중심의 지식조직체계와 상업성이 중심이 되는 지식조직체계의 성격과 범주화의 방식이 다름을 확인할 수 있었다. 한편, 이용자 태그는 최상위 빈도수의 태그인 경우 전통적인 지식조직체계 및 상업적 영역의 지식조직체계와 어휘의 측면에서 거의 차이가 없는 결과를 나타냈으나, 이용자 중심의 주제어로서 독특한 특성을 가지고 있음을 파악하였다. 이러한 분석을 바탕으로 분류를 대체하는 주제명 작성을 위해 기존의 지식조직체계를 활용할 때 고려해야 할 각각의 특성 및 상호 관계를 분석하였고, 국내에서의 적용을 위한 실질적인 고려사항을 제안하였다.

Abstract

This study aims to analyse the necessity of the subject heading construction for the word based classification and to suggest a methodology that uses various knowledge organization systems(KOS). For this purpose, six kinds of KOS were collected for the 20 selected works in each subject. The collected subjects were analysed in terms of constructing a subject heading for the word based classification. The result of the analysis shows that there is a noticeable difference between the library oriented KOS and commercial oriented KOS. In addition, user oriented tags are more similar to the commercial sector's concerning subject categorization than the library oriented ones. However, there is no noticeable difference among the library oriented KOS, commercial sector oriented KOS, and user oriented tags regarding the subject vocabulary. Some practical implications were suggested for the application to the Korean libraries based on the findings of this study.

16

여대생의 인터넷 생식건강정보 탐색에 영향을 미치는 요인 연구

윤현수(성균관대학교 문헌정보학과 박사과정) ; 오상희(성균관대학교 문헌정보학과 부교수) ; 이영미(성균관대학교 문헌정보학과 석사과정) 2024, Vol.41, No.1, pp.389-409 https://doi.org/10.3743/KOSIM.2024.41.1.389

초록보기

초록

본 연구의 목적은 여대생들의 생식건강정보 탐색행위에 영향을 미치는 요인을 살펴보고 그 관계성을 살펴보는 것이다. 건강신념모델(HBM)과 계획된행동이론(TPB)을 기반으로 지각된 민감성, 지각된 심각성, 지각된 이익, 지각된 장애, 주관적 규범, 지각된 행동통제, 감정적 평가를 주요 요인으로 정의하고 연구를 설계하였다. 대학생 온라인 커뮤니티인 ‘에브리타임’을 통해 서울 소재 4년제 대학교의 여대생을 대상으로 온라인 설문을 실시하여 데이터를 수집하였다. 연구결과, 여대생들은 지각된 민감성, 지각된 이익, 주관적 규범이 높을수록, 반면에 지각된 장애는 낮을수록, 인터넷을 통해 생식건강정보를 탐색할 의도가 높은 것으로 나타났다. 또한 여대생들의 인터넷 생식건강 탐색에 영향을 미치는 요인들은 여대생들의 성경험 유무, 생식기계 질환 경험 유무, 건강관심도 등에 따른 그룹 간의 차이를 보이기도 했다. 본 연구결과는 여대생들을 대상으로 하는 대학도서관이나 보건기관 등이 온라인 건강정보 문해교육이나 관련 서비스 프로그램을 개발하는데 있어 여대생들의 생식건강 인식 정도를 파악하는데 기여할 수 있을 것으로 기대한다.

Abstract

The purpose of this study is to identify the factors affecting female college students’ behaviors in seeking reproductive health information on the Internet and to explore the relationships among these factors. Based on the Health Belief Model(HBM) and the Theory of Planned Behavior(TPB), perceived sensitivity, perceived severity, perceived benefit, perceived barriers, subjective norms, perceived behavioral control, and affective evaluation were defined as key factors, and the study was designed accordingly. An online survey was distributed to female college students in Seoul through the university student’s online community, ‘Everytime.’ The results showed that the intention of female college students to seek reproductive health information via the Internet was associated with higher perceived sensitivity, perceived benefit, and subjective norms, and lower perceived barriers. There were statistically significant differences between groups in terms of sexual experiences, experience with reproductive system disorders, and the level of health interest. We believe that this research outcome will contribute to assessing the level of awareness regarding reproductive health among female college students, thereby aiding in the development of online health information literacy education or related service programs by university libraries, health institutions, and similar entities targeting female college students.

17

차세대디지털도서관서비스에 대한 Y세대 이용자의 요구분석 연구

노영희(건국대학교) 2014, Vol.31, No.3, pp.29-63 https://doi.org/10.3743/KOSIM.2014.31.3.029

초록보기

초록

본 연구에서는 Y세대의 특징을 밝히고 Y세대가 요구하는 차세대디지털도서관서비스를 도출하고자 하였으며, 이들의 요구가 베이비붐세대와 어느 정도 차이를 보이는지를 비교하고자 하였다. 연구결과, 첫째, Y세대가 가장 많이 이용하는 디지털기기는 휴대폰 또는 스마트폰으로 나타났고, 다음으로 데스크탑 PC, 노트북 PC, 디지털 카메라 순으로 나타났으며, 사용비율에 있어서 약간의 차이는 있지만 그 순위는 베이비붐세대와 거의 유사하게 나타났다. 둘째, 이용하는 디지털서비스에 있어서 Y세대와 베이비붐세대는 상당한 차이를 보이고 있는 것으로 분석되었으며, Y세대는 인터넷 포털을 가장 많이 이용하고 베이비붐세대는 이메일서비스를 가장 많이 이용하는 것으로 나타났다. 셋째, Y세대와 베이비붐세대가 차세대디지털도서관에 요구하는 서비스를 클라우드서비스, 무한창조공간, 빅데이터, 증강현실, 구글글래스, 상황인식기술, 시맨틱서비스, SNS서비스, 디지털교과서서비스, RFID 및 QRCode 서비스, 도서관공간구성, 최첨단디스플레이기술, 기타 획기적인 서비스로 구분하여 조사한 결과, Y세대가 가장 높은 요구도를 보인 서비스는 빅데이터서비스였고, 베이비붐세대는 디지털교과서서비스였다.

Abstract

This study attempted to reveal the characteristics of the Y generation, to derive the services of the next generation digital library, and to compare differences between the demands of the baby boom generation and the Y generation to some extent. As a result, first, it is shown that the digital device the Y generation uses the most, was a cell phone or smartphone, followed by desktop PC, notebook PC, and digital camera. Although there were some differences, the Y generation’s use ratio of digital devices was substantially similar to the baby boomers’. Second, there was a significant difference between the Y generation and baby boom generation in terms of using digital services. While the Y generation used internet portals the most, the baby boom generation used e-mail service the most. Third, we surveyed the services which the Y generation and baby boom generation require for the next generation digital libraries, by grouping as follows: the cloud service, infinite creative space (maker space), big data, augmented reality, Google Glass, context-aware technologies, semantic services, SNS service, digital textbook service, RFID and QRCode service, library space configuration, a state-of-the-art display technology, and other innovative services. While the most demanded service by the Y generation was big data service, the baby boom generation most demanded digital textbook service.

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

정보관리학회지