정보관리학회지, 한국정보관리학회

31

심지영(연세대학교 대학도서관발전연구소) 2022, Vol.39, No.4, pp.347-373 https://doi.org/10.3743/KOSIM.2022.39.4.347

초록보기

초록

본 연구는 독서자료의 접근점을 확장하기 위해, 도서이용 속성에 기반한 독서자료 분류체계를 고안하였다. 독서상황에서 도서 이용자가 고려할 수 있는 도서의 속성을 내용분석하여 주제명에 반영하고, 네트워크 분석을 통해 주제명 항목과 인접한 항목들을 연관 주제명으로 그룹화하여 함께 제시하였다. 본 연구에서 개발한 독서자료분류표(RMC)는 도서관 OPAC을 비롯한 독서정보 시스템 내에서 도서 이용자의 탐색을 돕는 다양한 접근점을 제공하는 도구로써 사용될 수 있을 것이다.

Abstract

In this study, in order to expand the access points of reading materials, a reading material classification (RMC) system based on the facets of book use was devised. The facets of books that can be considered by book users in the reading situation were content-analyzed. Also, through network analysis, subject headings adjacent to one subject heading were grouped into related subject headings. The RMC developed in this study can be used as a tool that provides various access points to help book users search in the library OPAC and other reading information systems.

32

팩터그래프 모델을 이용한 연구전선 구축: 생의학 분야 문헌을 기반으로

김혜진(연세대학교) ; 송민(연세대학교) 2017, Vol.34, No.1, pp.177-195 https://doi.org/10.3743/KOSIM.2017.34.1.177

초록보기

초록

연구전선이란 연구논문들 간에 인용이 빈번하게 발생하며, 지속적으로 발전이 이루어지고 있는 연구영역을 의미한다. 연구행위가 집중되는 핵심 연구분야로 발전 가능성이 높은 연구전선을 조기에 예측해내는 것은 학계와 산업계, 정부기관, 나아가 국가의 과학기술 발전에 큰 유익을 가져다 줄 수 있는 유용한 사회적 자원이 된다. 본 연구는 복합자질을 활용하여 연구전선을 추론하는 모델을 제시하고자 시도하였다. 연구전선 추론은 핵심 연구영역으로 발전할 가능성이 높은 문헌들이 포함될 수 있도록 문헌을 복합자질로 표현하고, 그 자질들을 심층학습하여 새로 발행된 문헌들이 연구전선에 포함될 수 있는지 그 가능성을 예측하였다. 서지 자질, 네트워크 자질, 내용 자질 등 복합자질 세트를 사용하여 문헌을 표현하고 피인용을 많이 받을 가능성이 있는 문헌을 추론하기 위해서 확률기반 팩터그래프 모델을 적용하였다. 추출된 자질들은 팩터그래프의 변수로 표현되어 합-곱 알고리즘과 접합 트리 알고리즘을 적용하여 연구전선 추론이 이루어졌다. 팩터그래프 확률모델을 적용하여 연구전선을 추론․구축한 결과, 서지결합도 4 이상으로 구축된 베이스라인 연구전선과 큰 차이를 보였다. 팩터그래프 기반 연구전선그룹이 서지결합 기반 연구전선그룹보다 문헌 간의 직접 연결정도가 강하며 연결 관계에 있지 않은 두 개의 문헌을 연결시키는 매개정도 또한 강한 집단으로 나타났다.

Abstract

This study attempts to infer research fronts using factor graph model based on heterogeneous features. The model suggested by this study infers research fronts having documents with the potential to be cited multiple times in the future. To this end, the documents are represented by bibliographic, network, and content features. Bibliographic features contain bibliographic information such as the number of authors, the number of institutions to which the authors belong, proceedings, the number of keywords the authors provide, funds, the number of references, the number of pages, and the journal impact factor. Network features include degree centrality, betweenness, and closeness among the document network. Content features include keywords from the title and abstract using keyphrase extraction techniques. The model learns these features of a publication and infers whether the document would be an RF using sum-product algorithm and junction tree algorithm on a factor graph. We experimentally demonstrate that when predicting RFs, the FG predicted more densely connected documents than those predicted by RFs constructed using a traditional bibliometric approach. Our results also indicate that FG-predicted documents exhibit stronger degrees of centrality and betweenness among RFs.

33

사회학 분야의 연구데이터 특성과 지적구조 규명에 관한 연구

최형욱(이화여자대학교 일반대학원 문헌정보학과) ; 정은경(이화여자대학교) 2017, Vol.34, No.3, pp.109-124 https://doi.org/10.3743/KOSIM.2017.34.3.109

초록보기

초록

여러 학문 분야에서 데이터의 공유와 재이용에 관한 관심이 증가하고 있다. 실제로 다른 연구자의 데이터를 다시 연구에 사용하고 인용을 부여하는 관행이 서서히 자리를 잡아가고 있다. 이러한 변화를 반영하여 톰슨로이터는 Data Citation Index(DCI)라는 데이터인용 색인 데이터베이스 서비스를 2012년부터 제공하기 시작하였다. DCI는 모든 학문의 전 영역에서 데이터의 인용 현황을 저널의 논문과 유사하게 집계한다. 본 연구에서는 데이터인용이 활발한 사회학 분야의 인용된 연구데이터를 분석하여 해당 분야의 특성과 지적구조를 규명하고자 하였다. 이를 위해 논문 인용을 기반으로 한 사회학 분야의 지적구조와 비교하였으며, 사회학 분야의 연구데이터의 특성과 고유한 지적구조를 살펴보고자 하였다. 분석을 위한 데이터는 두 종류로 수집하였다. 첫째는 DCI에서 ‘Sociology’로 주제 검색을 수행하여 총 8,365건의 인용된 데이터를 수집하였다. 둘째로, 논문 인용 분석과의 비교를 위해서 Web of Science에서 ‘Sociology’로 주제 검색을 수행하여 총 12,132건의 데이터를 수집하였다. 이 두 데이터를 활용하여 저자키워드 동시출현단어 분석을 수행한 결과, 데이터를 기반으로 한 사회학 분야는 2영역 15군집으로 구성된 반면, 논문을 기반으로 한 사회학 분야는 3영역 17군집으로 나타났다. 내용적인 특성을 살펴보면, 전통적으로 사회학의 지적구조를 나타낸다고 볼 수 있는 논문 기반 사회학과 달리 사회학 분야의 연구데이터는 의학 분야와의 활발한 접목을 찾아볼 수 있으며, 그 중에서도 공중보건과 심리학이 중심 영역인 것으로 나타났다.

Abstract

Through a wide variety of disciplines, practices on data access and re-use have been increased recently. In fact, there has been an emerging phenomenon that researchers tend to use the data sets produced by other researchers and give scholarly credit as citation. With respect to this practice, in 2012, Thomson Reuters launched Data Citation Index (DCI). With the DCI, citation to research data published by researchers are collected and analyzed in a similar way for citation to journal articles. The purpose of this study is to identify the characteristics and intellectual structure of sociology field based on research data, which is one of actively data-citing fields. To accomplish this purpose, two data sets were collected and analyzed. First, from DCI, a total of 8,365 data were collected in the field of sociology. Second, a total of 12,132 data were collected from Web of Science with a topic search with ‘Sociology’. As a result of the co-word analysis of author provided-keywords for both data sets, the intellectual structure of research data-based sociology was composed of two areas and 15 clusters and that of article-based sociology was composed with three areas and 17 clusters. More importantly, medical science area was found to be actively studied in research data-based sociology and public health and psychology are identified to be central areas from data citation.

34

응용프로파일 코어 온톨로지 설계 및 구현

한성국(원광대학교) ; 이현실(원광대학교) 2007, Vol.24, No.3, pp.245-269 https://doi.org/10.3743/KOSIM.2007.24.3.245

초록보기

초록

유비쿼터스 정보 환경하에서 정보자원의 공유와 상호 교환을 위한 정보자원의 구조와 내용 기술에 표준 메타데이터 체계가 이용되고 있다. 실제 응용 도메인에서는 다수의 메타데이터 요소를 혼합-일치 방식으로 재사용하여 응용 시스템을 구축하게 되는데, 이때 메타데이터 요소의 상세화와 상호 운용성 등의 문제가 발생한다. 메타데이터 활용에서 발생하는 문제 해결에 응용 프로파일 접근 방식이 이용되고 있다. 본 논문에서는 응용 프로파일의 목적과 기능을 달성할 수 있는 응용 프로파일 코어 온톨로지를 제시하고, 이를 기반으로 한 메타데이터 응용 시스템 구축에 대하여 서술하였다.

Abstract

The standard metadata systems are very popular for the description of structures and semantics of information resources to realize sharing and exchanging information in global ubiquitous environment. In real application domain, various metadata elements are reused together with mix-and-match manner. An application system using diverse metadata systems is compelled with refinement and interoperability of metadata elements. Application profile is the general approach to resolve the various problems occurred in metadata application systems. This paper proposes Application Profile Core Ontology (APO) that can achieve the goals and functions of application profile, and describes metadata application system based on APO.

35

도서이용 데이터에 기반한 독서자료의 속성 분석

심지영(연세대학교 대학도서관발전연구소) 2023, Vol.40, No.4, pp.279-306 https://doi.org/10.3743/KOSIM.2023.40.4.279

초록보기

초록

본 연구는 다양한 관점의 이용요구가 혼재되어있는 독서자료의 속성을 파악하기 위해, 도서의 동시이용(동시대출, 동시구매) 데이터에 기반하여 독서자료의 선택 및 이용과 관계된 서지적 속성을 분석하였다. KDC 주제, 독자대상, 이용자 연령 관련 26개 하위 속성 단위로 구분하여 서지적 속성 용어의 동시출현행렬을 생성하고 네트워크 분석을 수행한 결과, 독서자료의 서지적 속성의 세부 내용 및 두드러진 매개 역할을 파악하였다. 본 연구의 결과는 향후 도서관 OPAC을 비롯한 독서정보 시스템의 패싯 설계에 도움이 될 것이다.

Abstract

This study analyzed bibliographic attributes related to the selection and use of reading materials based on data on books borrowed or purchased together in order to understand the properties of reading materials that have complex user needs from various perspectives. As a result of creating co-occurrence matrices of bibliographic attribute terms by dividing them into 26 sub-attribute units related to KDC main class, target reader, and user age, and performing network analyses, the details and prominent mediating role of bibliographic attributes of reading materials were identified. The results of this study will be helpful in designing facets of reading information systems, including library OPAC, in the future.

36

북미 대학도서관 연구데이터 관리 교육 프로그램 내용 분석: 데이터 리터러시 세부 역량을 중심으로

김지현(이화여자대학교) 2018, Vol.35, No.4, pp.7-36 https://doi.org/10.3743/KOSIM.2018.35.4.007

초록보기

초록

본 연구에서는 북미에서 연구데이터 관리 서비스를 제공하는 121개 대학도서관 중 연구데이터 관리 교육 프로그램을 제공하는 51개 기관을 대상으로 제공되는 교육 프로그램의 내용을 12개 데이터 리터러시 세부 역량에 기반을 두어 분석하고 시사점을 제시하는 것을 목적으로 하였다. 내용 분석을 위해 집합 교육 프로그램의 제목 317개와 온라인 튜토리얼의 상위 목차 제목 42개를 수집하였으며 선행연구에서 제시된 12개 데이터 리터러시 세부 역량에 따라 코딩을 수행하였다. 집합 교육 프로그램 중에서는 데이터 처리 및 분석 역량에 대한 교육 프로그램이 가장 많은 것으로 나타났으며, 가장 많은 수의 기관에서 데이터 관리 및 조직 역량에 대한 교육을 제공하고 있었다. 데이터 시각화 및 표현은 집합 교육 프로그램 중에서 세 번째로 많이 다루어지는 역량이었다. 그러나 나머지 9개 역량에 대한 교육 프로그램은 매우 적은 것으로 나타나 교육 프로그램 내용이 특정 역량에 집중되어 있음을 알 수 있다. 집합 교육 없이 자체 개발한 온라인 튜토리얼을 제공하는 기관은 5곳이었으며 목차 제목을 분석한 결과 데이터 보존, 윤리 및 데이터 인용, 데이터 관리 및 조직 역량에 대한 교육 내용을 중점적으로 다루고 있어 집합 교육 프로그램에서 강조되는 역량과 차이를 보였다. 효과적인 연구데이터 관리 교육 프로그램 운영을 위해서는 대학도서관 사서들이 전통적으로 교육하고 강조해왔던 역량뿐만 아니라 데이터 처리와 분석, 데이터 시각화와 표현 등 연구자들의 연구 결과 도출에 필요한 데이터 리터러시 세부 역량에 대한 이해와 지원이 요청된다. 또한 연구데이터 관리 서비스와 관련된 사서들의 계속 교육을 지원하는 교육 자원의 개발도 필요할 것이다.

Abstract

This study aimed to analyze the content of Records Data Management (RDM) training programs provided by 51 out of 121 university libraries in North America that implemented RDM services, and to provide implications from the results. For the content analysis, 317 titles of classroom training programs and 42 headings at the highest level from the tables of content of online tutorials were collected and coded based on 12 data literacy competencies identified from previous studies. Among classroom training programs, those regarding data processing and analysis competency were offered the most. The highest number of the libraries provided classroom training programs in relation to data management and organization competency. The third most classroom training programs dealt with data visualization and representation competency. However, each of the remaining 9 competencies was covered by only a few classroom training programs, and this implied that classroom training programs focused on the particular data literacy competencies. There were five university libraries that developed and provided their own online tutorials. The analysis of the headings showed that the competencies of data preservation, ethics and data citation, and data management and organization were mainly covered and the difference existed in the competencies stressed by the classroom training programs. For effective RDM training program, it is necessary to understand and support the education of data literacy competencies that researchers need to draw research results, in addition to competencies that university librarians traditionally have taught and emphasized. It is also needed to develop educational resources that support continuing education for the librarians involved in RDM services.

37

네트워크 분석 논문의 고찰: 계량서지적 분석과 내용분석을 중심으로

정은경(이화여자대학교 문헌정보학과) 2021, Vol.38, No.1, pp.169-190 https://doi.org/10.3743/KOSIM.2021.38.1.169

초록보기

초록

네트워크 분석 기법을 활용한 연구가 다양한 학문 분야에서 수행되고 있다. 본 연구는 2003년부터 2021년까지 국내 학술지에 게재된 네트워크 분석 논문 총 2,187건을 대상으로 계량서지적 분석과 내용분석을 수행하였다. 분석결과는 살펴보면, 논문 생산에 있어서 교육학, 학제간연구, 컴퓨터학, 문헌정보학, 행정학, 경영학 등의 우위를 확인할 수 있다. 학술지 단위로 보면, 메가 학술지의 강세가 나타난다. 그러나 피인용 기반의 영향력을 살펴보면, 행정학, 문헌정보학, 교육학의 영향력을 뚜렷하게 확인할 수 있다. 저자 단위로 분석한 결과 역시 언론정보학, 행정학, 문헌정보학의 우위를 확인할 수 있다. 파악된 1,537명의 저자 중에서 극소수의 저자가 활발한 연구활동을 하는 것으로 나타났으며, 이를 통해 연구자 저변 확대의 필요성도 확인할 수 있다. 내용분석의 결과를 살펴보면, 논문을 데이터셋으로 하여 가중/비방향네트워크를 형성하는 것이 가장 일반적인 네트워크 형태로 나타났다. 노드는 단어, 링크는 동시출현으로 표현되는 것이 보편적이며, 분석을 위해서는 KrKwic, UCINET, NetMiner, NetDraw의 활용이 가장 두드러졌다.

Abstract

Research in various academic fields using network analysis techniques has been conducted and grown. This study performed bibliographical analysis and content analysis on a total of 2,187 network analysis papers published in journals from 2003 to 2021. The results showed that the fields of Pedagogy, Interdisciplinary Research, Computer Science, Library and Information Science, Public Administration, and Business Administration were higher in terms of the number of research papers. From the perspective of journal, mega-journals were indicated as the most productive journals. However, when looking at the impact based on the number of citations, the strength of Public Administration, Library and Information Science, and Pedagogy is clearly revealed. The results of the analysis by authors can also confirm the higher impact of Journalism, Public Administration Science, and Library and Information Science. Of the 1,537 authors identified, very few authors are active in research, confirming the need to expand the researcher base. The results of content analysis showed that the weighted and non-directional network was the most common network type with using the research papers as a data set. Generally nodes are expressed as words and links are expressed as relationship. For network analysis, the use of KrKwic, UCINET, NetMiner, and NetDraw is the most prominent.

38

식별체계기반 디지털콘텐츠 유통체제 구축방안 연구

석중호() 2003, Vol.20, No.4, pp.195-210 https://doi.org/10.3743/KOSIM.2003.20.4.195

초록보기

초록

최근 정보기술 및 인터넷의 급속한 발전으로 지식정보 자원들이 디지털화 되어 인터넷을 통해 유통되고 있으나, 디지털콘텐츠의 위치, 내용 변경 등으로 인해 이용자 접근 및 이용에 문제가 발생하고 있으므로 표준 식별체계를 활용한 디지털콘텐츠의 식별 및 유통 방안의연구가 필요하다. 본 연구의 목적은 디지털콘텐츠의 효율적인 관리와 안전한 유통을 위한식별체계 기반의 정보유통체제 구축에 기여하는데 있다. 이에 본 연구는 식별체계 개요, 활용사례, 유통모델의 조사와 정보유통 현황을 분석하고, KISTI 고유식별체계, 식별시스템 및 유통시스템 구축 방안 등을 제시하고 있다.

Abstract

With the rapid development of information technology and internet in these days, resources of knowledge information have been digitalized and distributed on the internet. However, the location of digital content and a change of content have generated problems for users access and services. In line with this regard, the research on the identification of digital content utilizing standardized identification system and distribution system is necessary. This study intends to contribute to the implementation of information system based on standard digital identifier for the effective management and safe distribution of digital contents. This study first tries to survey SDI outline, practical application case and distributed business model and to analyze information distribution status. Finally, this study tries to draw up a plan for the establishment of KISTI's SDl. content identification system, content distribution system.

39

전자문서 아카이빙 표준모델 연구

이원영(국회기록보존소) ; 강진영(한국정보보호진흥원) 2005, Vol.22, No.2, pp.147-164 https://doi.org/10.3743/KOSIM.2005.22.2.147

초록보기

초록

2003년 개정된, 공공기관의기록물관리에관한법률 동 시행령에서는 전자문서의 생산의무와 보존의무를 지정하였으나 장기보존과 관련된 법조항이나 관련 표준은 그 내용이 아주 미미하여 보강이 필요하다. 이에 본 연구는 전자문서의 장기보존을 위한 표준요소를 제공하여 전자문서의 보존기반을 마련하는데 그 목적이 있다. 관리 전략수립을 위하여 생산시점의 장기보존요소 추출을 기본으로 하였으며 현용준현용 단계 전자문서의 장기보존은 ISO 15489의 관리요소를 아카이브단계는 ISO 14721: OAIS(Open Archival Information System)참조모델을 분석하여 장기보존기능이 반영된 법률과 보다 개선된 시스템 환경을 제안하였다.

Abstract

Requirements concerning production of electronic documents and storage are stipulated in the act on document management of public institutions revised in 2003. However, provisions or standards for long term preservation of electronic documents are insufficient and in need of strengthening. This study aims to provide standard factors for long term preservation of electronic documents and thus lay foundation for long term preservation related matters for the establishment of management strategy, ISO 15489 management factor is analyzed as a necessary framework for long term preservation of electronic record at a production stage. Preservation description information is derived from ISO 14721 which is suggesting document management systems to archival institutions. Through this case study, standard registry factors reflecting ISO 15489 and 14721's are suggested in an attempt to improve the act and system environment for long term preservation and archiving.

40

복수의 신문기사 자동요약에 관한 실험적 연구

김용광(연세대학교) ; 정영미(연세대학교) 2006, Vol.23, No.1, pp.83-98 https://doi.org/10.3743/KOSIM.2006.23.1.083

초록보기

초록

이 연구에서는 복수의 신문기사를 자동으로 요약하기 위해 문장의 의미범주를 활용한 템플리트 기반 요약 기법을 제시하였다. 먼저 학습과정에서 사건/사고 관련 신문기사의 요약문에 포함할 핵심 정보의 의미범주를 식별한 다음 템플리트를 구성하는 각 슬롯의 단서어를 선정한다. 자동요약 과정에서는 입력되는 복수의 뉴스기사들을 사건/사고 별로 범주화한 후 각 기사로부터 주요 문장을 추출하여 템플리트의 각 슬롯을 채운다. 마지막으로 문장을 단문으로 분리하여 템플리트의 내용을 수정한 후 이로부터 요약문을 작성한다. 자동 생성된 요약문을 평가한 결과 요약 정확률과 요약 재현율은 각각 0.541과 0.581로 나타났고, 요약문장 중복률은 0.116으로 나타났다.

Abstract

This study proposes a template-based method of automatic summarization of multiple news articles using the semantic categories of sentences. First, the semantic categories for core information to be included in a summary are identified from training set of documents and their summaries. Then, cue words for each slot of the template are selected for later classification of news sentences into relevant slots. When a news article is input, its event/accident category is identified, and key sentences are extracted from the news article and filled in the relevant slots. The template filled with simple sentences rather than original long sentences is used to generate a summary for an event/accident. In the user evaluation of the generated summaries, the results showed the 54.1% recall ratio and the 58.1% precision ratio in essential information extraction and 11.6% redundancy ratio.

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

정보관리학회지