정보관리학회지, 한국정보관리학회

1

강인수(경성대학교) 2008, Vol.25, No.3, pp.27-39 https://doi.org/10.3743/KOSIM.2008.25.3.027

초록보기

초록

동일한 인명을 갖는 서로 다른 실세계 사람들이 존재하는 현실은 인터넷 세계에서 인명으로 표현된 개체의 신원을 식별해야 하는 문제를 발생시킨다. 상기의 문제가 학술정보 내의 저자명 개체로 제한된 경우를 저자식별이라 부른다. 저자식별은 식별 대상이 되는 저자명 개체 사이의 유사도 즉 저자유사도를 계산하는 단계와 이후 저자명 개체들을 군집화하는 단계로 이루어진다. 저자유사도는 공저자, 논문제목, 게재지정보 등의 저자식별자질들의 자질유사도로부터 계산되는데, 이를 위해 기존에 교사방법과 비교사방법들이 사용되었다. 저자식별된 학습샘플을 사용하는 교사방법은 비교사방법에 비해 다양한 저자식별자질들을 결합하는 최적의 저자유사도함수를 자동학습할 수 있다는 장점이 있다. 그러나, 기존 교사방법 연구에서는 SVM, MEM 등의 일부 기계학습기법만이 시도되었다. 이 논문은 다양한 기계학습기법들이 저자식별에 미치는 성능, 오류, 효율성을 비교하고, 공저자와 논문제목 자질에 대해 자질값 추출 및 자질 유사도 계산을 위한 여러 기법들의 비교분석을 제공한다.

Abstract

In bibliographic data, the use of personal names to indicate authors makes it difficult to specify a particular author since there are numerous authors whose personal names are the same. Resolving same-name author instances into different individuals is called author resolution, which consists of two steps: calculating author similarities and then clustering same-name author instances into different person groups. Author similarities are computed from similarities of author-related bibliographic features such as coauthors, titles of papers, publication information, using supervised or unsupervised methods. Supervised approaches employ machine learning techniques to automatically learn the author similarity function from author-resolved training samples. So far, however, a few machine learning methods have been investigated for author resolution. This paper provides a comparative evaluation of a variety of recent high-performing machine learning techniques on author disambiguation, and compares several methods of processing author disambiguation features such as coauthors and titles of papers.

2

VIVO를 활용한 국가적 전거구축모델에 관한 연구

오삼균(성균관대학교 문헌정보학과) ; 한상은(성균관대학교 문헌정보학과) ; 손태익(성균관대학교 학술정보관) ; 김성훈(성균관대학교 문헌정보학과) 2018, Vol.35, No.3, pp.165-187 https://doi.org/10.3743/KOSIM.2018.35.3.165

초록보기

초록

전거데이터 공동구축을 목표로 하는 많은 국내 연구가 시행되었음에도 불구하고 국가전거구축의 협업 환경은 표준 전거의 제정, 표준 전거 구축원칙의 제정, 기존 전거구축 기관의 업무 개편, 공동구축 업무의 핵심기관 선정 등과 같은 전제 조건의 미비로 인해 그 실질적 조성이 막혀 있다. 국가전거를 공동으로 구축하고 원활하게 활용하기 위해서는 기존 전거구축기관의 업무에 지장을 초래하지 않는 현실적 협업 방안과 지속적 추진력을 보유한 국가기관의 참여와 아울러 다수 기관의 데이터 병합을 가능케 하는 표준식별체계가 요구된다. 본 연구의 목적은 국가전거의 공동 구축을 위한 여건 조성에 있어서 필수사항이 무엇인가를 문헌조사로 밝히고, 시맨틱웹 기반으로 구축되어 상호운용성이 우수한 VIVO 온톨로지 모델의 활용을 통해 구현 가능한 국가전거 구축모델을 제시하는 것이다.

Abstract

Despite repeated efforts to develop a methodological foundation for assembling collaborative authority data in South Korea, issues such as the establishment of a standard authority model and standard authority construction as well as the reconfiguration of existing entities in authority building have prevented such research from generating a cooperative push for nation-wide authority data and progressing toward concrete implementation. The formulation of a collaborative and well-utilized collection of national authority data accordingly calls for 1) a practical approach to supporting both established authority data contributors and newly organized avenues of mutual participation in authority building, 2) committed involvement on the part of national institutions capable of providing the project with sustained assistance, and 3) a standard identification system which allows multiple organizations to merge their data. This study addresses the challenges of the current environment by taking stock of the key components necessary for the creation of collaborative authority data and using a Semantic Web-based interoperable VIVO ontology model to propose a viable national authority data framework.

3

국채보상운동 디지털 아카이브의 개인/단체명 관리를 위한 메타데이터 설계에 관한 연구

한상은(성균관대학교 사서교육원 강사) ; 도슬기(한성대학교 크리에이티브인문학부 조교수) 2024, Vol.41, No.1, pp.509-536 https://doi.org/10.3743/KOSIM.2024.41.1.509

초록보기

초록

본 연구의 목적은 소규모의 디지털 아카이브인 국채보상운동 디지털 아카이브의 개인과 단체 전거데이터를 관리하기 위한 메타데이터 AP를 개발하는 것이다. 도서관과 기록관의 개인/단체 메타데이터 표준, 구축 사례 및 지침을 분석하여 설계 원칙과 핵심적인 메타데이터 요소를 도출하였으며, 국채보상운동 개인/단체명 시소러스 데이터, 위키데이터 연계 메타데이터 모델과 매핑하여 최종적으로 식별영역 10개 요소, 내용영역 14개 요소, 관계영역 8개 요소, 통제영역 4개 요소를 도출하였다. 소규모의 기관에서도 적용할 수 있도록 단순 구조 스키마를 적용하였고, 상호운용성을 위해 DublinCore, SKOS 스킴을 참고하여 스키마를 제안하였고 실제 데이터를 토대로 적용가능성을 확인하였다. 본 연구의 결과는 데이터 관리의 중요성은 알지만, 실제적인 적용이 어려운 기관에서 전거데이터 관리 체계를 마련하고자 할 때, 기초자료로 활용할 수 있을 것이다.

Abstract

The purpose of this study is to develop a metadata AP for managing the person and organization name authority data in the National Debt Redemption Movement Digital Archive, a small-scale digital archive. The design principles and core metadata elements were derived by analyzing person/ organization(group or corporateBody) metadata standards, implementation practices, and guidelines of libraries and archives, and mapped to the National Debt Redemption Movement person/organization name thesaurus data and the Wikidata Linked Metadata Model, resulting in 10 elements in the identification area, 14 elements in the content area, 8 elements in the relationship area, and 4 elements in the control area. A simple structure schema was applied so that it can be applied even in small organizations, and for interoperability, the schema was proposed with reference to DublinCore and SKOS schemes, and the applicability was confirmed based on actual data. The results of this study can be utilized as a basis for institutions that recognize the importance of data management but have difficulty in applying it in practice, when they want to prepare a system for managing their own authority data.

4

전거제어를 위한 국제표준이름식별자(ISNI)의 활용가능성에 관한 연구

이미화(공주대학교) 2014, Vol.31, No.3, pp.133-151 https://doi.org/10.3743/KOSIM.2014.31.3.133

초록보기

초록

본고는 정보산업 분야 전체를 포괄하는 연계식별자로서 ISNI의 중요성을 인식하여 ISNI의 개념 및 전거제어에서 이의 활용가능성을 모색하고자 하였다. ISNI는 창작, 생산, 관리, 내용 배포의 흐름에서 정보매체 내용산업 전체와 관련된 개인(Party)의 대중개체(Public Identities) 식별을 위한 연계식별자로 전세계 망라적 이름 전거제어를 위해 필요하다. 우선 ISNI의 개념, 목적, 용어, 식별자구조, ISNI 할당원칙, 관리방식, 메타데이터에 대해 조사하였다. 이를 바탕으로 전거제어에서 활용가능성을 모색하였다. 첫째, 국내 협력 전거제어를 위해 ISNI의 개념을 도입하는 것을 고려해야 할 것이다. 국내에 맞는 ISNI 체제인 KISNI를 구축하여 도서관 및 정보산업 분야에서 생산되는 모든 전거데이터를 상호 활용할 수 있도록 할 수 있다. 둘째, 연계식별자인 ISNI를 이용하여 여러 식별자를 연계함으로 링크드데이터 구축이 가능하게 될 것이다. 셋째, 서지레코드 및 전거레코드에 ISNI 식별자를 기술할 수 있도록 KORMARC을 확장해야 할 것이다.

Abstract

This study was to investigate the concept of ISNI and to find its availability in authority control, realizing importance of ISNI as the bridge identifier including all the information media content industries. ISNI is needed for global and comprehensive name authority control as the bridge identifier for the identification of public identities of parties involved throughout the information media content industries in the creation, production, management and content distribution chains. First of all, it was to inquire ISNI concept, goal, terms and definitions, structure and syntax, allocation of ISNI, administration of the ISNI system, and metadata. Next, it was to suggest the applicability of ISNI in authority control. First, it should be needed to consider in applying ISNI for cooperative authority control. It is possible to interactively use the authority data created in library and other information industries area by constructing KISNI system. Second, it is possible to construct linked data by linking various identifier through ISNI identifier as bridge identifier. Third, it is needed to develop KORMARC for describing ISNI identifier in KORMARC bibliographic and authority record.

5

저자역할용어사전 구축 및 저작군집화에 관한 연구

윤재혁(성균관대학교 일반대학원 문헌정보학과) ; 도슬기(성균관대학교 일반대학원 문헌정보학과) ; 오삼균(성균관대학교 문헌정보학과) 2020, Vol.37, No.2, pp.197-223 https://doi.org/10.3743/KOSIM.2020.37.2.197

초록보기

초록

본 연구는 통합서지용 한국문헌자동화목록(KORMARC)으로 작성된 서지레코드를 FRBR의 저작(Work) 단위로 군집화 하는 과정에서 나타난 이슈사항들을 분석하고, 이에 대한 해결방안을 고안하였다. 특히 기존의 연구에서는 대표저작자를 식별하고 처리하는 기준이 명확하게 드러나지 않거나 파생저작 레코드의 대표저작자를 선정하는 방법에 대한 논의가 충분히 이루어지지 않았다. 따라서 본 연구는 저작을 창작하는 데 기여한 사람이 다수일 때 대표저작자를 명확하게 식별하기 위한 방법을 고안하는 데 초점을 맞추었다. 이를 위해 책임표시사항(245) 필드의 책임표시 태그(▼d, ▼e)에서 추출한 역할용어를 토대로 표준화된 저자역할용어사전을 개발하여 대표저작자 판별에 활용하는 방안을 마련하였다. 또한 저자명의 유사도와 표제의 유사도를 각각 계산하여 유사도가 일정 수준 이상인 경우 동일한 저작으로 군집화 하는 방법을 채택하였다. 각각의 유사도를 계산하여 동일 저작을 판단하므로 공백, 관제처리, 괄호제거와 같은 데이터 정제 조건을 조정하여 6가지 패턴에 따른 군집화의 정확도를 비교하였고, 저자명과 표제의 유사도가 모두 80퍼센트 이상일 때의 정확도가 가장 높게 나타났다. 본 연구는 대표저작자 선정을 위한 역할용어사전 개발, 대표저작자와 표제의 유사도를 별도로 측정하여 저작군집화를 시도한 실험연구이며 후속 연구에서는 표제 간 유사도 측정의 정확도를 향상시키는 방안과 FRBR 1그룹의 다른 개체(표현형, 구현형, 개별자료) 수준으로 확대하여 활용하는 방안, 국내에서 사용하고 있는 다른 형태의 MARC 데이터에 적용하는 방안을 고안할 예정이다.

Abstract

The purpose of this study is to analyze the issues resulted from the process of grouping KORMARC records using FRBR WORK concept and to suggest a new method. The previous studies did not sufficiently address the criteria or processes for identifying representative authors of records and their derivatives. Therefore, our study focused on devising a method of identifying the representative author when there are multiple contributors in a work. The study developed a method of identifying representative authors using an author role dictionary constructed by extracting role-terms from the statement of responsibility field (245). We also designed another way to group records as a work by calculating similarity measures of authors and titles. The accuracy rate of WORK grouping was the highest when blank spaces, parentheses, and controling processes were removed from titles and the measured similarity rates of authors and titles were higher than 80 percent. This was an experiment study where we developed an author-role dictionary that can be utilized in selecting a representative author and measured the similarity rate of authors and titles in order to achieve effective WORK grouping of KORMARC records. The future study will attempt to devise a way to improve the similarity measure of titles, incorporate FRBR Group 1 entities such as expression, manifestation and item data into the algorithm, and a method of improving the algorithm by utilizing other forms of MARC data that are widely used in Korea.

6

링크드 데이터에서 인물 정보의 식별 및 연계 범위 확장에 관한 연구: 국립중앙도서관 링크드 데이터를 중심으로

이성숙(충남대학교) ; 박지영(한성대학교) ; 이혜원(서울여자대학교) 2017, Vol.34, No.3, pp.7-21 https://doi.org/10.3743/KOSIM.2017.34.3.007

초록보기

초록

본 연구에서는 국립중앙도서관 링크드 데이터를 대상으로 인물 정보가 표현되고 연계되는 방식을 분석하고, 이를 확장하기 위한 방안을 제안하였다. 분석 결과, 저자로서의 인물 정보는 링크드 데이터에서도 인물을 표현하는 어휘와 연계되어 기술되고 있는 반면에, 주제로 표현된 인물은 개념으로만 취급되고 있었다. 또한 링크드 데이터 구축과정에서 기존의 전거 정보를 변환한 것 외에는 별도의 부가 정보를 추가했는지를 확인할 수 없었다. 이에 본 연구에서는 저자로서의 인물 정보뿐 아니라 주제로서의 인물 정보도 서지 정보에 포함시키고, 저자로서의 인물 정보와 주제로서의 인물 정보를 연계할 때 링크드 데이터의 품질을 제고할 수 있다고 판단하였다. 그리고 이와 더불어 인물과 관련된 부가 링크 정보를 함께 구축하고 이를 활용하여 서지데이터 검색의 접근점을 확장하는 방안도 함께 제안하였다.

Abstract

This study analyzed the methods for representing and linking personal information in the linked data of National Library of Korea and provided suggestions for expanding the scope of identifying and linking of the personal information. As a result, the personal information as a subject has been dealt with a concept, where the personal information as a contributor has been linked with a vocabulary of personal name. In addition, there have not been assured of including additional information except existing authority data in the process of building the linked data. Therefore, this study suggested that linking personal information as a subject and personal information as a contributor was essential for the quality of linked data. In addition, we proposed to provide additional information related to the person in linked data for expanding the scope of access points in information discovery.

7

기관 리포지터리의 검색기능 향상을 위한 인명 접근점제어 시스템 구축 연구

김미향(서울대학교) ; 김태수(연세대학교) 2010, Vol.27, No.3, pp.125-146 https://doi.org/10.3743/KOSIM.2010.27.3.125

초록보기

초록

본 연구에서는 셀프 아카이빙(self-archiving)을 기본으로 메타데이터가 구축되는 기관 리포지터리의 인명 검색 문제점을 해결하고자, 인명 접근점제어 데이터를 구축하였다. 이를 위해 기존 도서관의 전거데이터를 활용하면서도 전거형을 인정하지 않고, 정보원에 기재된 형식을 모두 접근점으로 사용하는 그룹화 방법을 사용하고, 동명이인 처리를 위해 저작자의 주제분야와 저작정보를 확장해서 사용하는 새로운 방법을 토대로 인명 접근점제어 데이터를 구축하고 시스템에 적용하여 검색의 기능이 향상되었다. 향후 기관 리포지터리 외에 도서관이 총괄하는 모든 메타데이터의 검색 기능 향상을 위해서도 활용할 수 있을 것이다.

Abstract

This study developed a name access point control system for better performance of information retrieval from institutional repositories, which are equipped with author- generated metadata processes for self-archiving. In developing name access point control data for the system, the primary data were created from the existing authority. However, unlike the existing authority data, the primary data did not use any authority forms. Instead, the data utilized all the forms provided by the resources as access points. Specifically, field of activity(subject) and title information on authorship were used to distinguish between persons who have the same name. The result showed that the system improved the performance of the information retrieval. The system has been also expected to be utilized over other metadata provided by libraries, in addition to the institutional repositories, in order to provide better quality information.

8

Lubetzky의 목록법 사상 연구

이강산다정(중앙대학교) 2015, Vol.32, No.3, pp.155-182 https://doi.org/10.3743/KOSIM.2015.32.3.155

초록보기

초록

본 논문은 현대목록법의 기초를 세운 Seymour Lubetzky의 생애 및 저술분석을 통하여 도서관 사상 및 목록법 이론을 도출하였다. 문헌조사와 역사연구방법을 적용하여 시대적인 사회적, 사상적, 문화적 배경을 조사하였다. 국내․외 단편적인 연구결과를 토대로 종합적인 연구를 지향하였으며, Lubetzky의 목록법 사상의 영향관계를 분석하였다. 그리하여 Lubetzky의 도서관 및 목록법 사상을 도출하였다. Lubetzky의 목록법 이론은 목록원칙의 설계 및 서지적 관계 정립이다. 먼저 기술목록법의 원칙은 필요성, 단순성, 통일성, 일관성, 목적성, 상호연관성의 특성을 포함해야 하며, 목록의 목적을 명시해야 한다. 서지적 관계는 지적 생산물인 저작과 구체적인 실체인 저서로 구분하는 것을 토대로 형성되었다. 또한 기본저록을 저자명 기준으로 기입하여 저작의 집중을 도모하였고, 저자명에 단체명과 무저자명을 포함시켜 저자의 개념을 확장하였다.

Abstract

This study came up a library thought and the theory of cataloging through analyzing the life and writings of Seymour Lubetzky who founded the principals of cataloging in the twentieth century. This study investigated the historical social, ideological, and cultural context, using a literature survey and the methodology of historical research. Moreover, this study aimed a comprehensive research based on the results of domestic and foreign fragmentary studies, and analyzed the effects of Lubetzky’s thought of cataloging. Thus, this study found Lubetzky’s library and cataloging thought. The theory of cataloging that analyzes the above findings is the design of cataloging rules and a bibliographical relationship establishment. First of all, the principles of descriptive cataloging should contain the necessity, simplicity, unity, consistency, finality, and the characteristics of interrelations, and clarify the purpose of cataloging. The bibliographical relationship is built on dividing a work as an intellectual product into a book as a physical substance. Moreover, a basic entry is entered on the basis of author names for planning the concentration of works, and a corporate name and an anonym are contained in the author names for extending the concept of author.

9

서지적 저자결합분석 - 연구동향 분석을 위한 새로운 접근 -

이재윤(경기대학교) 2008, Vol.25, No.1, pp.173-190 https://doi.org/10.3743/KOSIM.2008.25.1.173

초록보기

초록

저자동시인용분석 기법은 특정 분야의 연구 주제와 동향을 파악하는 수단으로 널리 사용되어왔다. 그러나 저자동시인용분석 기법은 인용 지체 현상 때문에 최근 동향을 나타내거나 활동적인 현역 연구자를 파악하기에는 다소 한계가 있음이 알려져 있다. 이 연구에서는 최신 연구 동향을 분석함과 동시에 활동적인 연구자를 파악하기위한 새로운 방법으로 서지적 저자결합분석 기법을 제안한다. 이 기법은 Kessler가 제안한 서지결합에 기반을 두되 분석 단위를 문헌이 아닌 저자로 삼고 있다. 즉 서지적 저자결합분석 기법은 같은 저자를 인용하는 저자끼리는 연구 주제가 유사할 것이라는 가정에 근거한 분석 기법이다. 저자동시인용분석 기법을 사용한 기존 연구의 분석 결과를 서지적 저자결합분석을 적용한 경우와 비교해본 결과, 제안된 기법이 저자동시인용분석 기법에 비해서 최근 연구 동향을 더 잘 반영하며 활동적인 현역 연구자 위주의 해석을 가능케 하는 것으로 나타났다.

Abstract

Author co-citation analysis(ACA) technique has been widely used for identifying research areas and trends in a discipline. But this technique has some limitations, mainly due to citation delay, on analyzing current trends and identifying active researchers. In this study, a new method, named as Bibliographic Author Coupling Analysis(BACA), is suggested for overcoming those limitations of author co-citation analysis. BACA is based on Kessler's bibliographic coupling approach and focuses not on documents but on authors. Simply stated, BACA technique assumes that those likewise citing authors have the same research interests. For the purpose of comparing with author co-citation analysis, two preceding studies with author co-citation analysis are reconsidered and re-examined using BACA. The comparing results can be regarded as promising the usefulness of BACA in analyzing current research trends and identifying active researchers.

10

다차원 메타데이터 공간을 활용한 학술 문헌 추천기법 연구

감미아(연세대학교 문헌정보학과) ; 이지연(연세대학교 문헌정보학과) 2023, Vol.40, No.1, pp.121-148 https://doi.org/10.3743/KOSIM.2023.40.1.121

초록보기

초록

본 연구는 ‘우수한 성능의 메타데이터 속성 유사도 기반의 학술 문헌추천시스템’을 제안하는 데에 목적을 두고 있다. 본 연구에서는 정보조직에서 다루는 메타데이터의 활용과 계량정보학에서 다루고 있는 동시인용, 저자-서지결합법, 동시출현 빈도, 코사인 유사도의 개념을 활용한 문헌정보학 기반의 학술 문헌 추천기법을 제안하고자 하였다. 실험을 위해 수집한 ‘불평등’, ‘격차’ 관련 총 9,643개의 논문 메타데이터를 정제하여 코사인 유사도를 활용한 저자, 키워드, 제목 속성 간의 상대적 좌표 수치를 도출하였고, 성능 좋은 가중치 조건 및 차원의 수를 선정하기 위해 실험을 수행하였다. 실험 결과를 제시하여 이용자의 평가를 거쳤으며, 이를 이용해 기준노드와 추천조합 특성 분석 및 컨조인트 분석, 결과 비교 분석을 수행하여 연구질문 중심의 논의를 수행하였다. 그 결과 전반적으로는 저자 관련 속성을 제한 조합 혹은 제목 관련 속성만 사용하는 경우 성능이 뛰어난 것으로 나타났다. 본 연구에서 제시한 기법을 활용하고 광범위한 표본의 확보를 이룬다면, 향후 정보서비스의 문헌 추천 분야뿐 아니라 사회의 다양한 분야에 대한 추천기법 성능 향상에 도움을 줄 수 있을 것이다.

Abstract

The purpose of this study is to propose a scholarly paper recommendation system based on metadata attribute similarity with excellent performance. This study suggests a scholarly paper recommendation method that combines techniques from two sub-fields of Library and Information Science, namely metadata use in Information Organization and co-citation analysis, author bibliographic coupling, co-occurrence frequency, and cosine similarity in Bibliometrics. To conduct experiments, a total of 9,643 paper metadata related to “inequality” and “divide” were collected and refined to derive relative coordinate values between author, keyword, and title attributes using cosine similarity. The study then conducted experiments to select weight conditions and dimension numbers that resulted in a good performance. The results were presented and evaluated by users, and based on this, the study conducted discussions centered on the research questions through reference node and recommendation combination characteristic analysis, conjoint analysis, and results from comparative analysis. Overall, the study showed that the performance was excellent when author-related attributes were used alone or in combination with title-related attributes. If the technique proposed in this study is utilized and a wide range of samples are secured, it could help improve the performance of recommendation techniques not only in the field of literature recommendation in information services but also in various other fields in society.

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

정보관리학회지