정보관리학회지, 한국정보관리학회

11

이태영(전북대학교) 2006, Vol.23, No.4, pp.41-67 https://doi.org/10.3743/KOSIM.2006.23.4.041

초록보기

초록

웹의 보도기사에 관한 자동요약시스템을 구축하기 위하여 담화구조와 지식기반 기법을 적용한 글구조 프레임과 제 규칙들을 작성하였다. 프레임에는 문단과 문장 및 절의 역할, 문단과 문장의 성질, 역할을 구분하는 판별규칙, 주요문장 발췌규칙, 그리고 요약문작성규칙 슬롯이 포함되었다. 문맥정의, 고유명사 등을 안내하는 ‘if-needed'와 변화된 슬롯 값을 알려주는 if-changed 패싯도 구비되었다. 슬롯이나 패싯의 실제 값들을 추출 표현하는 과정에서 문구의 수사적 역할과 단어 최상위 범주 및 줄거리 단위를 참조하였다. 의미흐름의 연결성을 유지하면서 요약 문장들을 통합, 분리, 합성하는 재구성은 유사도공식, 구문정보, 담화구조와 지식기반 방법에서 도출한 제 규칙 및 문맥정의를 이용하였고 비평과 같은 새로운 문장을 생성하였다.

Abstract

The writings frame and various rules based on discourse structure and knowledge-based methods were applied to construct the automatic Ext/Sums (extracts & summaries) system from the straight news in web. The frame contains the slot and facet represented by the role of paragraphs, sentences, and clauses in news and the rules determining the type of slot. Rearrangement like Unification, separation, and synthesis of the candidate sentences to summary, maintaining the coherence of meanings, were also used the rules derived from similar degree measurement, syntactic information, discourse structure, and knowledge-based methods and the context plots defined with the syntactic/semantic signature of noun and verb and category of verb suffix. The critic sentence were tried to insert into summary

12

응용프로파일 코어 온톨로지 설계 및 구현

한성국(원광대학교) ; 이현실(원광대학교) 2007, Vol.24, No.3, pp.245-269 https://doi.org/10.3743/KOSIM.2007.24.3.245

초록보기

초록

유비쿼터스 정보 환경하에서 정보자원의 공유와 상호 교환을 위한 정보자원의 구조와 내용 기술에 표준 메타데이터 체계가 이용되고 있다. 실제 응용 도메인에서는 다수의 메타데이터 요소를 혼합-일치 방식으로 재사용하여 응용 시스템을 구축하게 되는데, 이때 메타데이터 요소의 상세화와 상호 운용성 등의 문제가 발생한다. 메타데이터 활용에서 발생하는 문제 해결에 응용 프로파일 접근 방식이 이용되고 있다. 본 논문에서는 응용 프로파일의 목적과 기능을 달성할 수 있는 응용 프로파일 코어 온톨로지를 제시하고, 이를 기반으로 한 메타데이터 응용 시스템 구축에 대하여 서술하였다.

Abstract

The standard metadata systems are very popular for the description of structures and semantics of information resources to realize sharing and exchanging information in global ubiquitous environment. In real application domain, various metadata elements are reused together with mix-and-match manner. An application system using diverse metadata systems is compelled with refinement and interoperability of metadata elements. Application profile is the general approach to resolve the various problems occurred in metadata application systems. This paper proposes Application Profile Core Ontology (APO) that can achieve the goals and functions of application profile, and describes metadata application system based on APO.

13

시맨틱 웹 환경에서 적합한 문장을 제공하는 이야기 쓰기 도우미에 관한 연구

이태영(전북대학교) 2009, Vol.26, No.4, pp.7-34 https://doi.org/10.3743/KOSIM.2009.26.4.007

초록보기

초록

이야기 쓰기를 돕는 본문 및 문장 검색시스템의 구축을 위해서 (1)이야기와 단락 및 문장의 구조를 분석하고 (2)색인작성과 탐색 질문에 적용되는 언어 추론을 연구하였다. 이야기 쓰기에 필요한 이야기, 단락, 그리고 문장으로 구성된 사항 데이터베이스와 필요한 추론규칙으로 이루어진 지식베이스와 온톨로지가 고안되었다. 추론의 기초인 실례(實例) 파일들은 시맨틱 웹 환경에서 작동될 마크업 언어 형식으로 만들어졌다. 시맨틱 웹 환경에서 실용적인 시스템이 되려면 단락과 문장을 정확히 대변하는 색인 방법론과 이를 정밀하게 지식베이스화 할 수 있는 마크업 언어의 창조가 필수적이라 사료된다.

Abstract

Structures of stories, paragraphs, and sentences and inferences applied to indexing and searching were studied to construct the full-text and sentence retrieval system for storytelling. The system designed the database of stories, paragraphs, and sentences and the knowledge-base of inference rules to aid to write the story. The Knowledge-base comprised the files of story frames, paragraph scripts, and sentence logics made by mark-up languages like SWRL etc. able to operate in semantic web. It is necessary to establish more precise indexing language represented the sentences and to create a mark-up languages able to construct more accurate inference rules.

14

기록 관리 메타데이터의 개념 모델링

이현실(원광대학교) ; 한성국(원광대학교) 2006, Vol.23, No.3, pp.23-48 https://doi.org/10.3743/KOSIM.2006.23.3.023

초록보기

초록

기록 관리 메타데이터 스키마는 기록물 자체에 내재한 정보 요소뿐만 아니라, 기록 업무에 따른 기록물의 생명 주기 관리 등에 필요한 관리 요소를 표현할 수 있는 강고한 구조를 가져야 한다. 이를 위해서 메타데이터 스키마에서는 기록 도메인의 정보 모델과, 기록 관리 업무 및 응용에서 요구되는 의미 상세화와 데이터 요소 특수화 등을 지원하는 메타데이터 프레임워크가 요구된다. 본 연구에서는 메타데이터 스키마의 주요 원리와 특성을 분석하여, 기록 관리 메타데이터 스키마를 체계적이고 효과적으로 개발하기 위한 접근 방식을 제시한다. 이를 위해 ISO 15489와 23081에 제시된 기록 관리 지침과 메타데이터 운용에 근거한 기록 관리 정보 모델을 개발하고 핵심 데이터 요소를 제시하였으며, 기록 관리 프레임워크를 구현하는 방법을 보였다.

Abstract

Record management metadata schema should have robust structure to represent not only elements innate in records itself but also management elements for the life cycle of records according to business activities. To realize these requirement, Information model for record domain is needed and also Metadata framework supporting semantic refinement and data element specialization required in record management business or applications are required. This study analyse main principles and characteristics of metadata scheme, and then suggested a novel method to develope schema systematically and effectively. This study propose information model and set of core data elements of records management based on ISO 15489 and 230381, and show how to implement the record management framework.

15

온톨로지 기반 상황인지 모델링 연구: u-Convention을 중심으로

김성혁(숙명여자대학교) 2011, Vol.28, No.3, pp.123-139 https://doi.org/10.3743/KOSIM.2011.28.3.123

초록보기

초록

유비쿼터스 컴퓨팅의 주요 기술인 상황인지는 환경을 구성하는 다양한 종류의 정보 기기로부터 전달되는 상황 정보를 이해하고 처리하며, 다양한 도메인에 유연하게 적용할 수 있는 상황인지 모델을 필요로 한다. 시맨틱 웹 기술 기반의 온톨로지는 구조화된 공통의 포맷을 이용하고 의미적인 정보의 표현이 가능하므로, 시스템이 상황 정보를 공유하고 이해, 추론함으로써 효과적인 상황인지가 가능하다. 따라서 온톨로지를 이용한 상황인지 모델이 여러 연구에서 제시되어 왔는데, 본 논문에서는 이러한 기존 연구들에 대한 분석을 바탕으로 상황인지 모델의 범용성과 확장성을 위해 온톨로지의 구조를 계층화하고 이를 기반으로 상황인지 시스템을 구현하여 실제 u-Convention 도메인에 적용하였다. 또한 OWL-DL의 기술논리와 SWRL 규칙 추론을 결합함으로써 복합적인 상황을 효과적으로 추론하는 방법을 제시하였다.

Abstract

Context-awareness as a key technology of ubiquitous computing needs a context model that understands and processes situational information coming from diverse sensors and devices, and can be applied diversely in various domains. Semantic web based ontologies use structured standard format and express meaning of information, so it is possible to recognize effectively context-awareness situations, allowing the system to share information and understand situation by inference. In this paper, we propose a layered ontology model to support generality and scaleability of the context-awareness system, and applied the model to u-Convention domain. In addition, we propose a effective reasoning method to handle compound situation by combining OWL-DL and SWRL rules.

16

OWL을 이용한 온톨로지 기반의 목록시스템 설계 연구

이현실(원광대학교) ; 한성국(원광대학교) 2004, Vol.21, No.2, pp.249-267 https://doi.org/10.3743/KOSIM.2004.21.2.249

초록보기

초록

MARC는 목록 데이터를 상세하게 정의할 수 있는 장점이 있지만, 개념요소가 구조화 되어 있지 않고 표현체계가 복잡하기 때문에 단순 계층구조의 의미 어휘 체계를 지원하는 XML DTD나 RDF/S로는 그 구조를 모델화하기가 어렵다. 본 연구에서는 MARC의 데이터 요소를 추상화하여 목록 데이터의 개념 구조를 표현하는 서지 온톨로지를 구축하였으며, 개념간의 논리 관계와 프로퍼티의 카디널리티 및 프로퍼티 값에 대한 논리적 제한을 부가할 수 있는 OWL을 이용하여 MRAC 필드의 복합 구조를 모델링하여 구축한 목록 온톨로지를 구현하였다. 온톨로지 언어를 이용한 MARC 데이터를 기술 방법은 목록 데이터에 대한 메타데이터 구성과 목록의 호환성 문제를 해결할 수 있는 기초적 방안이 되며, 시맨틱 웹 서비스를 기반으로 하는 차세대 문헌 정보서비스 시스템 구현의 토대가 될 것이다.

Abstract

Although MARC can define the detail cataloguing data, it has complex structures and frameworks to represent bibliographic information. On account of these idiosyncratic features of MARC, XML DTD or RDF/S that supports simple hierarchy of conceptual vocabularies cannot capture MARC formalism effectively. This study implements bibliographic ontology by means of abstracting conceptual relationships between bibliographic vocabularies of MARC. The bibliographic ontology is formalized with OWL that can represent the logical relations between conceptual elements and specify cardinality and property value restrictions. The bibliographic ontology in this study will provide metadata for cataloguing data and resolve compatibility problems between cataloguing systems. And it can also contribute the development of next generation bibliographic information system using semantic Web services.

17

인문학 및 사회과학 분야 국내 학술논문의 저자키워드 출현빈도와 피인용횟수의 상관관계 연구

고영만(성균관대학교) ; 송민선(성균관대학교 정보관리연구소) ; 김비연(성균관대학교) ; 민혜령(성균관대학교) 2013, Vol.30, No.2, pp.227-243 https://doi.org/10.3743/KOSIM.2013.30.2.227

초록보기

초록

본 연구의 목적은 저자키워드의 출현빈도와 해당 키워드가 속한 논문들의 총피인용횟수 간 상관관계 여부를 확인하고자 하는 것이다. 연구의 배경은 인문사회과학 분야 학술용어사전을 구축하는데 있어서 실제 연구에서의 활용도가 높고 다른 키워드와의 의미적 연관관계가 많은 학술용어를 추출하기 위한 방법론을 개발해 보고자 하는 것이다. 본 연구의 목적을 이루기 위해 한국연구재단 한국학술지인용색인(KCI)에 수록된 2007년에서 2011년까지의 인문학 및 사회과학 분야 학술지 논문의 저자키워드와 피인용횟수를 분석하였다. 분석 결과 저자키워드의 출현빈도와 해당 키워드가 속한 논문들의 총피인용횟수는 통계적으로 상관관계가 있으며, 저자키워드의 출현빈도가 늘어날수록 논문의 총피인용횟수도 많아지는 것으로 나타났다.

Abstract

The purpose of this study is to verify the correlation between the appearance frequency of author keyword and the number of citation in journal articles. In this study, we were trying to develop a methodology that can select the term having semantic relation with other terms and higher utilization to build a structured scientific glossary. In order to achieve this purpose, we analyzed the number of citation and the author keyword of the humanities and social science journal articles of the Korea Citation Index (KCI) from 2007 to 2011. This study found a correlation between appearance frequency of author keyword and the number of citation of the journal articles, with higher appearance frequency of author keyword of the journal articles being more cited.

18

디지털 특수자료를 위한 XML 스키마 기반의 메타데이터 표현 체계

오삼균(성균관대학교) ; 채진석(인천대학교) 2004, Vol.21, No.4, pp.109-131 https://doi.org/10.3743/KOSIM.2004.21.4.109

초록보기

초록

연구는 서울대학교 디지털도서관 프로젝트의 지원으로 추진되었음.＊＊＊＊성균관대학교 문헌정보학과 부교수(samoh@skku.ac.kr)＊＊＊＊인천대학교 컴퓨터공학과 부교수(jschae@incheon.ac.kr) 논문접수일자 : 2004년 11월 13일 게재확정일자 : 2004년 12월 19일攀攀정보자원의 전달 매체와 형태가 다양화됨에 따라서 이에 대한 관리방법 또한 다양화되어 왔다. 도서관 환경에서는 정보자원를 위한 관리방법으로서 AACR, KCR 등의 목록규칙이 정립되었으며 이러한 목록규칙에 근거한 정보자원관리를 자동화하고자 하는 노력의 결과로서 MARC가 개발되었다. 하지만, MARC 레코드는 서지 레코드가 지니고 있는 의미적 관계의 표현을 지원하지 못하는 구조적 경직성으로 인해 다양하고 상이한 기술적 특성을 지니는 정보자원들을 적절히 기술하는데 제약이 따른다. 즉, MARC의 기본 설계 목적이 몇몇 정보유형에는 비교적 적합하더라도 새로운 형태의 정보유형의 다양성을 지원하는데 어려움이 있다. 또한 MARC를 활용한 정보자원 관리 방식에서는 정보자원 간 연결 관계의 표현을 지원하지 못한다. 즉, MARC의 데이터 모델은 자원기술의 대상을 단일의 객체로 파악하는 단층 데이터 모델이기 때문에 여러 객체들 간의 연결 관계를 설정할 수 있는 다층 데이터 모델을 이용한 정보자원 기술이 필요한 경우는 적절치 못하다. 본 연구에서는 다층 데이터 모델을 지원하는 IFLA FRBR 기본 모델을 기초로 하여 전자도서관에서 사용되는 고서, 고문서, 음악 자료, 학술회의 및 세미나 자료의 관리에 있어서 이용자의 정보요구를 최대한 수용할 수 있는 최적의 메타데이터 모델과 이에 대한 XML 스키마 기반의 표현 체계를 제시하고자 한다.

Abstract

As there are diverse delivery media and forms of information resources, their management schemes are diverse as well. In library community, cataloguing rules for describing information resources such as AACR and KCR have been developed. The efforts to automate management of information resources based on these rules resulted in the development of MARC. However, MARC records are restricted in describing the information resources and MARC has various and distinct characteristics of the structural rigidity, which does not support the representation of extended semantic structures that exist among bibliographic entities. Therefore, since the data model for MARC is single-layer data model, it is not appropriate for describing information resources represented by multi-layer data model which can be used to set up the relationships among various objects in digital libraries. In this paper, we propose an a metadata model for digital libraries based on the IFLA FRBR basic model which supports multi-layer data model and a representation scheme based on XML Schema to manage the metadata about old books, old documents, resource related to music, conferences and seminars.

19

AACR2에서 RDA로 목록규칙 변화에 따른 KCR4의 고려사항에 관한 연구

이미화(이화여자대학교) 2011, Vol.28, No.1, pp.23-42 https://doi.org/10.3743/KOSIM.2011.28.1.023

초록보기

초록

본 연구는 AACR2와 이를 대체하는 새로운 목록규칙인 RDA의 규칙을 비교하여, RDA에 대한 이해를 높이고, 우리나라의 한국목록규칙에서 고려해야 할 사항을 파악하기 위한 것이다. RDA는 모든 유형의 자원을 서지제어할 수 있는 구조로 International Cataloging Principles(2009), FRBR, FRAD를 구현하기 위한 목록규칙이며, 국제적인 환경에 융통성 있게 적용가능하다. RDA는 웹환경에 맞는 시멘틱웹으로 구현이 가능하도록 집중기능과 다양한 관계에 기반을 두고 있어 미래의 목록에 큰 영향을 줄 것이기 때문에 국내에서도 이를 반영하는 연구가 필요하다. 비교는 JSC for Development of RDA의 2008년 RDA 초안을 기반으로 저작, 표현형, 구현형의 기술규칙을 대상으로 분석하였다. 구현형에서는 표제, 자료유형, 책임사항, 판사항, 발행사항, 형태사항, 총서사항의 기술영역별로, 저작과 표현형에서는 저작 유형에 따른 채택접근점을 중심으로 RDA와 AACR2 규칙 중에서 변경된 사항을 중심으로 살펴보았다. 본 연구는 RDA에서 제시한 목록규칙을 바탕으로 앞으로 목록의 발전 방향을 파악할 수 있으며, 국내의 목록규칙 개정 시에도 많은 도움이 될 것이다.

Abstract

This study is to compare the descriptive cataloging rules between AACR2 and RDA, and then to find a direction of future cataloging and KCR 4. RDA is new cataloging rules that embody the International Cataloging Principles(2009), FRBR and FRAD. It is a structure of bibliographic control of all kinds of resources, and the rules can be flexibly applicable in the international cataloging community. It is critical to embody RDA in KCR 4 because RDA is likely to affect the future cataloging through its collocation function and relation function to construct semantic web of OPAC. This study analyzed the descriptive rules of work, expression, and manifestation based on RDA draft(2008) of JSC for Development of RDA. It analyzed the changes in the cataloging rules from AACR2 to RDA in such descriptive areas as title, type of resources, statement of responsibility, edition, publication, physical description and series in the manifestation level, and the preferred access points in both expression and work levels. The findings of this study will provide implications in revising KCR4.

20

위키피디아를 이용한 분류자질 선정에 관한 연구

김용환(연세대학교) ; 정영미(연세대학교) 2012, Vol.29, No.2, pp.155-171 https://doi.org/10.3743/KOSIM.2012.29.2.155

초록보기

초록

텍스트 범주화에 있어서 일반적인 문제는 문헌을 표현하는 핵심적인 용어라도 학습문헌 집합에 나타나지 않으면 이 용어는 분류자질로 선정되지 않는다는 것과 형태가 다른 동의어들은 서로 다른 자질로 사용된다는 점이다. 이 연구에서는 위키피디아를 활용하여 문헌에 나타나는 동의어들을 하나의 분류자질로 변환하고, 학습문헌 집합에 출현하지 않은 입력문헌의 용어를 가장 유사한 학습문헌의 용어로 대체함으로써 범주화 성능을 향상시키고자 하였다. 분류자질 선정 실험에서는 (1) 비학습용어 추출 시 범주 정보의 사용여부, (2) 용어의 유사도 측정 방법(위키피디아 문서의 제목과 본문, 카테고리 정보, 링크 정보), (3) 유사도 척도(단순 공기빈도, 정규화된 공기빈도) 등 세 가지 조건을 결합하여 실험을 수행하였다. 비학습용어를 유사도 임계치 이상의 최고 유사도를 갖는 학습용어로 대체하여 kNN 분류기로 분류할 경우 모든 조건 결합에서 범주화 성능이 0.35%~1.85% 향상되었다. 실험 결과 범주화 성능이 크게 향상되지는 못하였지만 위키피디아를 활용하여 분류자질을 선정하는 방법이 효과적인 것으로 확인되었다.

Abstract

In text categorization, core terms of an input document are hardly selected as classification features if they do not occur in a training document set. Besides, synonymous terms with the same concept are usually treated as different features. This study aims to improve text categorization performance by integrating synonyms into a single feature and by replacing input terms not in the training document set with the most similar term occurring in training documents using Wikipedia. For the selection of classification features, experiments were performed in various settings composed of three different conditions: the use of category information of non-training terms, the part of Wikipedia used for measuring term-term similarity, and the type of similarity measures. The categorization performance of a kNN classifier was improved by 0.35~1.85% in F1 value in all the experimental settings when non-learning terms were replaced by the learning term with the highest similarity above the threshold value. Although the improvement ratio is not as high as expected, several semantic as well as structural devices of Wikipedia could be used for selecting more effective classification features.

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

정보관리학회지