정보관리학회지, 한국정보관리학회

51

김영범(전남대학교 대학원 기록관리학 석사) ; 장우권(전남대학교 문헌정보학과 교수) 2023, Vol.40, No.3, pp.99-118 https://doi.org/10.3743/KOSIM.2023.40.3.099

초록보기

초록

이 연구의 목적은 기록물의 맥락정보를 담고 있는 메타데이터를 활용하여 기록물 자동분류 과정에서의 성능요소를 파악하는데 있다. 연구를 위해 2022년 중앙행정기관 원문정보 약 97,064건을 수집하였다.수집한 데이터를 대상으로 다양한 분류 알고리즘과 데이터선정방법, 문헌표현기법을 적용하고 그 결과를 비교하여 기록물 자동 분류를 위한 최적의 성능요소를 파악하고자 하였다. 연구 결과 분류 알고리즘으로는 Random Forest가, 문헌표현기법으로는 TF 기법이 가장 높은 성능을 보였으며, 단위과제의 최소데이터 수량은 성능에 미치는 영향이 미미하였고 자질은 성능변화에 명확한 영향을 미친다는 것이 확인되었다.

Abstract

The objective of this study is to identify performance factors in the automatic classification of records by utilizing metadata that contains the contextual information of records. For this study, we collected 97,064 records of original textual information from Korean central administrative agencies in 2022. Various classification algorithms, data selection methods, and feature extraction techniques are applied and compared with the intent to discern the optimal performance-inducing technique. The study results demonstrated that among classification algorithms, Random Forest displayed higher performance, and among feature extraction techniques, the TF method proved to be the most effective. The minimum data quantity of unit tasks had a minimal influence on performance, and the addition of features positively affected performance, while their removal had a discernible negative impact.

52

텍스트마이닝을 활용한 “잊힐 권리”의 토픽 분석

이소현(부산대학교 도서관) ; 구본진(부산대학교) 2022, Vol.39, No.2, pp.275-298 https://doi.org/10.3743/KOSIM.2022.39.2.275

초록보기

초록

본 연구는 잊힐 권리와 관련한 뉴스 기사와 학술지 게재 논문을 대상으로 텍스트마이닝 분석을 활용해 각 문서 내에 나타난 논점과 특성을 살펴보았다. 분석을 위해 ‘잊힐 권리’와 ‘잊혀질 권리’ 키워드를 검색어로 하여 2010년부터 2020년까지의 데이터를 수집하였다. 수집된 데이터를 대상으로 키워드 분석과 토픽모델링 분석을 수행한 결과, 지난 10년간 뉴스 기사와 학술지 논문에서 다루어진 쟁점은 크게 다르지 않으며, 접근 방법 또한 유사한 것으로 나타났다. 다만 뉴스 기사와 학술지 논문 간 비교를 통해 이들 간 공통적으로 나타나는 쟁점과 부분적인 쟁점의 차이가 있음을 확인하였다. 따라서 본 연구에서 도출된 쟁점을 중심으로 기록관리학 분야에서도 적극적인 논의가 이루어져야 할 필요가 있으며, 공통적인 쟁점들을 우선적으로 고려하되, 쟁점 상 이견이 존재하는 경우, 이를 다각적으로 논의하는 것이 필요하다고 볼 수 있다. 본 연구는 국내 기록관리학계에서 잊힐 권리와 관련된 논의가 이루어지고 있지 않은 현재의 상황에서 기록관리학 분야에서 잊힐 권리의 의미와 향후 발생할 수 있는 이슈를 도출해볼 수 있었다는데 의의가 있으며, 본 연구의 결과를 중심으로 기록관리학 분야에서 잊힐 권리에 대한 다양한 논의가 이루어지기를 기대한다.

Abstract

This study examined the issues and characteristics that appeared in news and journal articles related to the ‘right to be forgotten’ using text mining analysis. Data for analysis were collected from 2010 to 2020 with the keyword ‘right to be forgotten’. Keyword analysis and topic modeling analysis were performed on the collected data. As a result, in the last 10 years the issues about ‘right to be forgotten’ are not much different in news and journal articles and the approaches also are similar. However, it confirmed common issues and the partial difference between news and journal articles through comparison. Therefore in Archives and Records Management Studies, it is necessary to discuss derived in this study. In particular common issues are considered first but if there are differences in issues, it is needed to discuss them in various ways. This study is meaningful to understand the meaning and to draw issues that may arise in the future of the ‘right to be forgotten’. The results of this study will contribute to be variously discussed on the ‘right to be forgotten’ in Archives and Records Management Studies.

53

온라인 커뮤니티 사이트에 대한 신뢰가 해당 커뮤니티 내에서 이뤄지는 포럼활동에 미치는 영향에 관한 실증연구

문병석(성균관대학교) ; 이건창(성균관대학교) ; 조창현(성균관대학교) ; 강신장(성균관대학교) 2007, Vol.24, No.1, pp.227-250 https://doi.org/10.3743/KOSIM.2007.24.1.227

초록보기

초록

온라인 커뮤니티 사이트는 최근 크게 발전하고 있다. 그 이유는 인터넷이 개인생활 속에 깊숙이 침투하면서 사회 연결망, 즉 social networking 현상이 활성화되고 그에 따라 많은 사용자들이 특정 온라인 커뮤니티 사이트에서 다양한 정보활동을 하고 있기 때문이다. 본 연구에서는 이러한 온라인 커뮤니티 사이트에 대한 중개자 신뢰와 시스템 신뢰가 해당 커뮤니티 내에서의 포럼활동에 대한 신뢰 및 정보품질 만족에 미치는 영향에 관한 실증분석을 하고자 한다. 실증분석을 위한 자료수집은 삼성경제연구소의 온라인 커뮤니티 사이트인 SERI ( HYPERLINK "http://www.seri.org" www.seri.org)를 대상으로 하였으며, 해당 SERI 사이트 내에서 SERI 포럼활동을 하고 있는 사용자들을 대상으로 하여 591명의 유의한 설문자료를 수집하였다. 실증분석결과 다음과 같은 결과를 얻을 수 있었다. 첫째, SERI의 중개자 신뢰와 시스템 신뢰는 해당 SERI 포럼의 정보품질과 시스템품질, 그리고 인지효과성에 긍정적인 영향을 준다. 둘째, SERI의 중개자 신뢰는 해당 SERI 포럼의 인지위험을 줄이는데 기여를 한다. 반면, SERI의 시스템 신뢰는 해당 SERI 포럼의 인지위험에는 유의한 영향을 주지 못한다. 이는 아무리 온라인 커뮤니티 사이트의 지명도가 높다고 하더라도 이는 해당 온라인 커뮤니티 내의 포럼 사용자가 느끼는 인지위험에는 유의한 영향을 주지 못하다는 것을 의미한다. 셋째, 그러나 SERI의 중개자 신뢰와 시스템 신뢰가 높을수록 해당 SERI 포럼의 신뢰와 정보품질만족에는 긍정적인 영향을 준다.

Abstract

With the advent of social networking activity on the Internet, online community sites are becoming more popular. The main purpose of this study is to empirically investigate the influence of intermediary trust and system trust on the forum activity trust and information quality satisfaction. We assume that the intermediary trust and system trust come from the online community site itself, while the forum activity is made within a specific forum allowed on the online community site, and therefore forum activity trust and information quality satisfaction are related to a specific forum. The 591 valid questionnaire data were gathered from the users acting in forums allowed on the Samsung Economic Research Institute (SERI) (www.seri.org). The empirical results are as follows. First, the SERI intermediary trust and its system trust have positive influence on the SERI forum information quality system quality, and perceived effectiveness. Second, the SERI intermediary trust contributes to reducing the SERI forum perceived risks, while the SERI system quality does not. Third, the higher the SERI intermediary trust is, the higher the SERI forum trust and information quality satisfaction increase.

54

해양과학기술 분야 연구자의 정보이용행태에 관한 연구

한종엽(한국해양과학기술원) ; 서만덕(한국해양과학기술원) 2014, Vol.31, No.1, pp.163-187 https://doi.org/10.3743/KOSIM.2014.31.1.163

초록보기

초록

이 연구의 목적은 해양과학기술 분야 연구자의 정보이용행태를 규명하기 위한 것으로, 연구자의 연령, 학력, 연구분야 등 개인적 특성에 따른 차별화된 정보서비스 수립과 전문도서관 서비스 고도화를 위한 기초자료를 확보하는데 있다. 자료수집은 2014년 1월 중 2주간 국내의 대표적인 해양연구기관 소속 연구자 348명을 대상으로 웹설문지를 배포하고 최총 115명의 데이터를 회수하였다. 분석결과, 연구자가 가장 선호하는 정보유형은 학술논문이며, 국내자료보다 해외자료, 인쇄자료보다 전자자료를 주로 이용하고 있다. 정보입수경로는 ‘인터넷정보원’과 ‘소속 도서관 이용’이 높았고, 자료 수집 시 겪는 문제점은 ‘소속도서관의 전자자원 다양성 부족’과 ‘유료정보에 대한 이용부담’에 대한 의견이 가장 많았다. 도서관 만족도의 주요 영향요인은 ‘전자도서관 시스템’, ‘도서관 직원’, ‘도서관 소장자료’ 순으로 나타났고, 이는 정보이용 만족도와 밀접한 관계가 있음을 보여준다. 마지막으로 전문도서관 정보서비스의 수요를 분석한 결과, 향후 중점적으로 실시해야하는 서비스는 ‘맞춤형 정보검색서비스’, ‘프로젝트지원서비스’, ‘연구동향분석서비스’로 나타났다.

Abstract

The purpose of this study is to explain information usage behavior of researchers in the field of ocean science and technology. The study mainly collected primary data for advancement of special library services as well as establishment of personalized information services based on personal characteristics such as age, education level, and area of research. The data collection was conducted for two weeks during January 2014, through a web survey to 348 researchers in national ocean research institutions in South Korea. Total of 115 researchers replied. The analysis showed that the most preferred type of information medium was a scholarly journal. Researchers used more foreign published journals compared to Korean ones, while favoring digital formats rather than printed ones. The top channels for information collection were ‘web search’ and ‘affiliated libraries.’ Most pointed out difficulties of data collection were ‘lack of variety of digital resources in affiliated libraries’ and ‘reluctance to use charged information.’ Key elements for satisfactory user experience were ranked in the order of ‘digital library system,’ ‘library staff,’ and ‘library collection’ and so on, which proves the close relationship between library service and information usage service satisfaction. The result of an assessment for demands in special libraries showed that ‘personalized information search service,’ ‘project support service,’ and ‘research direction analysis service’ should be implemented in the future.

55

청각장애 대학생의 도서관 이용행태와 정보요구에 대한 연구

장보성(국립중앙도서관 자료개발과) 2015, Vol.32, No.1, pp.297-316 https://doi.org/10.3743/KOSIM.2015.32.1.297

초록보기

초록

본 연구는 청각장애 대학생들의 도서관 이용행태와 정보요구를 파악하여, 그들에게 적절한 도서관 서비스 프로그램 등을 개발하기 위한 기초자료를 수립하는데 목적이 있다. 그 목적을 달성하기 위해 청각장애 대학생에게 설문조사와 면접을 실시, 총 155명의 데이터를 수집하였고, 그 데이터를 빈도분석, 교차검증, t-검증, 일원분산분석으로 분석하였다. 연구결과, 청각장애 대학생의 성별, 학년, 장애등급, 출신학교, 학과, 사용 보장구에 따라 도서관 이용형태(정보수집의 어려움, 도서관 이용횟수, 도서관 이용목적, 도서관을 이용하지 않는 이유) 전 영역에서 유의미한 차이를 발견하였다. 그리고 청각장애 대학생의 사용 보장구의 종류, 출신학교, 장애등급에 따른 정보요구의 차이를 분석한 결과, 사용 보장구에 따른 정보요구(최신 자료 확충, 이용자 교육홍보, 수화통역사, 홈페이지 개선, 열람환경개선)는 전 영역에서 유의미한 차이를 보였고, 출신학교에 따른 정보요구(이용자 교육홍보, 수화통역사 배치)와 장애등급에 따른 정보요구(이용자 교육홍보, 열람환경 개선)에서도 일부 유의미한 차이를 보였다.

Abstract

This study looks into how hearing-impaired college students use libraries and what their information needs are in order to prepare basic materials which would be applied for developing a library service program and others proper enough to be used by the hearing-impaired college students. In order to achieve the research goal, the study gathered data from a total of 155 hearing-impaired college students through a survey and interviews and a frequency analysis, a cross validation, a t-test and a one-way ANOVA were conducted to analyze the data. At the end of its research, the study confirmed that the hearing-impaired college students’ gender, years, degrees of disability, schools, specialties and prosthetic appliances would make significant differences in how the students use the libraries. In addition, the study took a look into differences in the hearing-impaired college students’ information needs caused by types of the students’ prosthetic appliances, schools and degrees of disability and found out that these types of the prosthetic appliances the students use would significantly affect every category of their information needs. The study now also understands that both the schools and the degrees of disability would make significant differences in a few categories of the information needs, and the former influences education and promotion targeting users and arrangement of sign language interpreters while the latter affects education and promotion targeting users and improvements in browsing environments.

56

대학도서관의 디지털참고봉사 제공 및 이용 분석

장혜란(상명대학교) 2003, Vol.20, No.4, pp.49-66 https://doi.org/10.3743/KOSIM.2003.20.4.049

초록보기

초록

대학도서관에서 제공하고 있는 디지털참고봉사의 현황을 이해하기 위하여 전국적인 조사를 수행하여 분석하였다. 2003년 7월 8일부터 7월 22일까지 직접 접속을 통한 관찰과 질문응답 기록을 통하여 데이터를 수집하였다. 우리나라 4년제 대학 중 171개 도서관이 디지털참고봉사를 제공하는 것으로 나타났으며, 접근수준, 명칭, 서비스방식, 웹폼, 서비스정책 등 디지털참고봉사 제공 관련 특성과 이용량, 응답비율, 질문유형 등 이용관련 특성을 분석하였다. 현황과 문제점이 식별되고, 도서관 서비스로 확립되기 위한 제언이 이루어졌다.

Abstract

To understand current state of the digital reference services in Korean academic libraries, a comprehensive examination and analysis was performed. Data was collected from July 8. 2003 to July 22. 2003, through direct site examination and recording the question and answer transcripts. The analysis focused on the provision characteristics and use of the digital reference services. Results revealed interesting findings and related problems. Suggestions for future development are provided.

57

웹 포털 이용자 로그 데이터에 기반한 개인화 검색 서비스 모형의 설계 및 평가

이소영(다음커뮤니케이션) ; 정영미(연세대학교) 2006, Vol.23, No.4, pp.179-196 https://doi.org/10.3743/KOSIM.2006.23.4.179

초록보기

초록

이 연구에서는 한국형 포털에 적합한 커뮤니티 기반 개인화 검색 서비스 모형을 제안하였다. 개인화 검색 서비스 모형은 이용자의 관심 주제를 파악하는 과정과 이를 반영한 검색 결과 재순위화 및 관련 주제 카테고리와 질의어 추천 과정으로 구성된다. 개인화 검색 모형의 유용성을 검증하기 위한 실험에서는 포털 사이트 다음에서 12일간 수집한 이용자 로그 데이터를 사용하였다. 실험 결과 개별 이용자의 주제 카테고리 선정에 사용한 카페 활동성 분석과 신지식 활동성 분석 데이터는 매우 유용한 것으로 나타났으며, 개인화 검색 결과와 추천 서비스에 대한 만족도도 비교적 높게 나타났다.

Abstract

This study proposes an expanded model of personalized search service based on community activities on a Korean Web portal. The model is composed of defining subject categories of users, providing personalized search results, and recommending additional subject categories and queries. Several experiments were performed to verify the feasibility and effectiveness of the proposed model. It was found that users’ activities on community services provide valuable data for identifying their interests, and the personalized search service increases users’ satisfaction.

58

선택적 웹 아카이빙을 위한 메타데이터 요소 개발

김희정(한성대학교) ; 이혜원(서울여자대학교) 2007, Vol.24, No.2, pp.143-160 https://doi.org/10.3743/KOSIM.2007.24.2.143

초록보기

초록

다지털 보존의 중요성에 대한 인식이 확산되면서 웹 아카이빙에 대한 관심도 높아지고 있다. 웹 아카이빙은 수집형태나 운영형태, 또는 아카이빙 대상 컬렉션 범위와 포괄정도에 따라서 설계지침이 달라지게 된다. 본 연구에서는 이 중 선택적 아카이빙을 중심으로 아카이빙을 수행하고자 할 때에 고려해야 할 메타에이터 요소들을 분석하였다. 선택적 아카이빙을 수행한 선행 프로젝트에서 제안한 메타데이터를 기반으로 필요한 요소들을 분석하였으며, 분석 결과 더블린코어의 기본적인 요소들과 함께 관리적인 요소에 해당하는 메타데이터 내용들의 확충이 필요함을 확인할 수 있었다.

Abstract

As digital preservation becomes increasingly important, interest in web archiving has correspondingly increased. The processes of web archiving depend on the types of acquisition methods employed, the organization and storage of data, their completeness, and their scope. This study develops metadata for intensive web archiving. Several web archiving projects are reviewed and analyzed. As a result, administrative metadata has been suggested in addition to the basic elements from the Dublin Core.

59

동시출현단어 분석에 기반한 메타데이터 분야의 지적구조에 관한 연구

최예진(이화여자대학교 문헌정보학과) ; 정연경(이화여자대학교) 2016, Vol.33, No.3, pp.63-83 https://doi.org/10.3743/KOSIM.2016.33.3.063

초록보기

초록

다양한 매체와 유형으로 생산되는 정보자원에 대한 이용이 높아짐에 따라, 정보자원을 기술하기 위한 정보조직의 도구로서 메타데이터에 대한 중요성이 높아지고 있다. 본 연구에서는 메타데이터 분야의 연구 영역을 파악할 수 있도록 동시출현단어 분석을 사용하여 메타데이터 분야의 지적 구조를 규명하고자 하였다. 이를 위하여 1998년 1월 1일부터 2016년 7월 8일까지 Web of Science 핵심컬렉션에 등재된 저널에 게재된 문헌을 대상으로 ‘metadata’라는 질의어로 Topic 검색을 수행하여, 총 727건의 논문에 대한 서지정보를 수집하였다. 이 중 저자 키워드를 가진 410건의 논문의 저자 키워드로 수집하고, 전처리 과정을 거쳐 저자 키워드 총 1,137개를 추출하여 최종적으로 빈도수 6회 이상의 키워드 37개를 분석대상으로 선정하였다. 이후 메타데이터 분야의 지적구조 규명을 위해 첫째, 네트워크 분석을 통하여 2개 영역 9개 군집을 도출하였으며, 메타데이터 분야 키워드들의 지적 관계를 시각화하고, 중심성 분석을 통한 전역 중심 키워드와 지역 중심이 높은 키워드를 제시하였다. 둘째, 군집분석을 실시하여 형성된 6개의 군집을 다차원축적지도상에 표시하였으며, 각 키워드들 간의 상관관계에 따른 지적구조를 제시하였다. 이러한 연구의 결과는 메타데이터 분야의 지적구조를 시각적으로 파악할 수 있게 하며, 향후 메타데이터 관련 교육과 연구의 방향성 모색에 유용하게 사용될 수 있을 것이다.

Abstract

As the usage of information resources produced in various media and forms has been increased, the importance of metadata as a tool of information organization to describe the information resources becomes increasingly crucial. The purposes of this study are to analyze and to demonstrate the intellectual structure in the field of metadata through co-word analysis. The data set was collected from the journals which were registered in the Core collection of Web of Science citation database during the period from January 1, 1998 to July 8, 2016. Among them, the bibliographic data from 727 journals was collected using Topic category search with the query word ‘metadata’. From 727 journal articles, 410 journals with author keywords were selected and after data preprocessing, 1,137 author keywords were extracted. Finally, a total of 37 final keywords which had more than 6 frequency were selected for analysis. In order to demonstrate the intellectual structure of metadata field, network analysis was conducted. As a result, 2 domains and 9 clusters were derived, and intellectual relations among keywords from metadata field were visualized, and proposed keywords with high global centrality and local centrality. Six clusters from cluster analysis were shown in the map of multidimensional scaling, and the knowledge structure was proposed based on the correlations among each keywords. The results of this study are expected to help to understand the intellectual structure of metadata field through visualization and to guide directions in new approaches of metadata related studies.

60

도서관 공공데이터의 품질에 관한 연구: 도서관 정보나루의 도서 상세 조회 API를 중심으로

양수완(중앙대학교 문헌정보학과 박사과정 수료) 2020, Vol.37, No.4, pp.181-206 https://doi.org/10.3743/KOSIM.2020.37.4.181

초록보기

초록

공공데이터의 개방과 제공의 활성화와 함께, 공공도서관이 업무 중에 생산한 서지 데이터와 대출 이력과 같은 데이터가 도서관 공공데이터로 제공되고 있다. 본 논문은 도서관 공공데이터의 품질을 진단하고, 그 결과를 바탕으로 도서관 공공데이터의 품질을 높일 개선방안을 제안하고자 한다. 먼저, 문헌정보학 영역에서 공공데이터에 관해 이루어진 연구를 개괄한다. 그다음으로, 도서관 공공데이터 개방 플랫폼인 도서관 정보나루의 오픈 API를 통해 확보한 도서관 공공데이터의 완전성과 정확성을 진단한다. 마지막으로, 데이터 품질 진단 결과에 바탕을 개선방안을 도출한다. 완전성을 진단한 결과, 도서의 식별과 검색을 위 필수적인 서지 요소에서 다수의 공백이 확인되었다. 정확성을 진단한 결과, 값의 유형, 값의 범위, 제한조건을 따르지 않는 부정확한 서지 요소가 확인되었다. 본 연구는 데이터 품질 진단 분석 결과를 바탕으로, 도서관 정보나루의 데이터 수집 절차 개선, 데이터별 스키마 구축, 데이터 수집과 데이터 처리에 관한 안내 제공, 원자료 공개를 제언하였다.

Abstract

With the popularization of open government data, Library-related open government data is also open and utilized to the public. The purpose of this paper is to diagnose the quality of library-related open government data and propose improvement measures to enhance the quality based on the diagnosis result. As a result of diagnosing the completeness of the data, a number of blanks are identified in the bibliographic elements essential for identifying and searching a book. As a result of diagnosing the accuracy of the data, the bibliographic elements that are not compliant with the data schema have been identified. Based on the result of data quality diagnosis, this study suggested improving the data collection procedure, establishing data set schema, providing details on data collection and data processing, and publishing raw data.

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

정보관리학회지