정보관리학회지, 한국정보관리학회

권한신청
P-ISSN1013-0799
E-ISSN2586-2073
KCI

검색어: hyperlink, 검색결과: 5

이지숙(NHN㈜) ; 정영미(연세대학교) 2007, Vol.24, No.3, pp.201-218 https://doi.org/10.3743/KOSIM.2007.24.3.201

초록보기

초록

이 연구에서는 TREC이 제시한 토픽 검색의 정의에 따라 질의에 적합한 웹 사이트를 검색하는 효과적인 토픽 검색 알고리즘을 제안하고 실험을 통해 그 성능을 평가하였다. 이 연구의 토픽 검색 알고리즘은 먼저 질의에 대한 웹 페이지 검색 결과로부터 적합한 웹 사이트를 선정한 다음, 선정된 사이트의 구조를 이용하여 질의에 대한 적합성 점수를 산출한다. TREC의 .GOV 실험 문헌 집단과 TREC-2004 실험의 질의 및 적합문헌 리스트를 이용한 검색 실험 결과 이 토픽 검색 알고리즘은 상위 10위 안에 최소 2개 이상의 적합 사이트를 검색하여 비교적 높은 수준의 성능을 보였다. 또한 TREC-2004의 적합문헌 리스트 분석을 통해 적합문헌 선정에 토픽 검색의 정의가 엄격하게 적용되지 않은 경우가 있음을 확인하고, 수정된 적합문헌 리스트를 이용하여 토픽 검색 성능을 재평가한 결과 이 연구에서 제안한 토픽 검색 알고리즘의 성능이 월등히 향상되었다.

Abstract

This study proposes a topic distillation algorithm that ranks the relevant sites selected from retrieved web pages, and evaluates the performance of the algorithm. The algorithm calculates the topic score of a site using its hierarchical structure. The TREC .GOV test collection and a set of TREC-2004 queries for topic distillation task are used for the experiment. The experimental results showed the algorithm returned at least 2 relevant sites in top ten retrieval results. We performed an in-depth analysis of the relevant sites list provided by TREC-2004 to find out that the definition of topic distillation was not strictly applied in selecting relevant sites. When we re-evaluated the retrieved sites/sub-sites using the revised list of relevant sites, the performance of the proposed algorithm was improved significantly.

웹 이용자를 위한 통계 메타데이터: 통계정보 제공사이트의 메타데이터 제공 수준 평가 사례 연구

오정선(미국 노스캐롤라이나 대학) 2007, Vol.24, No.2, pp.161-179 https://doi.org/10.3743/KOSIM.2007.24.2.161

초록보기

초록

디지털 도서관을 통해 제공되는 정보 자원의 형태와 종류가 다양화됨에 따라 자료의 유형별로 적정 수준의 메타데이터를 정의하고 제공하는 것이 또 다른 과제로 대두되고 있다. 일반 텍스트 자료와 달리 수치로 표현된 데이터에 대한 해석을 필요로 하는 통계 자료의 특성상, 통계 도메인에서 메타데이터는 통계 자료의 검색뿐 아니라 검색된 자료의 정확한 이해와 활용을 위한 필수적인 도구로 인식되고 있다. 하지만 기존의 통계 메타데이터 연구는 통계 작성 기관이나 분석 기관의 전문적인 요구에 중점을 두고 있어, 인터넷을 통해 통계 자료에 접근하는 일반 이용자들의 관점에서의 논의는 상대적으로 부족한 실정이다.일반 이용자를 위한 통계 메타데이터에 대한 논의의 단초로서, 본 연구는 미국의 연방 통계 기관인 the Bureau of Labor Statistics (BLS, HYPERLINK "http://www.bls.gov/" http://www.bls.gov/) 및 the Energy Information Administration (EIA, HYPERLINK "http://www.eia.doe.gov/" http://www.eia.doe.gov/)의 웹사이트에 대한 내용 분석을 통해, 현재 인터넷을 통해 통계 자료에 접근하는 이용자들에게 제공되고 있는 메타데이터의 현황을 평가하였다. 본 사례 연구의 결과는 이들 웹사이트를 통해 제공되는 방대한 양의 자료에도 불구하고 메타데이터의 제공 수준은 국제 기구에 의해 정의된 최소 수준에 미치지 못함을 나타내고 있어,이용자 중심의 메타데이터 설계의 필요성을 재확인 하고 있다.

Abstract

As increasingly diverse kinds of information materials are available on the Internet, it becomes a challenge to define an adequate level of metadata provision for each different type of material in the context of digital libraries. This study explores issues of metadata provision for a particular type of material, statistical tables. Statistical data always involves numbers and numeric values which should be interpreted with an understanding of underlying concepts and constructs. Because of the unique data characteristics, metadata in the statistical domain is essential not only for finding and discovering relevant data, but also for understanding and using the data found. However, in statistical metadata research, more emphasis has been put on the question of what metadata is necessary for processing the data and less on what metadata should be presented to users.In this study, a case study was conducted to gauge the status of metadata provision for statistical tables on the Internet. The websites of two federal statistical agencies in the United States were selected and a content analysis method was used for that purpose. The result showing insufficient and inconsistent provision of metadata demonstrate the need for more discussions on statistical metadata from the ordinary web users’ perspective.

뉴스 웹 페이지에서 기사 본문 추출에 관한 연구

이용구(피츠버그대학) 2009, Vol.26, No.1, pp.305-320 https://doi.org/10.3743/KOSIM.2009.26.1.305

초록보기

초록

웹을 통해 제공되는 뉴스 페이지의 경우 필요한 정보 뿐 아니라 많은 불필요한 정보를 담고 있다. 이러한 불필요한 정보는 뉴스를 처리하는 시스템의 성능 저하와 비효율성을 가져온다. 이 연구에서는 웹 페이지로부터 뉴스 콘텐츠를 추출하기 위해 문장과 블록에 기반한 뉴스 기사 추출 방법을 제시하였다. 또한 이들을 결합하여 최적의 성능을 가져올 수 있는 방안을 모색하였다. 실험 결과, 웹 페이지에 대해 하이퍼링크 텍스트를 제거한 후 문장을 이용한 추출 방법을 적용하였을 때 효과적이었으며, 여기에 블록을 이용한 추출 방법과 결합하였을 때 더 좋은 결과를 가져왔다. 문장을 이용한 추출 방법은 추출 재현율을 높여주는 효과가 있는 것으로 나타났다.

Abstract

The news pages provided through the web contain unnecessary information. This causes low performance and inefficiency of the news processing system. In this study, news content extraction methods, which are based on sentence identification and block-level tags news web pages, was suggested. To obtain optimal performance, combinations of these methods were applied. The results showed good performance when using an extraction method which applied the sentence identification and eliminated hyperlink text from web pages. Moreover, this method showed better results when combined with the extraction method which used block-level. Extraction methods, which used sentence identification, were effective for raising the extraction recall ratio.

북마크릿을 활용한 LibraryLookup 서비스 제공방안에 관한 연구

구중억(한국기초과학지원연구원) ; 이응봉(충남대학교) 2006, Vol.23, No.3, pp.49-68 https://doi.org/10.3743/KOSIM.2006.23.3.049

초록보기

초록

도서관 이용자에게 장애가 없는 정보서비스를 제공하기 위해서는 OPAC의 접근성, 사용성 및 검색성을 향상시키고, 도서의 검색, 식별 및 브라우징의 도구로써 ISBN의 활용가치를 높이는 것이 필요하다. 북마크릿은 웹브라우저의 ‘즐겨찾기에 추가’ 또는 ‘툴바’에 드래그하여 저장할 수 있는 작은 크기의 자바스크립트이다. 그리고 오픈소스인 북마크릿은 웹페이지에서 ISBN을 추출한 다음, 해당 ISBN으로 도서관의 OPAC에서 도서를 검색할 수 있는 간단하지만 강력한 검색도구이다. 해외의 도서관 시스템 벤더, 도서관, OCLC 등은 이용자가 온라인서점의 웹페이지를 살펴보면서 동시에 도서관의 소장 및 대출 정보를 실시간으로 검색할 수 있는 북마크릿을 제공하고 있다. 따라서 본 연구에서는 해외에서 개발되어 활용되고 있는 네 가지 유형의 북마크릿에 대한 적용사례 분석을 통해 북마크릿의 특징과 장단점을 정리하였다. 이를 통해서 북마크릿의 기본요건과 적용모델을 도출하고, 국내 도서관의 OPAC과 온라인서점에서 북마크릿을 활용한 Library Lookup 서비스 제공방안을 제안하였다.

Abstract

It is required to enhance the value of ISBN as a tool for book search, identification, and browsing, and improve the accessability and search capability of library OPAC. Bookmarklet is a small size javascript which can be saved as URL in a web browser bookmark or web page hyperlink. Open source bookmarklet can extract ISBN from web pages and search a book from library OPAC using the ISBN, so it is recognized as a simple but powerful search tool. In foreign countries, commercial library system vendors, libraries, OCLC, etc. are providing bookmarklets which allow a user to search for library holdings and loan information in a real time while he/she is travelling in an online bookshop web page. Therefore, this paper compared and analyzed international bookmarklets application examples and proposed LibraryLookup service in which library OPAC and online bookshop can make use of the bookmarklets.

온라인 커뮤니티 사이트에 대한 신뢰가 해당 커뮤니티 내에서 이뤄지는 포럼활동에 미치는 영향에 관한 실증연구

문병석(성균관대학교) ; 이건창(성균관대학교) ; 조창현(성균관대학교) ; 강신장(성균관대학교) 2007, Vol.24, No.1, pp.227-250 https://doi.org/10.3743/KOSIM.2007.24.1.227

초록보기

초록

온라인 커뮤니티 사이트는 최근 크게 발전하고 있다. 그 이유는 인터넷이 개인생활 속에 깊숙이 침투하면서 사회 연결망, 즉 social networking 현상이 활성화되고 그에 따라 많은 사용자들이 특정 온라인 커뮤니티 사이트에서 다양한 정보활동을 하고 있기 때문이다. 본 연구에서는 이러한 온라인 커뮤니티 사이트에 대한 중개자 신뢰와 시스템 신뢰가 해당 커뮤니티 내에서의 포럼활동에 대한 신뢰 및 정보품질 만족에 미치는 영향에 관한 실증분석을 하고자 한다. 실증분석을 위한 자료수집은 삼성경제연구소의 온라인 커뮤니티 사이트인 SERI ( HYPERLINK "http://www.seri.org" www.seri.org)를 대상으로 하였으며, 해당 SERI 사이트 내에서 SERI 포럼활동을 하고 있는 사용자들을 대상으로 하여 591명의 유의한 설문자료를 수집하였다. 실증분석결과 다음과 같은 결과를 얻을 수 있었다. 첫째, SERI의 중개자 신뢰와 시스템 신뢰는 해당 SERI 포럼의 정보품질과 시스템품질, 그리고 인지효과성에 긍정적인 영향을 준다. 둘째, SERI의 중개자 신뢰는 해당 SERI 포럼의 인지위험을 줄이는데 기여를 한다. 반면, SERI의 시스템 신뢰는 해당 SERI 포럼의 인지위험에는 유의한 영향을 주지 못한다. 이는 아무리 온라인 커뮤니티 사이트의 지명도가 높다고 하더라도 이는 해당 온라인 커뮤니티 내의 포럼 사용자가 느끼는 인지위험에는 유의한 영향을 주지 못하다는 것을 의미한다. 셋째, 그러나 SERI의 중개자 신뢰와 시스템 신뢰가 높을수록 해당 SERI 포럼의 신뢰와 정보품질만족에는 긍정적인 영향을 준다.

Abstract

With the advent of social networking activity on the Internet, online community sites are becoming more popular. The main purpose of this study is to empirically investigate the influence of intermediary trust and system trust on the forum activity trust and information quality satisfaction. We assume that the intermediary trust and system trust come from the online community site itself, while the forum activity is made within a specific forum allowed on the online community site, and therefore forum activity trust and information quality satisfaction are related to a specific forum. The 591 valid questionnaire data were gathered from the users acting in forums allowed on the Samsung Economic Research Institute (SERI) (www.seri.org). The empirical results are as follows. First, the SERI intermediary trust and its system trust have positive influence on the SERI forum information quality system quality, and perceived effectiveness. Second, the SERI intermediary trust contributes to reducing the SERI forum perceived risks, while the SERI system quality does not. Third, the higher the SERI intermediary trust is, the higher the SERI forum trust and information quality satisfaction increase.

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

정보관리학회지