In recent years, theories of image and sound analysis have been proposed to work with text retrieval systems and have progressed quickly with the rapid progress in data processing speeds. This study proposes a common representation format for multimedia documents (CRFMD) composed of both images and text to form a single data structure. It also shows that image classification of a given test set is dramatically improved when text features are encoded together with image features. CRFMD might be applicable to other areas of multimedia document retrieval and processing, such as medical image retrieval, World Wide Web searching, and museum collection retrieval.
Record management metadata schema should have robust structure to represent not only elements innate in records itself but also management elements for the life cycle of records according to business activities. To realize these requirement, Information model for record domain is needed and also Metadata framework supporting semantic refinement and data element specialization required in record management business or applications are required. This study analyse main principles and characteristics of metadata scheme, and then suggested a novel method to develope schema systematically and effectively. This study propose information model and set of core data elements of records management based on ISO 15489 and 230381, and show how to implement the record management framework.
Several performance factors which have applied to the automatic indexing with controlled vocabulary and text categorization based on Rocchio algorithm were examined, and the simple method for performance improvement of them were tried. Also, results of the methods using Rocchio algorithm were compared with those of other learning based methods on the same conditions. As a result, keeping with the strong points which are implementational easiness and computational efficiency, the methods based Rocchio algorithms showed equivalent or better results than other learning based methods(SVM, VPT, NB). Especially, for the semi-automatic indexing(computer-aided indexing), the methods using Rocchio algorithm with a high recall level could be used preferentially.
Recent advancements in information technology and the Internet have caused an explosive increase in the information available and the means to distribute it. However, such information overflow has made the efficient and accurate search of information a difficulty for most users. To solve this problem, an information retrieval and filtering system was developed as an important tool for users. Libraries and information centers have been in the forefront to provide customized services to satisfy the user's information needs under the changing information environment of today. The aim of this study is to propose an efficient information service for libraries and information centers to provide a personalized recommendation system to the user. The proposed method overcomes the weaknesses of existing systems, by providing a personalized hybrid recommendation method for multimedia contents that works in a large-scaled data and user environment. The system based on the proposed hybrid method uses an effective framework to combine Association Rule with Collaborative Filtering Method.
This study aims to analyze substantive theories in user studies on Web-based information seeking in order to explore the implications of building a unified theory in this field. Substantive theory is defined as research which generalized user behaviors during information seeking on the Web or suggested significant relationships between variables in Web-based information seeking context. The sample of research articles were published in four core journals from 1995 to 2005. Among 42 articles of user-based information seeking on the Web, ten articles were contributed to the development of substantive theories. Findings show that substantive theories used thirteen dependent and ten independent variables and 22 relationships were considered as significant. They have a tendency to focus on individual differences of users and on some specific narrow aspects(i.e. searching or searching time) of user behaviors. Findings suggested a range of implications for a unified theory construction.
In this study, experiments for selection of association terms were conducted in order to discover the optimum method in selecting additional terms that are related to an initial query term. Association term sets were generated by using support, confidence, and lift measures of the Apriori algorithm, and also by using the similarity measures such as GSS, Jaccard coefficient, cosine coefficient, and Sokal & Sneath 5, and mutual information. In performance evaluation of term selection methods, precision of association terms as well as the overlap ratio of association terms and relevant documents' indexing terms were used. It was found that Apriori algorithm and GSS achieved the highest level of performances.
The h-index, also called as Hirsch-index, is a new tool for measuring research outputs by citations. This h-index is not only easy to calculate, but also robust enough to handle various citation data. After its suggestion by Hirsch in 2005, many researchers applied the h-index to their own areas, and some others tried to improve the weak points of the h-index such as low discriminating power. Firstly, several of these efforts are reviewed in the present article, and then novel indexes are suggested to measure research outputs by citations more fairly and reasonably. Calculating these indexes on both artificial data and real data showed that the newly suggested indexes in this article can replace the h-index and its variants.
The main purpose of the archives is to maintain a history of the organization's functions and activities. Selecting valuable records for the permanent archives through objective appraisal from among many records that reflects the activities of organization is very important but also very difficult. The quantity and complexity of comtemporary records has rapidly expanded because of electronic storage, and with the information environment, and it is now possible to manage and control the records an entire lifetime. In this study, it is proposed the appraisal methods based on a business analysis that is combined the macro-appraisal factor and micro-appraisal factor; The former is functions and organizations as the objective guideline of selecting valuable records from the beginningand the latter is a contents appraisal (evidential value) of the individual records.
To investigate the present states and analyze the library anxiety among academic library users, A University library users sampled systematically and surveyed. Data collected from 707 students are analyzed and compared with the previous studies. The average LAS score is 2.55 and show differences in sex, grade levels, and major areas of study. Library use instruction reduces anxiety subscale of barriers with staff. Library anxiety is also differed according to the frequency of library visit. Especially, the anxiety subscale of affective barriers is decreased according to the grade level and frequency of library visit. Based on the results, recommendations are suggested to overcome the library anxiety.
As the fact that science is the driving force behind technological development and that technological innovation contributes to economic development has been proved empirically convincing, the interaction between science and technology is highly emphasized in advanced countries. But, Korea has not been active in conducting research on science-based technological development and on the scientific fields that have strong relationships with Korean technology. This study attempts to explore the influence of scientific research papers cited in US patents by Koreans on other US patents and identify the interactions between scientific research papers and patents, by examining the scientific references cited in the Korean-originated US patents.
This study suggested indispensable elements of metadata for electronic resource management and effectively manage of electronic resource in library. Therefore, this research analyzed into the data elements of DLF ERMI's ERMS data structure, foreign three universities. The Data elements are verified by domestic ERM specialist. As result of study, Trial categories are 12 elements, Consortium categories are 15 elements, License categories are 33 elements, Electronic resource information categories are 21 elements, Access/Administrative information categories are 20 elements, Usage statistics categories are 13 elements, Workflow categories are 14 elements, Contact information categories are 18 elements.