바로가기메뉴

본문 바로가기 주메뉴 바로가기

logo

Investigating the Efficient Method for Constructing Audio Surrogates of Digital Video Data

Journal of the Korean Society for Information Management / Journal of the Korean Society for Information Management, (P)1013-0799; (E)2586-2073
2009, v.26 no.3, pp.169-188
https://doi.org/10.3743/KOSIM.2009.26.3.169

  • Downloaded
  • Viewed

Abstract

The study proposed the algorithm for automatically summarizing the audio information from a video and then conducted an experiment for the evaluation of the audio extraction that was constructed based on the proposed algorithm. The research results showed that first, the recall and precision rates of the proposed method for audio summarization were higher than those of the mechanical method by which audio extraction was constructed based on the sentence location. Second, the proposed method outperformed the mechanical method in summary making tasks, although in the gist recognition task(multiple choice), there is no statistically difference between the proposed and mechanical methods. In addition, the study conducted the participants' satisfaction survey regarding the use of audio extraction for video browsing and also discussed the practical implications of the proposed method in Internet and digital library environments.

keywords
tag, annotation, digital library, 비디오 요약, 오디오 요약, 영상 요약, 텍스트 초록, 멀티미디어 기반 요약, 텍스트 기반 요약, 소셜 메타데이터, tag, annotation, digital library

Reference

1.

김재곤. (2000). 효율적인 비디오 브라우징을 위한 동적 요약 및 요약 기술구조. 방송공학회논문지, 5(1), 82-93.

2.

정영미. (2005). 정보검색연구:구미무역 출판부.

3.

진성호. (2005). 개인화된 의미기반 컨텐츠 소비를 위한 지능형방송 시스템과 서비스. 방송공학회 논문지, 10(3), 422-435.

4.

Edmunson, H. P. (1969). New methods in automatic extracting. Journal of the ACM, 16(2), 265-285.

5.

Furini, M. (2006). An Audio- video smmarisation scheme based on audio and video analysis (1209-1213). Proceedings of the IEEE Consumer Communications and Networking Conference.

6.

Gunther, R. (2004). Using 3D sound as a navigational aid in virtual environments. Behaviour and Information Technology, 23(6), 435-446.

7.

Hauptmann, A. G. (2005). Lessons for the future from a decade of informedia video analysis research. http://www.informedia.cs.cmu.edu/documents/CIVR05_Hauptmann.pdf.

8.

Kristin, B. (2006). Audio surrogation for digital video: A design framework. UNC School of Information and Library Science.

9.

Kupiec, J. (1995). A trainable document summarizer (68-73). Proceedings of the Eighteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.

10.

Luhn, H. P. (1958). The automatic creation of literature abstracts. IBM Journal of Research and Development, 2(2), 159-165.

11.

Mani, I. (2001). Automatic summarization:John Benjamins Publishing Co.

12.

Marchionini, G. (2006). The Open Video Digital Library: A Möbius strip of research and practice. Journal of the American Society for Information Science and Technology, 57(12), 1623-1643.

13.

Money, A. G. (2008). Video summarisation: A conceptual framework and survey of the state of the art. Journal of visual communication and image representation, 19(2), 121-143.

14.

Money, A. G. (2009). Analysing user physiological responses for affective video summarisation. Displays, 30, 59-70.

15.

Myaeng, S. H. (1999). Development and evaluation of a statistically-based document summarization system in Advances in automatic text summarization:The MIT Press.

16.

Over, P. (2005). TRECVID, 2005: An introduction (1-14). Proceedings of the TRECVID.

17.

Schmandt, C. Audio- Streamer: Exploiting simultaneity for listening. http://doi.acm.org.libproxy.lib.unc.edu/10.1145/223355.223533.

18.

Smeaton, A. F. (2007). Techniques used and open challenges to the analysis, in- dexing and retrieval of digital video. Information Systems, 32, 545-559.

19.

Smeaton, A. F. (2006). A usage study of retrieval modalities for video shot retrieval. Information Processing and Management, 42(5), 1330-1344.

20.

Song, Y. (2007). Effects of audio and visual surrogates for making sense of digital video (867-876). Proceedings of CHI 2007.

21.

Sparck Jones, K. (2007). Automatic summarising: The state of the art. Information Processing and Management, 43, 1449-1481.

22.

Witbrock, M. (1998). Speech recognition for a digital video library. Journal of the American Society for Information Science and Technology, 49(7), 619-632.

23.

Yang, M. (2005). Deci- phering visual gist and its implications for video retrieval and interface de- sign (2-7). Conference on Human Factors in Computing Systems(CHI).

Journal of the Korean Society for Information Management