바로가기메뉴

본문 바로가기 주메뉴 바로가기

logo

Combining Ego-centric Network Analysis and Dynamic Citation Network Analysis to Topic Modeling for Characterizing Research Trends

Journal of the Korean Society for Information Management / Journal of the Korean Society for Information Management, (P)1013-0799; (E)2586-2073
2015, v.32 no.1, pp.153-169
https://doi.org/10.3743/KOSIM.2015.32.1.153

  • Downloaded
  • Viewed

Abstract

The combined approach of using ego-centric network analysis and dynamic citation network analysis for refining the result of LDA-based topic modeling was suggested and examined in this study. Tow datasets were constructed by collecting Web of Science bibliographic records of White LED and topic modeling was performed by setting a different number of topics on each dataset. The multi-assigned top keywords of each topic were re-assigned to one specific topic by applying an ego-centric network analysis algorithm. It was found that the topical cohesion of the result of topic modeling with the number of topic corresponding to the lowest value of perplexity to the dataset extracted by SPLC network analysis was the strongest with the best values of internal clustering evaluation indices. Furthermore, it demonstrates the possibility of developing the suggested approach as a method of multi-faceted research trend detection.

keywords
Ego-centric network, SPLC, Topic Modeling, DBI, Index S, 자아중심 네트워크, 토픽 모델링

Reference

1.

박자현. (2013). 토픽모델링을 활용한 국내 문헌정보학 연구동향 분석. 정보관리학회지, 30(1), 7-32. http://dx.doi.org/10.3743/KOSIM.2013.30.1.007.

2.

서은경. (2013). Detecting Research Trends in Korean Information Science Research, 2000-2011. 정보관리학회지, 30(4), 215-239. http://dx.doi.org/10.3743/KOSIM.2013.30.4.215.

3.

유소영. (2013). 문헌 단위 인용 네트워크 구조와 Topic Descriptor Profile을 활용한 연구경향 분석에 관한 연구 (39-58). 2013 한국정보관리학회 추계 학술대회 논문집.

4.

이재윤. (2011). 계량서지적 기법을 활용한 LED 핵심 주제영역의 연구 동향 분석. Journal of Information Science Theory and Practice, 42(3), 1-26.

5.

정우성. (2013). 과학계량학 연구동향 및 과학기술 정책 분야 응용가능성. 한국과학기술기획평가원.

6.

한국과학기술정보연구원. 미래기술백서 2013.

7.

한국과학기술정보연구원. 미래기술백서 2014.

8.

Davies, D. L.. (1979). A cluster separation measure. Pattern Analysis and Machine Intelligence. IEEE Transactions on, 2, 224-227.

9.

de Nooy, W.. (2011). Exploratory social network analysis with Pajek:Cambridge University Press.

10.

Ding, W.. (2014). Dynamic topic detection and tracking: A comparison of HDP, C-word, and cocitation methods. Journal of the Association for Information Science and Technology, 65(10), 2084-2097.

11.

Garfield, E.. (2001). From computational linguistics to algorithmic historiography. Lazerow lecture held in conjunction with panel on “Knowledge and language: Building large-scale knowledge bases for intelligent applications,” presented at the University of Pittsburgh. http://garfield.library.upenn.edu/papers/pittsburgh92001.pdf.

12.

Garfield, E.. (2001). From bibliographic coupling to co-citation analysis via algorithmic historio-bibliography: A citationist’s tribute to Belver C. Griffith. http://garfield.library.upenn.edu/papers/drexelbevergrif?th92001.pdf.

13.

Garfield, E.. (2002). Algorithmic citation-linked historiography-Mapping the literature of science (14-24). Proceedings of the American Society for Information Science and Technology Annual Meeting.

14.

Garfield, E.. (2003). Why do we need algorithmic historiography?. Journal of the American Society for Information Science and Technology, 54(5), 400-412.

15.

Griffiths, T. L.. (2004). Finding scientific topics (5228-5235). Proceedings of the National Academy of Sciences.

16.

Huang, W.. A Neural Probabilistic Model for Context Based Citation Recommendation.

17.

Hansen, D.. (2010). Analyzing social media networks with NodeXL: Insights from a connected world:Morgan Kaufmann.

18.

Hummon, N. P.. (1989). Connectivity in a citation network: The development of DNA theory. Social Networks, 11, 39-63.

19.

Jiang, Z.. (2015). Chronological scientific information recommendation via supervised dynamic topic modeling (453-458). In Proceedings of the Eighth ACM International Conference on Web Search and Data Mining. ACM.

20.

Mccallum, A.. (2009). Rethinking LDA: Why priors matter (1973-1981). In Advances in Neural Information Processing Systems.

21.

Ramage, D.. (2009). Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora (248-256). In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics.

22.

Ramage, D.. (2009). Topic modeling for the social sciences (-). In NIPS 2009 Workshop on Applications for Topic Models: Text and Beyond.

23.

Rousseeuw, P. J.. (1987). Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics, 20, 53-65.

24.

Saka, A.. (2014). Science Map 2010&2012. .

25.

Song, Z.. (2010). Research on text categorization based on LDA.

26.

Teh, Y. W.. (2006). Hierarchical dirichlet processes. Journal of the American Statistical Association, 101(476), 1566-1581.

27.

Walesiak M.. (2010). The cluster sim package for R. http://keii.ue.wroc.pl/clusterSim.

28.

유소영. (2014). Exploratory Study of Developing a Synchronization-Based Approach for Multi-step Discovery of Knowledge Structures. Journal of Information Science Theory and Practice, 2(2), 16-32. http://dx.doi.org/10.1633/JISTaP.2014.2.2.2.

Journal of the Korean Society for Information Management