文献“大数据”分析软件Citespace和SCI2的对比分析研究
The Comparison of “Big Data” Literatures Analysis Tools: Taking Citespace and SCI2 as Examples
DOI: 10.12677/CSA.2017.76069, PDF, HTML, XML, 下载: 2,514  浏览: 4,975  国家自然科学基金支持
作者: 杨 静*, 程昌秀:北京师范大学环境演变与自然灾害教育部重点实验室,北京;北京师范大学地理科学学部,北京;北京师范大学地理数据与应用分析中心,北京
关键词: 文献计量学数据挖掘CitespaceSCI2Bibliometrics Data Mining Citespace SCI2
摘要: 当今世界,信息爆炸已经累积到了一个可以引发变革的时刻,对学术研究者而言,知识更新迭代周期不断缩短,然而海量、多元的数据并不等于精准的数据贮备。如何快速精准获取重要学术资源,掌握研究前沿、热点与脉络是大数据时代学术研究者共同面临的挑战。本文以Citespace和Science of Science  (SCI2) Tool为例,从数据源、数据预处理、数据可视化几方面总结了二者在文献数据挖掘中的优势互补作用,并加以地震实例分析。二者互补情况主要有以下几点:1) 数据源。Citespace可以分析中英文文献,而SCI2只限英文文献,不过英文数据源更广;2) 数据预处理。SCI2能自动合并拼写相近的单词,但有时会出现误合或漏合的情况;而Citespace则需人工添加同义词对照表,可以避免这种情况,但处理工作量较大;3) 网络输出。SCI2输出的网络可编辑能力强;而Citespace编辑能力相对较差,不过其剪枝功能使得剪枝后的网络研究脉络更清晰;4) 可视化方式。SCI2除无向网络外还可生成有向网络,有向网络能更好地反映研究之间的指向关系;Citespace的网络节点除能表达出现次数多少之外,还能给出节点中心度,并且Citespace的时域图能更好地反映研究随时间变化的特点。
Abstract: Recently, the information is too much to induce revolution. For researchers, the speed of update of knowledge is faster and faster. However, the mass and various data do not mean high accurate. It is a big challenge to get the important academic resources quickly and accurately and master research’s skeleton, frontiers and hot topics. This article took the Citespace and SCI2 as example, then summarized their complementary roles in data acquisition, data preprocessing, data visualization. After that an earthquake case was shown. Their complementarities are as follows: a) Data sources. Citespace can analyze both Chinese and English articles. SCI2 is just for English articles and it has more English data source than Citespace. b) Data preprocessing. SCI2 can merge the synonyms automatically but may miss merged. And as Citespace need a lot of manual work to build a synonym table, it is laborious. c) Network output. The network outputted by SCI2 is editable. However, the network from Citespace cannot do it. But the network of citespace is clear due to its path finder function. d) Visualization. Besides undirected network, SCI2 can output directed network which would express the directed relationship between nodes. The nodes in network of Citespace can express the centrality beside the occurrence frequency. Time zone map is another advantage of Citespace, it could reflect research hot topics changes with the time.
文章引用:杨静, 程昌秀. 文献“大数据”分析软件Citespace和SCI2的对比分析研究[J]. 计算机科学与应用, 2017, 7(6): 580-589. https://doi.org/10.12677/CSA.2017.76069

参考文献

[1] White, H.D. (2003) Pathfinder Networks and Author Cocitation Analysis: A Remapping of Paradigmatic Information Scientist. Journal of the American Society for Information Science and Technology, 54, 423-434.
https://doi.org/10.1002/asi.10228
[2] 陈悦, 陈超美, 刘泽渊. Citespace知识图谱的方法论功能[J]. 科学学研究, 2015, 33(2): 242-253.
[3] 樊瑛, 任素婷. 基于复杂网络的世界贸易格局探测[J]. 北京师范大学学报(自然科学版), 2015(2): 140-143+221.
[4] 邱小花, 李国俊, 肖明. SCI2——一款新的知识图谱分析软件介绍与评价[J]. 图书馆杂志, 2013, 32(9): 79-87.
[5] Small, H. (1999) Visualizing Science by Citation Mapping. Journal of the American Society for Information Science, 50, 799-813.
https://doi.org/10.1002/(SICI)1097-4571(1999)50:9<799::AID-ASI9>3.0.CO;2-G
[6] Cobo, M.J., López-Herrera, A.G., Herrera-Viedma, E., et al. (2011) Science Mapping Software Tools: Review, Analysis, and Cooperative Study among Tools. Journal of the American Society for Information Science, 62, 1382-1402.
https://doi.org/10.1002/asi.21525
[7] Zhang, X., Gao, Y., Yan, X.D., et al. (2015) From E-Learning to Social-Learning: Mapping Development of Studies on Social Media-Supported Knowledge Management. Computer in Human Behavior, 51, 803-811.
https://doi.org/10.1016/j.chb.2014.11.084
[8] Wu, Y. and Duan, Z.G. (2015) Analysis on Evolution and Research Focus in Psychiatry Field. BMC Psychiatry, 12, 105-129.
https://doi.org/10.1186/s12888-015-0482-1
[9] Xie, P. (2015) Study of International Anticancer Research Trends via Co-Word and Co-Citation Visualization Analysis. Scientometrics, 105, 611-622.
https://doi.org/10.1007/s11192-015-1689-0
[10] Fang, Y.Q. (2015) Visualizing the Structure and the Evolving of Digital Medicine: A Scientometrics Review. Scientometrics, 105, 5-21.
https://doi.org/10.1007/s11192-015-1696-1
[11] Gao, Y., Qu, B., Shen, Y., et al. (2015) Bibliometric Profile of Neurogenic Bladder in the Literature: A 20-Year Bibliometric Analysis. Neural Regeneration Research, 10, 797-803.
https://doi.org/10.4103/1673-5374.156985
[12] Light, R.P., Polley, D.E. and Börner, K. (2014) Open Data and Open Code for Big Science of Science Studies. Scientometrics, 101, 1535-1551.
[13] Kim, M.C. and Chen, C.M. (2015) A Scientometric Review of Emerging Trends and New Developments in Recommendation Systems. Scientometrics, 104, 239-263.
https://doi.org/10.1007/s11192-015-1595-5