摘要
传统的基于文本聚类的网络舆情热点追踪算法,在处理海量网页时,文本聚类速度过低,聚合结果较差.提出了一种基于关键词提取的网络舆情热点追踪方案,并根据新闻、论坛和博客的不同特点分别设计了热点分析模型.通过在笔者开发的啄木鸟网络舆情系统上的实际验证表明,该方案行之有效,热点分析模型识别热点准确率高.
Based on text clustering, the traditional hot spot of Internet public opinion tracing algorithm clusters slowly very much. The results of clustering are poor in dealing with massive web pages. This paper shows a hot spot of Internet public opinion tracing scheme based on keyword extraction, and according to the different features of news, BBS, Blog designs hot topic analysis models respectively. Through the experiments on the woodpecker Internet public opinion system developed by us, it shows that the scheme can be effective, and the hot spot analysis model recognizes the hot spots with high accuracy.
出处
《河北大学学报(自然科学版)》
CAS
北大核心
2012年第3期311-315,共5页
Journal of Hebei University(Natural Science Edition)
基金
国家自然科学基金资助项目(60873203)
关键词
网络舆情
关键词
热点追踪
热点分析模型
Internet public opinion
keyword
hot spot tracing
hot spot analysis model
作者简介
张寿华(1980-),男,河北广宗人,河北大学讲师,主要从事分布式数据库、网络舆情传播方向研究.E-mail:zhangshouhua@hbu.edu.cn