河北大学学报(自然科学版) ›› 2018, Vol. 38 ›› Issue (4): 416-422.DOI: 10.3969/j.issn.1000-1565.2018.04.012

• • 上一篇    下一篇

融合时序性和波动性的热点话题发现研究

李汉才1,徐建民2,吴树芳3,4   

  • 收稿日期:2017-10-12 出版日期:2018-07-25 发布日期:2018-07-25
  • 通讯作者: 吴树芳(1979—),女,河北邯郸人,河北大学副教授,博士,主要从事信息检索、不确定信息处理方向研究.E-mail:shufang_44@126.com
  • 作者简介:李汉才(1973—),男,河北枣强人,河北大学教授,主要从事信息检索方向研究. E-mail:hbdxlhc@163.com
  • 基金资助:
    国家社科基金资助项目(17BTQ068);河北省教育厅青年基金资助项目(QN2015099);河北大学中西部提升综合实力专项资金项目;河北省自然科学基金资助项目(F2015201142);中国博士后基金资助项目(2017M621078)

On hot topic detection by merging temporal and volatility

LI Hancai1,XU Jianmin2,WU Shufang3, 4   

  1. 1.Personnel Department, Hebei University, Baoding 071002, China; 2.College of Computer Science and Technology, Hebei University, Baoding 071002, China; 3.College of Management and Economics, Tianjin University, Tianjin 300000, China; 4.College of Management, Hebei University, Baoding 071002, China
  • Received:2017-10-12 Online:2018-07-25 Published:2018-07-25

摘要: 时序性和波动性直接与话题的热度有关,短时间内某话题出现的相关报道越多,则其热度越高;话题的波动幅度越大,则其热度越高.依据积分理论给出了基于时序性的相关报道密度计算和基于波动性的峰值计算,并采用线性调和的方法将二者融合,给出话题热度计算方法.实验采用TDT4语料作为测试集合,验证了该方法的有效性与合理性.

关键词: 热点话题, 发现, 时序性, 波动性

Abstract: Temporal and volatility are directly related to the hot degree of topic, namely, the more related stories the topic has in a certain time distance, and the larger fluctuation the topic is, the higher hot degree of this topic. Applying integration theory, methods of computing the density of related stories and the peak value are put forward based on temporal and volatility respectively. Finally, linear meditated method is used to merge them to compute the hot degree of a topic. Experiments are carried out on TDT4 corpus to testify the validity and rationality of our new method.

Key words: hot topic, detection, temporal, volatility

中图分类号: