摘要
海量数据中关键词高速检索对增强网络安全态势感知系统反应速度,提高系统整体效率和安全性具有重要意义。基于双数组Trie树的网络信息检索算法具有较高的查找效率,但其插入时间复杂度较高,同时叶子结点占用了大量存储空间。为此,文章提出一种基于叶子结点压缩存储的双数组Trie树构造方法,按层次遍历Trie树,将分枝结点存储在基本双数组中,对叶子结点进行压缩后以位图形式存储于压缩数组中。该方法在保留双数组Trie树查询性能的同时,一定程度上提高了插入效率,改善了存储空间利用效率。
High-speed keyword retrieval in massive data is of great significance for enhancing the response speed of network security situational awareness system and improving the overall efficiency and security of the system.The network information retrieval algorithm based on the double-array Trie-tree has higher search efficiency,but its insertion time complexity is higher,and the leaf nodes consume a lot of storage space.For this reason,this paper proposes a double-array Trie-tree construction method based on leaf node compression storage,traverses the tree hierarchically,stores the branch nodes in the basic double array,compresses the leaf nodes,and stores them in the compressed array.This method not only preserves the query performance of double-array Trie-tree,but also improves the insertion efficiency and storage space utilization efficiency.
作者
徐国天
张铭
XU Guotian;ZHANG Ming(Cyber Crime Investigation Department,Criminal Investigation Police University of China,Shenyang Liaoning 110854,China;The Second Affiliated Hospital of Harbin Medical University,Harbin Heilongjiang 150086,China)
出处
《信息网络安全》
CSCD
北大核心
2019年第4期55-62,共8页
Netinfo Security
基金
辽宁省自然科学基金[20180550841
2015020091]
公安部理论及软科学研究计划[2016LLYJXJXY013]
公安部技术研究计划[2016JSYJB06]
辽宁省经济社会发展研究重大课题[2018LSLKTZD-028]
关键词
态势感知
双数组
TRIE树
压缩
信息检索
situational awareness
double-array
Trie-tree
compression
information retrieval
作者简介
通信作者:徐国天(1978—),男,辽宁,副教授,硕士,主要研究方向为网络安全、电子物证,xu_guo_tian888@163.com;张铭(1970-),男,黑龙江,高级工程师,硕士,主要研究方向为计算机软件。