摘要
提出了一种新的基于特征串匹配的文件内容动态识别算法,对文件类型可快速有效地识别,此技术可应用于网络信息流动的监控;研究了多种多模式串字符文本匹配算法,并基于Boyer—Moore提出了多特征串匹配算法(multiple features tringmatching algorithm,MFSM),以加速大量特征串匹配运算,与传统的“暴力”算法相比,MFSM在匹配速度上要快一倍以上。
This paper proposes a new algorithm that dynamically identifies content of files for different types transmitting on the network based on feature string matching. It has an effective solution of identifying types of files being transferred on intranets, then monitors flowing pieces of information in an enterprise's intranet. Several multiple pattern matching algorithms are referred. Based on Boyer- Moore, the paper brings out a new algorithm called MFSM, focusing on searching of a large scale set of feature strings (byte sequence with a fixed offset). The new designed MFSM is at least twice as fast as traditional Brute-Force algorithm on searching string matching.
出处
《装备指挥技术学院学报》
2010年第6期102-105,共4页
Journal of the Academy of Equipment Command & Technology
关键词
信息监控
文件类型识别
特征串
模式串
information monitoring
file type identification
feature string
pattern string
作者简介
张军,女,高级工程师.主要研究方向:数据库技术.