在分析现有程序代码抄袭检测系统的特点及局限性的基础上,提出一种综合文本分析、结构度量和属性计数技术的混合式程序抄袭检测方法。应用文档指纹技术和Winnowing算法计算程序的文本相似度;将程序代码表示成动态控制结构树(Dynamic Con...在分析现有程序代码抄袭检测系统的特点及局限性的基础上,提出一种综合文本分析、结构度量和属性计数技术的混合式程序抄袭检测方法。应用文档指纹技术和Winnowing算法计算程序的文本相似度;将程序代码表示成动态控制结构树(Dynamic Control Structure tree,DCS),运用Winnowing算法计算DCS树相似度,从而得到结构相似度;收集并统计程序中的每个变量信息,应用变量相似度算法分析变量信息节点获取变量相似度;分别赋予文本相似度、结构相似度和变量相似度一个权值,计算得到总体的代码相似度。实验结果表明,所提出的方法能够有效检测出各种抄袭行为。针对不同的抄袭门槛值,使用该方法的检测结果准确度和查全率高于JPLAG系统。特别对于结构简单的程序组,此方法和JPLAG系统检测结果的平均准确度分别为82.5%和69.5%,说明所提的方法更加有效。展开更多
Parameter estimation of the attributed scattering center(ASC) model is significant for automatic target recognition(ATR). Sparse representation based parameter estimation methods have developed rapidly. Construction o...Parameter estimation of the attributed scattering center(ASC) model is significant for automatic target recognition(ATR). Sparse representation based parameter estimation methods have developed rapidly. Construction of the separable dictionary is a key issue for sparse representation technology. A compressive time-domain dictionary(TD) for ASC model is presented. Two-dimensional frequency domain responses of the ASC are produced and transformed into the time domain. Then these time domain responses are cutoff and stacked into vectors. These vectored time-domain responses are amalgamated to form the TD. Compared with the traditional frequency-domain dictionary(FD), the TD is a matrix that is quite spare and can markedly reduce the data size of the dictionary. Based on the basic TD construction method, we present four extended TD construction methods, which are available for different applications. In the experiments, the performance of the TD, including the basic model and the extended models, has been firstly analyzed in comparison with the FD. Secondly, an example of parameter estimation from SAR synthetic aperture radar(SAR) measurements of a target collected in an anechoic room is exhibited. Finally, a sparse image reconstruction example is from two apart apertures. Experimental results demonstrate the effectiveness and efficiency of the proposed TD.展开更多
文摘在分析现有程序代码抄袭检测系统的特点及局限性的基础上,提出一种综合文本分析、结构度量和属性计数技术的混合式程序抄袭检测方法。应用文档指纹技术和Winnowing算法计算程序的文本相似度;将程序代码表示成动态控制结构树(Dynamic Control Structure tree,DCS),运用Winnowing算法计算DCS树相似度,从而得到结构相似度;收集并统计程序中的每个变量信息,应用变量相似度算法分析变量信息节点获取变量相似度;分别赋予文本相似度、结构相似度和变量相似度一个权值,计算得到总体的代码相似度。实验结果表明,所提出的方法能够有效检测出各种抄袭行为。针对不同的抄袭门槛值,使用该方法的检测结果准确度和查全率高于JPLAG系统。特别对于结构简单的程序组,此方法和JPLAG系统检测结果的平均准确度分别为82.5%和69.5%,说明所提的方法更加有效。
基金Project(NCET-11-0866)supported by Education Ministry's new Century Excellent Talents Supporting Plan,China
文摘Parameter estimation of the attributed scattering center(ASC) model is significant for automatic target recognition(ATR). Sparse representation based parameter estimation methods have developed rapidly. Construction of the separable dictionary is a key issue for sparse representation technology. A compressive time-domain dictionary(TD) for ASC model is presented. Two-dimensional frequency domain responses of the ASC are produced and transformed into the time domain. Then these time domain responses are cutoff and stacked into vectors. These vectored time-domain responses are amalgamated to form the TD. Compared with the traditional frequency-domain dictionary(FD), the TD is a matrix that is quite spare and can markedly reduce the data size of the dictionary. Based on the basic TD construction method, we present four extended TD construction methods, which are available for different applications. In the experiments, the performance of the TD, including the basic model and the extended models, has been firstly analyzed in comparison with the FD. Secondly, an example of parameter estimation from SAR synthetic aperture radar(SAR) measurements of a target collected in an anechoic room is exhibited. Finally, a sparse image reconstruction example is from two apart apertures. Experimental results demonstrate the effectiveness and efficiency of the proposed TD.