摘要
针对指代消解一直是自然语言处理中的核心问题,提出一种利用DBN(deep belief nets)模型的Deep Learning学习机制进行基于语义特征的指代消解方法。DBN模型由多层无监督的RBM(restricted Boltzmann machine)网络和一层有监督的BP(back-propagation)网络组成,RBM网络确保特征向量映射达到最优,最后一层BP网络可以对RBM网络的输出特征向量进行分类,从而训练指代消解分类器。在ACE04英文语料及ACE05中文语料上进行测试,实验结果表明,增加RBM训练层数可以提高系统性能。此外,引入对特征集合的抽象分层因素,也对系统性能的提升产生积极作用。
Because coreference resolution is a fundamental task in natural language process, a coreference resolution system based on Deep Learning model via the deep belief nets (DBN), which is a classifier of a combination of several unsupervised learning networks, named RBM (restricted Boltzmann machine) and a supervised learning network named BP (back-propagation), is proposed to detect and classify the coreference relationships between the anaphor and antecedent. The RBM layers maintain as much information as possible when feature vectors are transferred to next layer. The BP layer is trained to classify the features generated by the last RBM layer. The experiments are conducted on the ACE 2004 English NWlRE corpus and the ACE 2005 Chinese NWIRE corpus. The results show that increasing the number of layers RBM training and joining of abstract layer for feature set are able to improve the performance of coreference resolution system.
出处
《北京大学学报(自然科学版)》
EI
CAS
CSCD
北大核心
2014年第1期100-110,共11页
Acta Scientiarum Naturalium Universitatis Pekinensis
基金
国家自然科学基金(61273320
61003153
61272257)
863计划(2012AA011102)资助
关键词
代词消解
深度学习
深层语义特征
pronoun resolution
Deep Learning
deep semantic feature
作者简介
通信作者,E—mail:gdzhou@suda.edu.cn