期刊文献+
共找到25篇文章
< 1 2 >
每页显示 20 50 100
针对文本情感分类任务的textSE-ResNeXt集成模型 被引量:9
1
作者 康雁 李浩 +2 位作者 梁文韬 宁浩宇 霍雯 《计算机工程与应用》 CSCD 北大核心 2020年第7期205-209,共5页
针对深度学习方法中文本表示形式单一,难以有效地利用语料之间细化的特征的缺陷,利用中英文语料的不同特性,有区别地对照抽取中英文语料的特征提出了一种新型的textSE-ResNeXt集成模型。通过PDTB语料库对语料的显式关系进行分析,从而截... 针对深度学习方法中文本表示形式单一,难以有效地利用语料之间细化的特征的缺陷,利用中英文语料的不同特性,有区别地对照抽取中英文语料的特征提出了一种新型的textSE-ResNeXt集成模型。通过PDTB语料库对语料的显式关系进行分析,从而截取语料主要情感部分,针对不同中、英文情感词典进行情感程度关系划分以此获得不同情感程度的子数据集。在textSE-ResNeXt神经网络模型中采用了动态卷积核策略,以此对文本数据特征进行更为有效的提取,模型中融合了SEnet和ResNeXt,有效地进行了深层次文本特征的抽取和分类。将不同情感程度的子集上对textSE-ResNeXt模型采用投票集成的方法进一步提高分类效率。分别在中文酒店评论语料和六类常见英文分类数据集上进行实验。实验结果表明了本模型的有效性。 展开更多
关键词 文本情感分类 textsE-ResNeXt 特征划分 集成模型
在线阅读 下载PDF
What Eye Movements Tell About Identifying Compound Words in Reading and Top-Down Effects in Reading Long Texts 被引量:1
2
作者 Jukka Hyn 《心理与行为研究》 2004年第3期497-504,共8页
Two lines of research on eye movements in reading are summarized. One line of research examines how adult readers identify compound words during reading. The other line of research deals with how a specific reading go... Two lines of research on eye movements in reading are summarized. One line of research examines how adult readers identify compound words during reading. The other line of research deals with how a specific reading goal influences the way long expository texts are read. Both lines of research are conducted using Finnish as the source language. With respect to the first research question, it is demonstrated that compound words are recognized either holistically or via their components, depending on the length of the compound word. Readers begin to process whatever information is readily available in the foveal vision(i.e., either the whole-word form or the initial component). The second line of research demonstrates that(1)a specific reading goal is capable of exerting an early effect on readers’ eye fixation patterns,(2)time course analyses based on eye movement patterns can reveal interesting individual differences, and(3)working memory capacity is linked to the efficiency to strategically allocate attention as well as to encode information to and retrieve it from the long-term memory. It is concluded that the eye-tracking technique is an excellent research tool to tap into the workings of the human mind during the comprehension of written texts. 展开更多
关键词 eye movements word recognition COMPOUND WORDS text COMPREHENSION working memory capacity.
在线阅读 下载PDF
RNSQL:融合逆规范化的Text2SQL生成
3
作者 帖军 范子琪 +2 位作者 孙翀 郑禄 朱柏尔 《计算机应用与软件》 北大核心 2025年第9期31-37,86,共8页
Text2SQL是自然语言处理科研领域中的一项重要任务,在研究智能问答系统中发挥关键性的作用,其核心任务是将自然语言描述的问题自动转换为SQL查询语句。当前研究重点为提高SQL子句任务的匹配准确率,但忽略了SQL的句法生成的正确性,涉及... Text2SQL是自然语言处理科研领域中的一项重要任务,在研究智能问答系统中发挥关键性的作用,其核心任务是将自然语言描述的问题自动转换为SQL查询语句。当前研究重点为提高SQL子句任务的匹配准确率,但忽略了SQL的句法生成的正确性,涉及多表连接的SQL生成仍存在大量错误。因此,提出一种基于神经网络的Text2SQL方法,该方法通过逆规范化技术,对数据库模式进行重构,关注SQL句法生成的正确性,称为逆规范化网络(Reverse Normalization SQL,RNSQL)。经理论分析和在公共数据集Spider上实验验证,RNSQL能有效提升Text2SQL任务的质量。 展开更多
关键词 逆规范化 语义解析 Text2SQL 槽填充
在线阅读 下载PDF
基于PE文件无容量限制的信息隐藏技术研究 被引量:8
4
作者 李钱 方勇 +1 位作者 谭登龙 张长山 《计算机应用研究》 CSCD 北大核心 2011年第7期2758-2760,共3页
分析了现有的基于PE文件信息隐藏技术及其不足,提出一种以扩充.text节达到无容量限制的信息隐藏方案。通过对嵌入的信息进行加密、完整性校验、代码伪装、混合原代码等预处理,再根据预处理后的信息大小扩充.text节,并调整随后的各个节... 分析了现有的基于PE文件信息隐藏技术及其不足,提出一种以扩充.text节达到无容量限制的信息隐藏方案。通过对嵌入的信息进行加密、完整性校验、代码伪装、混合原代码等预处理,再根据预处理后的信息大小扩充.text节,并调整随后的各个节以及输入表的位置,以及PE头的各个相应标志的值,保证嵌入信息后的PE文件仍然能正常执行。实验表明,该方案不仅能达到无容量限制的信息隐藏,而且具有一定的隐蔽性和鲁棒性。 展开更多
关键词 信息隐藏 PE文件 text节 密码学 无容量限制
在线阅读 下载PDF
基于Oracle的多权限多格式文档组织与检索系统 被引量:4
5
作者 熊志辉 王德鑫 +1 位作者 王炜 张茂军 《计算机应用》 CSCD 北大核心 2008年第9期2407-2409,共3页
海量异构文档的快速检索和精细颗粒度权限控制的文档存取是面向行业应用的文档管理系统中的关键。在Oracle Text全文检索技术的基础上,基于B/S架构设计并实现了一个密级文档组织与检索系统,实现了对海量异构文档数据的快速检索,并实现... 海量异构文档的快速检索和精细颗粒度权限控制的文档存取是面向行业应用的文档管理系统中的关键。在Oracle Text全文检索技术的基础上,基于B/S架构设计并实现了一个密级文档组织与检索系统,实现了对海量异构文档数据的快速检索,并实现了文档数据的逐文档按角色分等级权限管理。 展开更多
关键词 全文检索 ORACLE TEXT 权限控制 文档组织与检索系统
在线阅读 下载PDF
基于Oracle组件的数据采集与全文检索系统设计与优化 被引量:7
6
作者 袁琴琴 李志勋 吕林涛 《现代电子技术》 北大核心 2016年第8期37-40,44,共5页
从应用系统数据采集与全文检索的需求出发,结合权限控制,提出基于Oracle Transparent Gateway,Oracle Text的数据采集与全文检索的设计和实现方案。基于此方案,着重进行系统框架设计、采集存储及数据库设计,实现创建索引及检索流程,最... 从应用系统数据采集与全文检索的需求出发,结合权限控制,提出基于Oracle Transparent Gateway,Oracle Text的数据采集与全文检索的设计和实现方案。基于此方案,着重进行系统框架设计、采集存储及数据库设计,实现创建索引及检索流程,最后给出系统性能优化方法,并对检索速度和查准率进行测试分析。目前系统已上线运行,取得高效简捷、运行稳定的使用效果。 展开更多
关键词 数据采集 ORACLE TRANSPARENT GATEWAY 全文检索 ORACLE TEXT 性能优化
在线阅读 下载PDF
Java 3D中的Text2D的扩展与应用 被引量:2
7
作者 冯乔生 陈玉华 +1 位作者 刘丹非 段鹏 《计算机工程与应用》 CSCD 北大核心 2003年第20期122-125,共4页
Java3D中Text2D类虽然能生成二维文本,但由于文本的字体、字号、颜色在文本生成后就不可改变,字的排列也只有“从左向右”方式。所以,Text2D不能满足交互式建立虚拟环境中二维文本的需要。Java3D中的Text3D类支持交互式地建立文本,但Tex... Java3D中Text2D类虽然能生成二维文本,但由于文本的字体、字号、颜色在文本生成后就不可改变,字的排列也只有“从左向右”方式。所以,Text2D不能满足交互式建立虚拟环境中二维文本的需要。Java3D中的Text3D类支持交互式地建立文本,但Text3D的三维文本却不能退化为二维文本,这使得Text3D不能替代Text2D。所以,有必要扩展Text2D类以支持交互地建立虚拟环境中二维文本。文章给出实现这一扩展的关键技术,为用Java3D开发交互式虚拟环境建模器的二维文本生成功能提供支持。扩展的Text2D在智能型虚拟汽车驾驶道路环境建模器中的使用表明它是有效的。 展开更多
关键词 JAVA 3D Text2D 交互式虚拟环境建模
在线阅读 下载PDF
Oracle全文检索技术在高校图书馆的应用 被引量:2
8
作者 杨应全 《现代情报》 北大核心 2008年第9期159-161,共3页
本文讨论了一种利用Oracle text全文检索技术在高校图书馆中的应用方法。
关键词 全文检索 oriole TEXT 图书馆
在线阅读 下载PDF
关于SQL Server7.0中BLOB对象的存取技术 被引量:2
9
作者 陈功伟 刘艳霞 +1 位作者 周定康 黄明和 《江西师范大学学报(自然科学版)》 CAS 2001年第1期36-40,共5页
研究了SQLServer 7.0中BLOB(二进制大对象 )的存取方式 ,分析了text和image数据类型的存储结构及T SQL语言对其提供的操作 ,并且结合题卷库系统开发经验 ,引用具体实例 ,介绍在VB中如何运用ADO技术访问数据库中的BLOB对象 .
关键词 BLOB ADO Text Image 二进制大对象 存取方式 SQL SERVER7.0 数据类型 存储结构 多媒体数据库
在线阅读 下载PDF
面向复杂查询请求的SQL自动生成模型 被引量:4
10
作者 余波 彭敦陆 《小型微型计算机系统》 CSCD 北大核心 2021年第11期2446-2451,共6页
将自然语言自动转换成恰当的SQL语句是基于关系数据库智能问答系统的核心,而一个SQL语句执行后能否得到期望的查询结果在很大程度上取决于where子句的表达是否正确.目前,大多数Text2Sql算法只利用了数据库表的列语义向量来提取where子... 将自然语言自动转换成恰当的SQL语句是基于关系数据库智能问答系统的核心,而一个SQL语句执行后能否得到期望的查询结果在很大程度上取决于where子句的表达是否正确.目前,大多数Text2Sql算法只利用了数据库表的列语义向量来提取where子句中出现的值,但是当where子句中存在多列多值时往往无法准确地提取对应的值.本文提出的一种神经网络模型———2-SQL,将提取where子句中值的方式改进为范式转变模式.通过对运算符和值进行枚举,生成一系列的候选查询条件组合,再采用Transformer模型将查询请求语句与查询条件组合进行语义匹配,来实现对候选查询条件的筛选.实验表明,与现有Text2Sql相比较,2-SQL对复杂查询where子句中出现的值的提取具有较好的效果. 展开更多
关键词 Text2Sql 数据库问答系统 语义匹配 2-SQL
在线阅读 下载PDF
用于构建维吾尔文语料库的中文件格式转换技术研究 被引量:2
11
作者 艾斯卡尔.亚克甫 艾孜尔古丽 玉素甫.艾白都拉 《计算机应用与软件》 CSCD 北大核心 2012年第6期14-16,共3页
研究在维吾尔文字语料库建立过程中,从MS-DOS系统上排版的书刊、杂志中获得维吾尔语单词,并转换到Windows环境上RTF格式的一种快速解决方法,然后提出维吾尔文字Unicode代码对应的RTF代码表和动态生成维吾尔文RTF文件的简单方法。实践证... 研究在维吾尔文字语料库建立过程中,从MS-DOS系统上排版的书刊、杂志中获得维吾尔语单词,并转换到Windows环境上RTF格式的一种快速解决方法,然后提出维吾尔文字Unicode代码对应的RTF代码表和动态生成维吾尔文RTF文件的简单方法。实践证明这种方法有助于提高语料库构造中的大量单词收集的效率和质量。 展开更多
关键词 文件转换 RTF(Rich TEXT Format) 维吾尔文
在线阅读 下载PDF
Parallel naive Bayes algorithm for large-scale Chinese text classification based on spark 被引量:22
12
作者 LIU Peng ZHAO Hui-han +3 位作者 TENG Jia-yu YANG Yan-yan LIU Ya-feng ZHU Zong-wei 《Journal of Central South University》 SCIE EI CAS CSCD 2019年第1期1-12,共12页
The sharp increase of the amount of Internet Chinese text data has significantly prolonged the processing time of classification on these data.In order to solve this problem,this paper proposes and implements a parall... The sharp increase of the amount of Internet Chinese text data has significantly prolonged the processing time of classification on these data.In order to solve this problem,this paper proposes and implements a parallel naive Bayes algorithm(PNBA)for Chinese text classification based on Spark,a parallel memory computing platform for big data.This algorithm has implemented parallel operation throughout the entire training and prediction process of naive Bayes classifier mainly by adopting the programming model of resilient distributed datasets(RDD).For comparison,a PNBA based on Hadoop is also implemented.The test results show that in the same computing environment and for the same text sets,the Spark PNBA is obviously superior to the Hadoop PNBA in terms of key indicators such as speedup ratio and scalability.Therefore,Spark-based parallel algorithms can better meet the requirement of large-scale Chinese text data mining. 展开更多
关键词 Chinese text classification naive Bayes SPARK HADOOP resilient distributed dataset PARALLELIZATION
在线阅读 下载PDF
Chinese micro-blog sentiment classification through a novel hybrid learning model 被引量:2
13
作者 LI Fang-fang WANG Huan-ting +3 位作者 ZHAO Rong-chang LIU Xi-yao WANG Yan-zhen ZOU Bei-ji 《Journal of Central South University》 SCIE EI CAS CSCD 2017年第10期2322-2330,共9页
With the rising and spreading of micro-blog, the sentiment classification of short texts has become a research hotspot. Some methods have been developed in the past decade. However, since the Chinese and English are d... With the rising and spreading of micro-blog, the sentiment classification of short texts has become a research hotspot. Some methods have been developed in the past decade. However, since the Chinese and English are different in language syntax, semantics and pragmatics, sentiment classification methods that are effective for English twitter may fail on Chinese micro-blog. In addition, the colloquialism and conciseness of short Chinese texts introduces additional challenges to sentiment classification. In this work, a novel hybrid learning model was proposed for sentiment classification of Chinese micro-blogs, which included two stages. In the first stage, emotional scores were calculated over the whole dataset by utilizing an improved Chinese-oriented sentiment dictionary classification method. Data with extremely high or low scores were directly labeled. In the second stage, the remaining data were labeled by using an integrated classification method based on sentiment dictionary, support vector machine(SVM) and k-nearest neighbor(KNN). An improved feature selection method was adopted to enhance the discriminative power of the selected features. The two-stage hybrid framework made the proposed method effective for sentiment classification of Chinese micro-blogs. Experiments on the COAE2014(Chinese Opinion Analysis Evaluation 2014) dataset show that the proposed method outperforms other schemes. 展开更多
关键词 CHINESE micro-blog SHORT TEXT HYBRID LEARNING SENTIMENT classification
在线阅读 下载PDF
Lazy learner text categorization algorithm based on embedded feature selection 被引量:1
14
作者 Yan Peng Zheng Xuefeng +1 位作者 Zhu Jianyong Xiao Yunhong 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2009年第3期651-659,共9页
To avoid the curse of dimensionality, text categorization (TC) algorithms based on machine learning (ML) have to use an feature selection (FS) method to reduce the dimensionality of feature space. Although havin... To avoid the curse of dimensionality, text categorization (TC) algorithms based on machine learning (ML) have to use an feature selection (FS) method to reduce the dimensionality of feature space. Although having been widely used, FS process will generally cause information losing and then have much side-effect on the whole performance of TC algorithms. On the basis of the sparsity characteristic of text vectors, a new TC algorithm based on lazy feature selection (LFS) is presented. As a new type of embedded feature selection approach, the LFS method can greatly reduce the dimension of features without any information losing, which can improve both efficiency and performance of algorithms greatly. The experiments show the new algorithm can simultaneously achieve much higher both performance and efficiency than some of other classical TC algorithms. 展开更多
关键词 machine learning text categorization embedded feature selection lazy learner cosine similarity.
在线阅读 下载PDF
Reading Text Under Normal and Disappearing Presentation Conditions 被引量:2
15
作者 Simon P. Liversedge 《心理与行为研究》 2004年第3期505-512,共8页
In this article I discuss data from a series of experiments in which readers’ eye movements were recorded as they processed sentences in which each word disappeared or was masked 60ms after fixation onset. We used th... In this article I discuss data from a series of experiments in which readers’ eye movements were recorded as they processed sentences in which each word disappeared or was masked 60ms after fixation onset. We used this paradigm to investigate whether we could induce a gap effect during reading, and how visual and linguistic factors affected eye movements under these conditions. The data showed that no gap effect occurred in our experiment. Overall reading times were the same under normal and disappearing presentation conditions. However, readers did adopt a strategy of making fewer but longer fixations when the text disappeared than when it did not. Additionally, clear frequency effects occurred regardless of whether the text was presented normally or disappeared. This finding indicates that while the visual uptake of information is important, cognitive processes associated with the lexical identification of words are a primary influence on when readers move their eyes during reading. The findings are taken to support the E-Z Reader model of eye movement control. 展开更多
关键词 eye movements disappearing TEXT reading.
在线阅读 下载PDF
Texual Cohesion and Coherence 被引量:1
16
作者 Zhang Haiying (Foreign Languages Teaching Department, Northwest Normal University Lanzhou,730000, China) 《兰州大学学报(社会科学版)》 CSSCI 北大核心 2000年第S1期250-256,共7页
Cohesion and coherence are two of the most important components in discourse analysis. This thesis investigales some cohesive devices and coherent means. At zhe same time, it gives an account of how a text is identifi... Cohesion and coherence are two of the most important components in discourse analysis. This thesis investigales some cohesive devices and coherent means. At zhe same time, it gives an account of how a text is identified as a text. It also discusses the relationship between cohesion and coherence. 展开更多
关键词 text cohesion coherence
在线阅读 下载PDF
Numerical Method for Extracting Hydro-geological Parameter by Using Pumping-Affusing Text in Coast Plain Area
17
作者 Xuequn Chen,Li He,Fulin Li,Lijiang Lu 1.Water Conservancy Research Institute of Shandong Province,Jinan 250013,China. 2.School of Municipal and Environmental Engineering,Shandong Jizhu University,Jinan 250013,China 3.Shandong Linyi Water Conservancy Engineering Company,Linyi 276006,China 《地学前缘》 EI CAS CSCD 北大核心 2009年第S1期24-24,共1页
In order to avoid the risk of saltwater intrusion for large amount pumping groundwater,this study used small flux group drilling pumping-pouring text,and simulated the fall of water depth and recovery value with Feflo... In order to avoid the risk of saltwater intrusion for large amount pumping groundwater,this study used small flux group drilling pumping-pouring text,and simulated the fall of water depth and recovery value with Feflow software,and obtained a group of the best hydro-geological parameters.Compared with that of the method for calculating parameter with 展开更多
关键词 pumping-pouring TEXT hydro-geological parameter numerical simulation
在线阅读 下载PDF
A New Text Location Approach Based Wavelet
18
作者 Weihua Li Zhen Fang Shuozhong Wang 《计算机科学》 CSCD 北大核心 2002年第z2期105-106,114,共3页
With the advancement of content-based retrieval technology, the importance of semantics for text information contained in images attracts many researchers. An algorithm which will automatically locate the textual regi... With the advancement of content-based retrieval technology, the importance of semantics for text information contained in images attracts many researchers. An algorithm which will automatically locate the textual regions in the input image will facilitate the retrieving task, and the optical character recognizer can then be applied to only those regions of the image which contain text. In this paper a new text location method based wavelet is described, which can be used to locate textual regions from complex image and video frame. Experimental results show that the textual regions in image can be located effectively and quickly. 展开更多
关键词 TEXT location 2-D wavelet MORPHOLOGICAL operation
在线阅读 下载PDF
The Communist Manifesto,170 Years Later
19
作者 Samir Amin 《学术界》 CSSCI 北大核心 2018年第11期214-227,共14页
No text written in the mid-nineteenth century has held the road until today as well as the Communist Manifesto of 1848.There is no any other text written in the middle of the 21th century which retained validity until... No text written in the mid-nineteenth century has held the road until today as well as the Communist Manifesto of 1848.There is no any other text written in the middle of the 21th century which retained validity until this day as well as the Manifesto of the Communist Party.Even today entire paragraphs of the text correspond to the contemporary reality even better than in 1848.Starting from the premises which were hardly visible in the era,Marx and Engels drew the conclusions which the deployment of 170 years of history fully consolidated.In this article I will give further enlightening examples. 展开更多
关键词 TEXT WRITTEN CENTURY premises
在线阅读 下载PDF
Automatic Calculation of Dimension Chains in AutoCAD
20
作者 ZHANG Xia, YANG Yue, LI Xiong-bing (Central South University, Changsha 410075, China) 《厦门大学学报(自然科学版)》 CAS CSCD 北大核心 2002年第S1期171-172,共2页
In the course of mechanical part designing, process p lanning and assembling designing, we often have to calculate and analyse a dimen sion chain. Traditionally, a dimension chain is established and calculated m anual... In the course of mechanical part designing, process p lanning and assembling designing, we often have to calculate and analyse a dimen sion chain. Traditionally, a dimension chain is established and calculated m anually. With wide computer application in the field of mechanical design and ma nufacture, people began to use a computer to acquire and calculate a dimension c hain automatically. In reported work, a dimension chain can be established and c alculated automatically. However, dimension text values of dimensions composing a dimension chain and these dimensions’ tolerance’s upper values and lower valu es are put into a computer manually, which is inefficient and easy to make mis takes. In order to overcome above difficulties. it is very important to acquir e noted dimensions automatically, furthermore analyse and calculate a dimens ion chain, then show results. At present AutoCAD softwares of Autodesk company h ave been used popularly in mechanical designing. For automatically acquiring noted dimensions, analyzing and calculating a dimension chain in a design draw in AutoCAD, this paper introduces the solvable scheme of automatic dimension acq uisition and dimension chain calculation in AutoCAD by ObjectARX. ObjectARX is a developing tool for AutoCAD. In this paper a dimension chain is expressed b y three matrixes, which respectively stand for dimension text value matrix, tole rance’s upper value matrix and tolerance’s lower value matrix. The developed p rogram can be used to both calculate a assembling dimension chain, and a process dimension chain. When the program running in AutoCAD, noted dimensions comp osing a dimension chain in AutoCAD are selected in turn with a mouse, then the c omputer begin to calculate the dimension chain and results are shown in a dialog box. A running example is given in this paper. 展开更多
关键词 a dimension chian dimension text value matrix A utomatic Calculation
在线阅读 下载PDF
上一页 1 2 下一页 到第
使用帮助 返回顶部