提出了一种利用"bag of words"模型对视频内容进行建模和匹配的方法。通过量化视频帧的局部特征构建视觉关键词(visual words)辞典,将视频的子镜头表示成若干视觉关键词的集合。在此基础上构建基于子镜头的视觉关键词词组的...提出了一种利用"bag of words"模型对视频内容进行建模和匹配的方法。通过量化视频帧的局部特征构建视觉关键词(visual words)辞典,将视频的子镜头表示成若干视觉关键词的集合。在此基础上构建基于子镜头的视觉关键词词组的倒排索引,用于视频片段的匹配和检索。这种方法保留了局部特征的显著性及其相对位置关系,而且有效地压缩了视频的表达,加速的视频的匹配和检索过程。实验结果表明,和已有方法相比,基于"bag of words"的视频匹配方法在大视频样本库上获得了更高的检索精度和检索速度。展开更多
针对干果图像信息量大、分类精度低和耗时多的特点,提出利用Bag of Words模型提取图片的代表特征,并采用朴素贝叶斯分类器指导特征矩阵分类。结果表明,图像分类精度能达到80%,分类处理时间约为2 s。通过增加学习样本来进一步提高分类精...针对干果图像信息量大、分类精度低和耗时多的特点,提出利用Bag of Words模型提取图片的代表特征,并采用朴素贝叶斯分类器指导特征矩阵分类。结果表明,图像分类精度能达到80%,分类处理时间约为2 s。通过增加学习样本来进一步提高分类精度,将Bag of Words应用于干果图像识别和分类是可行的。展开更多
Two lines of research on eye movements in reading are summarized. One line of research examines how adult readers identify compound words during reading. The other line of research deals with how a specific reading go...Two lines of research on eye movements in reading are summarized. One line of research examines how adult readers identify compound words during reading. The other line of research deals with how a specific reading goal influences the way long expository texts are read. Both lines of research are conducted using Finnish as the source language. With respect to the first research question, it is demonstrated that compound words are recognized either holistically or via their components, depending on the length of the compound word. Readers begin to process whatever information is readily available in the foveal vision(i.e., either the whole-word form or the initial component). The second line of research demonstrates that(1)a specific reading goal is capable of exerting an early effect on readers’ eye fixation patterns,(2)time course analyses based on eye movement patterns can reveal interesting individual differences, and(3)working memory capacity is linked to the efficiency to strategically allocate attention as well as to encode information to and retrieve it from the long-term memory. It is concluded that the eye-tracking technique is an excellent research tool to tap into the workings of the human mind during the comprehension of written texts.展开更多
安全是民航业的核心主题。针对目前民航非计划事件分析严重依赖专家经验及分析效率低下的问题,文章提出一种结合Word2vec和双向长短期记忆(bidirectional long short-term memory,BiLSTM)神经网络模型的民航非计划事件分析方法。首先采...安全是民航业的核心主题。针对目前民航非计划事件分析严重依赖专家经验及分析效率低下的问题,文章提出一种结合Word2vec和双向长短期记忆(bidirectional long short-term memory,BiLSTM)神经网络模型的民航非计划事件分析方法。首先采用Word2vec模型针对事件文本语料进行词向量训练,缩小空间向量维度;然后通过BiLSTM模型自动提取特征,获取事件文本的完整序列信息和上下文特征向量;最后采用softmax函数对民航非计划事件进行分类。实验结果表明,所提出的方法分类效果更好,能达到更优的准确率和F 1值,对不平衡数据样本同样具有较稳定的分类性能,证明了该方法在民航非计划事件分析上的适用性和有效性。展开更多
The paper discusses translation of culturally-loaded words(CLW),to which the author maintains that great importance should be attached,especially to translation of literary works.The difficulty of translating CLW is d...The paper discusses translation of culturally-loaded words(CLW),to which the author maintains that great importance should be attached,especially to translation of literary works.The difficulty of translating CLW is determined by complexity of culture.The author argues that in translating Chinese words laden with cultural connotations,foreignization should take priority over other strategies when the source text(ST) is highly literary.Thus,foreignness and local flavor in ST can be retained as much as possible in the target text(TT),hence enhancing target readers' appreciation of ST and promoting intercultural exchanges.展开更多
[目的/意义]在人工智能技术及应用快速发展与深刻变革背景下,机器学习领域不断出现新的研究主题和方法,深度学习和强化学习技术持续发展。因此,有必要探索不同领域机器学习研究主题演化过程,并识别出热点与新兴主题。[方法/过程]本文以...[目的/意义]在人工智能技术及应用快速发展与深刻变革背景下,机器学习领域不断出现新的研究主题和方法,深度学习和强化学习技术持续发展。因此,有必要探索不同领域机器学习研究主题演化过程,并识别出热点与新兴主题。[方法/过程]本文以图书情报领域中2011—2022年Web of Science数据库中的机器学习研究论文为例,融合LDA和Word2vec方法进行主题建模和主题演化分析,引入主题强度、主题影响力、主题关注度与主题新颖性指标识别热点主题与新兴热点主题。[结果/结论]研究结果表明,(1)Word2vec语义处理能力与LDA主题演化能力的结合能够更加准确地识别研究主题,直观展示研究主题的分阶段演化规律;(2)图书情报领域的机器学习研究主题主要分为自然语言处理与文本分析、数据挖掘与分析、信息与知识服务三大类范畴。各类主题之间的关联性较强,且具有主题关联演化特征;(3)设计的主题强度、主题影响力和主题关注度指标及综合指标能够较好地识别出2011—2014年、2015—2018年和2019—2022年3个不同周期阶段的热点主题。展开更多
This paper introduces an English word formation algorithm used in natural language processing system, which is based on affix stripping strategy. The goal is to reduce the redundant information in the machine lexicon ...This paper introduces an English word formation algorithm used in natural language processing system, which is based on affix stripping strategy. The goal is to reduce the redundant information in the machine lexicon and improve the ability to recognize words. In appendix, a formation program written in Turbo-Prolog is presented.展开更多
This paper further discusses the Mutiplicity of Semantic Relations between Words(MSRW)(Lu 2010)in two aspects:the reason why the semantie relations between two words can be multiple and the significance of understandi...This paper further discusses the Mutiplicity of Semantic Relations between Words(MSRW)(Lu 2010)in two aspects:the reason why the semantie relations between two words can be multiple and the significance of understanding the MSRW.Through examples,this paper points out that understanding the MSRW is of particular importance not only in deepening our knowledge about lexical semantics,but also in advancing the methodology of syntactic analysis.The Construction-Chunk Approach,a new methodology of syntactic analysis built on the MSRW,is a case in point.展开更多
This paper, particularly focusing on the pitch of prosodic words,has conducted a contrastive study on the structure of prosodic words in Englishand Mandarin . This paper reports a Mandarin monologue speech corpus-stud...This paper, particularly focusing on the pitch of prosodic words,has conducted a contrastive study on the structure of prosodic words in Englishand Mandarin . This paper reports a Mandarin monologue speech corpus-study, anexperimental phonetic attempt to conduct a study on the pitch of trisyllabic prosodicwords in Mandarin monologue. In addition, taking the characteristics of Englishprosodic words into consideration, the paper makes a contrastive analysis of prosodicwords in English and Mandarin. This study finds that the pitch of trisyllabic prosodicwords in Mandarin is inevitably affected by structural factors. As far as the leftsyllable is concerned, the grammatical category, prosodic hierarchical boundary andthe position of the intonational phrase where the syllable is located, the mid syllableand the right syllable may have influences on the pitch contour of the left syllable.As to the mid syllable, the grammatical category, the left syllable, the right syllableand the position of the intonational phrase where the syllable is located may haveinfluences on the pitch contour of the mid syllable. As for the right syllable, theprosodic hierarchical boundary where the syllable is located and the mid syllable mayhave effects on the pitch contour of the right syllable. Different from the previousfindings of the study on read corpus, this study shows that the mid syllable not onlyhas dissimilatory effects but also has assimilatory effects on the pitch of its precedingsyllable. The left syllable has anticipatory effects on the onset pitch of the mid syllableand the right syllable has coarticulation effects on the offset pitch of the mid syllable.展开更多
Background Generally speaking. Chinese college graduates in the fifties and sixties took Russian as their second language, and those who graduated in the seventies had no second language to speak of. Now, in the years...Background Generally speaking. Chinese college graduates in the fifties and sixties took Russian as their second language, and those who graduated in the seventies had no second language to speak of. Now, in the years of our Open Door Policy, they find they have to learn some English and learn it quickly. They try to learn from radio and TV and many take English courses of 4 to 6 months, with varying degree of success. Their chief stumbling blocks展开更多
The article holds that the problem of the relationship between"words"and"meaning"has always been the philosophical proposition.Through analyzing the relationship between"words"and"me...The article holds that the problem of the relationship between"words"and"meaning"has always been the philosophical proposition.Through analyzing the relationship between"words"and"meaning"and probing into"disoourse","text"and"space-time history",the author thinks that the interpreters'preconceptions,preexistence,pre-structure and different perspectives have resulted in diverse forms of interpretations,and the being of each kind of form has its own rationality.Moreover,the process in which the interpreters interpret"words","discourse","text"and"history"is also the process to interpret themselves.Nevertheless,being able to say and being unable to say are soaked in the whole process of interpretation,and,as a result,human beings will always be confronted with a kind of"say and cannot say"embarrassment.展开更多
文摘提出了一种利用"bag of words"模型对视频内容进行建模和匹配的方法。通过量化视频帧的局部特征构建视觉关键词(visual words)辞典,将视频的子镜头表示成若干视觉关键词的集合。在此基础上构建基于子镜头的视觉关键词词组的倒排索引,用于视频片段的匹配和检索。这种方法保留了局部特征的显著性及其相对位置关系,而且有效地压缩了视频的表达,加速的视频的匹配和检索过程。实验结果表明,和已有方法相比,基于"bag of words"的视频匹配方法在大视频样本库上获得了更高的检索精度和检索速度。
文摘针对干果图像信息量大、分类精度低和耗时多的特点,提出利用Bag of Words模型提取图片的代表特征,并采用朴素贝叶斯分类器指导特征矩阵分类。结果表明,图像分类精度能达到80%,分类处理时间约为2 s。通过增加学习样本来进一步提高分类精度,将Bag of Words应用于干果图像识别和分类是可行的。
文摘Two lines of research on eye movements in reading are summarized. One line of research examines how adult readers identify compound words during reading. The other line of research deals with how a specific reading goal influences the way long expository texts are read. Both lines of research are conducted using Finnish as the source language. With respect to the first research question, it is demonstrated that compound words are recognized either holistically or via their components, depending on the length of the compound word. Readers begin to process whatever information is readily available in the foveal vision(i.e., either the whole-word form or the initial component). The second line of research demonstrates that(1)a specific reading goal is capable of exerting an early effect on readers’ eye fixation patterns,(2)time course analyses based on eye movement patterns can reveal interesting individual differences, and(3)working memory capacity is linked to the efficiency to strategically allocate attention as well as to encode information to and retrieve it from the long-term memory. It is concluded that the eye-tracking technique is an excellent research tool to tap into the workings of the human mind during the comprehension of written texts.
文摘安全是民航业的核心主题。针对目前民航非计划事件分析严重依赖专家经验及分析效率低下的问题,文章提出一种结合Word2vec和双向长短期记忆(bidirectional long short-term memory,BiLSTM)神经网络模型的民航非计划事件分析方法。首先采用Word2vec模型针对事件文本语料进行词向量训练,缩小空间向量维度;然后通过BiLSTM模型自动提取特征,获取事件文本的完整序列信息和上下文特征向量;最后采用softmax函数对民航非计划事件进行分类。实验结果表明,所提出的方法分类效果更好,能达到更优的准确率和F 1值,对不平衡数据样本同样具有较稳定的分类性能,证明了该方法在民航非计划事件分析上的适用性和有效性。
文摘The paper discusses translation of culturally-loaded words(CLW),to which the author maintains that great importance should be attached,especially to translation of literary works.The difficulty of translating CLW is determined by complexity of culture.The author argues that in translating Chinese words laden with cultural connotations,foreignization should take priority over other strategies when the source text(ST) is highly literary.Thus,foreignness and local flavor in ST can be retained as much as possible in the target text(TT),hence enhancing target readers' appreciation of ST and promoting intercultural exchanges.
文摘[目的/意义]在人工智能技术及应用快速发展与深刻变革背景下,机器学习领域不断出现新的研究主题和方法,深度学习和强化学习技术持续发展。因此,有必要探索不同领域机器学习研究主题演化过程,并识别出热点与新兴主题。[方法/过程]本文以图书情报领域中2011—2022年Web of Science数据库中的机器学习研究论文为例,融合LDA和Word2vec方法进行主题建模和主题演化分析,引入主题强度、主题影响力、主题关注度与主题新颖性指标识别热点主题与新兴热点主题。[结果/结论]研究结果表明,(1)Word2vec语义处理能力与LDA主题演化能力的结合能够更加准确地识别研究主题,直观展示研究主题的分阶段演化规律;(2)图书情报领域的机器学习研究主题主要分为自然语言处理与文本分析、数据挖掘与分析、信息与知识服务三大类范畴。各类主题之间的关联性较强,且具有主题关联演化特征;(3)设计的主题强度、主题影响力和主题关注度指标及综合指标能够较好地识别出2011—2014年、2015—2018年和2019—2022年3个不同周期阶段的热点主题。
文摘This paper introduces an English word formation algorithm used in natural language processing system, which is based on affix stripping strategy. The goal is to reduce the redundant information in the machine lexicon and improve the ability to recognize words. In appendix, a formation program written in Turbo-Prolog is presented.
文摘This paper further discusses the Mutiplicity of Semantic Relations between Words(MSRW)(Lu 2010)in two aspects:the reason why the semantie relations between two words can be multiple and the significance of understanding the MSRW.Through examples,this paper points out that understanding the MSRW is of particular importance not only in deepening our knowledge about lexical semantics,but also in advancing the methodology of syntactic analysis.The Construction-Chunk Approach,a new methodology of syntactic analysis built on the MSRW,is a case in point.
文摘This paper, particularly focusing on the pitch of prosodic words,has conducted a contrastive study on the structure of prosodic words in Englishand Mandarin . This paper reports a Mandarin monologue speech corpus-study, anexperimental phonetic attempt to conduct a study on the pitch of trisyllabic prosodicwords in Mandarin monologue. In addition, taking the characteristics of Englishprosodic words into consideration, the paper makes a contrastive analysis of prosodicwords in English and Mandarin. This study finds that the pitch of trisyllabic prosodicwords in Mandarin is inevitably affected by structural factors. As far as the leftsyllable is concerned, the grammatical category, prosodic hierarchical boundary andthe position of the intonational phrase where the syllable is located, the mid syllableand the right syllable may have influences on the pitch contour of the left syllable.As to the mid syllable, the grammatical category, the left syllable, the right syllableand the position of the intonational phrase where the syllable is located may haveinfluences on the pitch contour of the mid syllable. As for the right syllable, theprosodic hierarchical boundary where the syllable is located and the mid syllable mayhave effects on the pitch contour of the right syllable. Different from the previousfindings of the study on read corpus, this study shows that the mid syllable not onlyhas dissimilatory effects but also has assimilatory effects on the pitch of its precedingsyllable. The left syllable has anticipatory effects on the onset pitch of the mid syllableand the right syllable has coarticulation effects on the offset pitch of the mid syllable.
文摘Background Generally speaking. Chinese college graduates in the fifties and sixties took Russian as their second language, and those who graduated in the seventies had no second language to speak of. Now, in the years of our Open Door Policy, they find they have to learn some English and learn it quickly. They try to learn from radio and TV and many take English courses of 4 to 6 months, with varying degree of success. Their chief stumbling blocks
基金the research achievements of 2013 Classroom Teaching Reform Project in Higher Education fiscally aided by Zhejiang province(Grant No.kg2013416)2014 Pre-research for High-Level Project in Humanities and Social Sciences financially supported by Huzhou University(Grant No.2014SKYY07)2015 Monographic Research Project financed by Zhejiang Association of Foreign Languages&Literatures(Grant No.ZWYB2015003)
文摘The article holds that the problem of the relationship between"words"and"meaning"has always been the philosophical proposition.Through analyzing the relationship between"words"and"meaning"and probing into"disoourse","text"and"space-time history",the author thinks that the interpreters'preconceptions,preexistence,pre-structure and different perspectives have resulted in diverse forms of interpretations,and the being of each kind of form has its own rationality.Moreover,the process in which the interpreters interpret"words","discourse","text"and"history"is also the process to interpret themselves.Nevertheless,being able to say and being unable to say are soaked in the whole process of interpretation,and,as a result,human beings will always be confronted with a kind of"say and cannot say"embarrassment.