Purpose:This study aims to explore the trend and status of international collaboration in the field of artificial intelligence(AI)and to understand the hot topics,core groups,and major collaboration patterns in global...Purpose:This study aims to explore the trend and status of international collaboration in the field of artificial intelligence(AI)and to understand the hot topics,core groups,and major collaboration patterns in global AI research.Design/methodology/approach:We selected 38,224 papers in the field of AI from 1985 to 2019 in the core collection database of Web of Science(WoS)and studied international collaboration from the perspectives of authors,institutions,and countries through bibliometric analysis and social network analysis.Findings:The bibliometric results show that in the field of AI,the number of published papers is increasing every year,and 84.8%of them are cooperative papers.Collaboration with more than three authors,collaboration between two countries and collaboration within institutions are the three main levels of collaboration patterns.Through social network analysis,this study found that the US,the UK,France,and Spain led global collaboration research in the field of AI at the country level,while Vietnam,Saudi Arabia,and United Arab Emirates had a high degree of international participation.Collaboration at the institution level reflects obvious regional and economic characteristics.There are the Developing Countries Institution Collaboration Group led by Iran,China,and Vietnam,as well as the Developed Countries Institution Collaboration Group led by the US,Canada,the UK.Also,the Chinese Academy of Sciences(China)plays an important,pivotal role in connecting the these institutional collaboration groups.Research limitations:First,participant contributions in international collaboration may have varied,but in our research they are viewed equally when building collaboration networks.Second,although the edge weight in the collaboration network is considered,it is only used to help reduce the network and does not reflect the strength of collaboration.Practical implications:The findings fill the current shortage of research on international collaboration in AI.They will help inform scientists and policy makers about the future of AI research.Originality/value:This work is the longest to date regarding international collaboration in the field of AI.This research explores the evolution,future trends,and major collaboration patterns of international collaboration in the field of AI over the past 35 years.It also reveals the leading countries,core groups,and characteristics of collaboration in the field of AI.展开更多
Purpose: This study aims to build an automatic survey generation tool, named CitationAS, based on citation content as represented by the set of citing sentences in the original articles.Design/methodology/approach: ...Purpose: This study aims to build an automatic survey generation tool, named CitationAS, based on citation content as represented by the set of citing sentences in the original articles.Design/methodology/approach: Firstly, we apply LDA to analyse topic distribution of citation content. Secondly, in CitationAS, we use bisecting K-means, Lingo and STC to cluster retrieved citation content. Then Word2Vec, Word Net and combination of them are applied to generate cluster labels. Next, we employ TF-IDF, MMR, as well as considering sentence location information, to extract important sentences, which are used to generate surveys. Finally, we adopt manual evaluation for the generated surveys.Findings: In experiments, we choose 20 high-frequency phrases as search terms. Results show that Lingo-Word2Vec, STC-Word Net and bisecting K-means-Word2Vec have better clustering effects. In 5 points evaluation system, survey quality scores obtained by designing methods are close to 3, indicating surveys are within acceptable limits. When considering sentence location information, survey quality will be improved. Combination of Lingo, Word2Vec, TF-IDF or MMR can acquire higher survey quality.Research limitations: The manual evaluation method may have a certain subjectivity. We use a simple linear function to combine Word2Vec and Word Net that may not bring out their strengths. The generated surveys may not contain some newly created knowledge of some articles which may concentrate on sentences with no citing.Practical implications: CitationAS tool can automatically generate a comprehensive, detailed and accurate survey according to user’s search terms. It can also help researchers learn about research status in a certain field.Originality/value: Citaiton AS tool is of practicability. It merges cluster labels from semantic level to improve clustering results. The tool also considers sentence location information when calculating sentence score by TF-IDF and MMR.展开更多
Purpose:To reveal the research hotpots and relationship among three research hot topics in b iomedicine,namely CRISPR,iPS(induced Pluripotent Stem)cell and Synthetic biology.Design/methodology/approach:We set up their...Purpose:To reveal the research hotpots and relationship among three research hot topics in b iomedicine,namely CRISPR,iPS(induced Pluripotent Stem)cell and Synthetic biology.Design/methodology/approach:We set up their keyword co-occurrence networks with using three indicators and information visualization for metric analysis.Findings:The results reveal the main research hotspots in the three topics are different,but the overlapping keywords in the three topics indicate that they are mutually integrated and interacted each other.Research limitations:All analyses use keywords,without any other forms.Practical implications:We try to find the information distribution and structure of these three hot topics for revealing their research status and interactions,and for promoting biomedical developments.Originality/value:We chose the core keywords in three research hot topics in biomedicine by using h-index.展开更多
Purpose:Using the metaphor of"unicorn,"we identify the scientific papers and technical patents characterized by the informetric feature of very high citations in the first ten years after publishing,which ma...Purpose:Using the metaphor of"unicorn,"we identify the scientific papers and technical patents characterized by the informetric feature of very high citations in the first ten years after publishing,which may provide a new pattern to understand very high impact works in science and technology.Design/methodology/approach:When we set CT as the total citations of papers or patents in the first ten years after publication,with CT≥5,000 for scientific"unicorn"and CT≥500 for technical"unicorn,"we have an absolute standard for identifying scientific and technical"unicorn"publications.Findings:We identify 165 scientific"unicorns"in 14,301,875 WoS papers and 224 technical"unicorns"in 13,728,950 DII patents during 2001–2012.About 50%of"unicorns"belong to biomedicine,in which selected cases are individually discussed.The rare"unicorns"increase following linear model,the fitting data show 95%confidence with the RMSE of scientific"unicorn"is 0.2127 while the RMSE of technical"unicorn"is 0.0923.Research limitations:A"unicorn"is a pure quantitative consideration without concerning its quality,and"potential unicorns"as CT≤5,000 for papers and CT≤500 for patents are left in future studies.Practical implications:Scientific and technical"unicorns"provide a new pattern to understand high-impact works in science and technology.The"unicorn"pattern supplies a concise approach to identify very high-impact scientific papers and technical patents.Originality/value:The"unicorn"pattern supplies a concise approach to identify very high impact scientific papers and technical patents.展开更多
基金We acknowledge the National Natural Science Foundation of China(Grant No.71673143)the National Social Science Foundation of China(Grant No.19BTQ062)for thier financial support.
文摘Purpose:This study aims to explore the trend and status of international collaboration in the field of artificial intelligence(AI)and to understand the hot topics,core groups,and major collaboration patterns in global AI research.Design/methodology/approach:We selected 38,224 papers in the field of AI from 1985 to 2019 in the core collection database of Web of Science(WoS)and studied international collaboration from the perspectives of authors,institutions,and countries through bibliometric analysis and social network analysis.Findings:The bibliometric results show that in the field of AI,the number of published papers is increasing every year,and 84.8%of them are cooperative papers.Collaboration with more than three authors,collaboration between two countries and collaboration within institutions are the three main levels of collaboration patterns.Through social network analysis,this study found that the US,the UK,France,and Spain led global collaboration research in the field of AI at the country level,while Vietnam,Saudi Arabia,and United Arab Emirates had a high degree of international participation.Collaboration at the institution level reflects obvious regional and economic characteristics.There are the Developing Countries Institution Collaboration Group led by Iran,China,and Vietnam,as well as the Developed Countries Institution Collaboration Group led by the US,Canada,the UK.Also,the Chinese Academy of Sciences(China)plays an important,pivotal role in connecting the these institutional collaboration groups.Research limitations:First,participant contributions in international collaboration may have varied,but in our research they are viewed equally when building collaboration networks.Second,although the edge weight in the collaboration network is considered,it is only used to help reduce the network and does not reflect the strength of collaboration.Practical implications:The findings fill the current shortage of research on international collaboration in AI.They will help inform scientists and policy makers about the future of AI research.Originality/value:This work is the longest to date regarding international collaboration in the field of AI.This research explores the evolution,future trends,and major collaboration patterns of international collaboration in the field of AI over the past 35 years.It also reveals the leading countries,core groups,and characteristics of collaboration in the field of AI.
基金supported by Major Projects of National Social Science Fund (No. 17ZDA291)Fujian Provincial Key Laboratory of Information Processing and Intelligent Control (Minjiang University) (No. MJUKF201704)Qing Lan Project
文摘Purpose: This study aims to build an automatic survey generation tool, named CitationAS, based on citation content as represented by the set of citing sentences in the original articles.Design/methodology/approach: Firstly, we apply LDA to analyse topic distribution of citation content. Secondly, in CitationAS, we use bisecting K-means, Lingo and STC to cluster retrieved citation content. Then Word2Vec, Word Net and combination of them are applied to generate cluster labels. Next, we employ TF-IDF, MMR, as well as considering sentence location information, to extract important sentences, which are used to generate surveys. Finally, we adopt manual evaluation for the generated surveys.Findings: In experiments, we choose 20 high-frequency phrases as search terms. Results show that Lingo-Word2Vec, STC-Word Net and bisecting K-means-Word2Vec have better clustering effects. In 5 points evaluation system, survey quality scores obtained by designing methods are close to 3, indicating surveys are within acceptable limits. When considering sentence location information, survey quality will be improved. Combination of Lingo, Word2Vec, TF-IDF or MMR can acquire higher survey quality.Research limitations: The manual evaluation method may have a certain subjectivity. We use a simple linear function to combine Word2Vec and Word Net that may not bring out their strengths. The generated surveys may not contain some newly created knowledge of some articles which may concentrate on sentences with no citing.Practical implications: CitationAS tool can automatically generate a comprehensive, detailed and accurate survey according to user’s search terms. It can also help researchers learn about research status in a certain field.Originality/value: Citaiton AS tool is of practicability. It merges cluster labels from semantic level to improve clustering results. The tool also considers sentence location information when calculating sentence score by TF-IDF and MMR.
基金the National Natural Science Foundation of China Grant 71673131 for financial support
文摘Purpose:To reveal the research hotpots and relationship among three research hot topics in b iomedicine,namely CRISPR,iPS(induced Pluripotent Stem)cell and Synthetic biology.Design/methodology/approach:We set up their keyword co-occurrence networks with using three indicators and information visualization for metric analysis.Findings:The results reveal the main research hotspots in the three topics are different,but the overlapping keywords in the three topics indicate that they are mutually integrated and interacted each other.Research limitations:All analyses use keywords,without any other forms.Practical implications:We try to find the information distribution and structure of these three hot topics for revealing their research status and interactions,and for promoting biomedical developments.Originality/value:We chose the core keywords in three research hot topics in biomedicine by using h-index.
基金National Natural Science Foundation of China Grant No.71673131Jiangsu Key Laboratory Fundsupport from the International Joint Informatics Laboratory operated cooperatively by the University of Illinois at Urbana-Champaign,USA and Nanjing University,China。
文摘Purpose:Using the metaphor of"unicorn,"we identify the scientific papers and technical patents characterized by the informetric feature of very high citations in the first ten years after publishing,which may provide a new pattern to understand very high impact works in science and technology.Design/methodology/approach:When we set CT as the total citations of papers or patents in the first ten years after publication,with CT≥5,000 for scientific"unicorn"and CT≥500 for technical"unicorn,"we have an absolute standard for identifying scientific and technical"unicorn"publications.Findings:We identify 165 scientific"unicorns"in 14,301,875 WoS papers and 224 technical"unicorns"in 13,728,950 DII patents during 2001–2012.About 50%of"unicorns"belong to biomedicine,in which selected cases are individually discussed.The rare"unicorns"increase following linear model,the fitting data show 95%confidence with the RMSE of scientific"unicorn"is 0.2127 while the RMSE of technical"unicorn"is 0.0923.Research limitations:A"unicorn"is a pure quantitative consideration without concerning its quality,and"potential unicorns"as CT≤5,000 for papers and CT≤500 for patents are left in future studies.Practical implications:Scientific and technical"unicorns"provide a new pattern to understand high-impact works in science and technology.The"unicorn"pattern supplies a concise approach to identify very high-impact scientific papers and technical patents.Originality/value:The"unicorn"pattern supplies a concise approach to identify very high impact scientific papers and technical patents.