Purpose:The goal of this study is a comparative analysis of the relation between funding(a main driver for scientific research)and citations in papers of Nobel Laureates in physics,chemistry and medicine over 2019-202...Purpose:The goal of this study is a comparative analysis of the relation between funding(a main driver for scientific research)and citations in papers of Nobel Laureates in physics,chemistry and medicine over 2019-2020 and the same relation in these research fields as a whole.Design/methodology/approach:This study utilizes a power law model to explore the relationship between research funding and citations of related papers.The study here analyzes 3,539 recorded documents by Nobel Laureates in physics,chemistry and medicine and a broader dataset of 183,016 documents related to the fields of physics,medicine,and chemistry recorded in the Web of Science database.Findings:Results reveal that in chemistry and medicine,funded researches published in papers of Nobel Laureates have higher citations than unfunded studies published in articles;vice versa high citations of Nobel Laureates in physics are for unfunded studies published in papers.Instead,when overall data of publications and citations in physics,chemistry and medicine are analyzed,all papers based on funded researches show higher citations than unfunded ones.Originality/value:Results clarify the driving role of research funding for science diffusion that are systematized in general properties:a)articles concerning funded researches receive more citations than(un)funded studies published in papers of physics,chemistry and medicine sciences,generating a high Matthew effect(a higher growth of citations with the increase in the number of papers);b)research funding increases the citations of articles in fields oriented to applied research(e.g.,chemistry and medicine)more than fields oriented towards basic research(e.g.,physics).Practical implications:The results here explain some characteristics of scientific development and diffusion,highlighting the critical role of research funding in fostering citations and the expansion of scientific knowledge.This finding can support decision-making of policymakers and R&D managers to improve the effectiveness in allocating financial resources in science policies to generate a higher positive scientific and societal impact.展开更多
Purpose:The goal of this study is to analyze the relationship between funded and unfunded papers and their citations in both basic and applied sciences.Design/methodology/approach:A power law model analyzes the relati...Purpose:The goal of this study is to analyze the relationship between funded and unfunded papers and their citations in both basic and applied sciences.Design/methodology/approach:A power law model analyzes the relationship between research funding and citations of papers using 831,337 documents recorded in the Web of Science database.Findings:The original results reveal general characteristics of the diffusion of science in research fields:a)Funded articles receive higher citations compared to unfunded papers in journals;b)Funded articles exhibit a super-linear growth in citations,surpassing the increase seen in unfunded articles.This finding reveals a higher diffusion of scientific knowledge in funded articles.Moreover,c)funded articles in both basic and applied sciences demonstrate a similar expected change in citations,equivalent to about 1.23%,when the number of funded papers increases by 1%in journals.This result suggests,for the first time,that funding effect of scientific research is an invariant driver,irrespective of the nature of the basic or applied sciences.Originality/value:This evidence suggests empirical laws of funding for scientific citations that explain the importance of robust funding mechanisms for achieving impactful research outcomes in science and society.These findings here also highlight that funding for scientific research is a critical driving force in supporting citations and the dissemination of scientific knowledge in recorded documents in both basic and applied sciences.Practical implications:This comprehensive result provides a holistic view of the relationship between funding and citation performance in science to guide policymakers and R&D managers with science policies by directing funding to research in promoting the scientific development and higher diffusion of results for the progress of human society.展开更多
Purpose:We analyzed the structure of a community of authors working in the field of social network analysis(SNA)based on citation indicators:direct citation and bibliographic coupling metrics.We observed patterns at t...Purpose:We analyzed the structure of a community of authors working in the field of social network analysis(SNA)based on citation indicators:direct citation and bibliographic coupling metrics.We observed patterns at the micro,meso,and macro levels of analysis.Design/methodology/approach:We used bibliometric network analysis,including the“temporal quantities”approach proposed to study temporal networks.Using a two-mode network linking publications with authors and a one-mode network of citations between the works,we constructed and analyzed the networks of citation and bibliographic coupling among authors.We used an iterated saturation data collection approach.Findings:At the macro-level,we observed the global structural features of citations between authors,showing that 80%of authors have not more than 15 citations from other works.At the meso-level,we extracted the groups of authors citing each other and similar to each other according to their citation patterns.We have seen a division of authors in SNA into groups of social scientists and physicists,as well as into other groups of authors from different disciplines.We found some examples of brokerage between different groups that maintained the common identity of the field.At the micro-level,we extracted authors with extremely high values of received citations,who can be considered as the most prominent authors in the field.We examined the temporal properties of the most popular authors.Research limitations:The main challenge in this approach is the resolution of the author’s name(synonyms and homonyms).We faced the author disambiguation,or“multiple personalities”(Harzing,2015)problem.To remain consistent and comparable with our previously published articles,we used the same SNA data collected up to 2018.The analysis and conclusions on the activity,productivity,and visibility of the authors are relative only to the field of SNA.Practical implications:The proposed approach can be utilized for similar objectives and identifying key structures and characteristics in other disciplines.This may potentially inspire the application of network approaches in other research areas,creating more authors collaborating in the field of SNA.Originality/value:We identified and applied an innovative approach and methods to study the structure of scientific communities,which allowed us to get the findings going beyond those obtained with other methods.We used a new approach to temporal network analysis,which is an important addition to the analysis as it provides detailed information on different measures for the authors and pairs of authors over time.展开更多
Purpose:To analyze the diversity of citation distributions to publications in different research topics to investigate the accuracy of size-independent,rank-based indicators.The top percentile-based indicators are the...Purpose:To analyze the diversity of citation distributions to publications in different research topics to investigate the accuracy of size-independent,rank-based indicators.The top percentile-based indicators are the most common indicators of this type,and the evaluations of Japan are the most evident misjudgments.Design/methodology/approach:The distributions of citations to publications from countries and journals in several research topics were analyzed along with the corresponding global publications using histograms with logarithmic binning,double rank plots,and normal probability plots of log-transformed numbers of citations.Findings:Size-independent,top percentile-based indicators are accurate when the global ranks of local publications fit a power law,but deviations in the least cited papers are frequent in countries and occur in all journals with high impact factors.In these cases,a single indicator is misleading.Comparisons of the proportions of uncited papers are the best way to predict these deviations.Research limitations:This study is fundamentally analytical,and its results describe mathematical facts that are self-evident.Practical implications:Respectable institutions,such as the OECD,the European Commission,and the U.S.National Science Board,produce research country rankings and individual evaluations using size-independent percentile indicators that are misleading in many countries.These misleading evaluations should be discontinued because they can cause confusion among research policymakers and lead to incorrect research policies.Originality/value:Studies linking the lower tail of citation distribution,including uncited papers,to percentile research indicators have not been performed previously.The present results demonstrate that studies of this type are necessary to find reliable procedures for research assessments.展开更多
Purpose:The disseminating of academic knowledge to nonacademic audiences partly relies on the transition of subsequent citing papers.This study aims to investigate direct and indirect impact on technology and policy o...Purpose:The disseminating of academic knowledge to nonacademic audiences partly relies on the transition of subsequent citing papers.This study aims to investigate direct and indirect impact on technology and policy originating from transformative research based on ego citation network.Design/methodology/approach:Key Nobel Prize-winning publications(NPs)in fields of gene engineering and astrophysics are regarded as a proxy for transformative research.In this contribution,we introduce a network-structural indicator of citing patents to measure technological impact of a target article and use policy citations as a preliminary tool for policy impact.Findings:The results show that the impact on technology and policy of NPs are higher than that of their subsequent citation generations in gene engineering but not in astrophysics.Research limitations:The selection of Nobel Prizes is not balanced and the database used in this study,Dimensions,suffers from incompleteness and inaccuracy of citation links.Practical implications:Our findings provide useful clues to better understand the characteristics of transformative research in technological and policy impact.Originality/value:This study proposes a new framework to explore the direct and indirect impact on technology and policy originating from transformative research.展开更多
Correction to:Nuclear Science and Techniques(2024)35:162 https://doi.org/10.1007/s41365-024-01514-1 In this article,the citation information for Figures 6 and 7 was published incorrectly.The correct citations should r...Correction to:Nuclear Science and Techniques(2024)35:162 https://doi.org/10.1007/s41365-024-01514-1 In this article,the citation information for Figures 6 and 7 was published incorrectly.The correct citations should read as follows:In the sentence beginning‘the curve of Eq.17 is plotted…..’in this article,the text‘as shown in Fig.7a’should have read‘as shown in Fig.6a’.展开更多
This study examines how writer stance is projected in citation from a comparative perspective.The dataset consisted of 120 qualitative and quantitative research articles(RAs)that were authored by Anglophone and Chines...This study examines how writer stance is projected in citation from a comparative perspective.The dataset consisted of 120 qualitative and quantitative research articles(RAs)that were authored by Anglophone and Chinese applied linguists.ANO?VA tests revealed marked cross-language and cross-paradigmatic differences in stance-based dialogic engagement.展开更多
Purpose: (1) To test basic assumptions underlying frequency-weighted citation analysis: (a) Uni-citations correspond to citations that are nonessential to the citing papers; (b) The influence of a cited paper ...Purpose: (1) To test basic assumptions underlying frequency-weighted citation analysis: (a) Uni-citations correspond to citations that are nonessential to the citing papers; (b) The influence of a cited paper on the citing paper increases with the frequency with which it is cited in the citing paper. (2) To explore the degree to which citation location may be used to help identify nonessential citations. Design/methodology/approach: Each of the in-text citations in all research articles published in Issue 1 of the Journal of the Association for Information Science and Technology (JASIST) 2016 was manually classified into one of these five categories: Applied, Contrastive, Supportive, Reviewed, and Perfunctory. The distributions of citations at different in-text frequencies and in different locations in the text by these functions were analyzed. Findings: Filtering out nonessential citations before assigning weight is important for frequency-weighted citation analysis. For this purpose, removing citations by location is more effective than re-citation analysis that simply removes uni-citations. Removing all citation occurrences in the Background and Literature Review sections and uni-citations in the Introduction section appears to provide a good balance between filtration and error rates. Research limitations: This case study suffers from the limitation of scalability and generalizability. We took careful measures to reduce the impact of other limitations of the data collection approach used. Relying on the researcher's judgment to attribute citation functions, this approach is unobtrusive but speculative, and can suffer from a low degree of confidence, thus creating reliability concerns. Practical implications: Weighted citation analysis promises to improve citation analysis for research evaluation, knowledge network analysis, knowledge representation, and information retrieval. The present study showed the importance of filtering out nonessential citations before assigning weight in a weighted citation analysis, which may be a significant step forward to realizing these promises. Originality/value: Weighted citation analysis has long been proposed as a theoretical solution to the problem of citation analysis that treats all citations equally, and has attracted increasing research interest in recent years. The present study showed, for the first time, the importance of filtering out nonessential citations in weighted citation analysis, pointing research in this area in a new direction.展开更多
Purpose: First, to review the state-of-the-art in patent citation analysis, particularly characteristics of patent citations to scientific literature (scientific non-patent references, SNPRs). Second, to present a ...Purpose: First, to review the state-of-the-art in patent citation analysis, particularly characteristics of patent citations to scientific literature (scientific non-patent references, SNPRs). Second, to present a novel mapping approach to identify technology-relevant research based on the papers cited by and referring to the SNPRs. Design/methodology/approach: In the review part we discuss the context of SNPRs such as the time lags between scientific achievements and inventions. Also patent-to-patent citation is addressed particularly because this type of patent citation analysis is a major element in the assessment of the economic value of patents. We also review the research on the role of universities and researchers in technological development, with important issues such as universities as sources of technological knowledge and inventor-author relations. We conclude the review part of this paper with an overview of recent research on mapping and network analysis of the science and technology interface and of technological progress in interaction with science. In the second part we apply new techniques for the direct visualization of the cited and citing relations of SNPRs, the mapping of the landscape around SNPRs by bibliographic coupling and co-citation analysis, and the mapping of the conceptual environment of SNPRs by keyword co-occurrence analysis. Findings: We discuss several properties of SNPRs. Only a small minority of publications covered by the Web of Science or Scopus are cited by patents, about 3%-4%. However, for publications based on university-industry collaboration the number of SNPRs is considerably higher, around 15%. The proposed mapping methodology based on a "second order SNPR approach" enables a better assessment of the technological relevance of research. Research limitations: The main limitation is that a more advanced merging of patent and publication data, in particular unification of author and inventor names, in still a necessity. Practical implications: The proposed mapping methodology enables the creation of a database of technology-relevant papers (TRPs). In a bibliometric assessment the publications of research groups, research programs or institutes can be matched with the TRPs and thus the extent to which the work of groups, programs or institutes are relevant for technological development can be measured. Originality/value: The review part examines a wide range of findings in the research of patent citation analysis. The mapping approach to identify a broad range of technologyrelevant papers is novel and offers new opportunities in research evaluation practices.展开更多
Purpose:This study attempts to disclose the characteristics of knowledge integration in an interdisciplinary field by looking into the content aspect of knowledge.Design/methodology/approach:The eHealth field was chos...Purpose:This study attempts to disclose the characteristics of knowledge integration in an interdisciplinary field by looking into the content aspect of knowledge.Design/methodology/approach:The eHealth field was chosen in the case study.Associated knowledge phrases(AKPs)that are shared between citing papers and their references were extracted from the citation contexts of the eHealth papers by applying a stem-matching method.A classification schema that considers the functions of knowledge in the domain was proposed to categorize the identified AKPs.The source disciplines of each knowledge type were analyzed.Quantitative indicators and a co-occurrence analysis were applied to disclose the integration patterns of different knowledge types.Findings:The annotated AKPs evidence the major disciplines supplying each type of knowledge.Different knowledge types have remarkably different integration patterns in terms of knowledge amount,the breadth of source disciplines,and the integration time lag.We also find several frequent co-occurrence patterns of different knowledge types.Research limitations:The collected articles of the field are limited to the two leading open access journals.The stem-matching method to extract AKPs could not identify those phrases with the same meaning but expressed in words with different stems.The type of Research Subject dominates the recognized AKPs,which calls on an improvement of the classification schema for better knowledge integration analysis on knowledge units.Practical implications:The methodology proposed in this paper sheds new light on knowledge integration characteristics of an interdisciplinary field from the content perspective.The findings have practical implications on the future development of research strategies in eHealth and the policies about interdisciplinary research.Originality/value:This study proposed a new methodology to explore the content characteristics of knowledge integration in an interdisciplinary field.展开更多
Purpose: In the open science era, it is typical to share project-generated scientific data by depositing it in an open and accessible database. Moreover, scientific publications are preserved in a digital library arc...Purpose: In the open science era, it is typical to share project-generated scientific data by depositing it in an open and accessible database. Moreover, scientific publications are preserved in a digital library archive. It is challenging to identify the data usage that is mentioned in literature and associate it with its source. Here, we investigated the data usage of a government-funded cancer genomics project, The Cancer Genome Atlas(TCGA), via a full-text literature analysis.Design/methodology/approach: We focused on identifying articles using the TCGA dataset and constructing linkages between the articles and the specific TCGA dataset. First, we collected 5,372 TCGA-related articles from Pub Med Central(PMC). Second, we constructed a benchmark set with 25 full-text articles that truly used the TCGA data in their studies, and we summarized the key features of the benchmark set. Third, the key features were applied to the remaining PMC full-text articles that were collected from PMC.Findings: The amount of publications that use TCGA data has increased significantly since 2011, although the TCGA project was launched in 2005. Additionally, we found that the critical areas of focus in the studies that use the TCGA data were glioblastoma multiforme, lung cancer, and breast cancer; meanwhile, data from the RNA-sequencing(RNA-seq) platform is the most preferable for use.Research limitations: The current workflow to identify articles that truly used TCGA data is labor-intensive. An automatic method is expected to improve the performance.Practical implications: This study will help cancer genomics researchers determine the latest advancements in cancer molecular therapy, and it will promote data sharing and data-intensive scientific discovery.Originality/value: Few studies have been conducted to investigate data usage by governmentfunded projects/programs since their launch. In this preliminary study, we extracted articles that use TCGA data from PMC, and we created a link between the full-text articles and the source data.展开更多
Purpose:Researchers frequently encounter the following problems when writing scientific articles:(1)Selecting appropriate citations to support the research idea is challenging.(2)The literature review is not conducted...Purpose:Researchers frequently encounter the following problems when writing scientific articles:(1)Selecting appropriate citations to support the research idea is challenging.(2)The literature review is not conducted extensively,which leads to working on a research problem that others have well addressed.The study focuses on citation recommendation in the related studies section by applying the term function of a citation context,potentially improving the efficiency of writing a literature review.Design/methodology/approach:We present nine term functions with three newly created and six identified from existing literature.Using these term functions as labels,we annotate 531 research papers in three topics to evaluate our proposed recommendation strategy.BM25 and Word2vec with VSM are implemented as the baseline models for the recommendation.Then the term function information is applied to enhance the performance.Findings:The experiments show that the term function-based methods outperform the baseline methods regarding the recall,precision,and F1-score measurement,demonstrating that term functions are useful in identifying valuable citations.Research limitations:The dataset is insufficient due to the complexity of annotating citation functions for paragraphs in the related studies section.More recent deep learning models should be performed to future validate the proposed approach.Practical implications:The citation recommendation strategy can be helpful for valuable citation discovery,semantic scientific retrieval,and automatic literature review generation.Originality/value:The proposed citation function-based citation recommendation can generate intuitive explanations of the results for users,improving the transparency,persuasiveness,and effectiveness of recommender systems.展开更多
Purpose: This study aims to compare the characteristics of citation disciplinary structure between the G7 countries and the BRICS countries.Design/Methodology/Approach: In this contribution, which uses about 1 milli...Purpose: This study aims to compare the characteristics of citation disciplinary structure between the G7 countries and the BRICS countries.Design/Methodology/Approach: In this contribution, which uses about 1 million Web of Science publications and two publications years(1993 and 2013), we compare the G7 countries and the BRICS countries with regard to this type of structure. For the publication year 2013, cosine similarity values regarding the citation disciplinary structures of these countries(and of nine other countries) were used as input to cluster analysis. We also obtained cosine similarity values for a given country and its citation disciplinary structures across the two publication years. Moreover, for the publication year 2013, the within-country JeffreysMatusita distance between publication and citation disciplinary structure was measured. Research limitations: First, the citation disciplinary structures of countries depend on multiple and complex factors. It is therefore difficult to completely explain the formation and change of the citation disciplinary structure of a country. This study suggests some possible causes, whereas detailed explanations might be given by future research. Second, the length of the citation window used in this study is three years. However, scientific disciplines differ in their citation practices. Comparison between citations across disciplines using the same citation window length may affect the citation discipline structure results for some countries.Practical limitations: First, the results of this study are based on the WoS database. However, in this database some fields are covered to a greater extent than others, which may affect the results for the citation discipline structure for some studied countries. In future research, we might repeat this study using another database(like Scopus) and, in that case, we would like to make comparisons between the two outcomes. Second, the use of a constant journal setyielded that a large share of the journals covered by WoS year 2013 is ignored in the study. Thus, disciplinary structure is studied based on a quite restricted set of publications. The three mentioned limitations should be kept in mind when the results of this study are interpreted.Originality/value: Disciplinary structure on country level is a highlighted topic for the S&T policy makers, especially for those come from developing countries. This study observes the disciplinary structure in the view of academic impact, and the result will provide some evidence to make decision for the discipline strategy and funding allocation. Besides, JeffreysMatusita distance is introduced to measure the similarity of citation disciplinary structure and publication disciplinary structure. By applying this measure, some new observations were drawn, for example, "Based on the comparison of publication disciplinary structure and citation disciplinary structure, the paper finds most BRICS counties have less impact with more publications".Findings: The outcome of the cluster analysis indicates that the G7 countries and BRICS countries are quite heterogeneous regarding their citation disciplinary structure. For a majority of the G7 countries, the citation disciplinary structure tend to be more stable compared to BRICS countries with regard to the years 1993 and 2013. Most G7 countries, with United States as an exception, turned out to have lower values on the Jeffreys-Matusita distance than BRICS countries, indicating a higher degree of heterogeneity between the publication and the citation disciplinary structure for the latter countries. In other words, BRICS countries still receive much less citations in most disciplines than their publication output would suggest. G7 countries can still expect more citations than is to be expected based on their publication output, thereby generating relatively more impact than BRICS countries.展开更多
Purpose: The number of citations has been widely used to measure the significance of a paper. However, there is a need in introducing another index to determine superiority or inferiority of papers with the same numbe...Purpose: The number of citations has been widely used to measure the significance of a paper. However, there is a need in introducing another index to determine superiority or inferiority of papers with the same number of citations. We determine superiority or inferiority of papers by using the ranking based on the number of citations and PageRank.Design/methodology/approach: We show the positive linear correlation between Citation Rank(the ranking of the number of citation) and PageRank. On this basis, we identify highquality, prestige, emerging, and popular papers. Findings: We found that the high-quality papers belong to the subjects of biochemistry and molecular biology, chemistry, and multidisciplinary sciences. The prestige papers correspond to the subjects of computer science, engineering, and information science. The emerging papers are related to biochemistry and molecular biology, as well as those published in the journal "Cell." The popular papers belong to the subject of multidisciplinary sciences. Research limitations: We analyze the Science Citation Index Expanded(SCIE) from 1981 to 2015 to calculate Citation Rank and PageRank within a citation network consisting of 34,666,719 papers and 591,321,826 citations.Practical implications: Our method is applicable to forecast emerging fields of research subjects in science and helps policymakers to consider science policy.Originality/value: We calculated PageRank for a giant citation network which is extremely larger than the citation networks investigated by previous researchers.展开更多
Citation network is often used for academic recommendation. However, it is difficult to achieve high recommendation accuracy and low time complexity because it is often very large and sparse and different citations ha...Citation network is often used for academic recommendation. However, it is difficult to achieve high recommendation accuracy and low time complexity because it is often very large and sparse and different citations have different purposes. What's more, some citations include unreasonable information, such as in case of intentional self-citation. To improve the accuracy of citation network-based academic recommendation and reduce the time complexity, we propose an academic recommendation method for recommending authors and papers. In which, an author-paper bilayer citation network is built, then an enhanced topic model, Author Community Topic Time Model(ACTTM) is proposed to detect high quality author communities in the author layer, and a set of attributes are proposed to comprehensively depict the author/paper nodes in the bilayer citation network. Experimental results prove that the proposed ACTTM can detect high quality author communities and facilitate low time complexity, and the proposed academic recommendation method can effectively improve the recommendation accuracy.展开更多
Purpose:Research dynamics have long been a research interest.It is a macro perspective tool for discovering temporal research trends of a certain discipline or subject.A micro perspective of research dynamics,however,...Purpose:Research dynamics have long been a research interest.It is a macro perspective tool for discovering temporal research trends of a certain discipline or subject.A micro perspective of research dynamics,however,concerning a single researcher or a highly cited paper in terms of their citations and“citations of citations”(forward chaining)remains unexplored.Design/methodology/approach:In this paper,we use a cross-collection topic model to reveal the research dynamics of topic disappearance topic inheritance,and topic innovation in each generation of forward chaining.Findings:For highly cited work,scientific influence exists in indirect citations.Topic modeling can reveal how long this influence exists in forward chaining,as well as its influence.Research limitations:This paper measures scientific influence and indirect scientific influence only if the relevant words or phrases are borrowed or used in direct or indirect citations.Paraphrasing or semantically similar concept may be neglected in this research.Practical implications:This paper demonstrates that a scientific influence exists in indirect citations through its analysis of forward chaining.This can serve as an inspiration on how to adequately evaluate research influence.Originality:The main contributions of this paper are the following three aspects.First,besides research dynamics of topic inheritance and topic innovation,we model topic disappearance by using a cross-collection topic model.Second,we explore the length and character of the research impact through“citations of citations”content analysis.Finally,we analyze the research dynamics of artificial intelligence researcher Geoffrey Hinton’s publications and the topic dynamics of forward chaining.展开更多
Purpose: This study aims to build an automatic survey generation tool, named CitationAS, based on citation content as represented by the set of citing sentences in the original articles.Design/methodology/approach: ...Purpose: This study aims to build an automatic survey generation tool, named CitationAS, based on citation content as represented by the set of citing sentences in the original articles.Design/methodology/approach: Firstly, we apply LDA to analyse topic distribution of citation content. Secondly, in CitationAS, we use bisecting K-means, Lingo and STC to cluster retrieved citation content. Then Word2Vec, Word Net and combination of them are applied to generate cluster labels. Next, we employ TF-IDF, MMR, as well as considering sentence location information, to extract important sentences, which are used to generate surveys. Finally, we adopt manual evaluation for the generated surveys.Findings: In experiments, we choose 20 high-frequency phrases as search terms. Results show that Lingo-Word2Vec, STC-Word Net and bisecting K-means-Word2Vec have better clustering effects. In 5 points evaluation system, survey quality scores obtained by designing methods are close to 3, indicating surveys are within acceptable limits. When considering sentence location information, survey quality will be improved. Combination of Lingo, Word2Vec, TF-IDF or MMR can acquire higher survey quality.Research limitations: The manual evaluation method may have a certain subjectivity. We use a simple linear function to combine Word2Vec and Word Net that may not bring out their strengths. The generated surveys may not contain some newly created knowledge of some articles which may concentrate on sentences with no citing.Practical implications: CitationAS tool can automatically generate a comprehensive, detailed and accurate survey according to user’s search terms. It can also help researchers learn about research status in a certain field.Originality/value: Citaiton AS tool is of practicability. It merges cluster labels from semantic level to improve clustering results. The tool also considers sentence location information when calculating sentence score by TF-IDF and MMR.展开更多
Purpose:We study the proportion of Web of Science(WoS)citation links that are represented in the Crossref Open Citation Index(COCI),with the possible aim of using COCI in research evaluation instead of the WoS,if the ...Purpose:We study the proportion of Web of Science(WoS)citation links that are represented in the Crossref Open Citation Index(COCI),with the possible aim of using COCI in research evaluation instead of the WoS,if the level of coverage was sufficient.Design/methodology/approach:We calculate the proportion on citation links where both publications have a WoS accession number and a DOI simultaneously,and where the cited publications have had at least one author from our institution,the Czech Technical University in Prague.We attempt to look up each such citation link in COCI.Findings:We find that 53.7%of WoS citation links are present in the COCI.The proportion varies largely by discipline.The total figures differ significantly from 40%in the large-scale study by Van Eck,Waltman,Larivière,and Sugimoto(blog 2018,https://www.cwts.nl/blog?article=n-r2s234).Research limitations:The sample does not cover all science areas uniformly;it is heavily focused on Engineering and Technology,and only some disciplines of Natural Sciences are present.However,this reflects the real scientific orientation and publication profile of our institution.Practical implications:The current level of coverage is not sufficient for the WoS to be replaced by COCI for research evaluation.Originality/value:The present study illustrates a COCI vs WoS comparison on the scale of a larger technical university in Central Europe.展开更多
Purpose: Communicating scientific results to the public is essential to inspire future researchers and ensure that discoveries are exploited. News stories about research are a key communication pathway for this and ha...Purpose: Communicating scientific results to the public is essential to inspire future researchers and ensure that discoveries are exploited. News stories about research are a key communication pathway for this and have been manually monitored to assess the extent of press coverage of scholarship.Design/methodology/Approach: To make larger scale studies practical, this paper introduces an automatic method to extract citations from newspaper stories to large sets of academic journals. Curated ProQuest queries were used to search for citations to 9,639 Science and3,412 Social Science Web of Science(WoS) journals from eight UK daily newspapers during2006–2015. False matches were automatically filtered out by a new program, with 94% of the remaining stories meaningfully citing research.Findings: Most Science(95%) and Social Science(94%) journals were never cited by these newspapers. Half of the cited Science journals covered medical or health-related topics,whereas 43% of the Social Sciences journals were related to psychiatry or psychology. From the citing news stories, 60% described research extensively and 53% used multiple sources,but few commented on research quality.Research Limitations: The method has only been tested in English and from the ProQuest Newspapers database.Practical implications: Others can use the new method to systematically harvest press coverage of research.Originality/value: An automatic method was introduced and tested to extract citations from newspaper stories to large sets of academic journals.展开更多
文摘Purpose:The goal of this study is a comparative analysis of the relation between funding(a main driver for scientific research)and citations in papers of Nobel Laureates in physics,chemistry and medicine over 2019-2020 and the same relation in these research fields as a whole.Design/methodology/approach:This study utilizes a power law model to explore the relationship between research funding and citations of related papers.The study here analyzes 3,539 recorded documents by Nobel Laureates in physics,chemistry and medicine and a broader dataset of 183,016 documents related to the fields of physics,medicine,and chemistry recorded in the Web of Science database.Findings:Results reveal that in chemistry and medicine,funded researches published in papers of Nobel Laureates have higher citations than unfunded studies published in articles;vice versa high citations of Nobel Laureates in physics are for unfunded studies published in papers.Instead,when overall data of publications and citations in physics,chemistry and medicine are analyzed,all papers based on funded researches show higher citations than unfunded ones.Originality/value:Results clarify the driving role of research funding for science diffusion that are systematized in general properties:a)articles concerning funded researches receive more citations than(un)funded studies published in papers of physics,chemistry and medicine sciences,generating a high Matthew effect(a higher growth of citations with the increase in the number of papers);b)research funding increases the citations of articles in fields oriented to applied research(e.g.,chemistry and medicine)more than fields oriented towards basic research(e.g.,physics).Practical implications:The results here explain some characteristics of scientific development and diffusion,highlighting the critical role of research funding in fostering citations and the expansion of scientific knowledge.This finding can support decision-making of policymakers and R&D managers to improve the effectiveness in allocating financial resources in science policies to generate a higher positive scientific and societal impact.
文摘Purpose:The goal of this study is to analyze the relationship between funded and unfunded papers and their citations in both basic and applied sciences.Design/methodology/approach:A power law model analyzes the relationship between research funding and citations of papers using 831,337 documents recorded in the Web of Science database.Findings:The original results reveal general characteristics of the diffusion of science in research fields:a)Funded articles receive higher citations compared to unfunded papers in journals;b)Funded articles exhibit a super-linear growth in citations,surpassing the increase seen in unfunded articles.This finding reveals a higher diffusion of scientific knowledge in funded articles.Moreover,c)funded articles in both basic and applied sciences demonstrate a similar expected change in citations,equivalent to about 1.23%,when the number of funded papers increases by 1%in journals.This result suggests,for the first time,that funding effect of scientific research is an invariant driver,irrespective of the nature of the basic or applied sciences.Originality/value:This evidence suggests empirical laws of funding for scientific citations that explain the importance of robust funding mechanisms for achieving impactful research outcomes in science and society.These findings here also highlight that funding for scientific research is a critical driving force in supporting citations and the dissemination of scientific knowledge in recorded documents in both basic and applied sciences.Practical implications:This comprehensive result provides a holistic view of the relationship between funding and citation performance in science to guide policymakers and R&D managers with science policies by directing funding to research in promoting the scientific development and higher diffusion of results for the progress of human society.
基金supported in part by the Slovenian Research Agency(VB,research program P1-0294)(VB,research project J5-2557)+2 种基金(VB,research project J5-4596)COST EU(VB,COST action CA21163(HiTEc)is prepared within the framework of the HSE University Basic Research Program.
文摘Purpose:We analyzed the structure of a community of authors working in the field of social network analysis(SNA)based on citation indicators:direct citation and bibliographic coupling metrics.We observed patterns at the micro,meso,and macro levels of analysis.Design/methodology/approach:We used bibliometric network analysis,including the“temporal quantities”approach proposed to study temporal networks.Using a two-mode network linking publications with authors and a one-mode network of citations between the works,we constructed and analyzed the networks of citation and bibliographic coupling among authors.We used an iterated saturation data collection approach.Findings:At the macro-level,we observed the global structural features of citations between authors,showing that 80%of authors have not more than 15 citations from other works.At the meso-level,we extracted the groups of authors citing each other and similar to each other according to their citation patterns.We have seen a division of authors in SNA into groups of social scientists and physicists,as well as into other groups of authors from different disciplines.We found some examples of brokerage between different groups that maintained the common identity of the field.At the micro-level,we extracted authors with extremely high values of received citations,who can be considered as the most prominent authors in the field.We examined the temporal properties of the most popular authors.Research limitations:The main challenge in this approach is the resolution of the author’s name(synonyms and homonyms).We faced the author disambiguation,or“multiple personalities”(Harzing,2015)problem.To remain consistent and comparable with our previously published articles,we used the same SNA data collected up to 2018.The analysis and conclusions on the activity,productivity,and visibility of the authors are relative only to the field of SNA.Practical implications:The proposed approach can be utilized for similar objectives and identifying key structures and characteristics in other disciplines.This may potentially inspire the application of network approaches in other research areas,creating more authors collaborating in the field of SNA.Originality/value:We identified and applied an innovative approach and methods to study the structure of scientific communities,which allowed us to get the findings going beyond those obtained with other methods.We used a new approach to temporal network analysis,which is an important addition to the analysis as it provides detailed information on different measures for the authors and pairs of authors over time.
文摘Purpose:To analyze the diversity of citation distributions to publications in different research topics to investigate the accuracy of size-independent,rank-based indicators.The top percentile-based indicators are the most common indicators of this type,and the evaluations of Japan are the most evident misjudgments.Design/methodology/approach:The distributions of citations to publications from countries and journals in several research topics were analyzed along with the corresponding global publications using histograms with logarithmic binning,double rank plots,and normal probability plots of log-transformed numbers of citations.Findings:Size-independent,top percentile-based indicators are accurate when the global ranks of local publications fit a power law,but deviations in the least cited papers are frequent in countries and occur in all journals with high impact factors.In these cases,a single indicator is misleading.Comparisons of the proportions of uncited papers are the best way to predict these deviations.Research limitations:This study is fundamentally analytical,and its results describe mathematical facts that are self-evident.Practical implications:Respectable institutions,such as the OECD,the European Commission,and the U.S.National Science Board,produce research country rankings and individual evaluations using size-independent percentile indicators that are misleading in many countries.These misleading evaluations should be discontinued because they can cause confusion among research policymakers and lead to incorrect research policies.Originality/value:Studies linking the lower tail of citation distribution,including uncited papers,to percentile research indicators have not been performed previously.The present results demonstrate that studies of this type are necessary to find reliable procedures for research assessments.
基金supported by the National Natural Science Foundation of China(Grant No.71974167).
文摘Purpose:The disseminating of academic knowledge to nonacademic audiences partly relies on the transition of subsequent citing papers.This study aims to investigate direct and indirect impact on technology and policy originating from transformative research based on ego citation network.Design/methodology/approach:Key Nobel Prize-winning publications(NPs)in fields of gene engineering and astrophysics are regarded as a proxy for transformative research.In this contribution,we introduce a network-structural indicator of citing patents to measure technological impact of a target article and use policy citations as a preliminary tool for policy impact.Findings:The results show that the impact on technology and policy of NPs are higher than that of their subsequent citation generations in gene engineering but not in astrophysics.Research limitations:The selection of Nobel Prizes is not balanced and the database used in this study,Dimensions,suffers from incompleteness and inaccuracy of citation links.Practical implications:Our findings provide useful clues to better understand the characteristics of transformative research in technological and policy impact.Originality/value:This study proposes a new framework to explore the direct and indirect impact on technology and policy originating from transformative research.
文摘Correction to:Nuclear Science and Techniques(2024)35:162 https://doi.org/10.1007/s41365-024-01514-1 In this article,the citation information for Figures 6 and 7 was published incorrectly.The correct citations should read as follows:In the sentence beginning‘the curve of Eq.17 is plotted…..’in this article,the text‘as shown in Fig.7a’should have read‘as shown in Fig.6a’.
文摘This study examines how writer stance is projected in citation from a comparative perspective.The dataset consisted of 120 qualitative and quantitative research articles(RAs)that were authored by Anglophone and Chinese applied linguists.ANO?VA tests revealed marked cross-language and cross-paradigmatic differences in stance-based dialogic engagement.
文摘Purpose: (1) To test basic assumptions underlying frequency-weighted citation analysis: (a) Uni-citations correspond to citations that are nonessential to the citing papers; (b) The influence of a cited paper on the citing paper increases with the frequency with which it is cited in the citing paper. (2) To explore the degree to which citation location may be used to help identify nonessential citations. Design/methodology/approach: Each of the in-text citations in all research articles published in Issue 1 of the Journal of the Association for Information Science and Technology (JASIST) 2016 was manually classified into one of these five categories: Applied, Contrastive, Supportive, Reviewed, and Perfunctory. The distributions of citations at different in-text frequencies and in different locations in the text by these functions were analyzed. Findings: Filtering out nonessential citations before assigning weight is important for frequency-weighted citation analysis. For this purpose, removing citations by location is more effective than re-citation analysis that simply removes uni-citations. Removing all citation occurrences in the Background and Literature Review sections and uni-citations in the Introduction section appears to provide a good balance between filtration and error rates. Research limitations: This case study suffers from the limitation of scalability and generalizability. We took careful measures to reduce the impact of other limitations of the data collection approach used. Relying on the researcher's judgment to attribute citation functions, this approach is unobtrusive but speculative, and can suffer from a low degree of confidence, thus creating reliability concerns. Practical implications: Weighted citation analysis promises to improve citation analysis for research evaluation, knowledge network analysis, knowledge representation, and information retrieval. The present study showed the importance of filtering out nonessential citations before assigning weight in a weighted citation analysis, which may be a significant step forward to realizing these promises. Originality/value: Weighted citation analysis has long been proposed as a theoretical solution to the problem of citation analysis that treats all citations equally, and has attracted increasing research interest in recent years. The present study showed, for the first time, the importance of filtering out nonessential citations in weighted citation analysis, pointing research in this area in a new direction.
文摘Purpose: First, to review the state-of-the-art in patent citation analysis, particularly characteristics of patent citations to scientific literature (scientific non-patent references, SNPRs). Second, to present a novel mapping approach to identify technology-relevant research based on the papers cited by and referring to the SNPRs. Design/methodology/approach: In the review part we discuss the context of SNPRs such as the time lags between scientific achievements and inventions. Also patent-to-patent citation is addressed particularly because this type of patent citation analysis is a major element in the assessment of the economic value of patents. We also review the research on the role of universities and researchers in technological development, with important issues such as universities as sources of technological knowledge and inventor-author relations. We conclude the review part of this paper with an overview of recent research on mapping and network analysis of the science and technology interface and of technological progress in interaction with science. In the second part we apply new techniques for the direct visualization of the cited and citing relations of SNPRs, the mapping of the landscape around SNPRs by bibliographic coupling and co-citation analysis, and the mapping of the conceptual environment of SNPRs by keyword co-occurrence analysis. Findings: We discuss several properties of SNPRs. Only a small minority of publications covered by the Web of Science or Scopus are cited by patents, about 3%-4%. However, for publications based on university-industry collaboration the number of SNPRs is considerably higher, around 15%. The proposed mapping methodology based on a "second order SNPR approach" enables a better assessment of the technological relevance of research. Research limitations: The main limitation is that a more advanced merging of patent and publication data, in particular unification of author and inventor names, in still a necessity. Practical implications: The proposed mapping methodology enables the creation of a database of technology-relevant papers (TRPs). In a bibliometric assessment the publications of research groups, research programs or institutes can be matched with the TRPs and thus the extent to which the work of groups, programs or institutes are relevant for technological development can be measured. Originality/value: The review part examines a wide range of findings in the research of patent citation analysis. The mapping approach to identify a broad range of technologyrelevant papers is novel and offers new opportunities in research evaluation practices.
基金This study was funded by the National Social Science Foundation of China with Grant No.20CTQ024.
文摘Purpose:This study attempts to disclose the characteristics of knowledge integration in an interdisciplinary field by looking into the content aspect of knowledge.Design/methodology/approach:The eHealth field was chosen in the case study.Associated knowledge phrases(AKPs)that are shared between citing papers and their references were extracted from the citation contexts of the eHealth papers by applying a stem-matching method.A classification schema that considers the functions of knowledge in the domain was proposed to categorize the identified AKPs.The source disciplines of each knowledge type were analyzed.Quantitative indicators and a co-occurrence analysis were applied to disclose the integration patterns of different knowledge types.Findings:The annotated AKPs evidence the major disciplines supplying each type of knowledge.Different knowledge types have remarkably different integration patterns in terms of knowledge amount,the breadth of source disciplines,and the integration time lag.We also find several frequent co-occurrence patterns of different knowledge types.Research limitations:The collected articles of the field are limited to the two leading open access journals.The stem-matching method to extract AKPs could not identify those phrases with the same meaning but expressed in words with different stems.The type of Research Subject dominates the recognized AKPs,which calls on an improvement of the classification schema for better knowledge integration analysis on knowledge units.Practical implications:The methodology proposed in this paper sheds new light on knowledge integration characteristics of an interdisciplinary field from the content perspective.The findings have practical implications on the future development of research strategies in eHealth and the policies about interdisciplinary research.Originality/value:This study proposed a new methodology to explore the content characteristics of knowledge integration in an interdisciplinary field.
基金supported by the National Population and Health Scientific Data Sharing Program of Chinathe Knowledge Centre for Engineering Sciences and Technology (Medical Centre)the Fundamental Research Funds for the Central Universities (Grant No.: 13R0101)
文摘Purpose: In the open science era, it is typical to share project-generated scientific data by depositing it in an open and accessible database. Moreover, scientific publications are preserved in a digital library archive. It is challenging to identify the data usage that is mentioned in literature and associate it with its source. Here, we investigated the data usage of a government-funded cancer genomics project, The Cancer Genome Atlas(TCGA), via a full-text literature analysis.Design/methodology/approach: We focused on identifying articles using the TCGA dataset and constructing linkages between the articles and the specific TCGA dataset. First, we collected 5,372 TCGA-related articles from Pub Med Central(PMC). Second, we constructed a benchmark set with 25 full-text articles that truly used the TCGA data in their studies, and we summarized the key features of the benchmark set. Third, the key features were applied to the remaining PMC full-text articles that were collected from PMC.Findings: The amount of publications that use TCGA data has increased significantly since 2011, although the TCGA project was launched in 2005. Additionally, we found that the critical areas of focus in the studies that use the TCGA data were glioblastoma multiforme, lung cancer, and breast cancer; meanwhile, data from the RNA-sequencing(RNA-seq) platform is the most preferable for use.Research limitations: The current workflow to identify articles that truly used TCGA data is labor-intensive. An automatic method is expected to improve the performance.Practical implications: This study will help cancer genomics researchers determine the latest advancements in cancer molecular therapy, and it will promote data sharing and data-intensive scientific discovery.Originality/value: Few studies have been conducted to investigate data usage by governmentfunded projects/programs since their launch. In this preliminary study, we extracted articles that use TCGA data from PMC, and we created a link between the full-text articles and the source data.
基金This work is supported by the National Natural Science Foundation of China(Grant No.7167030644 and 71704137)。
文摘Purpose:Researchers frequently encounter the following problems when writing scientific articles:(1)Selecting appropriate citations to support the research idea is challenging.(2)The literature review is not conducted extensively,which leads to working on a research problem that others have well addressed.The study focuses on citation recommendation in the related studies section by applying the term function of a citation context,potentially improving the efficiency of writing a literature review.Design/methodology/approach:We present nine term functions with three newly created and six identified from existing literature.Using these term functions as labels,we annotate 531 research papers in three topics to evaluate our proposed recommendation strategy.BM25 and Word2vec with VSM are implemented as the baseline models for the recommendation.Then the term function information is applied to enhance the performance.Findings:The experiments show that the term function-based methods outperform the baseline methods regarding the recall,precision,and F1-score measurement,demonstrating that term functions are useful in identifying valuable citations.Research limitations:The dataset is insufficient due to the complexity of annotating citation functions for paragraphs in the related studies section.More recent deep learning models should be performed to future validate the proposed approach.Practical implications:The citation recommendation strategy can be helpful for valuable citation discovery,semantic scientific retrieval,and automatic literature review generation.Originality/value:The proposed citation function-based citation recommendation can generate intuitive explanations of the results for users,improving the transparency,persuasiveness,and effectiveness of recommender systems.
文摘Purpose: This study aims to compare the characteristics of citation disciplinary structure between the G7 countries and the BRICS countries.Design/Methodology/Approach: In this contribution, which uses about 1 million Web of Science publications and two publications years(1993 and 2013), we compare the G7 countries and the BRICS countries with regard to this type of structure. For the publication year 2013, cosine similarity values regarding the citation disciplinary structures of these countries(and of nine other countries) were used as input to cluster analysis. We also obtained cosine similarity values for a given country and its citation disciplinary structures across the two publication years. Moreover, for the publication year 2013, the within-country JeffreysMatusita distance between publication and citation disciplinary structure was measured. Research limitations: First, the citation disciplinary structures of countries depend on multiple and complex factors. It is therefore difficult to completely explain the formation and change of the citation disciplinary structure of a country. This study suggests some possible causes, whereas detailed explanations might be given by future research. Second, the length of the citation window used in this study is three years. However, scientific disciplines differ in their citation practices. Comparison between citations across disciplines using the same citation window length may affect the citation discipline structure results for some countries.Practical limitations: First, the results of this study are based on the WoS database. However, in this database some fields are covered to a greater extent than others, which may affect the results for the citation discipline structure for some studied countries. In future research, we might repeat this study using another database(like Scopus) and, in that case, we would like to make comparisons between the two outcomes. Second, the use of a constant journal setyielded that a large share of the journals covered by WoS year 2013 is ignored in the study. Thus, disciplinary structure is studied based on a quite restricted set of publications. The three mentioned limitations should be kept in mind when the results of this study are interpreted.Originality/value: Disciplinary structure on country level is a highlighted topic for the S&T policy makers, especially for those come from developing countries. This study observes the disciplinary structure in the view of academic impact, and the result will provide some evidence to make decision for the discipline strategy and funding allocation. Besides, JeffreysMatusita distance is introduced to measure the similarity of citation disciplinary structure and publication disciplinary structure. By applying this measure, some new observations were drawn, for example, "Based on the comparison of publication disciplinary structure and citation disciplinary structure, the paper finds most BRICS counties have less impact with more publications".Findings: The outcome of the cluster analysis indicates that the G7 countries and BRICS countries are quite heterogeneous regarding their citation disciplinary structure. For a majority of the G7 countries, the citation disciplinary structure tend to be more stable compared to BRICS countries with regard to the years 1993 and 2013. Most G7 countries, with United States as an exception, turned out to have lower values on the Jeffreys-Matusita distance than BRICS countries, indicating a higher degree of heterogeneity between the publication and the citation disciplinary structure for the latter countries. In other words, BRICS countries still receive much less citations in most disciplines than their publication output would suggest. G7 countries can still expect more citations than is to be expected based on their publication output, thereby generating relatively more impact than BRICS countries.
基金supported by Nihon University College of Science and Technology Grants-in Aid 2012 and 2016。
文摘Purpose: The number of citations has been widely used to measure the significance of a paper. However, there is a need in introducing another index to determine superiority or inferiority of papers with the same number of citations. We determine superiority or inferiority of papers by using the ranking based on the number of citations and PageRank.Design/methodology/approach: We show the positive linear correlation between Citation Rank(the ranking of the number of citation) and PageRank. On this basis, we identify highquality, prestige, emerging, and popular papers. Findings: We found that the high-quality papers belong to the subjects of biochemistry and molecular biology, chemistry, and multidisciplinary sciences. The prestige papers correspond to the subjects of computer science, engineering, and information science. The emerging papers are related to biochemistry and molecular biology, as well as those published in the journal "Cell." The popular papers belong to the subject of multidisciplinary sciences. Research limitations: We analyze the Science Citation Index Expanded(SCIE) from 1981 to 2015 to calculate Citation Rank and PageRank within a citation network consisting of 34,666,719 papers and 591,321,826 citations.Practical implications: Our method is applicable to forecast emerging fields of research subjects in science and helps policymakers to consider science policy.Originality/value: We calculated PageRank for a giant citation network which is extremely larger than the citation networks investigated by previous researchers.
基金supported by the grants from Natural Science Foundation of China (Project No.61471060)
文摘Citation network is often used for academic recommendation. However, it is difficult to achieve high recommendation accuracy and low time complexity because it is often very large and sparse and different citations have different purposes. What's more, some citations include unreasonable information, such as in case of intentional self-citation. To improve the accuracy of citation network-based academic recommendation and reduce the time complexity, we propose an academic recommendation method for recommending authors and papers. In which, an author-paper bilayer citation network is built, then an enhanced topic model, Author Community Topic Time Model(ACTTM) is proposed to detect high quality author communities in the author layer, and a set of attributes are proposed to comprehensively depict the author/paper nodes in the bilayer citation network. Experimental results prove that the proposed ACTTM can detect high quality author communities and facilitate low time complexity, and the proposed academic recommendation method can effectively improve the recommendation accuracy.
基金This work is supported by the Programs for the Young Talents of National Science Library,Chinese Academy of Sciences(Grant No.2019QNGR003).
文摘Purpose:Research dynamics have long been a research interest.It is a macro perspective tool for discovering temporal research trends of a certain discipline or subject.A micro perspective of research dynamics,however,concerning a single researcher or a highly cited paper in terms of their citations and“citations of citations”(forward chaining)remains unexplored.Design/methodology/approach:In this paper,we use a cross-collection topic model to reveal the research dynamics of topic disappearance topic inheritance,and topic innovation in each generation of forward chaining.Findings:For highly cited work,scientific influence exists in indirect citations.Topic modeling can reveal how long this influence exists in forward chaining,as well as its influence.Research limitations:This paper measures scientific influence and indirect scientific influence only if the relevant words or phrases are borrowed or used in direct or indirect citations.Paraphrasing or semantically similar concept may be neglected in this research.Practical implications:This paper demonstrates that a scientific influence exists in indirect citations through its analysis of forward chaining.This can serve as an inspiration on how to adequately evaluate research influence.Originality:The main contributions of this paper are the following three aspects.First,besides research dynamics of topic inheritance and topic innovation,we model topic disappearance by using a cross-collection topic model.Second,we explore the length and character of the research impact through“citations of citations”content analysis.Finally,we analyze the research dynamics of artificial intelligence researcher Geoffrey Hinton’s publications and the topic dynamics of forward chaining.
基金supported by Major Projects of National Social Science Fund (No. 17ZDA291)Fujian Provincial Key Laboratory of Information Processing and Intelligent Control (Minjiang University) (No. MJUKF201704)Qing Lan Project
文摘Purpose: This study aims to build an automatic survey generation tool, named CitationAS, based on citation content as represented by the set of citing sentences in the original articles.Design/methodology/approach: Firstly, we apply LDA to analyse topic distribution of citation content. Secondly, in CitationAS, we use bisecting K-means, Lingo and STC to cluster retrieved citation content. Then Word2Vec, Word Net and combination of them are applied to generate cluster labels. Next, we employ TF-IDF, MMR, as well as considering sentence location information, to extract important sentences, which are used to generate surveys. Finally, we adopt manual evaluation for the generated surveys.Findings: In experiments, we choose 20 high-frequency phrases as search terms. Results show that Lingo-Word2Vec, STC-Word Net and bisecting K-means-Word2Vec have better clustering effects. In 5 points evaluation system, survey quality scores obtained by designing methods are close to 3, indicating surveys are within acceptable limits. When considering sentence location information, survey quality will be improved. Combination of Lingo, Word2Vec, TF-IDF or MMR can acquire higher survey quality.Research limitations: The manual evaluation method may have a certain subjectivity. We use a simple linear function to combine Word2Vec and Word Net that may not bring out their strengths. The generated surveys may not contain some newly created knowledge of some articles which may concentrate on sentences with no citing.Practical implications: CitationAS tool can automatically generate a comprehensive, detailed and accurate survey according to user’s search terms. It can also help researchers learn about research status in a certain field.Originality/value: Citaiton AS tool is of practicability. It merges cluster labels from semantic level to improve clustering results. The tool also considers sentence location information when calculating sentence score by TF-IDF and MMR.
文摘Purpose:We study the proportion of Web of Science(WoS)citation links that are represented in the Crossref Open Citation Index(COCI),with the possible aim of using COCI in research evaluation instead of the WoS,if the level of coverage was sufficient.Design/methodology/approach:We calculate the proportion on citation links where both publications have a WoS accession number and a DOI simultaneously,and where the cited publications have had at least one author from our institution,the Czech Technical University in Prague.We attempt to look up each such citation link in COCI.Findings:We find that 53.7%of WoS citation links are present in the COCI.The proportion varies largely by discipline.The total figures differ significantly from 40%in the large-scale study by Van Eck,Waltman,Larivière,and Sugimoto(blog 2018,https://www.cwts.nl/blog?article=n-r2s234).Research limitations:The sample does not cover all science areas uniformly;it is heavily focused on Engineering and Technology,and only some disciplines of Natural Sciences are present.However,this reflects the real scientific orientation and publication profile of our institution.Practical implications:The current level of coverage is not sufficient for the WoS to be replaced by COCI for research evaluation.Originality/value:The present study illustrates a COCI vs WoS comparison on the scale of a larger technical university in Central Europe.
文摘Purpose: Communicating scientific results to the public is essential to inspire future researchers and ensure that discoveries are exploited. News stories about research are a key communication pathway for this and have been manually monitored to assess the extent of press coverage of scholarship.Design/methodology/Approach: To make larger scale studies practical, this paper introduces an automatic method to extract citations from newspaper stories to large sets of academic journals. Curated ProQuest queries were used to search for citations to 9,639 Science and3,412 Social Science Web of Science(WoS) journals from eight UK daily newspapers during2006–2015. False matches were automatically filtered out by a new program, with 94% of the remaining stories meaningfully citing research.Findings: Most Science(95%) and Social Science(94%) journals were never cited by these newspapers. Half of the cited Science journals covered medical or health-related topics,whereas 43% of the Social Sciences journals were related to psychiatry or psychology. From the citing news stories, 60% described research extensively and 53% used multiple sources,but few commented on research quality.Research Limitations: The method has only been tested in English and from the ProQuest Newspapers database.Practical implications: Others can use the new method to systematically harvest press coverage of research.Originality/value: An automatic method was introduced and tested to extract citations from newspaper stories to large sets of academic journals.