Journal of Information Resources Management ›› 2025, Vol. 15 ›› Issue (6): 37-51.doi: 10.13365/j.jirm.2025.06.037

Previous Articles     Next Articles

Early Identification of Breakthrough Papers from the Perspective of Innovation Scenario: An Application in the Biomedical Field

Zhang Shuqian1 Huang Shan1 Mao Jin1,2 Li Gang1, 2   

  1. 1.School of Information Management, Wuhan University, Wuhan, 430072; 
    2.Center for Studies of Information Resources, Wuhan University, Wuhan, 430072
  • Online:2025-11-26 Published:2026-01-06
  • About author:Zhang Shuqian, master candidate, research interests including scientific and technological information; Huang Shan, Ph.D. candidate, research interests including science metrics and technological innovation; Mao Jin, Ph.D., associate professor, Ph.D. supervisor, research interests including scientific and technological information and big data analysis; Li Gang(corresponding author), Ph.D., professor, Ph.D. supervisor, research interests including information resource management,Email: imiswhu@aliyun.com.
  • Supported by:
    This is an outcome of the Major Project "Research on the Innovation and Application of Information Analysis Methods Based on the Integration of Number and Intelligence"(22&ZD326)supported by National Social Science Foundation of China.

Abstract: The timely and accurate identification of breakthrough scientific literature is of crucial strategic significance for the efficient allocation of scientific research resources, seizing the initiative in technological development, and enhancing national core competitiveness. However, existing identification methods have limitations such as single-dimensional indicators and insufficient recognition efficiency. From the contextual perspective of knowledge innovation, this study constructs a characteristic system that distinguishes breakthrough papers from ordinary papers from three aspects: knowledge foundation, research team, and academic attention, and proposes a machine learning-based early identification method for breakthrough papers. Experiments in the field of biomedicine show that the F1-score of the model in this study reaches 0.838, which verifies the effectiveness of the method. Among the factors, early-stage impact, number of references, and Price Index have the most significant impact on the model's recognition results. This study enriches and expands the theoretical framework and methodological system for breakthrough paper identification from the novel perspective of innovation context.

Key words: Breakthrough papers;Machine learning models;Early recognition;Biomedical field, Knowledge innovation

CLC Number: