信息资源管理学报 ›› 2026, Vol. 16 ›› Issue (2): 125-139.doi: 10.13365/j.jirm.2026.02.125

• 研究论文 • 上一篇    下一篇

基于提示学习与多任务学习的学术文献引用意图识别研究

张晓娟1 郭佳润1 王嘉辉1 刘怡均2   

  1. 1.四川大学公共管理学院,成都,610000; 
    2.浙江天猫技术有限公司,杭州,311121
  • 出版日期:2026-03-26 发布日期:2026-06-04
  • 作者简介:张晓娟, 博士, 副教授, 博士生导师, 研究方向为信息检索; 郭佳润, 硕士研究生,研究方向为信息管理方法与技术;王嘉辉, 硕士研究生, 研究方向为信息管理方法与技术; 刘怡均(通讯作者), 产品经理, 研究方向为个性化推荐系统, Email: 645712088@qq.com。
  • 基金资助:
    本文系国家社会科学基金一般项目“ 时间感知的个性化学术文献引文推荐研究”(21BTQ072)的研究成果之一。

Identification of Citation Intent in Academic Literature Based on Prompt Learning with Multi-Task Learning

Zhang Xiaojuan1 Guo Jiarun1 Wang Jiahui1 Liu Yijun2   

  1. 1.School of Public Administration, Sichuan University,Chengdu , 610000; 
    2.Zhejiang Tmall Technology Co.,Ltd. , Hangzhou , 311121
  • Online:2026-03-26 Published:2026-06-04
  • About author:Zhang Xiaojuan, Ph.D, associate professor, doctoral supervisor, research interests including information retrieval; Guo Jiarun, a postgraduate student , research interests including information management methods and technologies; Wang Jiahui, a postgraduate student , research interests including information management methods and technologies; Liu Yijun(corresponding author), product manager, research interests including personalized recommendation system, Email: 645712088@qq.com.
  • Supported by:
    This paper is one of the research outcomes of the General Project of National Social Science Fund of China "Time-Aware Personalized Citation Recommendation for Academic Literature"(21BTQ072).

摘要: 学术文献引文意图的自动识别有助于对学术论文内容的深入理解,促进更公平的新型科研评价的构建。为了提高低资源场景下学术文献引用意图识别准确度与泛化性,本研究提出一种融合提示学习与多任务学习的新框架。首先在训练主任务(引文意图自动识别)的基础上,联合训练引用价值识别和引用章节识别两个辅助任务,这三个任务均基于P-tuning的提示学习框架,通过多层感知机将离散提示词转化为连续提示向量,再将提示模板与输入文本结合为输入序列,使分类任务转化为填空预测任务;其次,通过扩展标签词集并采用加权平均方法将得分最高的标签作为最终预测结果;最后,通过软参数共享机制实现三个任务的协同优化,以实现引文意图的识别。实验结果显示,在ACL-ARC和SciCite两个公开数据集中,本研究模型在不同学习样本量下均显著优于基线及其他提示学习模型,并且两个辅助任务也有效提升了引用意图识别的准确率以及泛化性。

关键词: 引用意图识别, 提示学习, 多任务学习, 文本分类, SciBERT

Abstract: The automatic identification of citation intentions in academic literature contributes to a deeper understanding of academic paper content and facilitates the development of a more equitable framework for scientific research evaluation. To improve the accuracy and generalizability of citation intention recognition in low-resource scenarios, this study proposes a novel framework that integrates prompt learning and multi-task learning, First, the main task of automatic citation intent recognition is jointly trained with two auxiliary tasks: citation value identification and citation section identification. All three tasks are unified within a P-tuning-based prompt learning framework. In this setup, discrete prompt tokens are first transformed into continuous vectors by a multi-layer perceptron. These vectors are then combined with the input text to form the input sequence, effectively converting the classification task into a cloze-style prediction. Furthermore, the label word set is expanded, and a weighted averaging method is applied to select the label with the highest score as the final prediction. Finally, a soft parameter sharing mechanism is employed to achieve collaborative optimization across the three tasks for citation intent recognition. Experimental results on two publicly available datasets, ACL-ARC and SciCite, demonstrate that the proposed model significantly outperforms baseline and other prompt learning models under varying sample sizes. Additionally, the two auxiliary tasks effectively improve the accuracy and generalizability of citation intent recognition.

Key words: Citation intent identification, Prompt learning, Multi-task learning, Text classification, SciBERT

中图分类号: