Journal of Information Resources Management ›› 2024, Vol. 14 ›› Issue (6): 131-142.doi: 10.13365/j.jirm.2024.06.131

Previous Articles     Next Articles

Linking Allusion Words in Ancient Poetry from the Perspective of Knowledge Reorganization: A Method of Integrating the Fine-grained Co-citation Relationships and the Semantic Features

Li Xiaomin1,2 Wang Hao1,2 Bu Wenru1,2 Zhou Shu1,2   

  1. 1.School of Information Management, Nanjing University, Nanjing, 210023; 
    2.Key Laboratory of Data Engineering and Knowledge Services in Jiangsu Provincial Universities(Nanjing University), Nanjing, 210093
  • Online:2024-11-26 Published:2024-12-20
  • About author:Li Xiaomin, Ph.D. candidate, with research interests in text mining and knowledge organization; Wang Hao(corresponding author), Ph.D., professor, with research interests in natural language processing, data mining applications and ontology learning, Email: ywhaowang@nju.edu.cn; Bu Wenru, postgraduate, with research interests in data analysis and mining; Zhou Shu, Ph.D. candidate, with research interests in natural language processing.
  • Supported by:
    This article is an outcome of the project "Research on Semantic Parsing and Humanities Computing of Chinese Intangible Cultural Heritage Text Driven by Linked Data"(72074108) supported by National Natural Science Foundation of China and the project "The Semantic Analysis and Knowledge Graph Research of Local Chronicle Text Oriented to Humanistic Computing"(010814370113) supported by the Fundamental Research Funds for the Central Universities.

Abstract: Guided by theories and technologies related to knowledge reorganization, this study conducts semantic mining and organization of allusion cultural resources to promote the inheritance and utilization of allusion culture. A model is proposed that integrates fine-grained co-reference relations and semantic features to link allusion terms. First, a co-reference network is constructed based on the citation relationships between ancient poems and allusion terms, and fine-grained co-reference relations, including positional co-reference and emotional co-reference, are added to build a fine-grained co-reference network. Then, Doc2vec is employed to extract the semantic features of each allusion term, and these features are integrated to reconstruct the co-reference network. Finally, a link prediction algorithm is applied to traverse the fine-grained co-reference network, achieving semantic association and organization of allusion terms. The association results are further analyzed from a path-based perspective, uncovering some regular patterns in domain knowledge. The constructed co-reference network consists of 5,869 nodes and 27,032 edges. The proposed method, which incorporates positional and emotional co-references as well as semantic features, achieves an accuracy of 0.963 in the task of linking allusion terms. Moreover, the analysis reveals that the shortest path order is negatively correlated with both the number of allusion term pairs and their similarity.

Key words: Knowledge reorganization, Allusion words, Co-citation network, Semantic features, Link prediction

CLC Number: