Journal of Information Resources Management ›› 2026, Vol. 16 ›› Issue (2): 55-68.doi: 10.13365/j.jirm.2026.02.055

Previous Articles     Next Articles

Constructing High-Quality Enterprise Data Sets: Connotations, Characteristics, Logical Framework and Future Prospects

Zhang Borui1 Chen Tao2 Hu Jie2 Xia Yikun1,3   

  1. 1.Research Institute for Data Management & Innovation, Nanjing University, Suzhou, 215163; 
    2.Nanjing University-China Mobile Joint Research Institute, Nanjing, 210029; 
    3. Laboratory of Data Intelligence and Interdisciplinary Innovation, Nanjing University, Nanjing, 210046
  • Online:2026-03-26 Published:2026-06-04
  • Supported by:
    This paper is one of the research outcomes of the project of Nanjing University-China Mobile Joint Research Institute "Research on the Layout and Path of Data Elements of Jiangsu Mobile"(NJ20250045).

Abstract: Analyzing the connotation, characteristics and logical framework of enterprise high-quality dataset construction is essential to unlocking data value and enabling firms' digital-intelligent transformation. This paper conceptualises enterprise high-quality datasets, identifies six core attributes, establishes a three-dimensional framework (value, scenario, technology) based on socio-technical systems theory to provide theoretical support for the construction of high-quality corporate datasets under the "AI Plus" background; subsequently, from the analytical perspectives of the data value chain and data life cycle theories, it constructs a five-dimensional model encompassing scenario demand perception, data resource weaving, knowledge resource extraction, data-knowledge integration and refinement, and a digital-intelligence service ecosystem; finally, in response to the practical dilemmas currently faced by enterprises, the study proposes targeted optimization strategies, suggesting that future efforts should focus synergistically on data scenario shaping, data aggregation and governance, data labeling optimization, and data security assurance to overcome the bottlenecks of high-quality dataset construction, enhance the value-enabling efficiency of data elements, and provide robust support for the digital-intelligence transformation and high-quality development of enterprises.

Key words: Artificial intelligence plus, Enterprise data, High-quality datasets, Data elements, Data governance

CLC Number: