Journal of Information Resources Management ›› 2015, Vol. 5 ›› Issue (4): 24-31, 46.doi: 10.13365/j.jirm.2015.04.024

Previous Articles     Next Articles

Text Classification Algorithms Using the LDA Model: On the Comparison of the Applicaitons on Webpages and eTexts Including Books and Journals

Li Xiangdong Pan Lian   

  • Received:2014-09-11 Online:2015-10-26 Published:2015-10-26

Abstract:

The object of this research is the bibliographic information and other major digital text of Webpage, books and journals in the information resource management. Based on the LDA model, this paper studies the characteristics of automatic text classification in digital resources management,and analyzes the different effect and influence of three kinds of common classification algorithm which including KNN, SVM and Rocchio algorithm. The experiment shows that the accuracy of three classification algorithms basic is about 80%, while in most cases SVM algorithm having  0.7~22% higher classification accuracy than the other two algorithms. Its conclusion may provide a certain basis for choosing the appropriate classification algorithm when LDA model is using in digital information classification system.

Key words:

 

CLC Number: