信息资源管理学报 ›› 2017, Vol. 7 ›› Issue (4): 38-43.doi: 10.13365/j.jirm.2017.04.038

• 专题-医疗健康大数据挖掘研究 • 上一篇    下一篇

基于电子病历数据挖掘的疾病危重度动态预测研究

李季 丁凤一 李翔宇   

  • 收稿日期:2017-06-23 出版日期:2017-10-26 发布日期:2017-10-26
  • 作者简介:李季,男,研究生,研究方向为数据挖掘与信息系统;丁凤一,女,研究生,研究方向为数据挖掘与信息分析;李翔宇(通讯作者),男,研究生,研究方向为数据挖掘,Email:2692865166@qq.com。
  • 基金资助:

    本文系教育部人文社会科学重点研究基地重大项目“大数据资源的挖掘与服务研究——面向医疗健康领域”的研究成果之一。

Study of Dynamic Forecasting Model Based on EMR Data Mining for Disease Severity

Li Ji Ding Fengyi Li Xiangyu   

  • Received:2017-06-23 Online:2017-10-26 Published:2017-10-26

摘要:

本文基于覆盖多病种的电子病历大数据,构建了适用于各种常见严重疾病的危重度动态预测模型。具体依据朴素贝叶斯理论、相关性分析和信息增益法等数据挖掘方法,建立模型框架,通过真实电子病历大数据集MIMIC-Ⅲ上的挖掘实验,筛选疾病危重度预测的主要区分特征,验证模型的动态预测效果。针对本模型的大数据实验证明了其进行多病种危重度动态预测的有效性,并筛选出了对疾病危重度具有高分辨性的38项区分特征,揭示了模型短期预测准确度高的特性。

关键词: 电子病历, 疾病危重度, 动态预测, 特征挖掘, 朴素贝叶斯, 数据挖掘

Abstract:

This article has built a dynamic forecast model suitable for all kinds of common serious diseases on the basis of the electronic medical record (EMR) big data. The dynamic forecast model was built according to the data mining methods, including nave Bayes theory, corrrelation analysis, and information gain. Based on the experiments in the database of EMR also known as MIMIC III, the effect of the dynamic forecast model was tested, and the main distinguishing characteristics of the severity of diease were screened. Data mining experiment results show that the 38 distinguishing characteristics discovered in this research is discernible to severity of dieases. Meanwhile, this dynamic forecast model is effective for forecasting the severity of various common severe dieases, and the short-term forecasting accuracy is especially high.

Key words: Electronic medical record, Severity of disease, Dynamic forecasting, Feature mining, Naive Bayes, Data mining

中图分类号: