Computerized sorting of electronic health care records using text mining

Tanuja , S and Acharya, Dinesh U and Shailesh, K R (2011) Computerized sorting of electronic health care records using text mining. Journal of Pharmaceutical and Biomedical Sciences, 5 (5). pp. 1-5. ISSN 2230 - 7885

[img] PDF - Published Version

Download (235kB)
Official URL:


Data mining has a wide use in the healthcare domain in areas such as diagnoses and patient management. One of the main concerns in the healthcare area is the measurement of flow of patients through hospitals and other health care facilities. For instance if the inpatient length of stay (LOS) can be predicted efficiently, the planning and management of hospital resources can be greatly enhanced. Hospital LOS of inpatients is frequently used as a proxy for measuring the consumption of hospital resources and therefore it is essential to develop accurate models for the prediction of inpatients LOS. In this paper we apply text mining techniques to pre-process the text data of electronic discharge summaries available in HTML format and apply traditional data mining techniques such as k-means clustering to analyze the textual information for classifying discharge summaries based on disease and patient’s length of stay in the hospital.

Item Type: Article
Uncontrolled Keywords: Text Mining;Discharge summary;regular expression;feature extraction;pattern recognition;length of stay;K-means
Subjects: Engineering > MIT Manipal > Computer Science and Engineering
Engineering > MIT Manipal > Electrical and Electronics
Depositing User: Users 48 not found.
Date Deposited: 22 Jun 2011 06:41
Last Modified: 22 Jun 2011 06:52

Actions (login required)

View Item View Item