Medical data mining in sentiment analysis based on optimized swarm search feature selection
Zeng, Daohui1; Peng, Jidong2; Fong, Simon3; Qiu, Yining4; Wong, Raymond4
AbstractIn this paper, we propose a novel technique termed as optimized swarm search-based feature selection (OS-FS), which is a swarm-type of searching function that selects an ideal subset of features for enhanced classification accuracy. In terms of gaining insights from unstructured medical based texts, sentiment prediction is becoming an increasingly crucial machine learning technique. In fact, due to its robustness and accuracy, it recently gained popularity in the medical industries. Medical text mining is well known as a fundamental data analytic for sentiment prediction. To form a high-dimensional sparse matrix, a popular preprocessing step in text mining is employed to transform medical text strings to word vectors. However, such a sparse matrix poses problems to the induction of accurate sentiment prediction model. The swarm search in our proposed OS-FS can be optimized by a new feature evaluation technique called clustering-by-coefficient-of-variation. In order to find a subset of features from all the original features from the sparse matrix, this type of feature selection has been a commonly utilized dimensionality reduction technique, and has the capability to improve accuracy of the prediction model. We implement this method based on a case scenario where 279 medical articles related to meaningful use functionalities on health care quality, safety, and efficiency' from a systematic review of previous medical IT literature. For this medical text mining, a multi-class of sentiments, positive, mixed-positive, neutral and negative is recognized from the document contents. Our experimental results demonstrate the superiority of OS-FS over traditional feature selection methods in literature.
KeywordMedical text mining Optimized swarm search-based feature selection Sentiment prediction Clustering-by-coefficient-of-variation
URLView the original
Indexed BySCI
WOS Research AreaEngineering
WOS SubjectEngineering, Biomedical
WOS IDWOS:000451676000029
Fulltext Access
Citation statistics
Cited Times [WOS]:1   [WOS Record]     [Related Records in WOS]
Document TypeJournal article
CollectionUniversity of Macau
Affiliation1.Guangzhou Univ TCM, Affiliated Hosp 1, Guangzhou, Guangdong, Peoples R China;
2.Ganzhou Peoples Hosp, Ganzhou, Jiangxi, Peoples R China;
3.Univ Macau, Dept Comp & Informat Sci, Taipa, Macau, Peoples R China;
4.Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW, Australia
Recommended Citation
GB/T 7714
Zeng, Daohui,Peng, Jidong,Fong, Simon,et al. Medical data mining in sentiment analysis based on optimized swarm search feature selection[J]. AUSTRALASIAN PHYSICAL & ENGINEERING SCIENCES IN MEDICINE,2018,41(4):1087-1100.
APA Zeng, Daohui,Peng, Jidong,Fong, Simon,Qiu, Yining,&Wong, Raymond.(2018).Medical data mining in sentiment analysis based on optimized swarm search feature selection.AUSTRALASIAN PHYSICAL & ENGINEERING SCIENCES IN MEDICINE,41(4),1087-1100.
MLA Zeng, Daohui,et al."Medical data mining in sentiment analysis based on optimized swarm search feature selection".AUSTRALASIAN PHYSICAL & ENGINEERING SCIENCES IN MEDICINE 41.4(2018):1087-1100.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Zeng, Daohui]'s Articles
[Peng, Jidong]'s Articles
[Fong, Simon]'s Articles
Baidu academic
Similar articles in Baidu academic
[Zeng, Daohui]'s Articles
[Peng, Jidong]'s Articles
[Fong, Simon]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Zeng, Daohui]'s Articles
[Peng, Jidong]'s Articles
[Fong, Simon]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.