Document Type
Article
Publication Date
1-30-2012
Abstract
This paper presents our solution for the i2b2 sentiment classification challenge. Our hybrid system consists of machine learning and rule-based classifiers. For the machine learning classifier, we investigate a variety of lexical, syntactic and knowledge-based features, and show how much these features contribute to the performance of the classifier through experiments. For the rule-based classifier, we propose an algorithm to automatically extract effective syntactic and lexical patterns from training examples. The experimental results show that the rule-based classifier outperforms the baseline machine learning classifier using unigram features. By combining the machine learning classifier and the rule-based classifier, the hybrid system gains a better trade-off between precision and recall, and yields the highest micro-averaged F-measure (0.5038), which is better than the mean (0.4875) and median (0.5027) micro-average F-measures among all participating teams.
Repository Citation
Wang, W.,
Chen, L.,
Tan, M.,
Wang, S.,
& Sheth, A. P.
(2012). Discovering Fine-Grained Sentiment in Suicide Notes. Biomedical Informatics Insights, 5, 137-145.
https://corescholar.libraries.wright.edu/knoesis/607
DOI
10.4137/BII.S8963
Included in
Bioinformatics Commons, Communication Technology and New Media Commons, Databases and Information Systems Commons, OS and Networks Commons, Science and Technology Studies Commons