Publication Date
2018
Document Type
Thesis
Committee Members
Tanvi Banerjee (Advisor), Michelle Cheatham (Committee Member), Krishnaprasad Thirunarayan (Committee Member)
Degree Name
Master of Science (MS)
Abstract
In recent times, social media platforms like Twitter have become more popular and people have become more interactive and responsive than before. People often react to every news in real-time and within no-time, the information spreads rapidly. Even with viral diseases like Zika, people tend to share their opinions and concerns on social media. This can be leveraged by the health officials to track the disease in real-time thereby reducing the time lag due to traditional surveys. A faster and accurate detection of the disease can allow health officials to understand people's opinion of the disease and take necessary precautions to prevent the misinformation from spreading at a faster pace. The purpose of this study was to analyze the tweets to understand the public opinion on Zika virus. With the help of machine learning and natural language processing, we classify the tweets into four disease characteristics namely, Symptom, Prevention, Transmission, and Treatment. Once the tweets were classified, topic modelling was performed using Latent Dirichlet Allocation (LDA) to generate underlying patterns within each disease characteristics. Such analysis can help to gain a deeper understanding of the content of tweets pertaining to Zika.
Page Count
71
Department or Program
Department of Computer Science and Engineering
Year Degree Awarded
2018
Copyright
Copyright 2018, some rights reserved. My ETD may be copied and distributed only for non-commercial purposes and may not be modified. All use must give me credit as the original author.
Creative Commons License
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 License.
ORCID ID
0000-0003-2111-750X