TY - GEN
T1 - Semantic properties of customer sentiment in tweets
AU - Ko, Eun Hee
AU - Klabjan, Diego
PY - 2014
Y1 - 2014
N2 - An increasing number of people are using online social networking services (SNSs), and a significant amount of information related to experiences in consumption is shared in this new media form. Text mining is an emerging technique for mining useful information from the web. We aim at discovering in particular tweets semantic patterns in consumers' discussions on social media. Specifically, the purposes of this study are twofold: 1) finding similarity and dissimilarity between two sets of textual documents that include consumers' sentiment polarities, two forms of positive vs. negative opinions and 2) driving actual content from the textual data that has a semantic trend. The considered tweets include consumers' opinions on US retail companies (e.g., Amazon, Walmart). Cosine similarity and K-means clustering methods are used to achieve the former goal, and Latent Dirichlet Allocation (LDA), a popular topic modeling algorithm, is used for the latter purpose. This is the first study which discover semantic properties of textual data in consumption context beyond sentiment analysis. In addition to major findings, we apply LDA (Latent Dirichlet Allocations) to the same data and drew latent topics that represent consumers' positive opinions and negative opinions on social media.
AB - An increasing number of people are using online social networking services (SNSs), and a significant amount of information related to experiences in consumption is shared in this new media form. Text mining is an emerging technique for mining useful information from the web. We aim at discovering in particular tweets semantic patterns in consumers' discussions on social media. Specifically, the purposes of this study are twofold: 1) finding similarity and dissimilarity between two sets of textual documents that include consumers' sentiment polarities, two forms of positive vs. negative opinions and 2) driving actual content from the textual data that has a semantic trend. The considered tweets include consumers' opinions on US retail companies (e.g., Amazon, Walmart). Cosine similarity and K-means clustering methods are used to achieve the former goal, and Latent Dirichlet Allocation (LDA), a popular topic modeling algorithm, is used for the latter purpose. This is the first study which discover semantic properties of textual data in consumption context beyond sentiment analysis. In addition to major findings, we apply LDA (Latent Dirichlet Allocations) to the same data and drew latent topics that represent consumers' positive opinions and negative opinions on social media.
KW - text analytics; tweet analysis; document similarity; clustering; topic modeling; part-of-speech tagging
UR - http://www.scopus.com/inward/record.url?scp=84904480031&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84904480031&partnerID=8YFLogxK
U2 - 10.1109/WAINA.2014.151
DO - 10.1109/WAINA.2014.151
M3 - Conference contribution
AN - SCOPUS:84904480031
SN - 9781479926527
T3 - Proceedings - 2014 IEEE 28th International Conference on Advanced Information Networking and Applications Workshops, IEEE WAINA 2014
SP - 657
EP - 663
BT - Proceedings - 2014 IEEE 28th International Conference on Advanced Information Networking and Applications Workshops, IEEE WAINA 2014
PB - IEEE Computer Society
T2 - 28th IEEE International Conference on Advanced Information Networking and Applications Workshops, IEEE WAINA 2014
Y2 - 13 May 2014 through 16 May 2014
ER -