TY - GEN
T1 - Exploiting social relations for sentiment analysis in microblogging
AU - Hu, Xia
AU - Tang, Lei
AU - Tang, Jiliang
AU - Liu, Huan
PY - 2013
Y1 - 2013
N2 - Microblogging, like Twitter and Sina Weibo, has become a popular platform of human expressions, through which users can easily produce content on breaking news, public events, or products. The massive amount of microblogging data is a useful and timely source that carries mass sentiment and opinions on various topics. Existing sentiment analysis approaches often assume that texts are independent and identically distributed (i.i.d.), usually focusing on building a sophisticated feature space to handle noisy and short texts, without taking advantage of the fact that the microblogs are networked data. Inspired by the social sciences findings that sentiment consistency and emotional contagion are observed in social networks, we investigate whether social relations can help sentiment analysis by proposing a Sociological Approach to handling Noisy and short Texts (SANT) for sentiment classification. In particular, we present a mathematical optimization formulation that incorporates the sentiment consistency and emotional contagion theories into the supervised learning process; and utilize sparse learning to tackle noisy texts in microblogging. An empirical study of two real-world Twitter datasets shows the superior performance of our framework in handling noisy and short tweets.
AB - Microblogging, like Twitter and Sina Weibo, has become a popular platform of human expressions, through which users can easily produce content on breaking news, public events, or products. The massive amount of microblogging data is a useful and timely source that carries mass sentiment and opinions on various topics. Existing sentiment analysis approaches often assume that texts are independent and identically distributed (i.i.d.), usually focusing on building a sophisticated feature space to handle noisy and short texts, without taking advantage of the fact that the microblogs are networked data. Inspired by the social sciences findings that sentiment consistency and emotional contagion are observed in social networks, we investigate whether social relations can help sentiment analysis by proposing a Sociological Approach to handling Noisy and short Texts (SANT) for sentiment classification. In particular, we present a mathematical optimization formulation that incorporates the sentiment consistency and emotional contagion theories into the supervised learning process; and utilize sparse learning to tackle noisy texts in microblogging. An empirical study of two real-world Twitter datasets shows the superior performance of our framework in handling noisy and short tweets.
KW - microblogging
KW - noisy text
KW - sentiment analysis
KW - sentiment classification
KW - short text
KW - social context
KW - social correlation
KW - twitter
UR - http://www.scopus.com/inward/record.url?scp=84874258367&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84874258367&partnerID=8YFLogxK
U2 - 10.1145/2433396.2433465
DO - 10.1145/2433396.2433465
M3 - Conference contribution
SN - 9781450318693
T3 - WSDM 2013 - Proceedings of the 6th ACM International Conference on Web Search and Data Mining
SP - 537
EP - 546
BT - WSDM 2013 - Proceedings of the 6th ACM International Conference on Web Search and Data Mining
T2 - 6th ACM International Conference on Web Search and Data Mining, WSDM 2013
Y2 - 4 February 2013 through 8 February 2013
ER -