File(s) under permanent embargo
Discriminative cues for different stages of smoking cessation in online community
conference contribution
posted on 2016-01-01, 00:00 authored by Thin NguyenThin Nguyen, R Borland, John YearwoodJohn Yearwood, Hua YongHua Yong, Svetha VenkateshSvetha Venkatesh, Quoc-Dinh PhungSmoking is one of the leading causes of preventable death, being responsible for about six million deaths annually worldwide. Most smokers want to quit, but many find quitting difficult. The Internet enables people interested in quitting smoking to connect with others via online communities; however, the characteristics of these discussions are not well understood. This work aims to explore the textual cues of an online community interested in quitting smoking: www.reddit.com/r/ stopsmoking – “a place for redditors to motivate each other to quit smoking”. A total of approximately 5, 000 posts were randomly selected from the community. Four subgroups of posts based on the cessation days of abstainers were defined: S0: within the first week, S1: within the first month (excluding cohort S0), S2: from second month to one year, and S3: beyond one year. Psycho-linguistic features and content topics were extracted from the posts and analysed. Machine learning techniques were used to discriminate the online conversations in the first week S0 from the other subgroups. Topics and psycho-linguistic features were found to be highly valid predictors of the subgroups, possibly providing an important step in understanding social media and its use in studies of smoking and other addictions in online settings.