Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

Using Natural Language Processing to Explore "Dry January" Posts on Twitter: Longitudinal Infodemiology Study.

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • Additional Information
    • Source:
      Publisher: JMIR Publications Country of Publication: Canada NLM ID: 100959882 Publication Model: Electronic Cited Medium: Internet ISSN: 1438-8871 (Electronic) Linking ISSN: 14388871 NLM ISO Abbreviation: J Med Internet Res Subsets: MEDLINE
    • Publication Information:
      Publication: <2011- > : Toronto : JMIR Publications
      Original Publication: [Pittsburgh, PA? : s.n., 1999-
    • Subject Terms:
    • Abstract:
      Background: Dry January, a temporary alcohol abstinence campaign, encourages individuals to reflect on their relationship with alcohol by temporarily abstaining from consumption during the month of January. Though Dry January has become a global phenomenon, there has been limited investigation into Dry January participants' experiences. One means through which to gain insights into individuals' Dry January-related experiences is by leveraging large-scale social media data (eg, Twitter chatter) to explore and characterize public discourse concerning Dry January.
      Objective: We sought to answer the following questions: (1) What themes are present within a corpus of tweets about Dry January, and is there consistency in the language used to discuss Dry January across multiple years of tweets (2020-2022)? (2) Do unique themes or patterns emerge in Dry January 2021 tweets after the onset of the COVID-19 pandemic? and (3) What is the association with tweet composition (ie, sentiment and human-authored vs bot-authored) and engagement with Dry January tweets?
      Methods: We applied natural language processing techniques to a large sample of tweets (n=222,917) containing the term "dry january" or "dryjanuary" posted from December 15 to February 15 across three separate years of participation (2020-2022). Term frequency inverse document frequency, k-means clustering, and principal component analysis were used for data visualization to identify the optimal number of clusters per year. Once data were visualized, we ran interpretation models to afford within-year (or within-cluster) comparisons. Latent Dirichlet allocation topic modeling was used to examine content within each cluster per given year. Valence Aware Dictionary and Sentiment Reasoner sentiment analysis was used to examine affect per cluster per year. The Botometer automated account check was used to determine average bot score per cluster per year. Last, to assess user engagement with Dry January content, we took the average number of likes and retweets per cluster and ran correlations with other outcome variables of interest.
      Results: We observed several similar topics per year (eg, Dry January resources, Dry January health benefits, updates related to Dry January progress), suggesting relative consistency in Dry January content over time. Although there was overlap in themes across multiple years of tweets, unique themes related to individuals' experiences with alcohol during the midst of the COVID-19 global pandemic were detected in the corpus of tweets from 2021. Also, tweet composition was associated with engagement, including number of likes, retweets, and quote-tweets per post. Bot-dominant clusters had fewer likes, retweets, or quote tweets compared with human-authored clusters.
      Conclusions: The findings underscore the utility for using large-scale social media, such as discussions on Twitter, to study drinking reduction attempts and to monitor the ongoing dynamic needs of persons contemplating, preparing for, or actively pursuing attempts to quit or cut down on their drinking.
      (©Alex M Russell, Danny Valdez, Shawn C Chiang, Ben N Montemayor, Adam E Barry, Hsien-Chang Lin, Philip M Massey. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 18.11.2022.)
    • References:
      J Med Internet Res. 2009 Mar 27;11(1):e11. (PMID: 19329408)
      J Sch Health. 2020 Feb;90(2):135-142. (PMID: 31828791)
      J Health Commun. 2020 Feb 1;25(2):150-158. (PMID: 31986999)
      Drug Alcohol Depend. 2021 Oct 1;227:108938. (PMID: 34392050)
      Harm Reduct J. 2022 Mar 4;19(1):24. (PMID: 35246148)
      Alcohol Clin Exp Res. 2019 Aug;43(8):1769-1776. (PMID: 31373703)
      Alcohol Alcohol. 2020 Jun 25;55(4):433-438. (PMID: 32391879)
      Addiction. 2022 Feb;117(2):331-340. (PMID: 34159674)
      Philos Trans A Math Phys Eng Sci. 2016 Apr 13;374(2065):20150202. (PMID: 26953178)
      Drug Alcohol Depend. 2021 May 1;222:108672. (PMID: 33757708)
      J Comput Soc Sci. 2022;5(2):1511-1528. (PMID: 36035522)
      Lancet Digit Health. 2021 Feb;3(2):e72-e75. (PMID: 33509386)
      Health Promot Pract. 2021 May;22(3):309-312. (PMID: 33759597)
      Am J Public Health. 2020 Mar;110(3):357-362. (PMID: 31855475)
      J Med Internet Res. 2018 Nov 19;20(11):e11669. (PMID: 30455162)
      Addict Behav. 2020 Nov;110:106532. (PMID: 32652385)
      J Ment Health. 2020 Apr;29(2):191-199. (PMID: 31694433)
      Drug Alcohol Depend. 2019 Nov 1;204:107500. (PMID: 31499242)
      Tob Control. 2017 Mar;26(2):226-229. (PMID: 26956467)
      J Stud Alcohol Drugs. 2021 Sep;82(5):615-622. (PMID: 34546908)
      JMIR Public Health Surveill. 2021 Nov 17;7(11):e30642. (PMID: 34653016)
      J Stud Alcohol Drugs. 2015 Jul;76(4):635-43. (PMID: 26098041)
      Tob Control. 2017 Dec;26(e2):e112-e116. (PMID: 28341768)
      Psychol Health. 2020 Nov;35(11):1293-1305. (PMID: 32216557)
      J Med Internet Res. 2014 Oct 16;16(10):e238. (PMID: 25320863)
      Addict Behav. 2017 Feb;65:289-295. (PMID: 27568339)
      Drug Alcohol Rev. 2022 Mar;41(3):546-549. (PMID: 34370881)
      Am J Prev Med. 2020 Jan;58(1):79-88. (PMID: 31806270)
      J Med Internet Res. 2020 Apr 28;22(4):e16206. (PMID: 32310818)
      Am J Prev Med. 2011 May;40(5 Suppl 2):S154-8. (PMID: 21521589)
      PLoS One. 2021 Jul 8;16(7):e0254114. (PMID: 34237087)
      Nicotine Tob Res. 2022 Jul 13;24(8):1193-1200. (PMID: 34562100)
      J Med Internet Res. 2020 Dec 14;22(12):e21418. (PMID: 33284783)
      Stat Med. 2013 Feb 20;32(4):556-77. (PMID: 22711416)
      Health Psychol. 2016 Mar;35(3):281-9. (PMID: 26690637)
      Addict Behav Rep. 2020 May 19;12:100286. (PMID: 32637562)
      Eur J Public Health. 2017 Oct 1;27(5):929-931. (PMID: 28957493)
      J Med Internet Res. 2016 Dec 05;18(12):e318. (PMID: 27919863)
      Drug Alcohol Rev. 2021 Feb;40(2):192-195. (PMID: 33020967)
    • Grant Information:
      K01 AA030614 United States AA NIAAA NIH HHS; R01 CA229324 United States CA NCI NIH HHS; R01 DA049154 United States DA NIDA NIH HHS
    • Contributed Indexing:
      Keywords: Dry January; Twitter; alcohol; drinking; infodemiology; infoveillance; natural language processing; social media
    • Accession Number:
      3K9958V90M (Ethanol)
    • Publication Date:
      Date Created: 20221107 Date Completed: 20221122 Latest Revision: 20240214
    • Publication Date:
      20240214
    • Accession Number:
      PMC9719059
    • Accession Number:
      10.2196/40160
    • Accession Number:
      36343184