Author: "Lyle Ungar" / Language: undetermined - Searchworks@Jio Institute Digital Library Search Results

1. Characterizing empathy and compassion using computational linguistic analysis

Author: David B. Yaden, Salvatore Giorgi, Matthew Jordan, Anneke Buffone, Johannes C. Eichstaedt, H. Andrew Schwartz, Lyle Ungar, and Paul Bloom
Subjects: General Psychology
Published: 2023

2. AI-based analysis of social media language predicts addiction treatment dropout at 90 days

Author: Brenda Curtis, Salvatore Giorgi, Lyle Ungar, Huy Vu, David Yaden, Tingting Liu, Kenna Yadeta, and H. Andrew Schwartz
Subjects: Pharmacology, Psychiatry and Mental health
Published: 2023

3. Emma: An Email Multimodal Architecture with Social Influence Factors for Email Reply Prediction

Author: Harsh Shah, Kokil Jaidka, Lyle Ungar, Jesse Fagan, and Travis Grosser
Published: 2023

4. Getting 'clean' from nonsuicidal self-injury: Experiences of addiction on the subreddit r/selfharm

Author: McKenzie Himelein-Wachowiak, Salvatore Giorgi, Amy Kwarteng, Destiny Schriefer, Chase Smitterberg, Kenna Yadeta, Elise Bragard, Amanda Devoto, Lyle Ungar, and Brenda Curtis
Subjects: Behavior, Addictive, Psychiatry and Mental health, Clinical Psychology, Substance-Related Disorders, mental disorders, Humans, Medicine (miscellaneous), General Medicine, Self-Injurious Behavior, Suicidal Ideation
Abstract: Background & AimsPrevious studies have shown that nonsuicidal self-injury (NSSI) has addictive features, and an addiction model of NSSI has been considered. Addictive features have been associated with severity of NSSI and adverse psychological experiences. Yet, there is debate over the extent to which NSSI and substance use disorders (SUDs) are similar experientially.MethodsTo evaluate the extent that people who self-injure experience NSSI like an addiction, we coded the posts of users of the subreddit r/selfharm (n= 500) for each of 11 DSM-5 SUD criteria adapted to NSSI.ResultsA majority (76.8%) of users endorsed at least two adapted SUD criteria in their posts, indicative of mild, moderate, or severe addiction. The most frequently endorsed criteria were urges or cravings (67.6%), escalating severity or tolerance (46.7%), and NSSI that is particularly hazardous. User-level addictive features positively predicted number of methods used for NSSI, number of psychiatric disorders, and particularly hazardous NSSI, but not suicidality. We also observed frequent use of language and concepts common in SUD recovery circles like Alcoholics Anonymous.Discussion & ConclusionOur findings support previous work describing the addiction potential of NSSI and associating addictive features with clinical severity. These results suggest that NSSI and SUD may share experiential similarities, which has implications for the treatment of NSSI. We also contribute to a growing body of work that uses social media as a window into the subjective experiences of stigmatized populations.
Published: 2022

5. Effect of Integrating Patient-Generated Digital Data Into Mental Health Therapy: A Randomized Controlled Trial

Author: Raina M. Merchant, Lauren Southwick, Rinad S. Beidas, David S. Mandell, Sharath Chandra Guntuku, Art Pelullo, Lin Yang, Nandita Mitra, Brenda Curtis, Lyle Ungar, and David A. Asch
Subjects: Psychiatry and Mental health
Abstract: The authors sought to determine whether providing summaries of patients' social media and other digital data to patients and their clinicians improves patients' health-related quality of life (HRQoL) measured by the RAND 36-Item Short Form Health Survey (SF-36).The authors randomly assigned 115 adults receiving outpatient mental health therapy to usual care or to periodic sharing of summaries of their digital data with their clinician providing psychosocial therapy. The study was conducted October 2020-December 2021.Patients' mean±SD age was 31.3±10.5 years, and 82% were women. At 60 days after enrollment, no statistically significant change was detected in SF-36 scores for patients randomly allocated to the intervention (mean difference=-0.39, 95% CI=-4.17, 3.39) or to usual care (mean difference=-1.98, 95% CI=-5.74, 1.77), and no significant between-arm difference was observed (between-arm difference=1.60, 95% CI=-3.67, 6.86).Collecting and summarizing digital data for use in mental health treatment was feasible for patients but did not significantly improve their HRQoL or other measures of mental health.
Published: 2022

6. Analyzing Personality through Social Media Profile Picture Choice

Author: Leqi Liu, Daniel Preotiuc-Pietro, Zahra Riahi Samani, Mohsen E. Moghaddam, and Lyle Ungar
Abstract: The content of images users post to their social media is driven in part by personality. In this study, we analyze how Twitter profile images vary with the personality of the users posting them. In our main analysis, we use profile images from over 66,000 users whose personality we estimate based on their tweets. To facilitate interpretability, we focus our analysis on aesthetic and facial features and control for demographic variation in image features and personality. Our results show significant differences in profile picture choice between personality traits, and that these can be harnessed to predict personality traits with robust accuracy. For example, agreeable and conscientious users display more positive emotions in their profile pictures, while users high in openness prefer more aesthetic photos.
Published: 2021

7. Characterizing Geographic Variation in Well-Being Using Tweets

Author: Hansen Schwartz, Johannes Eichstaedt, Margaret Kern, Lukasz Dziurzynski, Richard Lucas, Megha Agrawal, Gregory Park, Shrinidhi Lakshmikanth, Sneha Jha, Martin Seligman, and Lyle Ungar
Abstract: The language used in tweets from 1,300 different US counties was found to be predictive of the subjective well-being of people living in those counties as measured by representative surveys. Topics, sets of co-occurring words derived from the tweets using LDA, improved accuracy in predicting life satisfaction over and above standard demographic and socio-economic controls (age, gender, ethnicity, income, and education). The LDA topics provide a greater behavioural and conceptual resolution into life satisfaction than the broad socio-economic and demographic variables. For example, tied in with the psychological literature, words relating to outdoor activities, spiritual meaning, exercise, and good jobs correlate with increased life satisfaction, while words signifying disengagement like ’bored’ and ’tired’ show a negative association.
Published: 2021

8. CHANGE IN SOCIAL INTERACTION AND MENTAL HEALTH AMONG OLDER AMERICANS DURING COVID-19 PANDEMIC

Author: Krystyna Keller, Jielu Lin, Melissa Zajdel, Fiona Gilpin Macfoy, Philip Shaw, Brenda Curtis, Lyle Ungar, and Laura Koehly
Subjects: Health (social science), Life-span and Life-course Studies, Health Professions (miscellaneous)
Abstract: Recent research has shown the mental health consequence of social distancing during the COVID-19 pandemic, but longitudinal data are relatively scarce. It is unclear whether the pattern of isolation and elevated stress seen at the beginning of the pandemic persists over time. This study evaluates change in social interaction over six months and its mental health impact among older adults. We drew data from a panel study with six repeated assessments of social interaction and mental health conducted monthly May through October 2020. The sample included a total of 380 White, Black and Hispanic participants aged 50 and over, of whom 33% had low income, who residing in fourteen U.S. states with active stay-at-home orders in May 2020. The analysis examined how change in living arrangement, in-person interaction outside the household, quality of relationship with family and friends, and perceived social support affected trajectories of isolation stress, COVID worry and sadness. While their living arrangements and relationship quality remained stable, older adults experienced fluctuations in perceived social support and increases in in-person conversations outside the household. Living with a spouse/partner stabilized isolation stress and COVID worry over time. Individuals with better relationship quality with friends became happier over time. Changes in social support were associated with greater fluctuations in isolation stress and COVID worry. During the pandemic, social interactions are protective and lack of stability in feeling supported makes older adults vulnerable to stress. Efforts should focus on (re)building and maintaining companionship and support to mitigate the pandemic’s negative impact.
Published: 2022

9. Mining for Equitable Health: Assessing the Impact of Missing Data in Electronic Health Records

Author: Emily Getzen, Lyle Ungar, Danielle Mowery, Xiaoqian Jiang, and Qi Long
Subjects: Health Informatics, Computer Science Applications
Abstract: Electronic health records (EHR) are collected as a routine part of healthcare delivery, and have great potential to be utilized to improve patient health outcomes. They contain multiple years of health information to be leveraged for risk prediction, disease detection, and treatment evaluation. However, they do not have a consistent, standardized format across institutions, particularly in the United States, and can present significant analytical challenges- they contain multi-scale data from heterogeneous domains and include both structured and unstructured data. Data for individual patients are collected at irregular time intervals and with varying frequencies. In addition to the analytical challenges, EHR can reflect inequity- patients belonging to different groups will have differing amounts of data in their health records. Many of these issues can contribute to biased data collection. The consequence is that the data for under-served groups may be less informative partly due to more fragmented care, which can be viewed as a type of missing data problem. For EHR data in this complex form, there is currently no framework for introducing realistic missing values. There has also been little to no work in assessing the impact of missing data in EHR. In this work, we first introduce a terminology to define three levels of EHR data and then propose a novel framework for simulating realistic missing data scenarios in EHR to adequately assess their impact on predictive modeling. We incorporate the use of a medical knowledge graph to capture dependencies between medical events to create a more realistic missing data framework. In an intensive care unit setting, we found that missing data have greater negative impact on the performance of disease prediction models in groups that tend to have less access to healthcare, or seek less healthcare. We also found that the impact of missing data on disease prediction models is stronger when using the knowledge graph framework to introduce realistic missing values as opposed to random event removal.
Published: 2022

10. Reddit language indicates changes associated with diet, physical activity, substance use, and smoking during COVID-19

Author: Karan Wanchoo, Matthew Abrams, Raina M. Merchant, Lyle Ungar, and Sharath Chandra Guntuku
Subjects: Multidisciplinary
Abstract: COVID-19 has adversely impacted the health behaviors of billions of people across the globe, modifying their former trends in health and lifestyle. In this paper, we compare the psychosocial language markers associated with diet, physical activity, substance use, and smoking before and after the onset of COVID-19 pandemic. We leverage the popular social media platform Reddit to analyze 1 million posts between January 6, 2019, to January 5, 2021, from 22 different communities (i.e., subreddits) that belong to four broader groups—diet, physical activity, substance use, and smoking. We identified that before the COVID-19 pandemic, posts involved sharing information about vacation, international travel, work, family, consumption of illicit substances, vaping, and alcohol, whereas during the pandemic, posts contained emotional content associated with quarantine, withdrawal symptoms, anxiety, attempts to quit smoking, cravings, weight loss, and physical fitness. Prevalent topic analysis showed that the pandemic was associated with discussions about nutrition, physical fitness, and outdoor activities such as backpacking and biking, suggesting users’ focus shifted toward their physical health during the pandemic. Starting from the week of March 23, 2020, when several stay-at-home policies were enacted, users wrote more about coping with stress and anxiety, alcohol misuse and abuse, and harm-reduction strategies like switching from hard liquor to beer/wine after people were socially isolated. In addition, posts related to use of substances such as benzodiazepines (valium, xanax, clonazepam), nootropics (kratom, phenibut), and opioids peaked around March 23, 2020, followed by a decline. Of note, unlike the general decline observed, the volume of posts related to alternatives to heroin (e.g., fentanyl) increased during the COVID-19 pandemic. Posts about quitting smoking gained momentum after late March 2020, and there was a sharp decline in posts about craving to smoke. This study highlights the significance of studying social media discussions on platforms like Reddit which are a rich ecological source of human experiences and provide insights to inform targeted messaging and mitigation strategies, and further complement ongoing traditional primary data collection methods.
Published: 2023

11. Mining for Health: A Comparison of Word Embedding Methods for Analysis of EHRs Data

Author: Emily Getzen, Yucheng Ruan, Lyle Ungar, and Qi Long
Abstract: Electronic health records (EHRs), routinely collected as part of healthcare delivery, offer great promise for advancing precision health. At the same time, they present significant analytical challenges. In EHRs, data for individual patients are collected at irregular time intervals and with varying frequencies; they include both structured and unstructured data. Advanced statistical and machine learning methods have been developed to tackle these challenges, for example, for predicting diagnoses earlier and more accurately. One powerful tool for extracting useful information from EHRs data is word embedding algorithms, which represent words as vectors of real numbers that capture the words’ semantic and syntactic similarities. Learning embeddings can be viewed as automated feature engineering, producing features that can be used for predictive modeling of medical events. Methods such as Word2Vec, BERT, FastText, ELMo, and GloVe have been developed for word embedding, but there has been little work on re-purposing these algorithms for the analysis of structured medical data. Our work seeks to fill this important gap. We extended word embedding methods to embed (structured) medical codes from a patient’s entire medical history, and used the resultant embeddings to build prediction models for diseases. We assessed the performance of multiple embedding methods in terms of predictive accuracy and computation time using the Medical Information Mart for Intensive Care (MIMIC) database. We found that using Word2Vec, Fast-Text, and GloVe algorithms yield comparable models, while more recent contextual embeddings provide marginal further improvement. Our results provide insights and guidance to practitioners regarding the use of word embedding methods for the analysis of EHR data.
Published: 2022

12. Life under stay-at-home orders: a panel study of change in social interaction and emotional wellbeing among older Americans during COVID-19 pandemic

Author: Jielu Lin, Melissa Zajdel, Krystyna R. Keller, Fiona O. Gilpin Macfoy, Philip Shaw, Brenda Curtis, Lyle Ungar, and Laura Koehly
Subjects: Emotions, Public Health, Environmental and Occupational Health, Social Interaction, COVID-19, Humans, Social Support, Middle Aged, Pandemics, United States, Aged
Abstract: Background Recent research has shown the mental health consequence of social distancing during the COVID-19 pandemic, but longitudinal data are relatively scarce. It is unclear whether the pattern of isolation and elevated stress seen at the beginning of the pandemic persists over time. This study evaluates change in social interaction over six months and its impact on emotional wellbeing among older adults. Methods We drew data from a panel study with six repeated assessments of social interaction and emotional wellbeing conducted monthly May through October 2020. The sample included a total of 380 White, Black and Hispanic participants aged 50 and over, of whom 33% had low income, who residing in fourteen U.S. states with active stay-at-home orders in May 2020. The analysis examined how change in living arrangement, in-person interaction outside the household, quality of relationship with family and friends, and perceived social support affected trajectories of isolation stress, COVID worry and sadness. Results While their living arrangements (Odds Ratio [OR] = 0.95, 95% Confidence Interval [CI] = 0.87, 1.03) and relationship quality (OR = 0.94, 95% CI = 0.82, 1.01) remained stable, older adults experienced fluctuations in perceived social support (linear Slope b = -1.42, s.e. = 0.16, p p p p p p p p Conclusions During the pandemic, social interactions are protective and lack of stability in feeling supported makes older adults vulnerable to stress. Efforts should focus on (re)building and maintaining companionship and support to mitigate the pandemic’s negative impact.
Published: 2022

13. A 680,000-person megastudy of nudges to encourage vaccination in pharmacies

Author: Katherine L. Milkman, Linnea Gandhi, Mitesh S. Patel, Heather N. Graci, Dena M. Gromet, Hung Ho, Joseph S. Kay, Timothy W. Lee, Jake Rothschild, Jonathan E. Bogard, Ilana Brody, Christopher F. Chabris, Edward Chang, Gretchen B. Chapman, Jennifer E. Dannals, Noah J. Goldstein, Amir Goren, Hal Hershfield, Alex Hirsch, Jillian Hmurovic, Samantha Horn, Dean S. Karlan, Ariella S. Kristal, Cait Lamberton, Michelle N. Meyer, Allison H. Oakes, Maurice E. Schweitzer, Maheen Shermohammed, Joachim Talloen, Caleb Warren, Ashley Whillans, Kuldeep N. Yadav, Julian J. Zlatev, Ron Berman, Chalanda N. Evans, Rahul Ladhania, Jens Ludwig, Nina Mazar, Sendhil Mullainathan, Christopher K. Snider, Jann Spiess, Eli Tsukayama, Lyle Ungar, Christophe Van den Bulte, Kevin G. Volpp, and Angela L. Duckworth
Subjects: Male, Pharmacies, Text Messaging, Multidisciplinary, Immunization Programs, Reminder Systems, Vaccination, education, COVID-19, Middle Aged, Influenza Vaccines, Influenza, Human, Humans, Female, psychological phenomena and processes, Aged
Abstract: Encouraging vaccination is a pressing policy problem. To assess whether text-based reminders can encourage pharmacy vaccination and what kinds of messages work best, we conducted a megastudy. We randomly assigned 689,693 Walmart pharmacy patients to receive one of 22 different text reminders using a variety of different behavioral science principles to nudge flu vaccination or to a business-as-usual control condition that received no messages. We found that the reminder texts that we tested increased pharmacy vaccination rates by an average of 2.0 percentage points, or 6.8%, over a 3-mo follow-up period. The most-effective messages reminded patients that a flu shot was waiting for them and delivered reminders on multiple days. The top-performing intervention included two texts delivered 3 d apart and communicated to patients that a vaccine was "waiting for you." Neither experts nor lay people anticipated that this would be the best-performing treatment, underscoring the value of simultaneously testing many different nudges in a highly powered megastudy.
Published: 2022

14. Nonsuicidal Self-Injury and Substance Use Disorders: A Shared Language of Addiction

Author: Salvatore Giorgi, Mckenzie Himelein-wachowiak, Daniel Habib, Lyle Ungar, and Brenda Curtis
Published: 2022

15. Measuring the Language of Self-Disclosure across Corpora

Author: Ann-Katrin Reuel, Sebastian Peralta, João Sedoc, Garrick Sherman, and Lyle Ungar
Published: 2022

16. Beyond Positive Emotion: Deconstructing Happy Moments Based on Writing Prompts

Author: Kokil Jaidka, Niyati Chhaya, Saran Mumick, Matthew Killingsworth, Alon Halevy, and Lyle Ungar
Abstract: This study reports experiments with the newly-released CL-Aff HappyDB dataset, which looks beyond positive emotion in modeling descriptions of happy moments collected through writing prompts. The widespread adoption of social media has improved researchers' access to unsolicited expressions and behaviors. However, most of the approaches to analyzing these expressions involve a keyword search and focuses on predicting sentiment or emotional content rather than understanding a deeper psychological state, such as happiness. The CL-Aff HappyDB dataset is the first effort to distinguish the personal agency and social interaction in writings about happiness, which do not yet have an exact equivalent concept in existing text-based approaches. We report that state of the art approaches for emotion detection have different topical characteristics, and do not generalize well to detect happiness in the CL-Aff HappyDB dataset. Language models trained on the dataset, on the other hand, generalize to social media writing and are a valid approach for downstream tasks, such as predicting life satisfaction from social media posts.
Published: 2020

17. Dynamics of sadness by race, ethnicity, and income following George Floyd's death

Author: Jielu, Lin, Philip, Shaw, Brenda, Curtis, Lyle, Ungar, and Laura, Koehly
Published: 2022

18. 15. Automated Machine Learning for Risk Prediction of Incisional Hernia in Abdominal Surgery Patients

Author: Ankoor A. Talwar, Abhishek A. Desai, Phoebe B. McAuliffe, Tony Liu, Vivek James, Ivona Percec, Robyn B. Broach, Lyle Ungar, and John P. Fischer
Subjects: Surgery
Published: 2022

19. Bots and misinformation spread on social media: A mixed scoping review with implications for COVID-19 (Preprint)

Author: McKenzie Himelein-Wachowiak, Salvatore Giorgi, Amanda Devoto, Muhammad Rahman, Lyle Ungar, H. Andrew Schwartz, David H. Epstein, Lorenzo Leggio, and Brenda Curtis
Abstract: UNSTRUCTURED As of December 2020, the SARS-CoV-2 virus has been responsible for over 78 million cases of COVID-19 worldwide, resulting in over 1.7 million deaths. In the United States in particular, protective measures against the COVID-19 pandemic have been hampered by political polarization and discrepancies among federal, state, and local policies. As a result, a huge amount of information surrounding COVID-19, some of it contradictory or blatantly false, has proliferated on social media. In this mixed scoping review, we survey the role of automated accounts, or “bots,” in spreading misinformation during past epidemics, natural disasters, and politically polarizing events through the lens of the COVID-19 pandemic. We also review strategies used by bots to spread (mis)information and machine learning methods for detecting bot activity. We conclude by conducting and presenting a secondary analysis of known bots, finding that up to 66% of bots are discussing COVID-19. The proliferation of COVID-19 (mis)information by bots, coupled with human susceptibility to believing and sharing misinformation, may well impact the course of the pandemic.
Published: 2021

20. Does BERT Learn as Humans Perceive? Understanding Linguistic Styles through Lexica

Author: Shirley Anugrah Hayati, Dongyeop Kang, and Lyle Ungar
Subjects: FOS: Computer and information sciences, Computer Science - Computation and Language, Computation and Language (cs.CL)
Abstract: People convey their intention and attitude through linguistic styles of the text that they write. In this study, we investigate lexicon usages across styles throughout two lenses: human perception and machine word importance, since words differ in the strength of the stylistic cues that they provide. To collect labels of human perception, we curate a new dataset, Hummingbird, on top of benchmarking style datasets. We have crowd workers highlight the representative words in the text that makes them think the text has the following styles: politeness, sentiment, offensiveness, and five emotion types. We then compare these human word labels with word importance derived from a popular fine-tuned style classifier like BERT. Our results show that the BERT often finds content words not relevant to the target style as important words used in style prediction, but humans do not perceive the same way even though for some styles (e.g., positive sentiment and joy) human- and machine-identified words share significant overlap for some styles., Comment: Accepted at EMNLP 2021 Main Conference, updated typos and Appendix
Published: 2021
Full Text: View/download PDF

21. Recognizing Pathogenic Empathy in Social Media

Author: Muhammad Abdul-Mageed, Anneke Buffone, Hao Peng, Johannes Eichstaedt, and Lyle Ungar
Abstract: Empathy is an integral part of human social life, as people care about and for others who experience adversity. However, a specific “pathogenic” form of empathy, marked by automatic contagion of negative emotions, can lead to stress and burnout. This is particularly detrimental for individuals in caregiving professions who experience empathic states more frequently, because it can result in illness and high costs for health systems. Automatically recognizing pathogenic empathy from text is potentially valuable to identify at-risk individuals and monitor burnout risk in caregiving populations. We build a model to predict this type of empathy from social media language on a data set we collected of users’ Facebook posts and their answers to a new questionnaire measuring empathy. We obtain promising results in identifying individuals' empathetic states from their social media (Pearson r = 0.252, p
Published: 2017

22. Facebook versus Twitter: Differences in Self-Disclosure and Trait Prediction

Author: Kokil Jaidka, Sharath Guntuku, and Lyle Ungar
Abstract: This study compares self-disclosure on Facebook and Twitter through the lens of demographic and psychological traits. Predictive evaluation reveals that language models trained on Facebook posts are more accurate at predicting age, gender, stress, and empathy than those trained on Twitter posts. Qualitative analyses of the underlying linguistic and demographic differences reveal that users are significantly more likely to disclose information about their family, personal concerns, and emotions and provide a more `honest' self-representation on Facebook. On the other hand, the same users significantly preferred to disclose their needs, drives, and ambitions on Twitter. The higher predictive performance of Facebook is also partly due to the greater volume of language on Facebook than Twitter -- Facebook and Twitter are equally good at predicting user traits when the same-sized language samples are used to train language models. We explore the implications of these differences in cross-platform user trait prediction.
Published: 2018

23. Content Analysis of Metaphors About Hypertension and Diabetes on Twitter: Exploratory Mixed-Methods Study (Preprint)

Author: Lauren Sinnenberg, Christina Mancheno, Frances K Barg, David A Asch, Christy Lee Rivard, Emma Horst-Martz, Alison Buttenheim, Lyle Ungar, and Raina Merchant
Subjects: humanities
Abstract: BACKGROUND Widespread metaphors contribute to the public’s understanding of health. Prior work has characterized the metaphors used to describe cancer and AIDS. Less is known about the metaphors characterizing cardiovascular disease. OBJECTIVE The objective of our study was to characterize the metaphors that Twitter users employ in discussing hypertension and diabetes. METHODS We filtered approximately 10 billion tweets for keywords related to diabetes and hypertension. We coded a random subset of 5000 tweets for the presence of metaphor and the type of metaphor employed. RESULTS Among the 5000 tweets, we identified 797 (15.9%) about hypertension or diabetes that employed metaphors. When discussing the development of heart disease, Twitter users described the disease as a journey (n=202), as transmittable (n=116), as an object (n=49), or as being person-like (n=15). In discussing the experience of these diseases, some Twitter users employed war metaphors (n=101). Other users described the challenge to control their disease (n=34), the disease as an agent (n=58), or their bodies as machines (n=205). CONCLUSIONS Metaphors are used frequently by Twitter users in their discussion of hypertension and diabetes. These metaphors can help to guide communication between patients and providers to improve public health.
Published: 2018

24. Characterizing Tweet Volume and Content About Common Health Conditions Across Pennsylvania: Retrospective Analysis (Preprint)

Author: Christopher Tufts, Daniel Polsky, Kevin G Volpp, Peter W Groeneveld, Lyle Ungar, Raina M Merchant, and Arthur P Pelullo
Abstract: BACKGROUND Tweets can provide broad, real-time perspectives about health and medical diagnoses that can inform disease surveillance in geographic regions. Less is known, however, about how much individuals post about common health conditions or what they post about. OBJECTIVE We sought to collect and analyze tweets from 1 state about high prevalence health conditions and characterize the tweet volume and content. METHODS We collected 408,296,620 tweets originating in Pennsylvania from 2012-2015 and compared the prevalence of 14 common diseases to the frequency of disease mentions on Twitter. We identified and corrected bias induced due to variance in disease term specificity and used the machine learning approach of differential language analysis to determine the content (words and themes) most highly correlated with each disease. RESULTS Common disease terms were included in 226,802 tweets (174,381 tweets after disease term correction). Posts about breast cancer (39,156/174,381 messages, 22.45%; 306,127/12,702,379 prevalence, 2.41%) and diabetes (40,217/174,381 messages, 23.06%; 2,189,890/12,702,379 prevalence, 17.24%) were overrepresented on Twitter relative to disease prevalence, whereas hypertension (17,245/174,381 messages, 9.89%; 4,614,776/12,702,379 prevalence, 36.33%), chronic obstructive pulmonary disease (1648/174,381 messages, 0.95%; 1,083,627/12,702,379 prevalence, 8.53%), and heart disease (13,669/174,381 messages, 7.84%; 2,461,721/12,702,379 prevalence, 19.38%) were underrepresented. The content of messages also varied by disease. Personal experience messages accounted for 12.88% (578/4487) of prostate cancer tweets and 24.17% (4046/16,742) of asthma tweets. Awareness-themed tweets were more often about breast cancer (9139/39,156 messages, 23.34%) than asthma (1040/16,742 messages, 6.21%). Tweets about risk factors were more often about heart disease (1375/13,669 messages, 10.06%) than lymphoma (105/4927 messages, 2.13%). CONCLUSIONS Twitter provides a window into the Web-based visibility of diseases and how the volume of Web-based content about diseases varies by condition. Further, the potential value in tweets is in the rich content they provide about individuals’ perspectives about diseases (eg, personal experiences, awareness, and risk factors) that are not otherwise easily captured through traditional surveys or administrative data.
Published: 2018

25. Proper Proxy Scoring Rules

Author: Jens Witkowski, Pavel Atanasov, Lyle Ungar, and Andreas Krause
Subjects: General Medicine
Abstract: Proper scoring rules can be used to incentivize a forecaster to truthfully report her private beliefs about the probabilities of future events and to evaluate the relative accuracy of forecasters. While standard scoring rules can score forecasts only once the associated events have been resolved, many applications would benefit from instant access to proper scores. In forecast aggregation, for example, it is known that using weighted averages, where more weight is put on more accurate forecasters, outperforms simple averaging of forecasts. We introduce proxy scoring rules, which generalize proper scoring rules and, given access to an appropriate proxy, allow for immediate scoring of probabilistic forecasts. In particular, we suggest a proxy-scoring generalization of the popular quadratic scoring rule, and characterize its incentive and accuracy evaluation properties theoretically. Moreover, we thoroughly evaluate it experimentally using data from a large real world geopolitical forecasting tournament, and show that it is competitive with proper scoring rules when the number of questions is small.
Published: 2017

26. Enhanced estimations of post-stroke aphasia severity using stacked multimodal predictions

Author: Dorian, Pustina, Harry Branch, Coslett, Lyle, Ungar, Olufunsho K, Faseyitan, John D, Medaglia, Brian, Avants, and Myrna F, Schwartz
Subjects: Male, Language Tests, Rest, Brain, Middle Aged, Magnetic Resonance Imaging, Multimodal Imaging, Severity of Illness Index, Article, Oxygen, Stroke, Nonlinear Dynamics, Cerebrovascular Circulation, Chronic Disease, Multivariate Analysis, Aphasia, Connectome, Linear Models, Humans, Female
Abstract: The severity of post-stroke aphasia and the potential for recovery are highly variable and difficult to predict. Evidence suggests that optimal estimation of aphasia severity requires the integration of multiple neuroimaging modalities and the adoption of new methods that can detect multivariate brain-behavior relationships. We created and tested a multimodal framework that relies on three information sources (lesion maps, structural connectivity, and functional connectivity) to create an array of unimodal predictions which are then fed into a final model that creates "stacked multimodal predictions" (STAMP). Crossvalidated predictions of four aphasia scores (picture naming, sentence repetition, sentence comprehension, and overall aphasia severity) were obtained from 53 left hemispheric chronic stroke patients (age: 57.1 ± 12.3 yrs, post-stroke interval: 20 months, 25 female). Results showed accurate predictions for all four aphasia scores (correlation true vs. predicted: r = 0.79-0.88). The accuracy was slightly smaller but yet significant (r = 0.66) in a full split crossvalidation with each patient considered as new. Critically, multimodal predictions produced more accurate results that any single modality alone. Topological maps of the brain regions involved in the prediction were recovered and compared with traditional voxel-based lesion-to-symptom maps, revealing high spatial congruency. These results suggest that neuroimaging modalities carry complementary information potentially useful for the prediction of aphasia scores. More broadly, this study shows that the translation of neuroimaging findings into clinically useful tools calls for a shift in perspective from unimodal to multimodal neuroimaging, from univariate to multivariate methods, from linear to nonlinear models, and, conceptually, from inferential to predictive brain mapping. Hum Brain Mapp 38:5603-5615, 2017. © 2017 Wiley Periodicals, Inc.
Published: 2017

27. Discovering User Attribute Stylistic Differences via Paraphrasing

Author: Daniel Preotiuc-Pietro, Wei Xu, and Lyle Ungar
Subjects: General Medicine
Abstract: User attribute prediction from social media text has proven successful and useful for downstream tasks. In previous studies, differences in user trait language use have been limited primarily to the presence or absence of words that indicate topical preferences. In this study, we aim to find linguistic style distinctions across three different user attributes: gender, age and occupational class. By combining paraphrases with a simple yet effective method, we capture a wide set of stylistic differences that are exempt from topic bias. We show their predictive power in user profiling, conformity with human perception and psycholinguistic hypotheses, and potential use in generating natural language tailored to specific user traits.
Published: 2016

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

27 results on '"Lyle Ungar"'

1. Characterizing empathy and compassion using computational linguistic analysis

2. AI-based analysis of social media language predicts addiction treatment dropout at 90 days

3. Emma: An Email Multimodal Architecture with Social Influence Factors for Email Reply Prediction

4. Getting 'clean' from nonsuicidal self-injury: Experiences of addiction on the subreddit r/selfharm

5. Effect of Integrating Patient-Generated Digital Data Into Mental Health Therapy: A Randomized Controlled Trial

6. Analyzing Personality through Social Media Profile Picture Choice

7. Characterizing Geographic Variation in Well-Being Using Tweets

8. CHANGE IN SOCIAL INTERACTION AND MENTAL HEALTH AMONG OLDER AMERICANS DURING COVID-19 PANDEMIC

9. Mining for Equitable Health: Assessing the Impact of Missing Data in Electronic Health Records

10. Reddit language indicates changes associated with diet, physical activity, substance use, and smoking during COVID-19

11. Mining for Health: A Comparison of Word Embedding Methods for Analysis of EHRs Data

12. Life under stay-at-home orders: a panel study of change in social interaction and emotional wellbeing among older Americans during COVID-19 pandemic

13. A 680,000-person megastudy of nudges to encourage vaccination in pharmacies

14. Nonsuicidal Self-Injury and Substance Use Disorders: A Shared Language of Addiction

15. Measuring the Language of Self-Disclosure across Corpora

16. Beyond Positive Emotion: Deconstructing Happy Moments Based on Writing Prompts

17. Dynamics of sadness by race, ethnicity, and income following George Floyd's death

18. 15. Automated Machine Learning for Risk Prediction of Incisional Hernia in Abdominal Surgery Patients

19. Bots and misinformation spread on social media: A mixed scoping review with implications for COVID-19 (Preprint)

20. Does BERT Learn as Humans Perceive? Understanding Linguistic Styles through Lexica

21. Recognizing Pathogenic Empathy in Social Media

22. Facebook versus Twitter: Differences in Self-Disclosure and Trait Prediction

23. Content Analysis of Metaphors About Hypertension and Diabetes on Twitter: Exploratory Mixed-Methods Study (Preprint)

24. Characterizing Tweet Volume and Content About Common Health Conditions Across Pennsylvania: Retrospective Analysis (Preprint)

25. Proper Proxy Scoring Rules

26. Enhanced estimations of post-stroke aphasia severity using stacked multimodal predictions

27. Discovering User Attribute Stylistic Differences via Paraphrasing

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Journal

Database

Publisher

27 results on '"Lyle Ungar"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources