Predicting Linguistically Sophisticated Social Determinants of Health Disparities with Neural Networks: The Case of LGBTQ+ Minority Stress

Cory J. Cascalheira, Santosh Chapagain, Ryan E. Flinn, Yuxuan Zhao, Soukaina Filali Boubrahimi, Dannie Klooster, Alejandra Gonzalez, Emily M. Lund, Danica Laprade, Jillian R. Scheer, Shah Muhammad Hamdi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

LGBTQ+ minority stress is a pervasive form of anti-LGBTQ+ adverse events and psychological strain that drives health inequities among LGBTQ+ people. Minority stress is also linguistically sophisticated (e.g., composed of cultural idioms, psycholinguistic permutations, and lexical density). Because minority stress is a linguistically sophisticated social determinant of health disparities, it is challenging to detect using natural language processing (NLP). Using 5,789 human-annotated Reddit posts from the LGBTQ+ Minority Stress on Social Media (MiSSoM+) Dataset, we investigated and compared the performance of four neural networks and two traditional machine learning architectures in modeling minority stress at both the factor (i.e., separate components of minority stress) and composite level. A novel hybrid model combining Bidirectional Encoder Representations from Transformers and convolutional neural network (BERT-CNN) improved the prediction of composite minority stress (F1 = 0.84). Our experiments on separate factors of minority stress are the first to demonstrate that hybrid neural network models can detect semantically complex expressions of prejudiced events (F1 = 0.87), expected rejection (F1 = 0.92), internalized stigma (F1 = 0.91), identity concealment (F1 = 0.92), and minority coping (F1 = 0.84). We also substantially improved the prediction of gender dysphoria (F1 = 0.94) - a conceptually new candidate component of minority stress. Big data analytics may not be a panacea for the problem of minority stress, but our work joins a growing literature base to show that deep learning models are remarkable in detecting linguistically sophisticated social determinants of health disparities in big data, thus providing evidence in support of the potential benefit from the innovative use of such technology in eliminating group-specific health inequities.

Original languageEnglish
Title of host publicationProceedings - 2023 IEEE International Conference on Big Data, BigData 2023
EditorsJingrui He, Themis Palpanas, Xiaohua Hu, Alfredo Cuzzocrea, Dejing Dou, Dominik Slezak, Wei Wang, Aleksandra Gruca, Jerry Chun-Wei Lin, Rakesh Agrawal
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1314-1321
Number of pages8
ISBN (Electronic)9798350324457
DOIs
StatePublished - 2023
Event2023 IEEE International Conference on Big Data, BigData 2023 - Sorrento, Italy
Duration: 15 Dec 202318 Dec 2023

Publication series

NameProceedings - 2023 IEEE International Conference on Big Data, BigData 2023

Conference

Conference2023 IEEE International Conference on Big Data, BigData 2023
Country/TerritoryItaly
CitySorrento
Period15/12/2318/12/23

Bibliographical note

Publisher Copyright:
© 2023 IEEE.

Keywords

  • bidirectional encoder representation of transformers (BERT)
  • convolutional neural network (CNN)
  • deep learning
  • multi-label text classification
  • sexual and gender minority
  • word embedding

Fingerprint

Dive into the research topics of 'Predicting Linguistically Sophisticated Social Determinants of Health Disparities with Neural Networks: The Case of LGBTQ+ Minority Stress'. Together they form a unique fingerprint.

Cite this