Detection and Prediction of Future Mental Disorder from Social Media Data Using Machine Learning Ensemble Learning and Large Language Models

Authors

  • Cheni Sruneethi Assistant Professor Department of MCA, Annamacharya Institute of Technology and Sciences (AITS), Karakambadi, Tirupati, Andhra Pradesh, India Author
  • Kondarajula Sharmila Post Graduate, Department of MCA, Annamacharya Institute of Technology and Sciences (AITS), Karakambadi, Tirupati, Andhra Pradesh, India Author

DOI:

https://doi.org/10.32628/IJSRST2512338

Keywords:

social media, large language models (LLMs), machine learning

Abstract

With the exponential growth of user-generated content on social media, researchers are exploring new ways to extract meaningful patterns to understand public health trends—particularly mental health conditions. This paper investigates the detection and prediction of future mental disorders through social media analysis using a combination of machine learning, ensemble learning methods, and large language models (LLMs). The approach aims to identify behavioral and linguistic indicators of mental distress before clinical diagnosis or self-reporting. Ensemble methods such as Random Forests and Gradient Boosting are integrated with deep learning language models like BERT and RoBERTa to improve prediction accuracy. A diverse set of features including sentiment polarity, temporal posting patterns, and linguistic markers are extracted from social posts to train the models. The proposed system achieves high accuracy in predicting early warning signs of mental disorders, such as depression, anxiety, and PTSD, with explainability incorporated through SHAP values. This research offers a scalable, data-driven solution to assist clinicians and policymakers in early mental health intervention strategies.

Downloads

Download data is not yet available.

References

Ji Ho Park et al. (2020), “Detection and Classification of Mental Illnesses on Social Media Using RoBERTa”, arXiv:2011.11226.

Emadeldeen Eldakrouri et al. (2023), “Harnessing Hugging Face Transformers for Predicting Mental Health Disorders in Social Networks”, arXiv:2306.16891.

Elham J. F. et al. (2021), “MentalBERT: Publicly Available Pretrained Language Models for Mental Healthcare”, arXiv:2110.15621.

Lin Y., Fan C., & Xu J. (2017), “Mining Online Social Data for Detecting Social Network Mental Disorders”, arXiv:1702.03872.

S. Saha & A. Ghosh (2024), “Explainable AI for Mental Disorder Detection via Social Media”, arXiv:2406.05984.

De Choudhury, M. et al. (2013), “Predicting Depression via Social Media”, ICWSM.

Resnik, P. et al. (2015), “Beyond Labeled Data: Using Social Media to Detect Mental Health Conditions”, Journal of Biomedical Informatics.

Benton, A. et al. (2017), “Multitask Learning for Mental Health Conditions with Social Media Text”, EACL.

Chancellor, S. & De Choudhury, M. (2020), “Methods in Predictive Techniques for Mental Health

Downloads

Published

19-05-2025

Issue

Section

Research Articles