Mining Social Media Data: A Practical Approach with Weka

Authors

  • Pardeep Arora Department of IT, Kanya Maha Vidyalaya, Jalandhar, Punjab, India Author
  • Jass Kaur Department of IT, Kanya Maha Vidyalaya, Jalandhar, Punjab, India Author
  • Anshu Department of IT, Kanya Maha Vidyalaya, Jalandhar, Punjab, India Author

DOI:

https://doi.org/10.32628/IJSRST25123108

Keywords:

Data Mining, Social Media, Analysis, Data sets, Machine learning algorithms

Abstract

Due to the massive amount of data produced by social media's explosive growth, analysis and insight extraction are becoming more difficult. This study focuses on machine learning classification problems on various social media datasets. The datasets include "Time-Waster on Social Media," "Instagram Profile," "Instagram Photos," along with "Viral Trends." The Bayes Network, Random Forest, Decision Tree (J48), and Naive Bayes algorithms were employed. On the "Time-Waster" and "Instagram Photos" datasets, Random Forest outperformed the others. Machine learning algorithms like F-Measure, Precision, Recall, and ROC AUC were employed in the evaluation. Multimedia content may be investigated in future research to gain a greater understanding of user trends and behavior.

Downloads

Download data is not yet available.

References

Parack, S., Zahid, Z., & Merchant, F. (2012). Application of data mining in educational databases for predicting academic trends and patterns. 2012 IEEE International Conference on Technology Enhanced Education (ICTEE). doi:10.1109/ictee.2012.6208617

Mrs. Bharati M. Ramageri, DATA MINING TECHNIQUES AND APPLICATIONS Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra, India-411044

Daniel Larose-Central Connecticut State University, Chantal D. Larose, in book: Discovering Knowledge in Data: An Introduction to Data Mining, Second Edition (pp.1-15) (July 2014) DOI:10.1002/9781118874059.ch1

Ata Amrullah, Norah Muteb S., Mohammad Ahsan Habib (2020),A Review of Data Mining Techniques in Social Media. ata.beruntung@siswa.um.edu.my,norahmuteb@siswa.um.edu.my, ahabib.j@gmail.com , Faculty of Computer Science and Information Technology, University Malaya, Kuala Lumpur, Malaysia.

Mohammad Noor Injadat, Fadi Salo and Ali Bou Nassif, (2016). Department of Electrical and Computer Engineering, University of Western Ontario, 1151 Richmond St, London, Ontario N6A 3K7 Canada, Department of Electrical and Computer Engineering, University of Sharjah, United Arab Emirates .Data mining techniques in social media: A survey. Neurocomputing 214, 654–670. doi:10.1016/j.neucom.2016.06.045

Houssem Lahiania, Mondher Frikhaa, National School of Electronics and Telecommunications, University of Sfax, Systematic Review of Social Media Data Mining on Android, DOI ihttps://doi.org/10.1016/j.procs.2023.10.192

A.C. Nanyakkara, B.T.G.S Kumara, R.M.K.T Rathnayaka, Nanayakkara, R.M.K.T.3Center for Computer Studies, Sabaragamuwa University of Sri Lanka, Belihuloya, 70140, Sri Lanka. Department of Computing and Information Systems,Sri Lanka. Department of Physical Sciences and Technologies, Sabaragamuwa University of Sri Lanka, Belihuloya, 70140, Sri Lanka. A Survey of Finding Trends in Data Mining Techniques for Social Media Analysis. DOI 10.4038/sljssh.vli2.36

Brian C. Britt, Jameson L. Hayes (University of Alabama), Steven Holiday & Yuanwei Lyu (November 2024), Journal of Public Relations Research. Social Media Data Mining in Public Relations-Research, DOI:10.1080/1062726X.2024.2421563

Mariam Adedoyin-Olowe1, Mohamed Medhat Gaber and Frederic Stahl ,School of Computing Science and Digital Media, Robert Gordon University Aberdeen, AB10 7QB, UK ,School of Systems Engineering, University of Reading ,PO Box 225, Whiteknights, Reading, RG6 6AY, UK. A Survey of Data Mining Techniques for Social Media Analysis, DOI 10.46298/jdmdh.5

Haoran Li Proceedings of the 2023 4th International Conference on Big Data Economy and Information Management Pages 204 – 209, Hybrid Data Mining Methods for Social Media Sentiment Analysis DOI https://doi.org/10.1145/3659211.3659246

Dr. Lalit Sachdeva, Dr. Naveen Upadhyay, Dr. Rajesh Sehgal(2023). Social Media Analytics and Business Intelligence: Leveraging Management Information System for Competitive Advantage. DOI: https://doi.org/10.59670/ml.v20iS13.6268

Margarita.Rodríguez-Ib´aneza, AntonioCas´anez-Venturab, F´elix Caste´on-Mateosb, Pedro-Manuel Cuenca-Ji´enezb , A review on sentiment analysis from social media platforms, DOI 10.1016/j.eswa.2023.119862

Raj Agnihotri a, Khashayar Afshar Bakeshloo b, Sudha Mani c, Industrial Marketing Management. Volume 115, November 2023, Pages 110-126 Social media analytics for business-to-business marketing DOI https://doi.org/10.1016/j.indmarman.2023.09.012

Paul Harrigan, Timothy M. Daly, Kristof Coussement, Julie A. Lee, Geoffrey N. Soutar, Uwana Evers. Identifying influencers on social media, The University of Western Australia, Australia, Zayed University, United Arab Emirates, IESEG School of Management.

Jie Yang, Pishi Xiu, Lipeng Sun, Limeng Ying, Blaanand Muthu baSchool of Management, Wenzhou Business College, Wenzhou 325035, Zhejiang, ChinabDepartment of Computer Science and Engineering, Adhiyamaan College of Engineering, IndiaReceived 10 June 2021, Revised 2 September 2021, Accepted 6 September 2021, Available online 17 September 2021, Version of Record 17 September 2021 Social media data analytics for business decision making system to competitive analysis https://doi.org/10.1016/j.ipm.2021.102751

Aamod Khatiwada, Pradeep Kadariya, Sandip Agrahari, Rabin Dhakal, Aamod Khatiwada December 2019 Conference: IEEE International Conference on Innovating Technology for Humanity (PuneCon) 2019 At: Pune India Big Data Analytics and Deep Learning Based Sentiment Analysis System for Sales Prediction DOI:10.1109/PuneCon46936.2019.9105719

https://www.quora.com/What-are -TP-rate-FP-rate- precision-recall-F-measure-MCC-ROC-area-and-PRC-areas-in-the-Weka-tool Assessed on30.4.25

https://www.kaggle.com/code/hainescity/time-wasters-on-social-media-eda Assessed on 23.4.25

https://www.kaggle.com/datasets/bhanupratapbiswas/instgram Assessed on 25.4.25

https://www.kaggle.com/datasets/atharvasoundankar/viral-social-media-trends-and-engagement-analysis Assessed on 26.4.25

https://medium.com/coderbyte/introduction-to-data-preprocessing-in-data-mining-87f5134ef923 Assessed on 30.4.25

https://www.tatvasoft.com/blog/data-mining-with-weka/ Assessed on 10.4.25

https://www.tutorialspoint.com/weka/what_is_weka.htm Assessed on 11.4.25

https://www.sciencedirect.com/topics/mathematics/bayesian-network Assessed on 27.4.25

https://www.ibm.com>topics>naive-bayes Assessed on 28.4.25

https://www.analyticsvidhya.com/blog/2020/05/decision-tree-vs-random-forestalgorithm/ Assessed on 29.4.25

https://ijiset.com/vol2/v2s2/IJISET_V2_I2_63.pdf Assessed on 29.4.25

https://www.ibm.com/topics/decision-trees Assessed on 30.4.25

Downloads

Published

03-06-2025

Issue

Section

Research Articles