Unveiling Key Drivers of Customer Satisfaction in Women’s Clothing: A Topic Modeling Analysis of Online Reviews

Authors

DOI:

https://doi.org/10.52756/ijeim.2025.v01.i02.001

Keywords:

Customer satisfaction, Material quality, Online reviews, Topic modeling, Women’s clothing

Abstract

This study explores customer reviews of women’s clothing to find out the factors influencing customer satisfaction using STM. The model provides a deep-level insight which can be used at the industry level. By analyzing textual data, we identified key themes such as fit, comfort, material quality, and design preferences, which significantly influence consumer perceptions and buying decisions. The STM model was trained with five topics, and insights were drawn by analyzing the most probable words, FREX terms, and lift scores for each topic. Visualizations, including coherence scores and topic prevalence plots, were employed to validate the model’s reliability. The findings reveal that customers prioritize aspects like fabric quality, size accuracy, and style, while also highlighting concerns about product durability and fit inconsistencies. Regression analysis further showcases the influence of customer ratings and departmental categories on topic occurrence. This comprehensive analysis offers actionable insights for retailers and manufacturers to improve product distribution and customer satisfaction, thereby enhancing competitiveness in the women’s clothing industry.

Downloads

Download data is not yet available.

References

Açikgöz, F. Y., Kayakuş, M., Zăbavă, B.-, & Kabas, O. (2024). Brand reputation and trust: The impact on customer satisfaction and loyalty for the hewlett-packard brand. Sustainability, 16(22). https://www.mdpi.com/2071-1050/16/22/9681.

Bag, S., Golder, R., Sarkar, S., & Maity, S. (2023). Sene: A novel manifold learning approach for distracted driving analysis with spatio-temporal and driver praxeological features. Engineering Applications of Artificial Intelligence, 123, 106332. https://doi.org/10.1016/j.engappai.2023.106332.

Bhattacharyya, S., Sarkar, S., Sarkar, B., & Manatkar, R. (2023). Risk modeling framework for strategic and operational intervention to enhance the effectiveness of a closed-loop supply chain. IEEE Transactions on Engineering Management, 71, 7015–7028. https://doi.org/10.1109/TEM.2023.3261323.

Bhattacherjee, A. (2001). Understanding information systems continuance: An expectation-confirmation model. MIS Quarterly, 25(3), 351–370. http://www.jstor.org/stable/3250921.

Budescu, D. (1993). Dominance analysis: A new approach to the problem of relative importance of predictors in multiple regression. Psychological Bulletin, 114, 542–551. https://doi.org/10.1037/0033-2909.114.3.542.

Chaudhuri, A., Sarkar, S., & Bala, P. K. (2024). Thematic exploration and analysis of cybersecurity policies of businesses: an nlp-based approach. Journal of Organizational Computing and Electronic Commerce, 35(2),157–187. https://doi.org/10.1080/10919392.2024.2435115.

Chiu, C.-M., Wang, E. T. G., Fang, Y.-H., & Huang, H.-Y. (2014). Understanding customers’ repeat purchase intentions in b2c e-commerce: the roles of utilitarian value, hedonic value and perceived risk. Information Systems Journal, 24(1), 85–114. https://doi.org/10.1111/j.1365-2575.2012.00407.x.

Cyr, D., Head, M., & Ivanov, A. (2006). Design aesthetics leading to m-loyalty in mobile commerce. Information Management, 43, 950–963. https://doi.org/10.1016/j.im.2006.08.009.

Dwivedi, R., Lohmor Choudhary, S., Dixit, R., Sahiba, Z., & Naik, S. (2024). The customer loyalty vs. customer retention: The impact of customer relationship management on customer satisfaction. Web Intelligence, 22(3),425–442. https://doi.org/10.3233/WEB-230098.

Gefen, D., Karahanna, E., & Straub, D. (2003). Trust and tam in online shopping: An integrated model. MIS Quarterly, 27, 51–90. https://doi.org/10.2307/30036519.

Gundla, A. & Otari, M. (2015). A review on sentiment analysis and visualization of customer reviews.International Journal Of Engineering And Computer Science, 4(12), 435–446. https://www.ijsr.net/archive/v4i9/SUB158501.pdf.

Ilieva, G., Yankova, T., Klisarova-Belcheva, S., & Dzhabarova, Y. (2022). Customer satisfaction in e-commerce during the covid-19 pandemic. Systems, 10, 213. https://doi.org/10.3390/systems10060213.

Joshi, Y. & Rahman, Z. (2015). Factors affecting green purchase behaviour and future research directions. International Strategic Management Review, 3(1), 128–143. https://doi.org/10.1016/j.ism.2015.04.001.

Kapoor, K., Tamilmani, K., Rana, N., Patil, P., Dwivedi, Y., & Nerur, S. (2018). Advances in social media research: Past, present and future. Information Systems Frontiers, 20, 531–558. https://doi.org/10.1007/s10796-017-9810-y.

Khuri, A. (2013). Introduction to linear regression analysis. International Statistical Review, 81. https://doi.org/10.1111/insr.12020_10.

Maier, D., Waldherr, A., Miltner, P., Wiedemann, G., Niekler, A., Keinert, A., Pfetsch, B., Heyer, G., Reber,U., Häussler, T., Schmid-Petri, H., & Adam, S. (2018). Applying lda topic modeling in communication research: Toward a valid and reliable methodology. Communication Methods and Measures, 12, 1–26. https://doi.org/10.1080/19312458.2018.1430754.

Maity, S., Rastogi, A., Djeddi, C., Sarkar, S., & Maiti, J. (2022). A novel optimized method for feature selection using non-linear kernel-free twin quadratic surface support vector machine. In Pattern Recognition and Artificial Intelligence, volume 1543 (pp. 339–353). Springer Nature. https://doi.org/10.1007/978-3-031-04112-9_26.

Parasuraman, A., Zeithaml, V. A., & Berry, L. L. (1988). Servqual: A multiple-item scale for measuring consumer perc. Journal of retailing, 64(1), 12. https://www.proquest.com/openview/7d007e04d78261295e5524f15bef6837/1?cbl=41988&pq-origsite=gscholar.

Pramanik, A., Nande, V., Pradhan, A. S., Sarkar, S., & Maiti, J. (2021a). Dynamic functional bandwidth kernel-based svm: An efficient approach for functional data analysis. In Emerging Technologies in Data Mining and Information Security, volume 1286 (pp. 673–681). Springer, Singapore. https://doi.org/10.1007/978-981-15-9927-9_65.

Pramanik, A., Sarkar, S., Djeddi, C., & Maiti, J. (2021b). Real-time detection of traffic anomalies near roundabouts. In Mediterranean Conference on Pattern Recognition and Artificial Intelligence (pp. 253–264).:Springer. https://doi.org/10.1007/978-3-031-04112-9_19.

Pramanik, A., Sarkar, S., & Pal, S. K. (2023). Video surveillance-based fall detection system using object-level feature thresholding and z- numbers. Knowledge-Based Systems, 280, 110992. https://doi.org/10.1016/j.knosys.2023.110992.

Pramanik, A., Sarkar, S., Sai Siddharth, V., Maiti, J., & Mitra, P. (2021c). Semi-automated ontology creation and upgradation for rail-road incidents: a case of a steel plant in india. In Emerging Technologies in Data Mining and Information Security: Proceedings of IEMIS 2020, Volume 3 (pp. 285–294).: Springer. https://doi.org/10.1007/978-981-15-9774-9_28.

Pramanik, A., Sarker, S., Sarkar, S., & Bose, I. (2024). Fgi-cogvit: Fuzzy granule-based interpretable cognitive vision transformer for early detection of alzheimer’s disease using mri scan images. Information Systems Frontiers, (pp. 1–35). https://doi.org/10.1007/s10796-024-10541-7.

Pramanik, A., Sarker, S., Sarkar, S., & Pal, S. K. (2025). Real-time fall detection on road using transfer learningbased granulated bi-lstm. Knowledge-Based Systems, (pp. 113038). https://doi.org/10.1016/j.knosys.2025.113038.

Ramon, S. & Calif (2024). New survey reveals the strong influence of online reviews on consumer decision making. https://www.businesswire.com/news/home/20241009362692/en/New-Survey-Reveals-the-Strong-Influence-of-Online-Reviews-on-Consumer-Decision-Making.

Rao, A., Sarkar, S., Pramanik, A., & Maiti, J. (2022). Predicting and analysing pedestrian injury severity: A machine learning-based approach. In Proceedings of the Seventh International Conference on Mathematics and Computing: ICMC 2021 (pp. 485–497).: Springer. https://doi.org/10.1007/978-981-16-6890-6_36.

Roberts, M. E., Stewart, B. M., Tingley, D., Lucas, C., Leder-Luis, J., Gadarian, S. K., Albertson, B., & Rand,D. G. (2014). Structural topic models for open-ended survey responses. American Journal of Political Science, 58 (4), 1064–1082. https://doi.org/10.1111/ajps.12103.

Sabharwal, M., Meena, H., Sarkar, S., Damani, M., & Sharma, N. (2024). Real-time detection of coke particles in blast furnace operations using machine learning: Case of a steel plant in india. Ironmaking & Steelmaking,(pp. 03019233241276285). https://doi.org/10.1177/03019233241276285.

Saha, S., Das, M., Mondal, B. S., Sarkar, S., & Maiti, J. (2021). DiPSVM: A polynomial kernel-free support vector machine. In 2021 International Conference on Data Analytics for Business and Industry (ICDABI),IEEE. https://doi.org/10.1109/ICDABI53623.2021.9655976.

Sarkar, S., Ejaz, N., Maiti, J., & Pramanik, A. (2022). An integrated approach using growing self-organizing map-based genetic k-means clustering and tolerance rough set in occupational risk analysis. Neural Computing and Applications, 34, 9661–9687. https://doi.org/10.1007/s00521-022-06956-5.

Sarkar, S., Gaine, S., Deshmukh, A., Khatedi, N., & Maiti, J. (2020a). A structural topic modeling-based machine learning approach for pattern extraction from accident data. In Data Engineering and Communication Technology: Proceedings of 3rd ICDECT-2K19 (pp. 555–564).: Springer. https://doi.org/10.1007/978-981-15-1097-7_46.

Sarkar, S., Pramanik, A., Khatedi, N., Balu, A., & Maiti, J. (2020b). GSEL: A genetic stacking-based ensemble learning approach for incident classification. In Proceedings of ICETIT 2019: Emerging Trends in Information Technology (pp. 719–730).: Springer. https://doi.org/10.1007/978-3-030-30577-2_64.

Sarkar, S., Sammangi, V., Raj, R., Maiti, J., & Mitra, P. (2019). Application of optimized machine learning techniques for prediction of occupational accidents. Computers & Operations Research, 106, 210–224. https://doi.org/10.1016/j.cor.2018.02.021.

Szymanski, D. M. & Hise, R. T. (2000). E-satisfaction: an initial examination. Journal of Retailing, 76(3),309–322. https://doi.org/10.1016/S0022-4359(00)00035-X.

Tandu, C., Kosuri, M., Sarkar, S., & Maiti, J. (2022). A two-fold multi-objective multi-verse optimization based time series forecasting. In Proceedings of the Seventh International Conference on Mathematics and Computing: ICMC 2021 (pp. 743–754).: Springer. https://doi.org/10.1007/978-981-16-6890-6_55.

Zeithaml, V., Parasuraman, A. P., & Malhotra, A. (2002). Service quality delivery through web sites: A critical review of extant knowledge. Journal of the Academy of Marketing Science, 30, 362–375. https://doi.org/10.1177/009207002236911.

Zygiaris, S., Hameed, Z., Ayidh Alsubaie, M., & Ur Rehman, S. (2022). Service quality and customer satisfaction in the post pandemic world: A study of saudi auto care industry. Frontiers in Psychology, 13, 842141. https://www.frontiersin.org/journals/psychology/articles/10.3389/fpsyg.2022.842141.

Published

15-04-2025

Issue

Section

Articles

How to Cite

Bhootra, P. . (2025). Unveiling Key Drivers of Customer Satisfaction in Women’s Clothing: A Topic Modeling Analysis of Online Reviews. International Journal of Engineering and Information Management , 1(2), 1-18. https://doi.org/10.52756/ijeim.2025.v01.i02.001