IJCAIT Vol 12, Issue 2, 2020



An Integrated Approach for Amazon Product Reviews Classification Using Sentiment Analysis
Authors: Deepika Kumar, Ritik Aggarwal, Siddhant Raghuvanshi, Shourya Chand, India


Sentiment Analysis plays a huge role in business analytics and situations in which text needs to be analyzed. It is used in anticipating market progression based on different news, online blogs and social media opinions. Essential part of information-gathering for market research is to find the opinion of people about the product. Many business enterprises are utilizing these opinions to perform better in the market. In this paper, the analysis is done on the Amazon product?s reviews dataset. The data is organized through preprocessing and after cleaning through various techniques, some useful features are selected and sentiment analysis is done to generate a sentiment polarity. Various different learning techniques like Naïve Bayes, Linear Support Vector Machine and Logistic Regression classifiers are applied on the preprocessed data and comparison analysis is done to find the best classifier fit for the reviews data through the detailed analysis and generation of the Receiver operator characteristics curve and the comparison analysis through AUC value of different classifiers testing the precision or the accuracy. A term-doc incidence matrix is built using term frequency and inverse document frequency ponderation. The results obtained from the integrated approach of combining the sentiment polarity with the ratings to recommend a particular Amazon product proves invaluable for both the customer and seller.


Keywords: Data Preprocessing, NLP, Sentiment Analysis, Classification

References
1. Gurmeet kaur and Abhinash Singla, “Sentimental Analysis of Flipkart reviews using Naïve Bayes and Decision Tree algorithm”, International Journal of Advanced Research in Computer Engineering & Technology (IJARCET) Volume 5 Issue 1, January 2016.
2.Bo Pang and Lillian Lee, “Thumbs up? Sentiment Classification using Machine Learning Techniques”, Proceedings of EMNLP2002.
3. Yongfeng Zhangy and GuokunLaiy, “ Explicit Factor Models for Explainable Recommendation based on Phrase-level Sentiment Analysis”, SIGIR’14 Proceedings of the 37th international ACM SIGIR conference on Research & development in informational retrieval Pages83-92
4. Konstantin Bauman, Bing Liu and Alexander Tuzhilin, “Aspect Based Recommendations: Recommending Items with the Most Valuable Aspects Based on User Reviews”, KDD ’17, August 13-17, 2017, Halifax, NS,Canada
5. Xing Fang and Justin Zhan, “Sentiment Analysis using Product Review Data”, Journal of Big Data 2015
6. Minqing Hu and Bing Liu,”Mining Opinion Features in Customer Reviews”, American Association for Artificial Intelligence2005
7. Alvaro Ortigosa, José M. Martín, Rosa M. Carro, “Sentiment Analysis in Facebook and itsapplication to e-learning”, Department of Computer Science, Universidad Autónomade Madrid, Francisco Tomás y Valiente11, 28049 Madrid, Spain
8. M Mittal, LM Goyal, J K Sethi and D J Hemanth, (2018), “Monitoring the Impact of Economic Crisis on Crime in India Using Machine Learning”, Computational Economics, Springer. vol 53(4), pp. 1467-1485
9. M. Mittal, I. Kaur, S. C. Pandey, A. Verma, L M Goyal (2019) “Opinion Mining for the Tweets in Healthcare Sector using Fuzzy Association Rule”, EAI Endorsed Transactions on Pervasive Health and Technology. 4.16:1-10
10. M. Shastri, S. Roy, M. Mittal, (2019) “Stock Price Prediction using Artificial Neural Model: An Application of Big Data”, EAI Endorsed Transactions on Scalable Information Systems, VoL 06(20), pp. 1-8
11. Jasleen Kaur Sethi&Mamta Mittal (2019) “Ambient Air Quality Estimation using Supervised Learning Techniques”, EAI Endorsed Transactions on Scalable Information Systems, VoL 06(22), pp. 1-10
12. Mittal M., Arora M., Pandey T. (2020) Emoticon Prediction on Textual Data Using Stacked LSTM Model. In: Bansal J., Gupta M., Sharma H., Agarwal B. (eds) Communication and Intelligent Systems. ICCIS 2019. Lecture Notes in Networks and Systems, vol 120. Springer, Singapore
13. Kumar L., Mitra A., Mittal M., Sanghvi V., Roy S., Setua S.K. (2020) Music Tagging and Similarity Analysis for Recommendation System. In: Das A., Nayak J., Naik B., Pati S., Pelusi D. (eds) Computational Intelligence in Pattern Recognition. Advances in Intelligent Systems and Computing, vol 999. Springer, Singapore
14. Yejin Choi and Claire Cardie, “Learning with Compositional Semantics as Structural Inference for Subsentential Sentiment Analysis”, Department of Computer Science Cornell University Ithaca, NY 14853
15. Soleymani, M., Garcia, D., Jou, B., Schuller, B., Chang, S. F., &Pantic, M. (2017). A survey of multimodal sentiment analysis. Image and Vision Computing, 65, 3-14.
16. Batra, R., &Daudpota, S. M. (2018, March). Integrating Stock Twits with sentiment analysis for better prediction of stock price movement. In 2018 International Conference on Computing, Mathematics and Engineering Technologies (iCoMET) (pp. 1-5). IEEE.
17. Sharma, M., Singh, G., & Singh, R. (2017). Stark assessment of lifestyle based human disorders using data mining based learning techniques. IRBM, 38(6), 305-324.
18. Kaur, P., & Sharma, M. (2019). Diagnosis of Human Psychological Disorders using Supervised Learning and Nature-Inspired Computing Techniques: A Meta-Analysis. Journal of medical systems, 43(7), 204.
19. Khan, M. A., Khan, M. A., Ahmed, F., Mittal, M., Goyal, L. M., Hemanth, D. J., &Satapathy, S. C. (2020). Gastrointestinal diseases segmentation and classification based on duo-deep architectures. Pattern Recognition Letters, 131, 193-204.
20. Sharma, M., Singh, G., & Singh, R. (2019). Design of GA and Ontology based NLP Frameworks for Online Opinion Mining. Recent Patents on Engineering, 13(2), 159-165.


© 2012 onwards International Journal of Computer Applications & Information Technology

International Journal of Computer Applications & Information Technology
All rights reserved are reserved to Chief Editor IJCAIT.

For any Technical Support editorijcait@gmail.com