Customer Retention Booster

Keep them coming,...

Scroll

Increase your customer retention rate

WHEN Nov 2018
Tools Used Python - Web Scrapper, R, Shiny, ML-Classification
Visit GitHub

As people started providing product reviews in abundance, it is difficult to find a middle ground to evaluate the reviews and determine the reason behind their reviews.

we will be analyzing the reviews provided on the Amazon website using Multinomial Naïve Bayes and Support Vector Machine algorithms to predict the sentiment of the users towards the product and perform topic modelling using Latent Dirichlet Allocation algorithm to determine the topics discussed or in simple terms, the reason behind such ratings/reviews.

Classifying data is what we will be doing in this project and Support Vector Machines (SVM) are the frontier in classifying the data with the n-dimensional hyperplane, n being the number of features. Alongside SVM, we will also be using another classifier called Multinomial Naïve Bayes (MNB) classifier which is a specific instance of a Naïve Bayes Classifier which uses a multinomial distribution for each of the features.

Latent Dirichlet Allocation (LDA) is a generative probabilistic model of a collection of documents made up of words and/or n-grams (set of words) which we will be using for performing topic modelling.

We will be applying the above-mentioned algorithms to the data and build a model that will predict the sentiment of the provided comment/review and cluster the word according to the topics that were discussed.

Finally, we will be presenting our findings through an R-Shiny app with complete descriptive anylsis presented through the approproate visualization techniques.