This post discusses a detailed analysis for a start-up that aims to make a data driven decision on optimal marketing strategy that maximizes sales and conversion as well as understanding factors driving them.
This post describes an end to end project on predicting review sentiments by customers using Natural Language Processing and Deep Learning with Pytorch.
One Hot encoding is one of techniques employed in representing texts to natural language processing task. This post discusses one-hot encoding as part I of discussions on text representation in NLP
A short description of the post.
This post demonstrates the process for predicting number of clicks with focus on procedures and logic behind the modeling process. The post details exploratory analysis undertaken to inform the modeling processing, implementation of non-parametric machine learning models such as decision trees, ensembel model, hyperparameter optimization, bagging and boosting techinques to improve the model performance. The post also provides insights on the influence of various features on model and more essentially whether or not the modeling exercise was worth it by comparing the rsults to a benchmark model. It is duly recognized that not all techniques, some of which are capable of further reducing error where implemented. Nonetheless, an unexhaustive highlight of how to improve the model is provided.
This post demonstrates how to undertake customized interactive visualization using time series data.
This post analyzes how various strategies employed by a business for growth actually influences their sales. The data driven approach aims to assess how a firm will progress forward. To gain such actionable insights, this post demonstartes how to undertake a linear regression with focus on result interpretability.
This post discusses identifying and analyzing KPIs for products as well as undertaking A/B testing to make recommendations to that are relevant to product teams. The post is an end-to-end project that demonstrates how to support product development with statistical analysis.
Clustering analysis is popular unsupervised machine learning techniques that categorize data into groups of similarity.
This post entails the use of ggplot for data visualization.
Datasiast exists for enthusiatic data science knowledge sharing and practice.