December 18, 2020

How to determine epsilon and MinPts parameters of DBSCAN clustering

Every data mining task has the problem of parameters. Every parameter influences the algorithm in specific ways. For DBSCAN, the […]
November 9, 2020

Restricted Boltzmann Machines (RBMs) Simply Explained

Contents Definition & Structure Reconstructions Probability Distributions Code Sample: Stacked RBMS Parameters & k Continuous RBMs Next Steps Other Resources […]
February 21, 2020

Walkthrough of an exploratory analysis for classification problems

In this post I outline how to perform an exploratory analysis for a binary classification problem. I am going to […]
February 5, 2020

Dealing with Imbalanced Data

https://towardsdatascience.com/methods-for-dealing-with-imbalanced-data-5b761be45a18 https://towardsdatascience.com/methods-for-dealing-with-imbalanced-data-5b761be45a18 Imbalanced classes are a common problem in machine learning classification where there are a disproportionate ratio of observations […]
February 3, 2020

Exploratory Data Analysis

https://towardsdatascience.com/exploratory-data-analysis-8fc1cb20fd15 https://medium.com/omarelgabrys-blog/statistics-probability-exploratory-data-analysis-714f361b43d1 https://www.kaggle.com/ekami66/detailed-exploratory-data-analysis-with-python https://www.kaggle.com/dvigneshwer/kernele7f4dbb964/notebook Visualizing the distribution of a dataset — seaborn 0.10.0 documentationhttps://seaborn.pydata.org/tutorial/distributions.html https://www.kaggle.com/kashnitsky/topic-1-exploratory-data-analysis-with-pandas https://iq.opengenus.org/exploratory-data-analysis-python/ Plotting with categorical data […]