December 18, 2020

How to determine epsilon and MinPts parameters of DBSCAN clustering

Every data mining task has the problem of parameters. Every parameter influences the algorithm in specific ways. For DBSCAN, the […]
November 9, 2020

Restricted Boltzmann Machines (RBMs) Simply Explained

Contents Definition & Structure Reconstructions Probability Distributions Code Sample: Stacked RBMS Parameters & k Continuous RBMs Next Steps Other Resources […]
February 21, 2020

Walkthrough of an exploratory analysis for classification problems

In this post I outline how to perform an exploratory analysis for a binary classification problem. I am going to […]
February 5, 2020

Dealing with Imbalanced Data Imbalanced classes are a common problem in machine learning classification where there are a disproportionate ratio of observations […]
February 3, 2020

Exploratory Data Analysis Visualizing the distribution of a dataset — seaborn 0.10.0 documentation Plotting with categorical data […]
February 3, 2020

Data Levels of Measurement

There are four measurement scales: nominal, ordinal, interval and ratio. These are simply ways to categorize different types of variables […]
September 8, 2017

L1 and L2 as Loss Function and Regularization

While practicing machine learning, you may have come upon a choice of the mysterious L1 vs L2. Usually the two […]
September 8, 2017

Missing data methods   Missing Completely at Random Missing completely at random (MCAR) is the only missing data mechanism that can actually […]