2021-09-10

Double DQN and Dueling DQN in Reinforcement Learning

9 mins read In this article, we will see two algorithms that improve upon DQN. These are named Double DQN and Dueling DQN. But first, let’s […]
2021-09-09

SumTree data structure for Prioritized Experience Replay (PER) explained with Python Code

14 mins read Weighted sampling from a list-like collection is an important activity in many applications. Weighted sampling involves selecting samples randomly from […]
2021-08-22

Understanding Attention Mechanism in Sequence 2 Sequence Machine Translation

39 mins read Introduction Recurrent Neural Networks (or more precisely LSTM/GRU) have been found to be very effective in solving complex sequence-related problems […]
2021-08-07

Principal Component Analysis (PCA) Explained

14 mins read What is PCA? Let’s say that you want to predict what the gross domestic product (GDP) of the United States will be […]
2021-08-06

What are eigenvectors and eigenvalues?

16 mins read Introduction Eigenvectors and eigenvalues have many important applications in computer vision and machine learning in general. Well-known examples are PCA (Principal […]
2021-08-06

Automatic Differentiation Explained

8 mins read Introduction There are several methods to calculate gradients in computer programs: (1) Manual differentiation; (2) Symbolic differentiation; (3) Finite differences […]
2021-08-03

Understanding Model Calibration and Brier Score

12 mins read Do you ever encounter a storm when the probability of rain in your weather app is below 10%? Well, this […]
2021-07-10

Predicting Customer Churn with Machine Learning: From EDA to Classification

27 mins read Table of Contents Introduction Objective Libraries Parameters and Variables Functions A Quick Look at our Data Creating a Test Set […]
2021-07-09

Performance evaluation metrics for binary classification with Python code

30 mins read Classification metrics let you assess the performance of machine learning models but there are so many of them, each one has its […]
2021-07-08

What is Word2vec word embedding?

24 mins read I find the concept of embeddings to be one of the most fascinating ideas in machine learning. If you’ve ever […]
2021-07-04

Feature Scaling with Scikit-Learn

9 mins read 1 Introduction 2 Loading the libraries 3 Scaling methods 3.1 Standard Scaler 3.2 Min-Max Scaler 3.3 Robust Scaler 3.4 Comparison […]
2021-07-03

Understating and discovering multicollinearity in regression analysis with Python code

9 mins read In this post, I will explain the concept of collinearity and multicollinearity and why it is important to understand them […]
2021-06-27

Resampling time series in Pandas: resample and asfreq methods

23 mins read This article is an introductory dive into the technical aspects of resampling methods in pandas. 1. Resampling  Resampling is necessary […]
2021-06-26

Time series analysis with Pandas: Power consumption case study

24 mins read Originally developed for financial time series such as daily stock market prices, the robust and flexible data structures in pandas […]
2021-06-26

Labeling financial data for Machine Learning

24 mins read In this article, we’ll be looking at one method for labeling our data and getting it ready for our model. By the […]
2021-06-24

A complete guide on Pandas Hierarchical Indexing (MultiIndex)

31 mins read Pandas is the go-to library when for data analysis when working with tabular datasets. It is the best solution available for […]
2021-06-24

Data selection (indexing and slicing) in Pandas MultiIndex DataFrames

6 mins read A MultiIndex (also known as a hierarchical index) DataFrame allows you to have multiple columns acting as a row identifier and multiple […]
2021-06-21

Data Science and Machine Learning Cheat Sheets

5 mins read Click on the links to get the high-resolution cheat sheets. Algebra Linear Algebra Calculus Probability Statistics Python R Machine Learning […]
2021-06-12

Introduction to advanced candlesticks in finance: tick bars, dollar bars, volume bars, and imbalance bars

56 mins read In this article, we will explore why traditional time-based candlesticks are an inefficient method to aggregate price data, especially under […]
2021-05-26

5 steps to start becoming a Machine Learning Engineer

16 mins read Step 1: Adjusting Your Mindset Whenever I lead my workshops I always get a lot of questions afterward from developers […]