2022-04-09

Finding an optimized portfolio of machine learning models using Sklearn, LazyPredict, and Precise Packages

10 mins read In this post, I will provide an example of the use of the precise Python package (and PyPortfolioOpt) to create a diversified portfolio of […]
2022-04-09

Delving into GPT-2 and GPT-3 Language Models

32 mins read This year, we saw a dazzling application of machine learning. The OpenAI GPT-2 exhibited an impressive ability to write coherent and passionate […]
2022-03-28

A tutorial on data science project experimentation with Jupyter, Papermill, and MLflow

7 mins read Your company (e.g., an e-commerce platform across several countries) is starting a new project on fraud detection. You begin by […]
2022-03-27

Interpreting coefficients of Dummy Variables in a Linear Regression Model

5 mins read Linear regression is a method we can use to quantify the relationship between one or more predictor variables and a response variable. […]
2022-03-26

Styling Pandas dataframes using Styler

7 mins read What is styling and why care? The basic idea behind styling is that a user will want to modify the way […]
2022-03-24

Feature Importance calculation using Random Forest

5 mins read The feature importance (variable importance) describes which features are relevant. It can help with a better understanding of the solved […]
2022-03-24

When to avoid using Random Forest Regression?

8 mins read In this article, we’ll look at a major problem with using Random Forest for Regression which is extrapolation.  Random Forest Regression […]
2022-03-24

Mel Spectrogram Explained with Python Code

6 mins read Signals A signal is a variation in a certain quantity over time. For audio, the quantity that varies is air pressure. How […]
2022-03-23

A comprehensive tutorial on Transformers Architecture

43 mins read We’ve been hearing a lot about Transformers and with good reason. They have taken the world of NLP by storm […]
2022-03-22

Categorical data type in Pandas

8 mins read You may have categorical data in your dataset. A categorical data is a type with two or more categories. If […]
2022-03-22

NumPy Broadcasting tutorial

13 mins read In operations between NumPy arrays (ndarray), each shape is automatically converted to be the same by broadcasting. This article describes the following […]
2022-03-22

PySpark equivalent methods for Pandas dataframes

8 mins read Pandas is the go-to library for every data scientist. It is essential for every person who wishes to manipulate data […]
2022-03-11

A tutorial on Bayesian Statistics and Bayesian Machine Learning basics with Python Code

31 mins read Introduction Conditional probability and Bayes’ theorem are fundamental ideas in statistics that even laymen have heard of. Bayes’ theorem also […]
2022-03-08

Review of intuitions behind the recent advances in NLP: From RNNs to Transformers and BERT

48 mins read Few areas of AI are more exciting than NLP right now. In recent years language models (LM), which can perform […]
2022-02-27

Classical Time Series Forecasting Models in Python

11 mins read Machine learning methods can be used for the classification and forecasting of time series problems. Before exploring machine learning methods for time […]
2022-02-25

Autocorrelation and Partial Autocorrelation explained with Python code

10 mins read What is correlation? In statistics, correlation or dependence refers to any statistical association between two random variables or bivariate data, whether causal […]
2022-02-24

Understanding 1D, 2D, and 3D convolutional layers in deep neural networks

21 mins read In deep learning, convolutional layers have been major building blocks in many deep neural networks. The design was inspired by […]
2022-02-23

Understanding Attention Mechanism with example

14 mins read For decades, Statistical Machine Translation has been the dominant translation model, until the birth of Neural Machine Translation (NMT). NMT is an […]
2022-02-22

Hyperparameter optimization techniques in machine learning with Python code

10 mins read In every Machine Learning project, it is possible and recommended to search the hyperparameter space to get the best performance […]
2022-02-20

Bayesian view of linear regression – Maximum Likelihood Estimation (MLE) and Maximum A Priori (MAP)

16 mins read Linear Regression is commonly the first machine learning problem that people are interested in in the area of study. For […]