2022-05-22

Understanding np.newaxis and np.expand_dims in NumPy

9 mins read To add new dimensions (increase dimensions) to the NumPy array ndarray, you can use np.newaxis, np.expand_dims(), and np.reshape() (or reshape() method of ndarray). Indexing — NumPy v1.17 Manual Constants […]
2022-05-11

23 Useful but less used Pandas Functions

11 mins read Pandas is so vast and deep that it enables you to execute virtually any tabular manipulation you can think of. […]
2022-05-08

Encoding categorical features using the category_encoders package

11 mins read There are loads of different ways to convert categorical variables into numeric features so they can be used within machine […]
2022-05-04

Understand different feature scaling techniques with Python code

19 mins read In many machine learning algorithms, to bring all features in the same standing, we need to do scaling so that […]
2022-05-01

Speed up Python code using Numba

27 mins read Python is an interpreter-based language hence it’s slow compared to other compiler-based languages like C/C++. Due to this python was […]
2022-04-27

A tutorial on TensorFlow 2 and Keras: From basics to deep neural networks

49 mins read Predictive modeling with deep learning is a skill that modern developers need to know. TensorFlow is the premier open-source deep […]
2022-04-24

Implementing Transformers step-by-step in PyTorch from scratch

14 mins read Doing away with clunky for-loops, the transformer instead finds a way to allow whole sentences to simultaneously enter the network […]
2022-04-17

Profile Memory Usage in Python using memory_profiler

14 mins read With the rise in the primary memory of computer systems, we generally do not run out of memory. This is […]
2022-04-10

Understanding ROC and Precision-Recall curves

25 mins read It can be more flexible to predict probabilities of an observation belonging to each class in a classification problem rather […]
2022-04-09

Finding an optimized portfolio of machine learning models using Sklearn, LazyPredict, and Precise Packages

10 mins read In this post, I will provide an example of the use of the precise Python package (and PyPortfolioOpt) to create a diversified portfolio of […]
2022-03-28

Bulk Boto3 (bulkboto3): Python package for fast and parallel transferring a bulk of files to S3 based on boto3!

5 mins read Table of Contents: Introduction About bulkboto3 Getting Started Prerequisites Installation Usage Contributing Conclusion Introduction “How to transfer a bulk of […]
2022-03-28

A tutorial on data science project experimentation with Jupyter, Papermill, and MLflow

7 mins read Your company (e.g., an e-commerce platform across several countries) is starting a new project on fraud detection. You begin by […]
2022-03-26

Styling Pandas dataframes using Styler

7 mins read What is styling and why care? The basic idea behind styling is that a user will want to modify the way […]
2022-03-25

Different Python package import patterns using __init__.py file

10 mins read I have had a few conversations lately about Python packaging, particularly around structuring the import statements to access the various modules of […]
2022-03-24

Feature Importance calculation using Random Forest

5 mins read The feature importance (variable importance) describes which features are relevant. It can help with a better understanding of the solved […]
2022-03-24

Mel Spectrogram Explained with Python Code

6 mins read Signals A signal is a variation in a certain quantity over time. For audio, the quantity that varies is air pressure. How […]
2022-03-22

Categorical data type in Pandas

8 mins read You may have categorical data in your dataset. A categorical data is a type with two or more categories. If […]
2022-03-22

NumPy Broadcasting tutorial

13 mins read In operations between NumPy arrays (ndarray), each shape is automatically converted to be the same by broadcasting. This article describes the following […]
2022-03-22

PySpark equivalent methods for Pandas dataframes

8 mins read Pandas is the go-to library for every data scientist. It is essential for every person who wishes to manipulate data […]
2022-03-11

A tutorial on Bayesian Statistics and Bayesian Machine Learning basics with Python Code

31 mins read Introduction Conditional probability and Bayes’ theorem are fundamental ideas in statistics that even laymen have heard of. Bayes’ theorem also […]