32mins read This year, we saw a dazzling application of machine learning. The OpenAI GPT-2 exhibited an impressive ability to write coherent and passionate […]
19mins read Table of Contents An Introduction To PyTorch Dataset and DataLoaderWhy Write Good Data Loaders and Datasets?The Basic PyTorch Dataset StructureImplementing […]
14mins read For decades, Statistical Machine Translation has been the dominant translation model, until the birth of Neural Machine Translation (NMT). NMT is an […]
22mins read In this article, I will be covering the main concepts behind Attention, including the implementation of a sequence-to-sequence Attention model, […]
11mins read We often read almost everywhere that Lasso regression encourages zero coefficient and hence provides a great tool for variable selection as well but it […]
11mins read Conventional encoder-decoder architectures for machine translation encoded every source sentence into a fixed-length vector, irrespective of its length, from which […]
29mins read Sequence-to-sequence models are deep learning models that have achieved a lot of success in tasks like machine translation, text summarization, […]
39mins read Introduction Recurrent Neural Networks (or more precisely LSTM/GRU) have been found to be very effective in solving complex sequence-related problems […]
8mins read Introduction There are several methods to calculate gradients in computer programs: (1) Manual differentiation; (2) Symbolic differentiation; (3) Finite differences […]