Muhammed Muzzammil Shah

Personal Projects I've worked on

Artificial Intelligence and Machine Learning

Road to AI

Timeline: 6th December, 2024 - PRESENT

Description: This documentation site serves as a personal repository of notes and practical implementations from my journey of learning and building Neural Networks as well as GPT models. It focuses on foundational concepts like backpropagation and language modeling, while also exploring advanced topics such as transformer architectures, tokenizers, and GPT-2 reproduction. Designed as a resource for both revision and inspiration, it is also open for others to reference and learn from.

Documentation

Python

PyTorch

Neural Network

LLMs

Tokenization

Backpropagation

Legal Queries Multi-Agent

Timeline: 20th - 21st May, 2025

Description: A Streamlit-based chatbot that leverages Google Gemini API, Qdrant and LangGraph for multi-agent legal question answering. It uses a Query Agent to fetch relevant legal text, a Summarization Agent to convert complex legal excerpts into plain-language steps, and a Response Agent to deliver concise answers and invite follow-ups.

Python

Streamlit

Qdrant

LangGraph

AI Agent

GPT Transformer Model - 2

View Road-to-AI

Timeline: 6th April, 2025 - PRESENT (Weekends only)

Description: An end-to-end PyTorch implementation of a GPT-2 style language model, based on OpenAI's 124M Paramater model released by OpenAI and Andrej Karpathy’s NanoGPT. The project walks through all major components of the GPT-2 architecture from first principles, including tokenization, positional embeddings, multi-head self-attention, feedforward layers, and transformer blocks. Key ML concepts such as autoregressive language modeling, causal masking, residual connections, and layer normalization are implemented and explained in detail. The final model is trained on real-world text data to generate coherent natural language.

PyTorch

Python

Transformer Architecture

Multi-Head Self-Attention

Autoregressive Language Modeling

Positional Embeddings

Neural Networks

Layer Normalization & Residual Connections

GPT Transformer Model - 1

View Road-to-AI

Timeline: 11th - 21st February, 2025

Description: This project is a ground-up implementation of a GPT-style transformer, following Andrej Karpathy’s tutorial. It begins with a naive bigram language model and gradually evolves into a full transformer architecture with multi-head self-attention, feedforward layers, residual connections, and layer normalization. The implementation provides hands-on insight into self-attention, positional encodings, and the key building blocks of modern large language models. By the end, the model can generate text based on learned patterns, demonstrating the power of transformers in natural language processing.

PyTorch

Python

Transformer Architecture

Self Attention Implementation

Language Modeling

Neural Networks

Neural Networks - Makemore (Language Model - 5)

View Road-to-AI

Timeline: 8th - 9th February, 2025

Description: In this phase, we transform a basic 2-layer MLP into a deep, tree-like convolutional architecture inspired by DeepMind's WaveNet (2016), leveraging convolutional layers to effectively capture hierarchical patterns in the data. This implementation shifts from fully connected layers to a more complex network structure, delving into the inner workings of torch.nn in PyTorch, and highlighting the iterative deep learning development process.

PyTorch

CNNs

WaveNet

Deep Learning

Language Modeling

Neural Networks

Neural Networks - Makemore (Language Model - 4)

View Road-to-AI

Timeline: 15th January - 6th February, 2025

Description: In this we take the MLP implemented in the Language Model-3 project and backpropagate through it manually without using PyTorch autograd's loss.backward(). The aim is to get a strong intuitive understanding about how gradients flow backwards through the compute graph and on the level of efficient Tensors, not just individual scalars like in the Micrograd project.

PyTorch

Backpropagation

Tensors

Autograd

Gradient Descent

Neural Network

Neural Networks - Makemore (Language Model - 3)

View Road-to-AI

Timeline: 6th - 14th January, 2025

Description: Focused on implementing Batch Normalization within a neural network framework, emphasizing its role in stabilizing activations and gradients during training. Covered techniques like Kaiming initialization to enhance weight scaling and prevent saturation of activation functions. Also analysed the effects of Batch Normalization on convergence speed and overall model performance. Visualizations have also been used to monitor activations and gradients, providing valuable insights into the training dynamics.

PyTorch

Batch Normalization

Neural Networks

Kaiming Initialization

Optimization

Activation Functions

Neural Networks - Makemore (Language Model - 2)

View Road-to-AI

Timeline: 26th November - 11th December, 2024

Description: Implemented a MLP language model from the 'Bengio et al. 2003' research paper but on a character-level based prediction following Andrej Karpathy's approach and even slighly improved the final loss value.

PyTorch

MLP

NLP

Deep Learning

Character-Level Modeling

Language Modeling

Neural Networks - Makemore (Language Model - 1)

View Road-to-AI

Timeline: 4th - 22nd November, 2024

Description: Worked on implementing a bigram character level language model from scratch to generate text, exploring key concepts in natural language processing such as normalisation, probability distributions, sampling new words and evaluating the model based on the Negative Log Likelihood value. Also casted the same bigram problem into a neural network to produce a similar output but by using gradient based optimization to tune the parameters of the network, following Andrej Karpathy’s methodology.

PyTorch

NLP

Text Generation

Probability Distributions

Neural Networks

Optimization

Neural Networks - Micrograd

View Road-to-AI

Timeline: 2nd - 27th October, 2024

Description: Built a neural network from scratch by developing a micrograd library, implementing core concepts like backpropagation and gradient descent following Andrej Karpathy’s methodology.

Python

PyTorch

Backpropagation

Neural Networks

Gradient Descent

Autograd

Portfolio website with Chatbot

Timeline: 8th - 24th September, 2024

Description: Developed a Portfolio site using Flask, HTML, CSS, and JavaScript. Along with a functioning chatbot developed using RASA.

Flask

HTML

CSS

JavaScript

RASA

DeepLearning.AI Upskilling

Timeline: 21st August, 2024 - Present

Description: Project outlet for everything that I am learning from DeepLearning.AI's short courses

Deep Learning

Artificial Intelligence

Machine Learning

Jupyter Notebook

Open Source Tooling

PrivateGPT

Timeline: 2nd - 18th August, 2024

Description: PrivateGPT is an Open Source project available on the internet. The purpose of this was to establish, run and use my own Private AI without worrying about my data getting leaked. Can even use this without internet connection after setup.

Open Source

Private AI

LLMs

Offline AI

Web Development

Habits Tracker App

Timeline: 4th April - 6th April, 2025

Description: A full-stack web application designed and open-sourced to help users build and track daily habits. Built using the MERN stack, the app features a responsive React.js frontend, a Node.js and Express backend API, and MongoDB for storing user data and habit logs. The app supports user interaction, real-time updates, and persistent storage, and is deployed on Netlify for personal use.

React.js

Node.js

Express.js

MongoDB

RESTful API

Full-Stack Web Development

Netlify Deployment

Project Tasks List Tracker (Temporarily Archived)

Timeline: 7th - 24th July, 2024

Description: Project outlet for the Angular Specialization Course

Angular

TypeScript

Smart Question Paper Generator

View Research Paper

Timeline: Nov 2022 - June 2023

Description: Group project done as part of my Final Year Project during my UG Course. We developed an intelligent solution for creating custom question papers using custom designed Machine Learning Algorithms. Our website is designed to make the process of generating question papers quick, efficient and hassle-free.

Machine Learning

NLP

Python

Web Development

Hammedz Stack Overflow

Timeline: Sept 2022 - Oct 2022

Description: Developed a Stack Overflow website clone with an alternative design theme. This was done for the project implementation purpose while learning MERN Stack. MongoDB Atlas Cloud database was used.

MERN

MongoDB

Express.js

React.js

Node.js

HammedzFlix

Timeline: Dec 2021 - Feb 2022

Description: Developed a Video streaming platform (Similar to Netflix) as part of a Subject-based Project during my UG Course. The site has a high quality, responsive design along with custom made posters. Developed using HTML, CSS and JavaScript with PHP for Server-Side Scripting.

HTML

CSS

JavaScript

PHP

Mobile App Development

Deep Dive Audio App

Timeline: June 2022 - Aug 2022

Description: Developed an App using JAVA in Android Studio as part of a Mini-Project during the Pre-Final Year of my UG Course. The app can detect and list any type of Audio files in your mobile device. The user could play/pause and change the songs along an active Seekbar (Implemented using Threads) for an interactive experience.

Java

Android Studio

Threads

App Development