Muhammed Shah

AI Software Engineer

About Me

Hello, I'm Muhammed Muzzammil Shah.


My expertise includes using AI architectures and techniques to design and implement practical AI solutions such as AI-powered chatbots, agentic systems, or adding intelligent capabilities to existing software. I also have a good foundation in full-stack development to bring end-to-end systems to life.


What you will find in this site

This website is my technical profile. You can explore my Work Experience, Projects, Certifications, and more. Each section has its own dedicated page if you’d like a deeper look.


Feel free to swipe this section if you want to learn more about me!

Muhammed Shah
Bangalore, India

Sites I've worked on:

It all started back in 2014 when I first saw my uncle working his magic with lines of code (lowkey a Tony Stark vibes moment). Since then, I've been hooked on programming!


Fast forward to today, and I'm proud to be working in the dynamic field of AI. Through it all, I still keep my uncle's favorite Steve Jobs quote close to heart: "If you're afraid of failing you won't get very far."


Alongside my journey in tech, I’ve always enjoyed graphic design and website design (something I picked up in college and never really stopped doing). I even run a separate site just to showcase some of my design projects.

Muzzammil Studio →

My Space View More →

Blog post image

Personal Branding

Blog post image

Weekend Tech Cafe

Blog post image

Private GPT on Linux/WSL

Blog post image

AI as a Blackbox?

Blog post image

The Journey of Optimization

Work Experience View More →

AI Software Engineer

Sept 2025 - Present

Leading a focused AI team to design and deploy scalable AI solutions across multiple projects, while also creating and delivering training programs to help the company build AI skills and adoption.

Junior AI Software Engineer

Sept 2024 - Sept 2025

Building and optimizing applications using LLMs or SMLs for different departments and also integrating AI architectures for secure, scalable deployment.

AI Trainee Engineer

Sept 2023 - Sept 2024

Collaborated directly with the Global CIO and Founder on AI-driven initiatives, focusing on open source AI R&D, pilot implementations and cybersecurity operations.

AI Intern

Mar 2023 - Jun 2023

Developed a pilot for interactive data insight conversations and researched AI observability, ML monitoring, root cause analysis, anomaly detection, and capacity forecasting.

Work Projects View More →

Private Enterprise GPT →

Designed, configured, and deployed a secure in-house Private GPT system on Ubuntu with Nginx reverse proxy, SSL, and custom domain mapping. Integrated Microsoft OAuth authentication to ensure access only to company users, automated backend services for resilience, and enabled GPU powered remote access via VNC. Migrated all AI interactions in-house, ensuring full data privacy and operational reliability.

AI Search →

Developed an AI powered search widget with an “Ask AI” feature for instant, accurate querying of company salary and HR data, leveraging a RAG system with FAISS embeddings, LangChain orchestration, and Llama3.x models. Optimized embedding accuracy, reduced hallucinations, and introduced caching for speed, deploying the solution as a JavaScript widget via IIS for seamless internal integration.

Intelligence Framework →

Engineered an all-in-one intelligence framework for HR and Talent Acquisition, featuring a LLM driven resume reviewer, job spec generator, chatbot, and resume formatter with advanced UI and batch processing. Utilized local LLMs via Ollama, LangChain pipelines, and MongoDB for fast, secure, and accurate candidate evaluation, automation, and internal data generation.

Chatbot Development →

Led the creation of an advanced company chatbot widget, powered by a proprietary SLM trained on internal data and later enhanced with LLM (Ollama) fallbacks. Delivered features like file downloads, speech-to-text, custom UI, and robust conversational management. Migrated from Rasa Open Source to Rasa Pro CALM for advanced context switching and ensured scalable, low latency responses for dynamic business needs.

Skills

Programming Languages:

Python

ML & DL Frameworks:

PyTorch

HuggingFace

LangChain

LangGraph

AI SDK

AI Architectures:

Neural networks

LLMs

AI Techniques:

RAG

Tool calling

NLU/NLP

Prompt engineering

AI Applications:

Chatbots (Conversational AI)

Private GPT

AI Agents

Vector Databases:

FAISS

Qdrant

Data Engineering & Analysis:

NumPy

Pandas

Matplotlib

Jupyter Notebook

AI Deployment & MLOps:

Linux Server (Ubuntu)

Ollama

Docker

AI Tools & Platforms:

Rasa

Open WebUI

Certifications View More →

DeepLearningAI Logo

Agentic AIExternal Link

Issued by DeepLearning.AI - Nov 2025

Anthropic Logo

MCP: Build Rich-Context AI AppsExternal Link

Issued by DeepLearning.AI - Sept 2025

Hugging Face Logo

Open Source Models with Hugging FaceExternal Link

Issued by DeepLearning.AI - Aug 2024

OpenAI Logo

Prompt Engineering for DevelopersExternal Link

Issued by DeepLearning.AI - Aug 2024

LangChain Logo

Build LLM Apps with LangChain.jsExternal Link

Issued by DeepLearning.AI - July 2024

Coursera Logo

Generative AI with Large Language ModelsExternal Link

Issued by Coursera - Feb 2024

Microsoft Logo

Career Essentials in Generative AIExternal Link

Issued by Microsoft and LinkedIn - Jul 2023

Personal Projects View More →

Chat AI →

Built and deployed a React-based AI assistant on my personal website using the AI SDK, enabling multi-step reasoning, tool use, and real time responses with seamless UI/UX and dynamic knowledge updates.

Road to AI →

Developed a comprehensive documentation site covering practical neural network and GPT model implementations, foundational ML concepts, and advanced topics like transformers and tokenization, using Python and PyTorch.

Transformer Model: GPT →

Created a GPT style transformer in PyTorch from scratch, progressing from a simple bigram model to a full transformer with multi-head attention, feedforward layers, and text generation capabilities.

Neural Networks: Makemore →

Built neural networks of different language models progressing from a basic bigram and MLP-based character level predictor to deeper architectures with batch normalization, manual backpropagation, and a WaveNet inspired convolutional network.

Neural Networks: Micrograd →

Built a neural network library from the ground up in Python, implementing backpropagation, gradient descent, and autograd features to understand neural network fundamentals.

Education View More →

MVJ College of Engineering (Affiliated with VTU), 2023 Graduate

BTech - Computer Science

Grade: 9.32 CGPA

Achievements: Top 10 University Rank Holder in CSE Department, Director of Design at TEDxMVJCE, Vice President at Saahitya Literature Club

Primus PU College, 2019 Graduate

PUC - Physics, Chemistry, Math and Computer Science

Grade: 86.5%, Distinction

St.Peter's School, 2017 Graduate

I.C.S.E - Science

Grade: 92.4%

Achievements: Top Rank Holder in Computer Applications.