Collection of my favorite works.

Projects

Learning is constant and it happens faster when you code, break and build stuff

Categories

PaliGemma: Multimodal Vision-Language Model from Scratch
RecentAI/MLGenerative AIHighlights
05/2025 - 05/2025
Recreated PaliGemma from scratch to deeply understand vision-language model components like SigLIP, Gemma, KV cache, attention types, contrastive learning, rotary embeddings, and multimodal inference architecture.
Key Skills: Vision-Language Models, Contrastive Learning, Rotary Embeddings, KV Cache, Multimodal Inference, PyTorch, Transformer Architectures, SigLIP, Gemma
Learn more
Magnecruit: AI-Powered Productive Recruitment Workspace Application
RecentAI+Full StackGenerative AIDatabaseHighlights
04/2025 - 05/2025
Built an AI-powered recruitment platform with real-time chat, dynamic workspaces, and agentic backend logic using Flask and Gemini API, enhancing recruiter productivity and natural language interaction.
Key Skills: Python, Flask, SQLAlchemy, Google Gemini API, React, TypeScript, Prompt Engineering, Flask-SocketIO, PostgreSQL, Socket.IO
Learn more
Resonique: Multimodal Music Recommendation App
RecentAI/MLGenerative AIStreamlitHighlights
02/2025 - 03/2025
Developed an emotion-aware music retrieval system that interprets user context using Gemini Flash, transforming multi-modal inputs into MPNet embeddings for semantic song matching via Pinecone and Spotify.
Key Skills: Multi-modal Search, Generative AI, LLM, Vector Search, MPNet, CLAP, Pinecone, Supabase, API Integration, Streamlit
Learn more
Omni Truly Personal AGI
RecentAI/MLGenerative AIHighlights
11/2024 - 02/2025
Designed Omni, a context-aware multimodal AI assistant that recalls memories, assists tasks, navigates spaces, supports learning, and enhances wellbeing through adaptive modes and intelligent sensory integration.
Key Skills: Multimodal AI, Vector Databases, Speech Recognition, RAG, Depth Estimation, Context Awareness, Task Automation, Gemini Integration
Learn more
Adaptive Driver Assistance: Context-based Approach to Pedestrian Safety
RecentAI/MLResearchComputer VisionHighlights
06/2024 - 12/2024
Optimized Vision Transformer training for pedestrian intention recognition using LoRA adapters, achieving 90% accuracy with only 0.68% trainable parameters. Leveraged YOLOv8 to extract ROIs and Vision Transformer for classification on JAAD video clips.
Key Skills: Vision Transformer, Parameter Efficient Fine-Tuning, LoRA, YOLOv8, Object Detection, Low-Rank Adaptation, Deep Learning, Data Augmentation
Learn more
RxRovers: Roaming for Rapid Relief
AI/MLHighlights
02/2024 - 05/2024
Simulated dynamic medicine delivery using Deep RL, implementing and comparing six algorithms for path optimization and obstacle avoidance, including PPO, A2C, and Deep Q-Networks in custom Gym environments.
Key Skills: Deep Reinforcement Learning, Q-Learning, Deep Q Networks, A2C, PPO, Actor-Critic, Path Planning, Obstacle Avoidance, Gymnasium, Python
Learn more
F.E.A.S.T: Food & Ingredient AI Suggestion Technology
AI/MLGenerative AINLPStreamlitHighlights
01/2024 - 05/2024
Engineered a smart meal planning pipeline using object detection on food images and recipe tokenization, enabling personalized, nutrition-rich suggestions powered by BART and Hugging Face models.
Key Skills: Multimodal AI, Text Generation, Tokenization, Deep Learning, Hugging Face, Grounding DINO, BART, T5, BERT-FDA, Recipe Generation, Named Entity Recognition
Learn more
Mapping Crime Dynamics: Integrating Textual, Spatial, and Temporal Perspectives
AI/MLResearchNLPHighlights
01/2024 - 06/2024
Integrated multiple data modalities to predict crime types and trends using Random Forest, XGBoost, LSTM, and LDA topic modeling, enabling strategic identification of high-risk areas.
Key Skills: Machine Learning, LSTM, Random Forest, XGBoost, Bagging, Boosting, Topic Modeling, LDA, Clustering, BIRCH, Mini Batch KMeans, Time-Series Analysis
Learn more
Attention based Discrimination of Mycoplasma Pneumonia
AI/MLResearchComputer VisionHighlights
06/2021 - 02/2022
Applied attention mechanisms and UGTN transformer models for high-dimensional feature extraction, enhancing pneumonia classification accuracy using COVID-19 Radiography images with TensorFlow and PyTorch frameworks.
Key Skills: Attention Mechanisms, Transformers, UGTN, TensorFlow, PyTorch, Keras, OpenCV, Python, Medical Imaging, Deep Learning
Learn more
Gear Shift Genius: Master of Formula 1 Data Management
Database
03/2024 - 05/2024
Created a high-performance PostgreSQL database system for Formula 1 analytics, focusing on query cost optimization and indexing to support complex race strategy queries.
Key Skills: SQL, PostgreSQL, Query Optimization, Indexing, Data Management, Python, HTML, CSS, Database Design, Performance Tuning
Learn more
Crimson Eye: Data Driven Approach to Crime Analysis
AI+Full StackAI/MLNLP
08/2023 - 12/2023
Designed a predictive policing tool using 14 ML models, LDA topic modeling, and clustering to analyze crime patterns and assist law enforcement with strategic resource planning.
Key Skills: Machine Learning, Neural Networks, LDA, Clustering, Flask, JavaScript, HTML, CSS, Data Visualization, Predictive Modeling, Crime Analysis
Learn more
System And Method To Extract And Analyse Textual Features From An Image
Patents
06/2023 - 12/2023
Created an image analysis method combining GLCM and self-updating fuzzy clustering for detailed text feature extraction and accurate key-element identification.
Key Skills: Gray-Level Co-occurrence Matrix (GLCM), Fuzzy Clustering, Textual Feature Extraction, Image Analysis, Machine Learning, Patent
Learn more
Automated Monitoring System for Healthier Aquaculture Farming
AI/MLResearchComputer Vision
01/2023 - 07/2023
Deployed UAV-based monitoring using YOLOv5 and night-vision cameras to detect dead fish with 88% accuracy, enabling real-time alerts and enhancing aquaculture farm health under challenging conditions.
Key Skills: UAV Surveillance, Deep Learning, IoT, Real-Time Monitoring, Night Vision, Alert Systems, Computer Vision, Drone Technology
Learn more
A Traffic Control System
Patents
06/2022 - 12/2022
Created an AI-driven traffic control solution that monitors traffic density and environmental factors in real-time to improve flow and prioritize emergency response.
Key Skills: Deep Learning, Real-Time Video Analysis, Traffic Management, Signal Optimization, Emergency Vehicle Detection, Weather Adaptation, AI, Workflow Efficiency, Patent
Learn more
Immigration Reforms Sentiment Analysis with SenticNet APIs
AI/MLResearchNLP
01/2022 - 06/2022
Conducted concept-based sentiment analysis on immigration-related tweets using SenticNet APIs and Word2Vec embeddings, benchmarking TextBlob, VADER, and Gensim models under NTU research guidance.
Key Skills: Word2Vec, Semantic Similarity, TextBlob, VADER, Gensim, Scikit-learn, Unsupervised Learning, Sentiment Analysis, Concept Parsing
Learn more
Journey of Letters to Vectors through Neural Networks
AI/MLResearchNLP
02/2021 - 06/2021
Surveyed the evolution of image captioning from early methods to advanced deep learning, highlighting LSTM, CNN, attention mechanisms, and transfer learning applied on the Flickr8K dataset.
Key Skills: Image Captioning, Neural Networks, LSTM, CNN, Attention Mechanism, Transfer Learning, NLP, Deep Learning, Flickr8K Dataset
Learn more
Snake Detector and Alerting Gadget for Rural India Using Yolo
AI/MLPatentsComputer Vision
01/2021 - 08/2021
Created a low-cost, real-time snake detection and alert system integrating YOLOv5 with IoT devices, tailored for rural Indian environments and challenging lighting.
Key Skills: Object Detection, IoT Integration, Data Augmentation, Real-Time Detection, Embedded Systems, Alert Mechanisms, Low-Light Vision, Patent
Learn more
Python Based Motion Sensing Digital Writing Pad
PatentsComputer Vision
01/2021 - 12/2021
Designed and patented a non-touch digital writing pad using motion detection and OpenCV, offering user-friendly writing tools and export options for education in low-resource settings.
Key Skills: Python, OpenCV, Motion Sensing, Digital Writing Pad, Autocorrect, Spell Check, File Export, Accessibility Technology, Patent
Learn more

I'm an AI/ML engineer and researcher constantly exploring what AI can do. Thanks for checking out my portfolio!

General

Specifics

Collection of my favorite works.

Learning is constant and it happens faster when you code, break and build stuff

Categories

PaliGemma: Multimodal Vision-Language Model from Scratch

Magnecruit: AI-Powered Productive Recruitment Workspace Application

Resonique: Multimodal Music Recommendation App

Omni Truly Personal AGI

Adaptive Driver Assistance: Context-based Approach to Pedestrian Safety

RxRovers: Roaming for Rapid Relief

F.E.A.S.T: Food & Ingredient AI Suggestion Technology

Mapping Crime Dynamics: Integrating Textual, Spatial, and Temporal Perspectives

Attention based Discrimination of Mycoplasma Pneumonia

Gear Shift Genius: Master of Formula 1 Data Management

Crimson Eye: Data Driven Approach to Crime Analysis

System And Method To Extract And Analyse Textual Features From An Image

Automated Monitoring System for Healthier Aquaculture Farming

A Traffic Control System

Immigration Reforms Sentiment Analysis with SenticNet APIs

Journey of Letters to Vectors through Neural Networks

Snake Detector and Alerting Gadget for Rural India Using Yolo

Python Based Motion Sensing Digital Writing Pad