Hey there! I'm

Charvi Kusuma

I graduated from University at Buffalo with Masters degree in Computer Science and Engineering. During my undergraduate studies I was awarded the Chancellor's Gold Medal for the Best Outgoing Student at VIT, India.


About Me

My introduction

Full-stack AI Engineer, adept in both AI/ML Engineering and Software Development, ensuring AI models are accurate, scalable, and integrated into real-world applications. Strong academic theory and practical exposure from delivering results in fast-paced environments at companies like Amazon and JPMC. Actively exploring advancements in AI space with hands-on projects and end-to-end development.

4 Internships and
Research Experience
10+ Imapctful
AI/ML and Core CSE Projects
9+ Novel Publications
and Patents

Qualifications

My Career Path
Experience
Education

MS in Computer Science and Engineering

University at Buffalo

Transcript
Aug 2023 - Jan 2025
3.96/4.0 (AI/ML Track)

BTech in Computer Science and Engineering

Vellore Institute of Technology

Transcript
2019 - 2023
9.58/10.0 (Gold Medalist)

Research Assistant,

Graduate Student Assistant

University at Buffalo

Jan 2024 - Present (> 1year),
Aug 2024 - Dec 2024 (6 months)
Learn More

Research Assistant

  • Best MS Research Project
  • Awarded during CSE Demo Days, for Adaptive Driver Assistance System: A Context-Based Approach Pedestrian Safety. More details in project section.

  • Led 3 novel research projects and developed end-to-end Machine Learning application with 21 models incorporating LDA for topic modeling and LSTM for temporal analysis.

  • Collaborated with 200+ students, providing resources and mentorship in Data Intensive Computing projects on machine learning and big data frameworks (Hadoop, MapReduce, Spark) for Fall 2024.

Software Development and Operations Intern

Amazon

Jan 2023 - June 2023 (6 months)
Learn More

Amazon Internship

  • Amazon Internship Certificate
  • As part of Amazon Pay Core DevOps Engineering Team, I led initiatives to enhance code integrity and operational workflows by rectifying 120+ integration test cases. I worked closely with cross-functional teams throughout the Beta and Gamma stage for Amazon Pay's crucial pipeline ensuring CI/CD substantially reducing 12 hours of manual efforts per week.

  • Mitigated 20+ important security risks like breach of confidential data, lack of encryption standards, and automated the process with Python and Java skills for any future instances. This was a major contribution to the team's yearly goal, reducing 10+ different pending actions items.

  • I tackled the complex task of onboarding a legacy material set for access permissions onto AWS Secrets Manager with S3, IAM, EC2 instances, and Lambda functions. Given the scale of the Amazon Pay application, completely eliminating the dependency on the prior permission handler posed significant challenges, but I successfully managed the transition.

Software Engineering Program(SEP) Intern

JP Morgan Chase & Co.

June 2022 - July 2022 (2 months)
Learn More

JP Morgan Chase & Co. Internship

  • JP Morgan Chase Internship Certificate
  • As part of Consumer & Community Banking(CCB), contributed the team with seamless POC for migration from Splunk to Grafana dashboards, showcasing a 90% effective remodelling strategy.

  • I worked backened with an established core JAVA application based on Spring Framework that handles all bank account opening requests for CCB. Redirected requests from Splunk leveraging SoapUI web services, Apache ActiveMQ message requests, and Prometheus metric data.

  • On the frontend, illustrated 5 different data visualizations on Grafana to successfully validate the migration by the fetching of previous requests generated on the app.

Research Intern

Nanyang Technological University

Jan 2022 - June 2022 (6 months)
Learn More

Nanyang Technological University

  • NTU@India Connect - Internship Completion Certificate
  • Under the Guidance of Prof. Erik Cambria leading SenticNet Computing Team at NTU, I conducted an extensive research to analyze the errors of sentiment analysis APIs for the company.

  • With the topic "Immigration Reforms Sentiment and Error Analysis" and using the SenticNet framework, I analyzed 25K+ sentiment trends and identifying areas for API policy improvements.

  • Enhanced the performance of 5 different SenticNet APIs, including concept parsing, polarity classification, and subject detection, by comparing against established tools such as Gensim, Vader, and TextBlob.

  • Refer my project section for more details and github link to the research work.

Skills

My technical acumen

Programming

Python
Java
JavaScript
HTML/CSS
SQL
R

Relevant Courses

Machine Learning
Deep Learning
Computer Vision
Big-Data Analysis
Data Structures
OS, DBMS

AI/ML Expertise

PyTorch, TensorFlow, Scikit Learn, Keras
Multimodal Processing (NLP, CV, Speech)
Model Training & Finetuning
Agentic Systems, LLMs, RAG
Evaluation of ML models
Recommendation Systems

Software Development

RDBMS (MySQL, PostgreSQL)
Version Control (Git/Github)
Big Data (Hadoop, PySpark)
Flask & Flutter Framework
React, TypeScript, Tailwind, Node.js
Docker, Cloud (AWS), MongoDB, Kafka

Achievements

My Awards

Best MS Research Project

University at Buffalo

Certificate
Dec 2024
Selected among 90 different research projects at CSE Demo Day

Chancellor's Gold Medalist

Vellore Institute of Technology

Certificate
July 2023
Honored as the Best Outgoing Student among 1500+ Graduating Students

Academic Excellence Award

Vellore Institute of Technology

Certificate
July 2022 & 2021
Consecutively secured 2nd rank among BTech CSE students

Contributions

My Projects, Research Publications and Patents

PROJECTS

Magnecruit Preview
  1. Magnecruit: AI-Powered Productive Recruitment Workspace Application

    2025 Github

    Architected and built a comprehensive recruitment tool featuring an AI assistant (MagnecAI) to enhance recruiter productivity.

    AI & Backend: Developed the backend using Python/Flask, integrating Google Gemini API for natural language interaction. Implemented agentic logic to orchestrate LLM calls, parse responses (JSON extraction via regex/parsing), manage conversation state, and interact with the PostgreSQL database via SQLAlchemy ORM. Utilized Flask-SocketIO for real-time bidirectional communication. Designed RESTful endpoints for user authentication and conversation management.

    Frontend: Created a responsive user interface with React and TypeScript, featuring distinct panels for navigation, AI chat, and a dynamic workspace. Implemented real-time updates in the workspace component using Socket.IO, allowing users to see AI-generated sequences appear live. Managed component state effectively using React Hooks.

    Key Skills: Python, Flask, Flask-SocketIO, SQLAlchemy, Google Gemini API, Prompt Engineering, JSON Parsing, React, TypeScript, Socket.IO Client, REST APIs, PostgreSQL, Git, HTML, CSS (Tailwind).


Resonique Workflow
  1. Resonique - Multimodal Music Recommendation

    2025 Github

    Integrated Gemini Flash to semantically describe user surroundings (text/image/audio) and Spotify song metadata, transforming them into high-fidelity vector embeddings using MPNet for emotion-based music retrieval.

    It leverages

    1. Pinecone for vector similarity search
    2. Google Gemini AI for understanding user queries
    3. Supabase for cloud audio storage
    4. Spotify API for real-world song recommendations

    Key Skills: Multi-modal Search, Generative AI, LLM, Vector Search, MPNet, CLAP, Pinecone, Supabase, API Integration, Streamlit.


ADAS Simulation
  1. Adaptive Driver Assistance: Context-based Approach to Pedestrian Safety

    2024 Best MS Research Project Award

    Ongoing research publication focuses on utilizing Parameter Efficient Fine-Tuning on ViT Transformer for Pedestrian Behavior & Scene Context Classification

    Achieved a significant reduction in trainable parameters to 0.68% of the Vision Transformer (ViT) backbone while maintaining an impressive 90% classification accuracy across four intention adapters.

    1. Processed 346 HD video clips from the JAAD (Joint Attention for Autonomous Driving) Dataset, employing YOLOv8 as the pedestrian detector to extract Regions of Interest (ROIs). This approach ensures precise identification of pedestrians in various scenarios, enhancing the model's ability to predict intentions accurately.
    2. Implemented a robust pipeline for data preprocessing and model training, including data augmentation techniques to improve generalization and effective performance with Low Rank Adaptation (LoRA) for fine-tuning.
    3. Deep Learning: Optimized Vision Transformer, Object Detection, YOLOv8, Deployable Adapters

F.E.A.S.T Project Workflow
  1. F.E.A.S.T - Food & Ingredient AI Suggestion Technology

    2024 Github

    With an aim to reduce food waste and provide optimized meal planning, we performed extensive data augmentation and custon object detection for Ingredients images.

    Tokenized 2.2 million recipes for high-quality model inputs for personalized recipe generation and nutritional value provision.

    Showcased Project in CSE Demo Day Spring 2024 as "Eyes on Eats: From Image to Formula".

    1. Large Language Models, YOLO models (versions 7 and 9), Grounding DINO, BART tokenizer
    2. Trained BART transformer model and implemented ChefTransformerT5 and BertFDA-Nutrition-ner model from Hugging Face.
    3. Deep Learning: Object Detection, Text Generation, Named Entity Recognition

RxRover Simulation
  1. RxRovers - Roaming for Rapid Relief

    2024 Github

    Path Optimization and Dynamic Obstacle avoidance strategy simulating efficient delivery of medicines.

    Implemented 6 Deep RL algorithms including Value-based and Actor Critic with comphrehensive comparison.

    1. Q Learning, Deep Q Networks, A2C, Proximal Policy Optimization (PPO) and improved versions.
    2. Reinforcement Learning, Gymnasium Library and Python
    3. Beginner's Repo : Defining and Solving RL Environments for Stock Trading.

SenticNet Workflow
  1. Immigration Reforms Sentiment Analysis with SenticNet APIs

    2022 Github

    Worked with 25K tweets for concept parsing, polarity sentiment and word embeddings with Gensim's Word2Vec models to analyze similarities.

    Compared multiple NLP tools including TextBlob, VADER, and SenticNet APIs for detailed error analysis of polarity classification and concept parsing.

    Implemented Semantic Similarity Analysis (SSA) using unsupervised learning to predict sentiments.

    1. Natural Language Processing: TextBlob and VADER, Gensim, Scikit-learn, Tweepy
    2. This Project was successfully completed as part of my internship at NTU Research under the guidance of Prof. Erik Cambria

View More
  • More Projects

  • CrimsonEye Workflow
    1. Crimson Eye - Data Driven Approach to Crime Analysis

      2023 Github

      Predictive policing tools for strategic allocation of law enforcement resources, enhancing community safety.

      Applied 14 different ML classifiers, 7 clustering algorithms, Neural Networks and Latent Dirichlet Allocation(LDA) for topic modeling.

      1. Machine Learning, Flask, HTML, CSS, JavaScript
      2. Presented the work in CSE Demo Day Fall 2023

    GearShift DB Workflow
    1. Gear Shift Genius: Master of Formula 1 Data Management

      2024 Github

      Developed a centralized database system that integrates various datasets related to Formula 1, including race results, driver standings, lap times, and more.

      Dealt with advanced queries for race stategy optimization, analyzed query cost estimates and optimized with indexing concepts.

      1. Advanced SQL, PostgreSQL
      2. HTML, CSS, Python

PUBLICATIONS

Adaptive Driver Assistance Workflow
  1. Adaptive Driver Assistance: Context-based Approach to Pedestrian Safety

    Techrxiv (Pending - IEEE Transactions Journal)

    Applied task-specific tuning to retain base model knowledge, training 8 adapters that reduced parameters to 0.68% of the Vision Transformer backbone while maintaining 90% accuracy in context and behavior classification.


Mapping Crime Dynamics Visualization
  1. Mapping Crime Dynamics: Integrating Textual, Spatial, and Temporal Perspectives

    UEMCON 2024

    Developed classification models using algorithms like bagging, boosting, Random Forest, and XGBoost to predict crime types based on various features. Achieved classification accuracies up to 73%, offering a strong foundation for identifying high-risk areas.

    Leveraged LSTM (Long Short-Term Memory) networks for time-series analysis, successfully predicting future crime trends by analyzing historical crime data.

    Conducted topic modeling using LDA on crime descriptions to uncover latent crime themes, followed by applying clustering algorithms like BIRCH and Mini Batch KMeans to spatially analyze these topics.

    1. Machine Learning, LSTM, Topic Modelling

Aqualculture Workflow
  1. Automated Monitoring System for Healthier Aquaculture Farming

    ACCAI 2023

    Achieved 88% dead fish detection accuracy using UAVs and DL models for real-time monitoring of aquaculture facilities.

    Addressed challenges of inclement weather and large pond coverage using drones equipped with night vision cameras.

    1. YOLOv5 variants, Alerting Mechanism, IoT Devices
    2. UAV surveillance system, tested in actual shrimp and fish farming facilities.

Pneumonia Workflow
  1. Attention based Discrimination of Mycoplasma Pneumonia

    ICCIDE 2021

    Implemented attention-based feature extraction to enhance classification of pneumonia, achieving high-dimensional feature entanglement.

    Unsupervised Generative Transfusion Network (UGTN) and transformers with 8-layer encoders and decoders.

    1. TensorFlow, PyTorch, OpenCV
    2. Keras, Python
    3. Dataset: COVID-19 Radiography Database
View More
  • More Publications

  • Letters to Vectors Visualization
    1. Journey of Letters to Vectors through Neural Networks

      ICDAM 2021

      A comprehensive survey of the evolution of Image Captioning from early models to advanced deep learning techniques.

      1. Neural Networks, Image Captioning, LSTM, CNN
      2. Transfer Learning, Attention mechanism, NLP
      3. Dataset: Image Captioning FLIKR 8K

PATENTS

Snake Detector Working
  1. Snake Detector and Alerting Gadget for Rural India Using Yolo

    Deployed an Autonomous Snake Detection Device capable of real-time visual recognition for rural India.

    Demonstrated expertise in integrating IoT devices with advanced object detection frameworks to design a cost-effective, reliable solution for use in low-light conditions.

    1. YOLOv5 variants, Alerting Mechanism
    2. Data Augmentation, Object Detection, IoT Integration

PyDigital Working
  1. Python Based Motion Sensing Digital Writing Pad

    Developed a cost-effective, non-touchscreen digital writing pad using motion-sensing techniques, aimed at providing accessible technology to students and educators in economically deprived regions.

    Autocorrect, spell check, customizable pen colors, eraser, and save options in JPEG or PDF format

    1. Python, OpenCV
View More
  • More Patents

  • Traffic Control Workflow
    1. A Traffic Control System

      An intelligent traffic control system that monitors and analyzes real-time traffic conditions.

      Takes into consideration emergency vehicle types, and weather conditions to optimize traffic signal timings and reduce congestion.

      Deep learning modules for real-time video analysis and traffic density calculation.

      1. Real Time Video Analysis, Efficient Workflow
      2. Idea Patent

  • System And Method To Extract And Analyse Textual Features From An Image
    1. System And Method To Extract And Analyse Textual Features From An Image

      Applied Gray-level Co-occurrence Matrix (GLCM) for detailed analysis of textual features from an image.

      Implemented self-update fuzzy clustering analysis technique for precise clustering and key-element extraction.

      1. Machine Learning: Grey Level Co-occurrence Matrix

Medium Articles

See some of my published articles. Writer@Teendifferent

> Send me an email or drop a message below to get to know me better!

Connect Now!

Contact Me

Get in touch

Email Me

kcharvi01@gmail.com

Leave a Message

Project Image