Skip to content
View dhruvsh1997's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report dhruvsh1997

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
dhruvsh1997/README.md

Myself Dhruv Sharma, Welcome to my Profile πŸ‘‹

πŸš€ Senior AI/ML Engineer | LLM & Agentic AI Specialist | MLOps | Deep Learning | Computer Vision | Fine-Tuning | CI/CD

πŸ“ Senior AI Researcher & Developer β€” 5+ years architecting production-grade AI systems across healthcare, EdTech, legal, finance, and sales intelligence domains. Currently leading 8 concurrent enterprise AI/ML engagements at Chetu Inc.

Profile Views


πŸ“Š Dashboard Overview


πŸ”Ή About Me

  • πŸŽ“ Senior AI/ML Engineer with deep expertise in Agentic AI, LLM/SLM Fine-Tuning, RAG Pipelines, and MLOps
  • πŸ† Promoted twice in 18 months at Chetu Inc. β€” Team Member β†’ Senior TM β†’ Technical Team Lead
  • πŸ’» Skilled in LangChain, LangGraph, Google ADK, A2A, MCP, Unsloth QLoRA, LoRA, vLLM, RAGAS, LangSmith, Ollama
  • πŸ€– Experienced in On-Prem LLM Deployment (LLaMA 3.2 70B, 94% GPT-4o parity), Multi-Agent Orchestration, and Context Engineering
  • πŸš€ Built production AI systems across healthcare, EdTech, legal, finance, recruitment, and equestrian domains
  • πŸ“ˆ Expert in MLOps β€” MLflow, Docker, DVC, AWS EKS, GitHub Actions, GitLab CI/CD
  • πŸ›  Passionate about intelligent system design, enterprise AI integration, and measurable AI outcomes
  • πŸ“š Always learning & building scalable AI pipelines with real-world impact

πŸ† Key AI Projects & Contributions

1️⃣ LangGraph Multi-Agent Ticket Concierge (Enterprise β€” Chetu Inc.)

  • Built a GPT-4o + LangGraph multi-agent system integrated with Automatiq B2B API for intelligent ticket routing.
  • Designed an intent-routing state machine with a regex JSON sanitizer achieving 98% routing accuracy across 320+ test cases including adversarial and E2E journeys.
  • Stack: LangGraph, GPT-4o, Automatiq API, Python

2️⃣ AI Tutor with Animated Avatars (Enterprise β€” EdTech)

  • Developed a real-time AI tutoring system with Whisper STT, OpenAI TTS-1-HD, and Weaviate Hybrid RAG (BM25 + Vector search).
  • Implemented grade-aware LLM-as-Judge guardrails with WebRTC sub-300ms latency.
  • Deployed on Amazon EKS with multi-stage Docker builds.
  • Stack: Whisper, OpenAI TTS, Weaviate, WebRTC, AWS EKS, Docker

3️⃣ IRS Section 125 Benefits Compliance Agent (Enterprise β€” Legal/Finance)

  • Architected a 3-node LangGraph pipeline: Supervisor β†’ DB Extraction β†’ Self-RAG Vault with dual-layer guardrails (keyword + LLM).
  • Achieved 98% routing accuracy and 100% guardrail block rate across 500+ validation tests.
  • Stack: LangGraph, Self-RAG, Guardrails AI, Python

4️⃣ Equestrian AI Companion v2 (Enterprise)

  • Built a 7-node LangGraph StateGraph with Qdrant hierarchical memory (7 streams, MD5 dedup) and a 4-layer retrieval cascade.
  • Integrated an LLM quality judge (0.75 threshold), improving retry accuracy by 30%.
  • Stack: LangGraph, Qdrant, GPT-4o, Python

5️⃣ Agentic AI-Based Self-RAG Chatbot with Role-Based Authentication

  • Developed an Advanced Self-RAG chatbot using LangChain, LangGraph, MLflow, and Django.
  • Supports multi-modal query handling (text, images, tables, flowcharts) extracted from PDFs.
  • Achieved 87% answer accuracy on 100 human-evaluated Q&A pairs; BGE-M3 embeddings (nDCG 0.91); RBAC with 100% enforcement.
  • Integrated Web Scraping with Selenium for real-time referenced data.
  • Stack: LangChain, LangGraph, BGE-M3, ChromaDB, Django, MLflow

6️⃣ InvoiceIQ β€” PageIndex Vectorless RAG (PoC)

  • Eliminated vector DB overhead entirely using JSON/in-memory cache (PageIndex approach).
  • Built a dual-LLM pipeline (GPT-4o Vision + GPT-4o Text) with Gmail/IMAP integration for automated invoice extraction.
  • Stack: Flask, SQLite, LangChain, GPT-4o Vision, PageIndex

7️⃣ Agentic PPT Architect (PoC)

  • Designed a LangGraph multi-agent PPTX generator: Master Architect β†’ Quantitative Analyst (Matplotlib charts) β†’ Image Node (DALL-E 3) β†’ Layout Reconciliation.
  • Implemented AABB collision detection and dynamic font-scaling for polished slide layouts.
  • Stack: LangGraph, GPT-4o, DALL-E 3, python-pptx

8️⃣ Google ADK A2A Multi-Agent Pipeline (PoC)

  • Built a SequentialAgent pipeline: FintechAnalyst (Gemini 2.5 Flash) β†’ LogisticsExpert (LLaMA 3.3/Groq) using Agent-to-Agent (A2A) protocol.
  • Developed a Kafka KRaft FinBERT trading pipeline with BUY/SELL rules and a WebSocket real-time dashboard.
  • Stack: Google ADK, A2A, Gemini 2.5 Flash, LLaMA 3.3, Groq, Kafka, FinBERT, WebSocket

9️⃣ On-Prem LLM Infrastructure β€” LLaMA 3.2 70B (Enterprise β€” Chetu Inc.)

  • Deployed LLaMA 3.2 70B with 4-bit NF4 (bitsandbytes + Flash Attention 2) on 80GB GPU; achieved 94% GPT-4o parity while eliminating API costs.
  • Built vLLM/Ollama OpenAI-compatible wrapper with GitHub Actions CI/CD for hot-swappable model iterations.
  • Fine-tuned SLM (Llama 3.2 3B) via Unsloth QLoRA β€” 60% GPU memory reduction, 2Γ— training speed; improved nDCG 73%β†’82%, MRR 68β†’71.
  • Stack: LLaMA 3.2 70B, vLLM, Ollama, Unsloth, QLoRA, bitsandbytes, Flash Attention 2, GitHub Actions

πŸ”Ÿ Multi-Tenant Veterinary SaaS (Enterprise)

  • Built a Fusion RAG system (Pinecone + FAISS/SerpAPI) with MMR and cross-encoder reranking (ms-marco-MiniLM).
  • Reduced report generation time by 60% (270s β†’ 100s) via asyncio.gather; successfully stress-tested for 50 concurrent users.
  • Integrated Stripe Connect for multi-tenant billing.
  • Stack: Pinecone, FAISS, MiniLM, Django, Celery, Stripe Connect

1️⃣1️⃣ Computer Vision & Generative AI Models

  • Built Text-to-Image generation models with Stable Diffusion, Pix2Pix, and Transformer-based architectures.
  • Spatio-temporal crowd forecasting using YOLOv8 + CNN-LSTM, Gaussian density maps (ShanghaiTech dataset).
  • AI-driven Image Captioning using BLIP2 and vision-language models; real-time object detection with Detectron2.
  • U-Net + DLNN brain tumor segmentation/classification web app; ResNet-V3 plant disease detection REST API.
  • Stack: YOLO, Detectron2, Stable Diffusion, BLIP2, U-Net, ResNet, PyTorch, OpenCV

1️⃣2️⃣ Meeting Video Processing & AI-Based Report Generation

  • AI-powered meeting analytics system using YOLO (face detection), WhisperX (transcription, WER <8%), T5 (summarization, 92% grammar correction), Llama 3.3 (context analysis).
  • Implemented Emotion and Sentiment Detection; automated Report Generation via Google Cloud API.
  • Achieved 99% task reliability across 100 pipeline runs using Django Celery + RabbitMQ.
  • Stack: YOLO, WhisperX, T5, Llama 3.3, Django, Celery, RabbitMQ, Streamlit

1️⃣3️⃣ Web Scraping & Data Intelligence

  • Developed comprehensive web scraping frameworks using Selenium, BeautifulSoup, and Scrapy.
  • Created intelligent data collection pipelines with cleaning and transformation techniques.
  • Implemented ethical scraping practices (robots.txt compliance).

1️⃣4️⃣ NLP Research & Deep Learning Models

  • BERT Autoencoder for query-based data extraction (ontology + PCA + WaOA clustering).
  • NL-to-SQL pipeline using Seq2SQL + PSO for natural language database querying.
  • Neo4j intent classification for graph-based reasoning; improved DL model accuracy 80% β†’ 90%.
  • Stack: BERT, RoBERTa, T5, Seq2SQL, Neo4j, PyTorch, Scikit-learn

πŸ›  Tech Stack & Tools

Category Technologies
Languages Python JavaScript SQL
Agentic AI & Orchestration LangChain LangGraph Google ADK MCP A2A
LLMs & Generative AI GPT-4o Gemini LLaMA Groq Ollama vLLM DALL-E 3
Fine-Tuning & Embeddings Unsloth LoRA HuggingFace BGE-M3 bitsandbytes
RAG & Vector Stores Pinecone Qdrant Weaviate FAISS ChromaDB Redis Vector
Computer Vision YOLO Detectron2 OpenCV Stable Diffusion BLIP2
ML & Deep Learning PyTorch Scikit-learn T5 FinBERT
Web Frameworks Django FastAPI Flask Streamlit
Messaging & Streaming Kafka RabbitMQ Celery WebSocket WebRTC
Databases PostgreSQL Redis Neo4j SQLite
Data Processing Pandas NumPy PySpark Polars
MLOps & Deployment MLflow Docker DVC GitHub Actions GitLab CI/CD
Cloud AWS EKS Azure GCP

πŸ’Ό Professional Experience

Role Company Period
Technical Team Lead β€” AI/ML Chetu Inc. Sep 2025 – Present
Sr. Software Developer β€” AI/ML Chetu Inc. Mar 2025 – Sep 2025
Software Developer β€” AI/ML Chetu Inc. Mar 2024 – Mar 2025
ML/AI Engineer & Data Scientist Research Developers Sep 2021 – Mar 2024

πŸ“Š GitHub Contribution Activity

Dhruv's Activity Graph


πŸ“« Connect with Me


🎯 "Transforming Industries with Intelligent AI Solutions" πŸš€

Popular repositories Loading

  1. DeepLearning-Approach-of-Intrusion_Detection_against_IoT_Attacks-using-Optimized-DeepLearningModel DeepLearning-Approach-of-Intrusion_Detection_against_IoT_Attacks-using-Optimized-DeepLearningModel Public

    Implementation is enhancing security against cyberattacks by utilizing hybrid optimization (ABC+SCA) to optimize a CNLSTM neural network for intrusion detection. Strengthen IoT device protection an…

    Jupyter Notebook 3

  2. LangchainExperiments LangchainExperiments Public

    This repository contains hands-on, modular notebooks designed to explore and demonstrate key components of the LangChain framework β€” from document loaders and splitters to embedding models, vector …

    Jupyter Notebook 3

  3. Polar_Code_Decoding Polar_Code_Decoding Public

    Decoding using Polar Codes in Python

    Jupyter Notebook 1

  4. Advance-DBScan_Clustering_4_DrugRecommedationSystem-Using-Deep-Learning---Collaborative_Filtering Advance-DBScan_Clustering_4_DrugRecommedationSystem-Using-Deep-Learning---Collaborative_Filtering Public

    This project develops a drug recommendation system using sentiment analysis of reviews. It employs Collaborative Filtering, Novel ADBScan Clustering, and a Deep Learning approach of BiLSTM with GWO…

    Jupyter Notebook 1

  5. RAG_MultiDoc_Chatbot RAG_MultiDoc_Chatbot Public

    A Langchain based Multi Document Chatbot created in Django, where you can upload number of document and ask questions from any of them.

    Python 1

  6. PromptEngineering_UsingOpenAI_GPT4o PromptEngineering_UsingOpenAI_GPT4o Public

    Each section will feature markdown cells for descriptions and code cells for execution, ensuring an interpretive approach where inputs and outputs are clearly displayed and explained.

    Jupyter Notebook 1