AI / ML Engineer β’ Computer Vision β’ Generative AI
Building practical AI systems, one focused iteration at a time.
Building intelligent systems that see, understand, and create.
I am a Machine Learning Engineer focused on Computer Vision and Agentic AI, with a strong foundation in scalable backend systems. My engineering philosophy revolves around translating complex research papers into optimized, production-ready code.
- π― Focus: Bypassing computational bottlenecks in high-resolution (4K) object detection using Explainable AI (XAI).
- π€ AI Engineering: Building local LLM agents that seamlessly interact with third-party ecosystems (Google APIs, etc.).
- βοΈ Infrastructure: Architecting robust database migrations and building backend profilers.
- π‘ Goal: I build systems that are not just intelligent, but fast, scalable, and resilient.
β‘ PixelQueue
A sleek, dark-themed control panel designed for decoupled ML microservices and robust task queues, eliminating UX bottlenecks with pure speed and instantaneous rendering. Key Innovations:
|
A novel coarse-to-fine computer vision pipeline designed for efficient small object detection in high-resolution (2K/4K) aerial imagery. Tackles the critical trade-off between resolution and latency in drone forensics. Key Innovations:
|
π¨ Neural Canvas
A fast neural style transfer implementation that generates stylized images using a feed-forward CNN trained with perceptual loss. Performs instant stylization in a single forward pass. Key Features:
|
|
π§ pygog (Google CLI Agent) |
π Depth Estimation + Semantic Seg. |
I actively contribute to the broader developer ecosystem, with recent merged work spanning agent frameworks, AI infrastructure, developer tooling, and performance-focused ML apps:
- π€ SynapseKit/SynapseKit: Shipped 22 merged PRs covering native observability, VoiceAgent audio pipelines, graph-builder tooling, benchmark suites, CronTrigger scheduling, self-healing cost-aware agents, persistent agent memory, multimodal RAG ingestion, and new cloud/data loaders plus local/self-hosted model integrations.
- β‘ DhruvGarg111/PixelQueue: Landed 10 merged PRs focused on annotation-platform performance, including Zustand history optimizations, React memoization boundaries, callback stabilization, faster YOLO export formatting, and database/Celery fixes that remove N+1 insert bottlenecks.
- π¨ DhruvGarg111/Neural-Style-Transfer: Merged 5 improvements that sharpen both UX and inference efficiency, including clearer style-selection flows, in-place tensor operations, reflect-padding convolution simplifications, and modern Pillow compatibility fixes.
- π§ DhruvGarg111/py-goog-cli: Added safer and more reliable Google Workspace CLI workflows with a Drive query-injection fix,
--dry-runsupport for destructive actions, stronger config/output test coverage, and targeted cleanup. - π lancedb/lancedb: Updated LanceDBβs Python Gemini embedding provider to the newer
google-genaiSDK, keeping vector-search integrations aligned with Googleβs latest API stack.



