I am an AI/ML Engineer and Computer Science undergrad at FAST-NUCES. I specialize in the full lifecycle of AI systemsβfrom training custom models to deploying them on constrained hardware.
- π Iβm currently working at Cowlar Design Studio (YC W17) optimizing industrial vision systems.
- π± Iβm currently researching Mobile Edge Inference and Quantization (TFLite/NNAPI) for my FYP, PokeVision.
- βοΈ I love GPU Optimization, writing custom CUDA kernels, and minimizing inference latency.
- π¬ Ask me about TensorRT, Triton Inference Server, FastAPI, and Mobile AI.
| Domain | Technologies |
|---|---|
| π§ AI & ML | |
| π Inference & Ops | |
| π± Edge & Mobile | |
| π₯οΈ Backend | |
| β‘ Languages |
π PokeVision (FYP)
Real-time Intelligent Mobile Dashcam System
- Tech: TFLite, NNAPI, Python, Flutter, FastAPI
- Impact: Achieved real-time lane detection (UFLDv2) on mobile via INT8 quantization and hybrid cloud/edge architecture.
Low-level GPU Optimization Implementation
- Tech: C++, CUDA, Matrix Math
- Impact: Wrote custom kernels for forward/backward propagation on MNIST, achieving 40x speedup over CPU.



