Skip to content
View Kas-sim's full-sized avatar
:dependabot:
Developing
:dependabot:
Developing

Highlights

  • Pro

Block or report Kas-sim

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Kas-sim/README.md

Muhammad Qasim

Applied AI Engineer Β· Retrieval Systems (Search & RAG)

Typing SVG



LinkedIn Email


⚑ Engineering Philosophy

I engineer retrieval systems, not demos.

My work focuses on how information is:

  • ingested,
  • indexed,
  • retrieved,
  • ranked,
  • and safely exposed to downstream models.

I specialize in bridging classical Information Retrieval (IR) with modern Retrieval-Augmented Generation (RAG) β€” designing systems that remain correct under real-world constraints.

Core belief:

Language models are only as reliable as the retrieval systems feeding them.


πŸš€ Flagship Systems

MQNotebook β€” Enterprise RAG System

πŸ”— https://github.com/Kas-sim/MQNotebook
🌐 https://mqnotebook.streamlit.app/

A production-oriented RAG system designed for the messy reality of enterprise documents β€” not clean PDFs.

What it demonstrates

  • OCR-first ingestion (scanned PDFs, flattened text)
  • Structured parsing (PPTX speaker notes, Excel tables)
  • Hybrid retrieval (Vector search + Cross-Encoder reranking)
  • Local-first, BYOK security model
  • OS-level robustness (Windows file-lock mitigation)

This system exists to solve retrieval correctness, not prompt cleverness.


πŸ” DevShelf β€” Search Engine from First Principles

πŸ”— https://github.com/Kas-sim/DevShelf
πŸ“– https://kas-sim.github.io/systems/devshelf/

A classical vertical search engine for Computer Science literature, built without Lucene or ElasticSearch.

What it demonstrates

  • Positional inverted indices
  • Offline indexing vs online querying
  • TF-IDF and behavioral re-ranking
  • Deterministic, explainable retrieval pipelines

DevShelf forms the theoretical foundation behind my RAG work.


πŸ§ͺ Systems & Research Work

Project Focus Stack
BabyGPT Character-level language modeling from scratch Python TensorFlow LSTM
Sentiment Filter NLP edge cases (negation paradox) Python Scikit-Learn
MQ Banking Core Low-level transactional system C++ File I/O
Digital Eye CNN-based vision system TensorFlow Keras

These projects support my core specialization in retrieval and applied AI systems.


πŸ›οΈ Technical Arsenal

Retrieval & AI

Python RAG ChromaDB Transformers

Systems & Performance

Java C++ Bash

Environment

Linux Neovim Git


GitHub Streak

Pinned Loading

  1. DevShelf DevShelf Public

    A High-Performance Distributed Search Engine for CS Books built in Java. Features O(1) Inverted Indexing, Vector Space Ranking, and Cloud Streaming. No Lucene/ElasticSearch dependency.

    Java 4

  2. MQNotebook MQNotebook Public

    Enterprise-grade RAG and document search system for extracting reliable insights from real-world data.

    Python 1

  3. ML_Projects ML_Projects Public

    Jupyter Notebook 1

  4. Scripting Scripting Public

    Shell 16