RAG Chatbot - Modular Architecture

A modular RAG (Retrieval-Augmented Generation) chatbot with FastAPI backend and Streamlit frontend. Built with a plugin architecture that makes it easy to swap LLMs, embeddings, vector stores, and document loaders.

Architecture

The system is built with a modular, factory-based architecture:

Backend (FastAPI): RESTful API handling all business logic
Frontend (Streamlit): User-friendly chat interface
Core Interfaces: Abstract base classes for all components
Implementations: Pluggable providers for LLMs, embeddings, vector stores, and loaders

Project Structure

rag_chatbot/
├── backend/
│   ├── core/                    # Abstract interfaces
│   ├── implementations/         # Concrete implementations
│   │   ├── loaders/            # Document loaders (PDF, DOCX, CSV)
│   │   ├── embeddings/         # Embedding providers (Ollama, OpenAI, HF)
│   │   ├── vector_stores/      # Vector stores (FAISS, Chroma, Pinecone)
│   │   └── llms/               # LLM providers (Ollama, OpenAI, Anthropic)
│   ├── api/                    # API routes and models
│   ├── utils/                  # Utilities
│   ├── config.py              # Configuration
│   ├── main.py                # FastAPI app
│   ├── run.py                 # Server entry point
│   └── requirements.txt       # Backend dependencies
├── frontend/
│   ├── components/            # UI components
│   ├── services/              # Communication with backend
│   └── app.py                 # Streamlit app
├── .env.example
├── .gitignore
├── LICENSE
└── README.md

Quick Start

1. Installation

# Clone the repository
git clone <repository_url>
cd rag_chatbot

# Install dependencies
pip install -r backend/requirements.txt

# Copy environment variables
# Copy environment variables
cp .env.example .env

2. Configure Environment

Edit .env (in the project root) with your settings:

For local LLMs: Ensure Ollama is running
For cloud LLMs: Add your API keys (OpenAI, Anthropic, etc.)

3. Start Backend

# Using uvicorn from project root (recommended)
uvicorn backend.main:app --reload --host 127.0.0.1 --port 8000


# Or via python module
python -m backend.main

# Or via run script
python backend/run.py

4. Start Frontend

# In a new terminal
streamlit run frontend/app.py

Adding New Components

Adding a New LLM Provider

Create a new file in backend/implementations/llms/:

from ...core.llm import LLMProvider
from typing import Generator

class MyLLM(LLMProvider):
    def __init__(self, model: str, **kwargs):
        self.model = model
        # Your initialization
    
    def generate(self, prompt: str, **kwargs) -> str:
        # Your implementation
        pass
    
    def stream(self, prompt: str, **kwargs) -> Generator[str, None, None]:
        # Your streaming implementation
        pass
    
    def get_model_name(self) -> str:
        return self.model

Register it in backend/main.py:

from .implementations.llms.my_llm import MyLLM
LLMFactory.register_provider("my_provider", MyLLM)

Adding a New Document Loader

Create a new file in backend/implementations/loaders/:

from ...core.document_processor import DocumentLoader, Document
from typing import List, BinaryIO

class MyLoader(DocumentLoader):
    supported_extensions = ['.xyz']
    
    def load(self, file: BinaryIO, filename: str) -> List[Document]:
        # Your loading logic
        pass
    
    def supports_file_type(self, filename: str) -> bool:
        return any(filename.lower().endswith(ext) for ext in self.supported_extensions)

Register it in backend/main.py:

from .implementations.loaders.my_loader import MyLoader
DocumentProcessorFactory.register_loader(MyLoader())

Adding a New Embedding Provider

Create a new file in backend/implementations/embeddings/:

from ...core.embeddings import EmbeddingProvider
from typing import List

class MyEmbeddings(EmbeddingProvider):
    def __init__(self, model: str, **kwargs):
        self.model = model
    
    def embed_documents(self, texts: List[str]) -> List[List[float]]:
        # Batch embedding logic
        pass
    
    def embed_query(self, text: str) -> List[float]:
        # Single embedding logic
        pass
    
    def get_dimension(self) -> int:
        # Return embedding dimension
        pass

Register it in backend/main.py:

from .implementations.embeddings.my_embeddings import MyEmbeddings
EmbeddingFactory.register_provider("my_embeddings", MyEmbeddings)

Adding a New Vector Store

Create a new file in backend/implementations/vector_stores/:

from ...core.vector_store import VectorStore
from ...core.document_processor import Document
from typing import List, Tuple

class MyVectorStore(VectorStore):
    def __init__(self, dimension: int):
        self.dimension = dimension
    
    def add_documents(self, documents: List[Document], embeddings: List[List[float]]):
        pass
    
    def similarity_search(self, query_embedding: List[float], k: int = 4) -> List[Tuple[Document, float]]:
        pass
    
    def clear(self):
        pass
    
    def get_count(self) -> int:
        pass

Register it in backend/main.py:

from .implementations.vector_stores.my_store import MyVectorStore
VectorStoreFactory.register_store("my_store", MyVectorStore)

Features

Modular Architecture: Easy to extend and customize
Multiple LLM Support: Ollama, OpenAI, Anthropic (easily extensible)
Multiple Embedding Providers: Ollama, OpenAI, HuggingFace
Vector Store Options: FAISS, Chroma, Pinecone
Document Loaders: PDF (easily add DOCX, CSV, TXT, etc.)
Streaming Responses: Real-time chat experience
Session Management: Multiple chat sessions
RAG Toggle: Switch between RAG and normal chat
Configurable Chunking: Adjust chunk size and overlap
RESTful API: Well-documented endpoints

API Endpoints

POST /api/sessions - Create a new chat session
POST /api/sessions/{session_id}/upload - Upload a document
POST /api/sessions/{session_id}/process - Process document into vector store
POST /api/chat - Send a message (with streaming support)
GET /api/config - Get current configuration
PUT /api/config - Update configuration

Configuration Options

You can configure the system through:

Environment Variables (.env file)
Runtime API Calls (PUT /api/config)
Streamlit UI (Configuration sidebar)

Available settings:

LLM provider and model
Embedding provider and model
Vector store type
Chunk size and overlap
API keys for cloud providers

Security Notes

Never commit your .env file
Use environment variables for API keys
Consider authentication for production deployments
Validate and sanitize file uploads

Contributing

To add support for new providers:

Implement the appropriate interface from backend/core/
Add your implementation to backend/implementations/
Register it in backend/main.py
Update this README

License

MIT License - feel free to use in your projects!

Troubleshooting

Backend won't start:

Ensure all dependencies are installed
Check that ports 8000 is available
Verify Ollama is running (if using local LLMs)

Streamlit can't connect:

Ensure backend is running (http://localhost:8000/health should return status)
Check CORS settings in backend/main.py

Document processing fails:

Verify the document format is supported
Check chunk size settings
Ensure embedding provider is configured correctly

Next Steps

Consider adding:

SQL database support for document metadata
More document formats (DOCX, TXT, CSV, JSON)
Persistent storage for vector databases
User authentication
Multi-user support
Document management UI
Advanced RAG techniques (hybrid search, re-ranking)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Chatbot - Modular Architecture

Architecture

Project Structure

Quick Start

1. Installation

2. Configure Environment

3. Start Backend

4. Start Frontend

Adding New Components

Adding a New LLM Provider

Adding a New Document Loader

Adding a New Embedding Provider

Adding a New Vector Store

Features

API Endpoints

Configuration Options

Security Notes

Contributing

License

Troubleshooting

Next Steps

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
backend		backend
frontend		frontend
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

RAG Chatbot - Modular Architecture

Architecture

Project Structure

Quick Start

1. Installation

2. Configure Environment

3. Start Backend

4. Start Frontend

Adding New Components

Adding a New LLM Provider

Adding a New Document Loader

Adding a New Embedding Provider

Adding a New Vector Store

Features

API Endpoints

Configuration Options

Security Notes

Contributing

License

Troubleshooting

Next Steps

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages