You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Description:
Integrate local LLaMA 3 model as the main chatbot brain. User message → backend → session context → LLaMA 3 → response → stored in MongoDB → returned to UI.
User Story
Given user sends a chat message When backend processes request Then LLaMA 3 should generate a contextual response
Description:
Integrate local LLaMA 3 model as the main chatbot brain. User message → backend → session context → LLaMA 3 → response → stored in MongoDB → returned to UI.
User Story
Given user sends a chat message
When backend processes request
Then LLaMA 3 should generate a contextual response
Tasks
Model Setup
Install Model Runtime
Load LLaMA 3
Backend Integration
Create Model Service Layer
/app/services/model_service.pyDefine Inference Function
Prompt Engineering (Minimal)
Build Prompt Format
Context Injection
Chat Pipeline Integration
Connect with Message Router
/chat/messageReturn Model Response
Session Awareness
Use Session Context
Update Session After Response
Performance Optimization
Streaming Response (Optional Upgrade)
Error Handling
Logging & Debugging
Postman Testing 🧪
/chat/messageendpointFrontend Integration
Acceptance Criteria
Testing Steps
Definition of Done