Skip to content

[BUG]Β #922

@adarshmadrecha

Description

@adarshmadrecha

πŸ› Bug Description

A clear and concise description of what the bug is.

πŸ”„ Steps to Reproduce

  1. Go to '...'
  2. Click on '...'
  3. Scroll down to '...'
  4. See error

βœ… Expected Behavior

A clear and concise description of what you expected to happen.

❌ Actual Behavior

When chatting with knowledge base, I get below error in the browser console.

STREAM EVENT: error 
Object { error: "'ascii' codec can't encode character '\\U0001f527' in position 0: ordinal not in range(128)" }
​
error: "'ascii' codec can't encode character '\\U0001f527' in position 0: ordinal not in range(128)"
​
<prototype>: Object { … }
session-chat.tsx:257:21

πŸ“Έ Screenshots

Image

πŸ–₯️ Environment Information

Desktop/Server:

  • OS: Ubuntu 24.04,
  • Python Version: 3.11.7
  • Node.js Version: 24.3
  • Ollama Version: 11.0
  • Docker Version: NA

Browser (if web interface issue):

  • Browser: Firefox

πŸ“‹ System Health Check

(base) msml@msml:/media/msml/ssd1/adarshtest/localGPT$ python system_health_check.py
πŸ₯ RAG System Health Check
==================================================
πŸ” Testing basic imports...
βœ… Basic imports successful
πŸ” Checking configurations...
πŸ“Š External Models: {'embedding_model': 'Qwen/Qwen3-Embedding-0.6B', 'reranker_model': 'answerdotai/answerai-colbert-small-v1', 'vision_model': 'Qwen/Qwen-VL-Chat', 'fallback_reranker': 'BAAI/bge-reranker-base'}
πŸ“Š Ollama Config: {'host': 'http://localhost:11434', 'generation_model': 'qwen3:8b', 'enrichment_model': 'qwen3:0.6b'}
πŸ“Š Pipeline Configs: {'default': {'description': 'Production-ready pipeline with hybrid search, AI reranking, and verification', 'storage': {'lancedb_uri': './lancedb', 'text_table_name': 'text_pages_v3', 'image_table_name': 'image_pages_v3', 'bm25_path': './index_store/bm25', 'graph_path': './index_store/graph/knowledge_graph.gml'}, 'retrieval': {'retriever': 'multivector', 'search_type': 'hybrid', 'late_chunking': {'enabled': True, 'table_suffix': '_lc_v3'}, 'dense': {'enabled': True, 'weight': 0.7}, 'bm25': {'enabled': True, 'index_name': 'rag_bm25_index'}, 'graph': {'enabled': False, 'graph_path': './index_store/graph/knowledge_graph.gml'}}, 'embedding_model_name': 'Qwen/Qwen3-Embedding-0.6B', 'vision_model_name': 'Qwen/Qwen-VL-Chat', 'reranker': {'enabled': True, 'type': 'ai', 'strategy': 'rerankers-lib', 'model_name': 'answerdotai/answerai-colbert-small-v1', 'top_k': 10}, 'query_decomposition': {'enabled': True, 'max_sub_queries': 3, 'compose_from_sub_answers': True}, 'verification': {'enabled': True}, 'retrieval_k': 20, 'context_window_size': 0, 'semantic_cache_threshold': 0.98, 'cache_scope': 'global', 'contextual_enricher': {'enabled': True, 'window_size': 1}, 'indexing': {'embedding_batch_size': 50, 'enrichment_batch_size': 10, 'enable_progress_tracking': True}}, 'fast': {'description': 'Speed-optimized pipeline with minimal overhead', 'storage': {'lancedb_uri': './lancedb', 'text_table_name': 'text_pages_v3', 'image_table_name': 'image_pages_v3', 'bm25_path': './index_store/bm25'}, 'retrieval': {'retriever': 'multivector', 'search_type': 'vector_only', 'late_chunking': {'enabled': False}, 'dense': {'enabled': True}}, 'embedding_model_name': 'Qwen/Qwen3-Embedding-0.6B', 'reranker': {'enabled': False}, 'query_decomposition': {'enabled': False}, 'verification': {'enabled': False}, 'retrieval_k': 10, 'context_window_size': 0, 'contextual_enricher': {'enabled': False, 'window_size': 1}, 'indexing': {'embedding_batch_size': 100, 'enrichment_batch_size': 50, 'enable_progress_tracking': False}}, 'bm25': {'enabled': True, 'index_name': 'rag_bm25_index'}, 'graph_rag': {'enabled': False}}
πŸ” Embedding model: Qwen/Qwen3-Embedding-0.6B (1024 dims) - Check data compatibility!
βœ… Configuration check completed
πŸ” Testing database access...
βœ… LanceDB connected - 2 tables available
πŸ“‹ Available tables:
   - text_pages_2eb70151-f2df-4d1d-b594-681979c8a5f6
   - text_pages_2eb70151-f2df-4d1d-b594-681979c8a5f6_lc
πŸ” Testing agent initialization...
Initialized Verifier with Ollama model 'qwen3:8b'.
Agent initialized (GraphRAG disabled).
βœ… Agent initialization successful
πŸ” Testing embedding model...
Initializing HF Embedder with model 'Qwen/Qwen3-Embedding-0.6B' on device 'cuda'. (first load)
QwenEmbedder weights loaded and cached for Qwen/Qwen3-Embedding-0.6B.
Generating 1 embeddings with Qwen/Qwen3-Embedding-0.6B model...
βœ… Embedding model: Qwen/Qwen3-Embedding-0.6B
βœ… Vector dimension: 1024
πŸ” Using 1024-dim embeddings (Qwen3 compatible) - Ensure data compatibility!
πŸ” Testing sample query...
πŸ” Testing query on table: text_pages_2eb70151-f2df-4d1d-b594-681979c8a5f6
πŸ” ROUTING DEBUG: Starting triage for query: 'what is this document about?...'
πŸ“– ROUTING DEBUG: Attempting overview-based routing...
πŸ“– ROUTING DEBUG: No document overviews available, returning None
❌ ROUTING DEBUG: Overview routing returned None, falling back to LLM triage
πŸ€– ROUTING DEBUG: No history, using LLM fallback triage...
πŸ€– ROUTING DEBUG: LLM fallback triage decided: 'rag_query'
🎯 ROUTING DEBUG: Final triage decision: 'rag_query'
Agent Triage Decision: 'rag_query'
Generating 1 embeddings with Qwen/Qwen3-Embedding-0.6B model...
βœ… ROUTING DEBUG: Executing RAG_QUERY path (query_type='rag_query')

--- Query Decomposition Enabled ---
Query Decomposition Reasoning: Single information need; no pronouns or ambiguous references.
Original query: 'what is this document about?' (Contextual: 'what is this document about?')
Decomposed into 1 sub-queries: ['what is this document about?']
--- Only one sub-query after decomposition; using direct retrieval path ---
LanceDB connection established at: ./lancedb

--- Performing Retrieval for query: 'what is this document about?' on table 'text_pages_2eb70151-f2df-4d1d-b594-681979c8a5f6' ---
Generating 1 embeddings with Qwen/Qwen3-Embedding-0.6B model...
2025-09-06 14:09:40,405 | INFO     | rag-system | Top 20 results:
2025-09-06 14:09:40,405 | INFO     | rag-system | chunk_id       score   preview
2025-09-06 14:09:40,405 | INFO     | rag-system | ------------------------------
2025-09-06 14:09:40,405 | INFO     | rag-system | a4e76ee3-6c3 0.000   Context: The context highlights that the form for the final…
2025-09-06 14:09:40,405 | INFO     | rag-system | a4e76ee3-6c3 0.000   Context: The context highlights the GOST compensation…
2025-09-06 14:09:40,406 | INFO     | rag-system | a4e76ee3-6c3 0.000   Context: The context summarizes the Finance Act 2024's…
2025-09-06 14:09:40,406 | INFO     | rag-system | a4e76ee3-6c3 0.000   Context: The chunk outlines the proper officer's role in…
2025-09-06 14:09:40,406 | INFO     | rag-system | a4e76ee3-6c3 0.000   Context: The local context highlights digital signature…
2025-09-06 14:09:40,406 | INFO     | rag-system | a4e76ee3-6c3 0.000   Context: The chunk discusses registration periods,…
2025-09-06 14:09:40,406 | INFO     | rag-system | a4e76ee3-6c3 0.000   Context: The context summary is about the Authority for…
2025-09-06 14:09:40,406 | INFO     | rag-system | a4e76ee3-6c3 0.000   Context: The context summary is: The chunk discusses the…
2025-09-06 14:09:40,407 | INFO     | rag-system | a4e76ee3-6c3 0.000   Context: The context summary is about sharing details under…
2025-09-06 14:09:40,407 | INFO     | rag-system | a4e76ee3-6c3 0.000   Context: The chunk discusses regulations for handling…
2025-09-06 14:09:40,407 | INFO     | rag-system | f660d12c-80f 7056.611 Context: The context summarizes the specific goods listed…
2025-09-06 14:09:40,407 | INFO     | rag-system | f660d12c-80f 7536.421 Context: The context summarizes that electronic…
2025-09-06 14:09:40,407 | INFO     | rag-system | a4e76ee3-6c3 7635.182 Context: The specific chunk discusses the Screening…
2025-09-06 14:09:40,407 | INFO     | rag-system | f660d12c-80f 7964.705 Context: The context summary is: A company transferring…
2025-09-06 14:09:40,407 | INFO     | rag-system | f660d12c-80f 8244.507 Context: The context summarizes that the composition tax…
2025-09-06 14:09:40,408 | INFO     | rag-system | a4e76ee3-6c3 8336.688 Context: The context outlines the structure and composition…
2025-09-06 14:09:40,408 | INFO     | rag-system | a4e76ee3-6c3 8365.038 Context: The chunk discusses rectifying tax details under…
2025-09-06 14:09:40,408 | INFO     | rag-system | f660d12c-80f 8530.757 Context: The context summary is about refund procedures for…
2025-09-06 14:09:40,408 | INFO     | rag-system | a4e76ee3-6c3 8545.204 Context: The chunk clarifies that the aggregate value of…
2025-09-06 14:09:40,408 | INFO     | rag-system | a4e76ee3-6c3 8654.028 Context: The chunk discusses the Central Government's…
Retrieved 20 documents.
πŸ”§ Initialising Answer.AI ColBERT reranker (answerdotai/answerai-colbert-small-v1) via rerankers lib…
Loading ColBERTRanker model answerdotai/answerai-colbert-small-v1 (this message can be suppressed by setting verbose=0)
No device set
Using device cuda
No dtype set
Using dtype torch.float32
Loading model answerdotai/answerai-colbert-small-v1, this might take a while...
tokenizer_config.json: 1.24kB [00:00, 11.3MB/s]
vocab.txt: 232kB [00:00, 19.1MB/s]
tokenizer.json: 711kB [00:00, 103MB/s]
special_tokens_map.json: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 695/695 [00:00<00:00, 15.6MB/s]
config.json: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 702/702 [00:00<00:00, 13.6MB/s]
model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 134M/134M [00:09<00:00, 13.7MB/s]
Linear Dim set to: 96 for downcasting
βœ… AI reranker initialized successfully.

--- Reranking top 20 docs with AI model... ---
❌ Sample query failed: CUDA out of memory. Tried to allocate 16.00 MiB. GPU 0 has a total capacity of 23.65 GiB of which 12.50 MiB is free. Process 38427 has 17.54 GiB memory in use. Including non-PyTorch memory, this process has 1.75 GiB memory in use. Process 128744 has 4.33 GiB memory in use. Of the allocated memory 1.27 GiB is allocated by PyTorch, and 27.98 MiB is reserved by PyTorch but unallocated. If reserved but unallocated memory is large try setting PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True to avoid fragmentation.  See documentation for Memory Management  (https://pytorch.org/docs/stable/notes/cuda.html#environment-variables)

==================================================
πŸ₯ Health Check Complete: 5/6 checks passed
πŸ” System mostly healthy with minor issues

πŸ”§ Configuration

  • Deployment method: Direct Python
  • Models used: qwen3:0.6b, qwen3:8b
  • Document types: PDF

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions