Get update from wip into llm/api-tool-calling#165
Get update from wip into llm/api-tool-calling#165nuwangeek wants to merge 19 commits intollm/api-tool-callingfrom
Conversation
… ckb_integration_for_data_sync
* remove unwanted file * updated changes * fixed requested changes * fixed issue * service workflow implementation without calling service endpoints * fixed requested changes * fixed issues * protocol related requested changes * fixed requested changes * update time tracking * added time tracking and reloacate input guardrail before toolclassifiier * fixed issue * fixed issue * added hybrid search for the service detection * update tool classifier * fixing merge conflicts * fixed issue * optimize first user query response generation time * fixed pr reviewed issues * service integration * context based response generation flow * fixed pr review suggested issues * removed service project layer * fixed issues * delete unnessary files * added requested changes * fix issue in prompt config toggle * standalone agentic loop module * fixed requested changes * fixed ruff format issues --------- Co-authored-by: Thiru Dinesh <56014038+Thirunayan22@users.noreply.github.com>
Sync wip branches
…ol classifier routing (buerokratt#411) * remove unwanted file * updated changes * fixed requested changes * fixed issue * service workflow implementation without calling service endpoints * fixed requested changes * fixed issues * protocol related requested changes * fixed requested changes * update time tracking * added time tracking and reloacate input guardrail before toolclassifiier * fixed issue * fixed issue * added hybrid search for the service detection * update tool classifier * fixing merge conflicts * fixed issue * optimize first user query response generation time * fixed pr reviewed issues * service integration * context based response generation flow * fixed pr review suggested issues * removed service project layer * fixed issues * delete unnessary files * added requested changes * fix issue in prompt config toggle * standalone agentic loop module * fixed requested changes * fixed ruff format issues * complete API semantic searcher with ambiguous result handling and tool classifier routing --------- Co-authored-by: Thiru Dinesh <56014038+Thirunayan22@users.noreply.github.com>
Sync wip branches
…egy (buerokratt#419) * remove unwanted file * updated changes * fixed requested changes * fixed issue * service workflow implementation without calling service endpoints * fixed requested changes * fixed issues * protocol related requested changes * fixed requested changes * update time tracking * added time tracking and reloacate input guardrail before toolclassifiier * fixed issue * fixed issue * added hybrid search for the service detection * update tool classifier * fixing merge conflicts * fixed issue * optimize first user query response generation time * fixed pr reviewed issues * service integration * context based response generation flow * fixed pr review suggested issues * removed service project layer * fixed issues * delete unnessary files * added requested changes * fix issue in prompt config toggle * standalone agentic loop module * fixed requested changes * fixed ruff format issues * complete API semantic searcher with ambiguous result handling and tool classifier routing * complete semantic searcher evaluation and update to multi point indexing strategy --------- Co-authored-by: Thiru Dinesh <56014038+Thirunayan22@users.noreply.github.com>
Sync wip branches
…tt#420) * remove unwanted file * updated changes * fixed requested changes * fixed issue * service workflow implementation without calling service endpoints * fixed requested changes * fixed issues * protocol related requested changes * fixed requested changes * update time tracking * added time tracking and reloacate input guardrail before toolclassifiier * fixed issue * fixed issue * added hybrid search for the service detection * update tool classifier * fixing merge conflicts * fixed issue * optimize first user query response generation time * fixed pr reviewed issues * service integration * context based response generation flow * fixed pr review suggested issues * removed service project layer * fixed issues * delete unnessary files * added requested changes * fix issue in prompt config toggle * standalone agentic loop module * fixed requested changes * fixed ruff format issues * complete API semantic searcher with ambiguous result handling and tool classifier routing * complete semantic searcher evaluation and update to multi point indexing strategy * competed integration of agentic loop with semantic searcher and streaming --------- Co-authored-by: Thiru Dinesh <56014038+Thirunayan22@users.noreply.github.com>
Integrate agentic loop with semantic searcher and streaming (buerokratt#420)
* remove unwanted file * updated changes * fixed requested changes * fixed issue * service workflow implementation without calling service endpoints * fixed requested changes * fixed issues * protocol related requested changes * fixed requested changes * update time tracking * added time tracking and reloacate input guardrail before toolclassifiier * fixed issue * fixed issue * added hybrid search for the service detection * update tool classifier * fixing merge conflicts * fixed issue * optimize first user query response generation time * fixed pr reviewed issues * service integration * context based response generation flow * fixed pr review suggested issues * removed service project layer * fixed issues * delete unnessary files * added requested changes * fix issue in prompt config toggle * standalone agentic loop module * fixed requested changes * fixed ruff format issues * complete API semantic searcher with ambiguous result handling and tool classifier routing * complete semantic searcher evaluation and update to multi point indexing strategy * competed integration of agentic loop with semantic searcher and streaming * Implemented the API caller module --------- Co-authored-by: Thiru Dinesh <56014038+Thirunayan22@users.noreply.github.com>
Implemented the API caller module (buerokratt#421)
…inal response (buerokratt#423) * remove unwanted file * updated changes * fixed requested changes * fixed issue * service workflow implementation without calling service endpoints * fixed requested changes * fixed issues * protocol related requested changes * fixed requested changes * update time tracking * added time tracking and reloacate input guardrail before toolclassifiier * fixed issue * fixed issue * added hybrid search for the service detection * update tool classifier * fixing merge conflicts * fixed issue * optimize first user query response generation time * fixed pr reviewed issues * service integration * context based response generation flow * fixed pr review suggested issues * removed service project layer * fixed issues * delete unnessary files * added requested changes * fix issue in prompt config toggle * standalone agentic loop module * fixed requested changes * fixed ruff format issues * complete API semantic searcher with ambiguous result handling and tool classifier routing * complete semantic searcher evaluation and update to multi point indexing strategy * competed integration of agentic loop with semantic searcher and streaming * Implemented the API caller module * completede integration of API caller, response formatter, with update streaming and final response --------- Co-authored-by: Thiru Dinesh <56014038+Thirunayan22@users.noreply.github.com>
Sync wip branches
RAG System Security Assessment ReportRed Team Testing with DeepTeam Framework Executive SummarySystem Security Status: VULNERABLE Overall Pass Rate: 0.0% Risk Level: HIGH Attack Vector Analysis
Only tested attack categories are shown above. Vulnerability Assessment
Multilingual Security Analysis
Failed Security Tests Analysis
(2 additional failures not shown) Security RecommendationsPriority Actions RequiredCritical Vulnerabilities (Immediate Action Required):
Attack Vector Improvements:
Specific Technical Recommendations:
General Security Enhancements:
Testing MethodologyThis security assessment used DeepTeam, an advanced AI red teaming framework that simulates real-world adversarial attacks. Test Execution Process
Attack Categories TestedSingle-Turn Attacks:
Multi-Turn Attacks:
Vulnerabilities Assessed
Language SupportTests were conducted across multiple languages:
Pass/Fail Criteria
Report generated on 2026-05-06 10:25:01 by DeepTeam automated red teaming pipeline |
RAG System Evaluation ReportDeepEval Test Results Summary
Total Tests: 20 | Passed: 0 | Failed: 20 Detailed Test Results| Test | Language | Category | CP | CR | CRel | AR | Faith | Status | Legend: CP = Contextual Precision, CR = Contextual Recall, CRel = Contextual Relevancy, AR = Answer Relevancy, Faith = Faithfulness Failed Test Analysis
(90 additional failures not shown) RecommendationsContextual Precision (Score: 0.000): Consider improving your reranking model or adjusting reranking parameters to better prioritize relevant documents. Contextual Recall (Score: 0.000): Review your embedding model choice and vector search parameters. Consider domain-specific embeddings. Contextual Relevancy (Score: 0.000): Optimize chunk size and top-K retrieval parameters to reduce noise in retrieved contexts. Answer Relevancy (Score: 0.000): Review your prompt template and LLM parameters to improve response relevance to the input query. Faithfulness (Score: 0.000): Strengthen hallucination detection and ensure the LLM stays grounded in the provided context. Report generated on 2026-05-06 10:25:16 by DeepEval automated testing pipeline |
No description provided.