Custom prompt config handling in API tool calling by nuwangeek · Pull Request #431 · buerokratt/LLM-Module

nuwangeek · 2026-05-08T09:34:27Z

No description provided.

Get update from wip into llm-316

Get update from llm-316

Intent enrichment pipeline (buerokratt#319)

get update from wip into llm-304

Service layer validation in tool classifier (buerokratt#321)

Get update from wip

Pulling changes from BYK wip to LLM-Module WIP

Get update from wip into optimization/data-enrichment

…mance improvement

Get update from optimization/data-enrichment into optimization/vector-indexer

Get update from llm-394 into llm-345-dev

…l classifier routing

Get update from llm-394 into llm-403

Get update from llm-345-dev into llm-403

…ing strategy

Get update from llm-403 into llm-408

…ming

…able as id

Get update from llm-408 into llm-348

…or RAG

Sync wip branches

Integrate agentic loop with semantic searcher and streaming (buerokratt#420)

Implemented the API caller module (buerokratt#421)

Sync wip branches

CKB API integration for agency data sync (buerokratt#392)

Integrate CKB and RAG changelogs with schema updates for RAG (buerokratt#422)

Fixed Ruff lint issues (buerokratt#426)

Copilot

Pull request overview

This PR extends the API tool-calling workflow to respect organization-specific prompt configuration (“custom instructions”) during both parameter collection and API response formatting, and adds schema sanitization to reduce format-hint leakage into clarifying questions.

Changes:

Fetch custom prompt instructions from the orchestration service’s prompt_config_loader and pass them into ParamExtractionModule and APIResponseFormatterModule.
Sanitize parameter schema descriptions before sending them to the LLM to avoid propagating format hints (e.g., YYYY-MM-DD) into user-facing questions.
Expand streaming support and tests for stream_forward() / stream_run_turn() paths, and adjust tests to reflect “new extraction overrides prior value” semantics.

Reviewed changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
`src/tool_classifier/workflows/api_tool_workflow.py`	Loads custom instructions via prompt config loader; applies them to extractor/formatter and derives an effective session language from instructions.
`src/tool_classifier/param_extractor.py`	Adds `custom_instructions` input, schema-description sanitization, streaming cleanup, and same-type required-param reassignment logic.
`src/tool_classifier/api_response_formatter.py`	Adds `custom_instructions` input propagation to formatter predictor (blocking + streaming) and stream cleanup.
`src/tool_classifier/agentic_loop.py`	Adds optional `continuation_language` override for the hardcoded continuation question (run + stream).
`tests/test_param_extractor.py`	Updates override behavior expectation and adds tests for custom instructions, schema sanitization, and streaming extraction.
`tests/test_api_response_formatter.py`	Adds tests for custom instructions and streaming behavior (including stream token yielding / fallback paths).
`tests/test_agentic_loop.py`	Switches imports to `src.tool_classifier...`, updates override expectation, and adds streaming-path tests for `stream_run_turn()`.

+    ``in the format YYYY-MM-DD`` phrases.  The sanitised description is used
+    only for LLM question generation; the original description (with format
+    hints intact) is still used for extraction context.


+        # SINGLE-VALUE REASSIGNMENT: if the LLM assigned a value to a later same-type
+        # param while an earlier same-type param is still missing, move the value forward.
+        # This fixes the common case where a lone date like "2026-04-01" is extracted as
+        # endDate when startDate is still missing.
+        combined_after_extraction = {**already_collected, **validated_params}
+        required_schema_order = [
+            p for p in params_schema if isinstance(p, dict) and p.get("required", False)
+        ]
+        for idx, missing_entry in enumerate(required_schema_order):
+            m_name = missing_entry["name"]
+            m_type = missing_entry.get("type", "string")
+            if m_name in combined_after_extraction:
+                continue  # already satisfied
+            # Find the first later param with the same type that was just extracted
+            for later_entry in required_schema_order[idx + 1 :]:
+                l_name = later_entry["name"]
+                l_type = later_entry.get("type", "string")
+                if l_type == m_type and l_name in validated_params:
+                    logger.debug(
+                        f"ParamExtractor: reassigning '{l_name}' → '{m_name}' "
+                        f"(single {m_type} value assigned to wrong param by LLM)"
+                    )
+                    validated_params[m_name] = validated_params.pop(l_name)
+                    break
+


+        custom_instructions = await self._get_custom_instructions()
+        loop = self._build_agentic_loop(session_store, custom_instructions)  # type: ignore[arg-type]
+


+def _make_async_iter(*chunks: Any) -> AsyncMock:
+    """Return an async context manager that yields the given chunks then closes cleanly."""
+
+    async def _gen() -> AsyncGenerator[Any, None]:
+        for chunk in chunks:
+            yield chunk
+
+    mock_stream = AsyncMock()
+    mock_stream.__aiter__ = lambda self: _gen()
+    mock_stream.aclose = AsyncMock()
+    return mock_stream


+from src.tool_classifier.agentic_loop import AgenticLoop
+from src.tool_classifier.enums import AgenticLoopStatus
+from src.tool_classifier.param_extractor import ParamExtractionResult


nuwangeek and others added 30 commits February 20, 2026 16:06

Merge pull request #122 from rootcodelabs/wip

3020e31

Get update from wip into llm-316

remove unwanted file

6e5c22c

updated changes

38d0533

fixed requested changes

72b8ae1

fixed issue

9b7bc7b

Merge pull request #123 from rootcodelabs/llm-316

46dd6c4

Get update from llm-316

Merge pull request #124 from buerokratt/wip

068f4e0

Intent enrichment pipeline (buerokratt#319)

service workflow implementation without calling service endpoints

a2084e5

Merge pull request #126 from rootcodelabs/wip

5216c09

get update from wip into llm-304

fixed requested changes

864ad30

fixed issues

25f9614

protocol related requested changes

69c1279

fixed requested changes

07f2e0f

update time tracking

f63f777

added time tracking and reloacate input guardrail before toolclassifiier

5429bc0

fixed issue

721263a

Merge pull request #127 from buerokratt/wip

6ed02d1

Service layer validation in tool classifier (buerokratt#321)

Merge branch 'optimization/llm-304' into wip

7238baa

Merge pull request #128 from rootcodelabs/wip

ae7cfa0

Get update from wip

fixed issue

f8a82b6

added hybrid search for the service detection

3b89fba

update tool classifier

789f062

fixing merge conflicts

609e6d5

Merge pull request #129 from buerokratt/wip

a30c52d

Pulling changes from BYK wip to LLM-Module WIP

Merge pull request #130 from rootcodelabs/wip

8dfc155

Get update from wip into optimization/data-enrichment

updated intent data enrichment and service classification flow perfor…

3d7fb85

…mance improvement

fixed issue

bee9fbf

Merge pull request #131 from rootcodelabs/optimization/data-enrichment

4888045

Get update from optimization/data-enrichment into optimization/vector-indexer

optimize first user query response generation time

0a0806f

fixed pr reviewed issues

1eb8b47

nuwangeek and others added 25 commits April 22, 2026 06:56

Merge pull request #157 from rootcodelabs/llm-394

d159731

Get update from llm-394 into llm-345-dev

complete API semantic searcher with ambiguous result handling and too…

83c7500

…l classifier routing

Merge pull request #158 from rootcodelabs/llm-394

21c3c27

Get update from llm-394 into llm-403

Merge pull request #159 from rootcodelabs/llm-345-dev

591b119

Get update from llm-345-dev into llm-403

complete semantic searcher evaluation and update to multi point index…

c5582f8

…ing strategy

Merge pull request #160 from rootcodelabs/llm-403

f569070

Get update from llm-403 into llm-408

competed integration of agentic loop with semantic searcher and strea…

80bfce7

…ming

Enhancements in data-sync flow and updated agency_id in agency_sync t…

8b984f1

…able as id

Merge pull request #161 from rootcodelabs/llm-408

d71a5eb

Get update from llm-408 into llm-348

Implemented the API caller module

6efe48b

Completed integration of CKB and RAG changelogs with schema updates f…

51d8a0e

…or RAG

Merge pull request #164 from buerokratt/wip

2449472

Sync wip branches

Merge branch 'llm-345-dev' into wip

43e9ad3

Merge pull request #167 from buerokratt/wip

0ea073b

Sync wip branches

Merge pull request #169 from buerokratt/wip

c368cfd

Sync wip branches

Merge pull request #171 from buerokratt/wip

bdc878c

Integrate agentic loop with semantic searcher and streaming (buerokratt#420)

Merge branch 'llm-348' into wip

a385166

Merge pull request #173 from buerokratt/wip

59b604c

Implemented the API caller module (buerokratt#421)

Merge pull request #175 from buerokratt/wip

49e9e77

Sync wip branches

Merge branch 'ckb_integration_for_data_sync' into wip

8e7ab98

Merge pull request #177 from buerokratt/wip

5b3ee08

CKB API integration for agency data sync (buerokratt#392)

Merge branch 'llm-412' into wip

135ddf0

Merge pull request #179 from buerokratt/wip

3671b6c

Integrate CKB and RAG changelogs with schema updates for RAG (buerokratt#422)

Merge pull request #180 from buerokratt/wip

e8d2d8d

Fixed Ruff lint issues (buerokratt#426)

updated api tool calling to handle custom prompt configs

e1cab89

nuwangeek requested review from Thirunayan22 and Copilot May 8, 2026 09:34

Copilot started reviewing on behalf of nuwangeek May 8, 2026 09:35 View session

Copilot AI reviewed May 8, 2026

View reviewed changes

nuwangeek linked an issue May 11, 2026 that may be closed by this pull request

Implement custom prompt changes to responses in the API tool calling #430

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Custom prompt config handling in API tool calling#431

Custom prompt config handling in API tool calling#431
nuwangeek wants to merge 87 commits intobuerokratt:wipfrom
rootcodelabs:llm-430

nuwangeek commented May 8, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		custom_instructions = await self._get_custom_instructions()
		loop = self._build_agentic_loop(session_store, custom_instructions) # type: ignore[arg-type]

Conversation

nuwangeek commented May 8, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants