feat: add Data Fabric tool support by milind-jain-uipath · Pull Request #726 · UiPath/uipath-langchain-python

milind-jain-uipath · 2026-03-25T05:15:33Z

Summary

Adds Data Fabric tool support. Enables agents to query Data Fabric entities using SQL by injecting entity schemas into the system prompt at INIT time and providing a generic query_datafabric tool.

What it does: When an agent has Data Fabric entity contexts configured in Studio Web, the INIT node fetches entity metadata (table names, column names, types) from the Data Fabric API at startup. This metadata is formatted as markdown with SQL guidelines and constraints, then appended to the system prompt. A generic query_datafabric tool is registered that accepts raw SQL and dispatches it to the Data Fabric API.

Changes

react/agent.py - Add resources param, pass to create_init_node()
react/init_node.py - Fetch DF schemas at INIT, inject into system prompt via additional_context
react/types.py - Define AgentResources type alias
tools/datafabric_tool/ - New module: schema fetching, formatting, entity detection, query_datafabric tool, SQL guidelines and constraints
tools/tool_factory.py - Register DF tool, skip DF contexts in generic resource loop
tools/context_tool.py - Guard: raise error if DF context is accidentally routed here

Companion PR

https://github.com/UiPath/uipath-agents-python/pull/383

Testing

Tested the integration with Local robot run. Please ignore the error thrown from the tool - that is the folder level integration we are doing in FQS.

Screen.Recording.2026-03-24.at.4.48.53.PM.mov

UIPath-Harshit · 2026-03-25T05:56:06Z

#721

Please address the comments from the above PR on this..

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 4f94efcd44

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-25T05:59:54Z

src/uipath_langchain/agent/react/init_node.py

-            resolved_messages = list(messages(state))
+            if schema_context:
+                resolved_messages = list(
+                    messages(state, additional_context=schema_context)


Preserve one-arg message callables when injecting DF context

When Data Fabric schemas are available, INIT invokes the message factory as messages(state, additional_context=...). Existing callable message generators commonly take only one positional parameter (e.g., create_messages(state)), so this raises a TypeError at startup for agents with enabled Data Fabric contexts. This is a runtime-breaking regression for previously valid message callables and should be guarded by signature detection or a fallback call without the extra keyword.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-25T05:59:54Z

src/uipath_langchain/agent/react/init_node.py

+            else:
+                resolved_messages = list(messages(state))
        else:
            resolved_messages = list(messages)


Inject schema context for static message definitions

If messages is provided as a static sequence (which create_agent supports), the INIT node ignores schema_context and returns list(messages) unchanged. In that path, Data Fabric schemas are fetched but never reach the system prompt, so query_datafabric runs without table/column guidance and the feature silently underperforms or fails to generate valid SQL. The static-message branch should also append/update a system message with the generated schema context.

Useful? React with 👍 / 👎.

milind-jain-uipath · 2026-03-25T21:14:34Z

Addressed the comments @UIPath-Harshit .

Changes addressing review feedback:

Init context registry
Concurrent schema fetch
Remove FK reference
Pydantic models for schema
Tool initialisation moved out from tool factory & dedup added

Couldn't find refined SQL tool though.

UIPath-Harshit · 2026-03-27T05:22:47Z

src/uipath_langchain/agent/react/init_context_registry.py

+from uipath.agent.models.agent import BaseAgentResourceConfig
+
+logger = logging.getLogger(__name__)
+


Redundant whitespaces. Can you run ruff with whitespace check if there is a rule for that. Dont like these spaces Agents generate

Copilot

Pull request overview

Adds Data Fabric (DF) tool support to the ReAct agent runtime by registering a generic query_datafabric tool and injecting DF entity schema/context into the system prompt at INIT via a pluggable init-context registry.

Changes:

Introduces a datafabric_tool module (schema fetching, prompt/context formatting, and query_datafabric tool).
Adds init-time context registry + wires INIT node to gather provider context (and plumbs resources into agent creation).
Updates context/tool factory behavior and adjusts tests for new required context fields + async init node.

Reviewed changes

Copilot reviewed 18 out of 18 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
tests/agent/tools/test_tool_factory.py	Updates test context resource config to include `context_type`.
tests/agent/tools/test_context_tool.py	Updates helper to include `context_type` in context resources.
tests/agent/react/test_init_node.py	Updates INIT node tests for async init-node execution and callable signature change.
tests/agent/react/test_create_agent.py	Updates expectations for `create_init_node(..., resources_for_init=None)`.
src/uipath_langchain/agent/tools/tool_factory.py	Adds tool deduping behavior when building from resources.
src/uipath_langchain/agent/tools/datafabric_tool/system_prompt.txt	Adds SQL generation guidelines for DF.
src/uipath_langchain/agent/tools/datafabric_tool/sql_constraints.txt	Adds explicit SQL constraints reference text.
src/uipath_langchain/agent/tools/datafabric_tool/schema_context.py	Builds/derives DF schema + query pattern context for system prompt injection.
src/uipath_langchain/agent/tools/datafabric_tool/models.py	Adds Pydantic models for schema/context + tool input.
src/uipath_langchain/agent/tools/datafabric_tool/datafabric_tool.py	Implements DF schema fetch + `query_datafabric` tool creation.
src/uipath_langchain/agent/tools/datafabric_tool/init.py	Registers DF init-context provider at import time.
src/uipath_langchain/agent/tools/context_tool.py	Routes DF context resources to the `query_datafabric` tool.
src/uipath_langchain/agent/tools/init.py	Exposes DF helpers and ensures DF module import (provider registration).
src/uipath_langchain/agent/react/types.py	Adds `AgentResources` type alias.
src/uipath_langchain/agent/react/init_node.py	Makes INIT node async and gathers init-context from registered providers.
src/uipath_langchain/agent/react/init_context_registry.py	Adds registry + `gather_init_context` mechanism.
src/uipath_langchain/agent/react/agent.py	Plumbs `resources` into INIT node creation.
src/uipath_langchain/agent/react/init.py	Exports `AgentResources`.

Copilot · 2026-03-27T05:27:28Z

src/uipath_langchain/agent/tools/datafabric_tool/__init__.py

+async def _datafabric_init_context_provider(
+    resources: Sequence[BaseAgentResourceConfig],
+) -> str | None:
+    """Fetch and format DataFabric entity schemas for system prompt injection."""
+    entity_identifiers = get_datafabric_entity_identifiers_from_resources(resources)
+    if not entity_identifiers:
+        return None
+
+    _logger.info(
+        "Fetching Data Fabric schemas for %d identifier(s)",
+        len(entity_identifiers),
+    )
+    entities = await fetch_entity_schemas(entity_identifiers)
+    return format_schemas_for_context(entities)
+
+
+register_init_context_provider("datafabric", _datafabric_init_context_provider)


New Data Fabric init-time context registration and schema formatting logic is introduced here, but there are no unit tests covering (a) provider registration + gather_init_context integration, and (b) that fetched entity schemas are formatted and injected as expected. Given the repo’s existing comprehensive tool/init tests, adding focused tests for the provider behavior would help prevent regressions.

Copilot · 2026-03-27T05:27:28Z

src/uipath_langchain/agent/react/init_node.py

+            else:
+                resolved_messages = list(messages(state))
        else:
            resolved_messages = list(messages)


additional_context is computed from resources_for_init, but it is only passed into a callable messages generator. When messages is a static list/sequence, the additional context is silently ignored, so Data Fabric schemas (and any other provider output) won’t be injected into the system prompt in that common usage. Consider appending additional_context to the SystemMessage content (or inserting a new SystemMessage) when messages is a sequence, so init-time context is applied consistently.

Suggested change

resolved_messages = list(messages)

resolved_messages = list(messages)

# When using a static sequence of messages, inject any init-time context

# into the system prompt so provider output (e.g., Data Fabric schemas)

# is not silently ignored.

if additional_context:

# Try to append the additional context to the first SystemMessage.

system_msg_index = next(

(i for i, m in enumerate(resolved_messages) if isinstance(m, SystemMessage)),

None,

)

if system_msg_index is not None:

system_msg = resolved_messages[system_msg_index]

# Safely append to existing content, assuming string content.

existing_content = str(system_msg.content)

updated_content = f"{existing_content}\n\n{additional_context}"

resolved_messages[system_msg_index] = SystemMessage(

content=updated_content, additional_kwargs=system_msg.additional_kwargs

)

else:

# No SystemMessage present; prepend a new one with the additional context.

resolved_messages.insert(0, SystemMessage(content=additional_context))

Copilot · 2026-03-27T05:27:28Z

src/uipath_langchain/agent/react/init_node.py

        if callable(messages):
-            resolved_messages = list(messages(state))
+            if additional_context:
+                resolved_messages = list(
+                    messages(state, additional_context=additional_context)
+                )
+            else:
+                resolved_messages = list(messages(state))


When additional_context is present, the init node calls messages(state, additional_context=...). This will raise TypeError for existing message generator callables that only accept a single positional state argument (no **kwargs), making Data Fabric/resources injection a breaking change for those users. Consider detecting whether the callable accepts additional_context (or **kwargs) and falling back to messages(state) if not.

Copilot · 2026-03-27T05:27:29Z

src/uipath_langchain/agent/tools/datafabric_tool/schema_context.py

+    group_field = text_field or (field_names[0] if field_names else "Category")
+    agg_field = numeric_field or (field_names[1] if len(field_names) > 1 else "Amount")
+    filter_field = text_field or (field_names[0] if field_names else "Name")
+    fields_sample = ", ".join(field_names[:5]) if field_names else "*"
+    count_col = field_names[0] if field_names else "id"
+
+    query_patterns = [
+        QueryPattern(
+            intent="Show all",
+            sql=f"SELECT {fields_sample} FROM {table} LIMIT 100",
+        ),
+        QueryPattern(
+            intent="Find by X",
+            sql=f"SELECT {fields_sample} FROM {table} WHERE {filter_field} = 'value' LIMIT 100",
+        ),


fields_sample falls back to "*" when no non-hidden fields are present. This contradicts the documented constraints (NO SELECT *) and will produce example patterns that are guaranteed to violate the tool’s own SQL rules. Prefer omitting the “Show all / Find by X / Top N” patterns when there are no fields, or choose a safe default like the primary key field when available.

Copilot · 2026-03-27T05:27:29Z

src/uipath_langchain/agent/tools/datafabric_tool/schema_context.py

+        QueryPattern(
+            intent="Top N by Y",
+            sql=f"SELECT {fields_sample} FROM {table} ORDER BY {agg_field} DESC LIMIT N",
+        ),
+        QueryPattern(
+            intent="Count by X",
+            sql=f"SELECT {group_field}, COUNT({count_col}) as count FROM {table} GROUP BY {group_field}",
+        ),
+        QueryPattern(
+            intent="Top N segments",
+            sql=f"SELECT {group_field}, COUNT({count_col}) as count FROM {table} GROUP BY {group_field} ORDER BY count DESC LIMIT N",
+        ),


The query pattern examples use LIMIT N, which is not valid SQLite syntax and conflicts with the prompt rule “Ensure the query is syntactically correct”. Since these examples are intended for LLM copying, they should be executable SQL (e.g., LIMIT 10) or use a clearly marked comment-style placeholder that won’t be emitted verbatim.

Copilot · 2026-03-27T05:27:29Z

src/uipath_langchain/agent/tools/datafabric_tool/datafabric_tool.py

+    async def _query_datafabric(sql_query: str) -> dict[str, Any]:
+        from uipath.platform import UiPath
+
+        logger.debug(f"query_datafabric called with SQL: {sql_query}")
+
+        sdk = UiPath()


create_datafabric_query_tool() is cached as a singleton, but _query_datafabric creates a new UiPath() SDK instance on every call. This adds avoidable overhead on each SQL query and can impact latency. Consider instantiating the SDK once at tool-creation time (outside the inner coroutine) so calls reuse the same client/config.

Suggested change

async def _query_datafabric(sql_query: str) -> dict[str, Any]:

from uipath.platform import UiPath

logger.debug(f"query_datafabric called with SQL: {sql_query}")

sdk = UiPath()

from uipath.platform import UiPath

# Instantiate the SDK once per tool instance to avoid per-call overhead.

sdk = UiPath()

async def _query_datafabric(sql_query: str) -> dict[str, Any]:

logger.debug(f"query_datafabric called with SQL: {sql_query}")

UIPath-Harshit · 2026-03-27T06:20:05Z

src/uipath_langchain/agent/react/init_context_registry.py

+                )
+        except Exception:
+            logger.exception("Init context provider '%s' failed; skipping", name)
+    return "\n\n".join(parts) if parts else None


Make it structured rather than free form text. Expose it as a pydantic model to be consumed by caller

UIPath-Harshit

Left few comments for @cristian-groza to chime-in. Others please address

UIPath-Harshit · 2026-03-27T06:21:03Z

src/uipath_langchain/agent/tools/datafabric_tool/__init__.py

+# --- Init-time context self-registration ---
+
+
+async def _datafabric_init_context_provider(


Move this to datafabric_tool rather than exposing it as a module method.

UIPath-Harshit · 2026-03-27T06:22:20Z

src/uipath_langchain/agent/tools/datafabric_tool/datafabric_tool.py

+
+# --- Generic Tool Creation ---
+
+_MAX_RECORDS_IN_RESPONSE = 50


We can remove this check with free-form queries and LIMIT clause.

UIPath-Harshit · 2026-03-27T06:23:31Z

src/uipath_langchain/agent/tools/datafabric_tool/models.py

+    def display_type(self) -> str:
+        """Type string with PK/required modifiers for markdown display."""
+        modifiers = []
+        if self.is_primary_key:


Concept of PK will get removed with VDOs integration. You need is_required and other attributes.

UIPath-Harshit · 2026-03-27T06:23:51Z

src/uipath_langchain/agent/tools/datafabric_tool/models.py

+    query_patterns: list[QueryPattern]
+
+
+class SQLContext(BaseModel):


EntittySQLContext

UIPath-Harshit · 2026-03-27T06:24:27Z

src/uipath_langchain/agent/tools/datafabric_tool/schema_context.py

+@lru_cache(maxsize=1)
+def _load_system_prompt() -> str:
+    """Load SQL generation strategy from system_prompt.txt."""
+    prompt_path = _PROMPTS_DIR / "system_prompt.txt"


@cristian-groza this needs a discussion.

UIPath-Harshit · 2026-03-27T06:25:28Z

src/uipath_langchain/agent/tools/context_tool.py

+
+        return create_datafabric_query_tool()
+
    assert resource.settings is not None


Put this behind isIndexTool flag.

UIPath-Harshit · 2026-03-27T06:25:59Z

src/uipath_langchain/agent/tools/tool_factory.py

 async def create_tools_from_resources(
    agent: LowCodeAgentDefinition, llm: BaseChatModel
 ) -> list[BaseTool]:
+    """Create tools from agent resources including Data Fabric tools.


Lets remove this comment. Doesnt make sense from overall agent perspective

UIPath-Harshit · 2026-03-27T06:27:06Z

tests/agent/react/test_create_agent.py

            messages,
            None,  # input schema
            True,  # is_conversational
+            resources_for_init=None,


Can we add a test for resources_for_init also. Also this involves an async network call. @cristian-groza do we see any implication of this when using the "convert to coded agent" flow?

feat: add Data Fabric tool support

4f94efc

milind-jain-uipath force-pushed the feat-df-agent-integrations-v0 branch from 0ec64ac to 4f94efc Compare March 25, 2026 05:22

milind-jain-uipath marked this pull request as ready for review March 25, 2026 05:53

milind-jain-uipath requested review from UIPath-Harshit, cristian-groza and gcuip March 25, 2026 05:54

chatgpt-codex-connector bot reviewed Mar 25, 2026

View reviewed changes

milind-jain-uipath mentioned this pull request Mar 25, 2026

feat: Addition of DF tool #721

Closed

milind-jain-uipath force-pushed the feat-df-agent-integrations-v0 branch 2 times, most recently from 91edca2 to 9bfc30c Compare March 25, 2026 21:21

feat: init context registry

796d1d9

milind-jain-uipath force-pushed the feat-df-agent-integrations-v0 branch from 9bfc30c to 796d1d9 Compare March 25, 2026 21:27

UIPath-Harshit requested a review from Copilot March 27, 2026 05:21

Copilot started reviewing on behalf of UIPath-Harshit March 27, 2026 05:22 View session

UIPath-Harshit reviewed Mar 27, 2026

View reviewed changes

Copilot AI reviewed Mar 27, 2026

View reviewed changes

UIPath-Harshit reviewed Mar 27, 2026

View reviewed changes

		from uipath.agent.models.agent import BaseAgentResourceConfig

		logger = logging.getLogger(__name__)

-            resolved_messages = list(messages)
+            resolved_messages = list(messages)
+            # When using a static sequence of messages, inject any init-time context
+            # into the system prompt so provider output (e.g., Data Fabric schemas)
+            # is not silently ignored.
+            if additional_context:
+                # Try to append the additional context to the first SystemMessage.
+                system_msg_index = next(
+                    (i for i, m in enumerate(resolved_messages) if isinstance(m, SystemMessage)),
+                    None,
+                )
+                if system_msg_index is not None:
+                    system_msg = resolved_messages[system_msg_index]
+                    # Safely append to existing content, assuming string content.
+                    existing_content = str(system_msg.content)
+                    updated_content = f"{existing_content}\n\n{additional_context}"
+                    resolved_messages[system_msg_index] = SystemMessage(
+                        content=updated_content, additional_kwargs=system_msg.additional_kwargs
+                    )
+                else:
+                    # No SystemMessage present; prepend a new one with the additional context.
+                    resolved_messages.insert(0, SystemMessage(content=additional_context))

		# --- Init-time context self-registration ---


		async def _datafabric_init_context_provider(


		# --- Generic Tool Creation ---

		_MAX_RECORDS_IN_RESPONSE = 50

		query_patterns: list[QueryPattern]


		class SQLContext(BaseModel):


		return create_datafabric_query_tool()

		assert resource.settings is not None

Conversation

milind-jain-uipath commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Companion PR

Testing

Uh oh!

UIPath-Harshit commented Mar 25, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

milind-jain-uipath commented Mar 25, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

UIPath-Harshit left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

milind-jain-uipath commented Mar 25, 2026 •

edited

Loading