UN-3266 [FEAT] Async Executor Backend for Prompt Studio by harini-venkataraman · Pull Request #1849 · Zipstack/unstract

harini-venkataraman · 2026-03-11T10:51:50Z

What

Introduces a pluggable executor system that replaces Docker-container-based tool execution with Celery worker tasks, and migrates the Prompt Studio IDE to an async execution model using Socket.IO for result delivery. Gated behind the async_prompt_execution feature flag for safe rollout.

Why

The existing architecture has several limitations:

Prompt Studio IDE executions block HTTP connections — Django workers are tied up waiting for LLM responses (up to minutes per prompt), limiting concurrency and causing timeouts
Docker-container-based tool execution requires spinning up containers per workflow step, adding overhead and complicating deployments
No real-time feedback — the frontend polls for results, wasting resources and providing poor UX
Tight coupling between prompt-service HTTP calls and the Django backend makes it hard to scale execution independently

How

Backend (65 files)

Async Prompt Studio views: index_document, fetch_response, single_pass_extraction now return HTTP 202 (accepted) with a task_id instead of blocking. Gated by async_prompt_execution feature flag — old sync path preserved as fallback
Celery callback tasks (backend/prompt_studio/prompt_studio_core_v2/tasks.py): ide_index_complete, ide_prompt_complete, ide_prompt_error etc. run on prompt_studio_callback queue, perform ORM writes via OutputManagerHelper, and emit prompt_studio_result Socket.IO events
Worker dispatch Celery app (backend/backend/worker_celery.py): A second Celery app instance that coexists with Django's Celery app, configured to route tasks to executor workers
prompt_studio_helper.py rewrite: Removed PromptTool HTTP calls entirely. New build_index_payload(), build_fetch_response_payload(), build_single_pass_payload() methods construct ExecutionContext objects with all ORM data pre-loaded
Removed: backend/backend/workers/, file_execution_tasks.py, celery_task.py (old in-process workers)

Workers (70 files, ~19,500 new lines)

Executor Worker (workers/executor/): New WorkerType.EXECUTOR Celery worker with LegacyExecutor handling all operations: extract, index, answer_prompt, single_pass_extraction, summarize, agentic_extraction, structure_pipeline
Pluggable Executor Framework: BaseExecutor → ExecutorRegistry (class-decorator self-registration) → ExecutionOrchestrator → ExecutionDispatcher (Celery send_task)
ExecutorToolShim: Lightweight stand-in for BaseTool that satisfies SDK1 adapter interfaces without Docker context
Structure tool task (workers/file_processing/structure_tool_task.py): Celery-native replacement for Docker-based StructureTool.run() with profile overrides, smart table detection, and output file management
26 test files (~10,000+ lines): Comprehensive coverage from unit tests through full Celery eager-mode integration tests

SDK1 (22 files)

Execution framework (unstract/sdk1/src/unstract/sdk1/execution/): ExecutionContext, ExecutionResult (serializable DTOs for Celery JSON transport), ExecutionDispatcher (dispatch() + dispatch_with_callback()), BaseExecutor, ExecutorRegistry

Frontend (275 files)

Async prompt execution: usePromptStudioSocket hook listens for prompt_studio_result Socket.IO events. usePromptRun rewritten from polling to fire-and-forget. PromptRun.jsx conditionally renders async or sync path based on feature flag
CRA → Vite migration: Build tooling migrated to Vite + Bun with Biome linter replacing ESLint
Dashboard metrics UI: New metrics dashboard with charts, LLM usage table, and recent activity
Card-based layouts: New card grid views for pipelines and API deployments

Docker / Infrastructure

Added: worker-executor-v2, worker-prompt-studio-callback, worker-metrics
Promoted: All workers-v2 services from opt-in (profiles: [workers-v2]) to default

Architecture Change

BEFORE:  FE → Django (blocks) → PromptTool HTTP → prompt-service → LLM
AFTER:   FE → Django (HTTP 202) → ExecutionDispatcher → Executor Worker → LLM
              ↑ Socket.IO result    (Celery send_task)    (LegacyExecutor)

Can this PR break any existing features? If yes, please list possible items. If no, please explain why.

Yes, potential breaking changes — mitigated by feature flag:
Prompt Studio IDE async path — gated by async_prompt_execution feature flag. When flag is OFF (default), all 3 endpoints (index_document, fetch_response, single_pass_extraction) use the old sync path returning HTTP 200. No behavior change for existing users.

Review Guidelines

This PR touches 441 files across backend, frontend, workers, and SDK1. Below is a structured review path to navigate it efficiently.

Code Structure Overview

unstract/sdk1/src/unstract/sdk1/execution/   ← Core abstractions (review FIRST)
    context.py          ExecutionContext dataclass (the universal payload)
    result.py           ExecutionResult dataclass (success/failure container)
    executor.py         BaseExecutor ABC (the executor contract)
    registry.py         ExecutorRegistry (class-decorator self-registration)
    dispatcher.py       ExecutionDispatcher (Celery send_task, 3 dispatch modes)
    orchestrator.py     ExecutionOrchestrator (worker-side: find executor → execute)

workers/executor/                            ← Executor worker (review SECOND)
    worker.py           Celery app entry point
    tasks.py            Single task: execute_extraction (deserialize → orchestrate → return)
    executor_tool_shim.py   BaseTool substitute for worker context
    executors/
        legacy_executor.py  Main executor: 7 operations via _OPERATION_MAP strategy pattern
        answer_prompt.py    Prompt answering pipeline (retrieve → LLM → postprocess)
        index.py            Document indexing (vectorDB operations)
        retrieval.py        RetrievalService + 7 retriever strategies
        variable_replacement.py, postprocessor.py, json_repair_helper.py, usage.py

backend/prompt_studio/prompt_studio_core_v2/ ← Django async wiring (review THIRD)
    views.py            3 endpoints return HTTP 202 (gated by feature flag)
    prompt_studio_helper.py   build_*_payload() methods construct ExecutionContext
    tasks.py            Celery callbacks: ORM writes + Socket.IO emission

frontend/src/                                ← Frontend async path (review FOURTH)
    hooks/usePromptRun.js           Fire-and-forget POST + 5-min timeout safety net
    hooks/usePromptStudioSocket.js  Socket.IO listener for prompt_studio_result
    components/.../PromptRun.jsx    Headless queue manager (dequeues + calls runPrompt)

Recommended Review Order

Review in dependency order — each layer builds on the previous:

Step	Area	Key Files	What to Look For
1	SDK1 Execution Framework	`execution/context.py`, `result.py`, `dispatcher.py`, `registry.py`	Contract stability: are `to_dict()`/`from_dict()` round-trips correct? Is the `Operation` enum complete? Queue naming (`celery_executor_{name}`).
2	Executor Worker Entry	`executor/tasks.py`, `executor/worker.py`	Single entry point `execute_extraction`: retry policy, error handling, log correlation.
3	LegacyExecutor Core	`executors/legacy_executor.py` (focus on `_OPERATION_MAP` + `execute()`)	Strategy pattern routing. Unsupported operation handling. Error wrapping.
4	LegacyExecutor Handlers	`answer_prompt.py`, `index.py`, `retrieval.py`	Parameter contracts: do the keys in `executor_params` match what `build_*_payload()` sends? Lazy import pattern (`_get_prompt_deps()`, `_get_indexing_deps()`).
5	Backend Views (async path)	`views.py` lines 351–583	Feature flag gating. 202 vs 200 response. `dispatch_with_callback` usage with correct callback task names and queue.
6	Backend Payload Builders	`prompt_studio_helper.py` (`build_index_payload`, `build_fetch_response_payload`, `build_single_pass_payload`)	ORM data loading. Are all required params packed into `executor_params`? Key compatibility with executor handlers.
7	Backend Callbacks	`tasks.py` (callback tasks)	`ide_prompt_complete`: ORM writes via `OutputManagerHelper`. Socket.IO emission shape. Error callback cleanup. State store setup/teardown.
8	Frontend	`usePromptRun.js`, `usePromptStudioSocket.js`, `PromptRun.jsx`	Socket event shape matches backend `_emit_result()`. Timeout handling. Status cleanup on failure.
9	Docker/Infra	`docker/docker-compose.yaml`	New services: `worker-executor-v2`, `worker-prompt-studio-callback`. Removed old workers. Queue bindings.
10	Tests	`workers/tests/test_sanity_phase*.py`	Integration tests validate end-to-end Celery chains in eager mode.

Data Flow (End-to-End)

User clicks "Run" in Prompt Studio IDE
  │
  ▼
[Frontend] PromptRun.jsx dequeues → usePromptRun.runPromptApi()
  │  POST /fetch_response/{tool_id}  (fire-and-forget)
  ▼
[Django View] views.py:fetch_response()
  │  if feature_flag ON → build_fetch_response_payload() → dispatch_with_callback()
  │  Returns HTTP 202 {task_id, run_id, status: "accepted"}
  ▼
[RabbitMQ] → celery_executor_legacy queue
  ▼
[Executor Worker] tasks.py:execute_extraction()
  │  ExecutionOrchestrator → ExecutorRegistry.get("legacy") → LegacyExecutor
  │  → _handle_answer_prompt() → RetrievalService → LLM call → postprocess
  │  Returns ExecutionResult.to_dict()
  ▼
[Celery link callback] → prompt_studio_callback queue
  ▼
[Django Callback Worker] tasks.py:ide_prompt_complete()
  │  OutputManagerHelper.handle_prompt_output_update() (ORM write)
  │  _emit_result() → Socket.IO "prompt_studio_result" event
  ▼
[Frontend] usePromptStudioSocket.onResult()
  │  handleCompleted("fetch_response", result)
  │  → updatePromptOutputState(data) → clears spinner
  ▼
User sees result in UI

Known Code Duplication

Where	What's Duplicated	Severity	Notes
`views.py` — 3 view actions	Dispatch pattern: `build_payload → get_dispatcher → dispatch_with_callback → return 202`	Low	Each view has different ORM/param resolution before the common dispatch. Could be a helper but manageable at 3 instances.
`tasks.py` — callback tasks	`ide_index_complete` and `ide_prompt_complete` follow same structure: extract kwargs → setup state → check result → ORM work → emit → cleanup	Low	Different ORM logic per callback type. Acceptable for 2 callbacks; monitor if more are added.
`tasks.py` — legacy tasks	`run_index_document`, `run_fetch_response`, `run_single_pass_extraction` kept alongside new callback tasks	Intentional	Legacy tasks retained for backward compatibility during feature flag rollout. Can be removed once flag is permanently ON.

Files Safe to Skim

workers/tests/ — 24 test files, ~10,000 lines. Well-structured but high volume. Focus on test_sanity_phase2.py (full Celery chain) and test_sanity_phase4.py (IDE payload compatibility) as representative examples.
workers/executor/executors/retrievers/ — 7 retriever implementations. All follow the same pattern. Reviewing one (simple.py) covers the pattern.
Architecture docs at repo root (architecture-*.md, phase*.md) — Reference material, not code.

Relevant Docs

Architecture: architecture-executor-system.md, architecture-flow-diagram.md, architecture-sequence-diagrams.md in repo root
Migration phases: architecture-migration-phases.md
Rollout: rollout-plan.md

Related Issues or PRs

Async Prompt Studio Execution epic

Dependencies Versions / Env Variables

New env variables:

Variable	Purpose	Default
`FLIPT_SERVICE_AVAILABLE`	Enable Flipt feature flag service	`false`

Notes on Testing

Workers: cd workers && uv run pytest -v — 490+ tests (444 in workers/tests/ + extras)
SDK1: cd unstract/sdk1 && uv run pytest -v — 146+ tests
Backend callbacks: cd backend && python -m pytest prompt_studio/prompt_studio_core_v2/test_tasks.py -v
Manual testing: Enable flag in Flipt (async_prompt_execution=true), trigger prompt runs in IDE, verify Socket.IO events deliver results via Network → WS → Messages tab
Feature flag OFF: Verify all sync paths still work identically to main branch

Screenshots

N/A (primarily backend/worker architecture change; frontend UX unchanged when feature flag is off)

Checklist

I have read and understood the Contribution Guidelines.

Conflicts resolved: - docker-compose.yaml: Use main's dedicated dashboard_metric_events queue for worker-metrics - PromptCard.jsx: Keep tool_id matching condition from our async socket feature - PromptRun.jsx: Merge useEffect import from main with our branch - ToolIde.jsx: Keep fire-and-forget socket approach (spinner waits for socket event) - SocketMessages.js: Keep both session-store and socket-custom-tool imports + updateCusToolMessages dep - SocketContext.js: Keep simpler path-based socket connection approach - usePromptRun.js: Keep Celery fire-and-forget with socket delivery over polling - setupProxy.js: Accept main's deletion (migrated to Vite)

…on-backend

for more information, see https://pre-commit.ci

…on-backend

… into feat/execution-backend

for more information, see https://pre-commit.ci

… into feat/execution-backend

for more information, see https://pre-commit.ci

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Rename PascalCase local variables to snake_case to comply with S117: - legacy_executor.py: rename tuple-unpacked _get_prompt_deps() results (AnswerPromptService→answer_prompt_svc, RetrievalService→retrieval_svc, VariableReplacementService→variable_replacement_svc, LLM→llm_cls, EmbeddingCompat→embedding_compat_cls, VectorDB→vector_db_cls) and update all downstream usages including _apply_type_conversion and _handle_summarize - test_phase1_log_streaming.py: rename Mock* local variables to mock_* snake_case equivalents - test_sanity_phase3.py: rename MockDispatcher→mock_dispatcher_cls and MockShim→mock_shim_cls across all 10 test methods - test_sanity_phase5.py: rename MockShim→mock_shim, MockX2Text→mock_x2text in 6 test methods; MockDispatcher→mock_dispatcher_cls in dispatch test; fix LLM_cls→llm_cls, EmbeddingCompat→embedding_compat_cls, VectorDB→vector_db_cls in _mock_prompt_deps helper Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

for more information, see https://pre-commit.ci

- test_sanity_phase2/4.py, test_answer_prompt.py: rename PascalCase local variables in _mock_prompt_deps/_mock_deps to snake_case (RetrievalService→retrieval_svc, VariableReplacementService→ variable_replacement_svc, Index→index_cls, LLM_cls→llm_cls, EmbeddingCompat→embedding_compat_cls, VectorDB→vector_db_cls, AnswerPromptService→answer_prompt_svc_cls) — fixes S117 - test_sanity_phase3.py: remove unused local variable "result" — fixes S1481 - structure_tool_task.py: remove redundant json.JSONDecodeError from except clause (subclass of ValueError) — fixes S5713 - shared/workflow/execution/service.py: replace generic Exception with RuntimeError for structure tool failure — fixes S112 - run-worker-docker.sh: define EXECUTOR_WORKER_TYPE constant and replace 10 literal "executor" occurrences — fixes S1192 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…iolations - Reduce cognitive complexity in answer_prompt.py: - Extract _build_grammar_notes, _run_webhook_postprocess helpers - _is_safe_public_url: extracted _resolve_host_addresses helper - handle_json: early-return pattern eliminates nesting - construct_prompt: delegates grammar loop to _build_grammar_notes - Reduce cognitive complexity in legacy_executor.py: - Extract _execute_single_prompt, _run_table_extraction helpers - Extract _run_challenge_if_enabled, _run_evaluation_if_enabled - Extract _inject_table_settings, _finalize_pipeline_result - Extract _convert_number_answer, _convert_scalar_answer - Extract _sanitize_dict_values helper - _handle_answer_prompt CC reduced from 50 to ~7 - Reduce CC in structure_tool_task.py: guard-clause refactor - Reduce CC in backend: dto.py, deployment_helper.py, api_deployment_views.py, prompt_studio_helper.py - Fix S117: rename PascalCase local vars in test_answer_prompt.py - Fix S1192: extract EXECUTOR_WORKER_TYPE constant in run-worker.sh - Fix S1172: remove unused params from structure_tool_task.py - Fix S5713: remove redundant JSONDecodeError in json_repair_helper.py - Fix S112/S5727 in test_execution.py Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…er_prompt Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

for more information, see https://pre-commit.ci

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

@staticmethod

…002192 - Add @staticmethod to _sanitize_null_values (fixes S2325 missing self) - Reduce _execute_single_prompt params from 25 to 11 (S107) by grouping services as deps tuple and extracting exec params from context.executor_params - Add NOSONAR suppression for raise exc in test helper (S112) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

for more information, see https://pre-commit.ci

execution_id, file_hash, log_events_id, custom_data are now extracted inside _execute_single_prompt from context.executor_params. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Signed-off-by: harini-venkataraman <115449948+harini-venkataraman@users.noreply.github.com>

coderabbitai

Actionable comments posted: 4

Note

Due to the large number of review comments, Critical severity comments were prioritized as inline comments.

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py (1)
1717-1720: ⚠️ Potential issue | 🟠 Major

Broaden the cleanup around dynamic_indexer() setup.

The indexing flag is set before platform-key lookup, child-context creation, and dispatch, but the cleanup only runs for (IndexingError, IndexingAPIError, SdkError). A local setup failure outside that tuple leaves the document permanently marked as indexing.

Also applies to: 1740-1779
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py` around
lines 1717 - 1720, The code sets
DocumentIndexingService.set_document_indexing(...) before doing
dynamic_indexer() platform-key lookup, child-context creation, and dispatch but
only clears the flag on (IndexingError, IndexingAPIError, SdkError), which can
leave a document permanently marked as indexing if any other local setup error
occurs; wrap the entire setup+dispatch sequence that begins with
DocumentIndexingService.set_document_indexing and includes dynamic_indexer(),
platform key lookup and child context creation in a try/finally (or move the
set_document_indexing call to after successful setup) so that the cleanup call
that clears the indexing flag always runs regardless of exception type, and
apply the same change to the analogous block referenced at the later section
(around lines 1740-1779) to ensure consistent behavior.

🟠 Major comments (33)

workers/shared/enums/task_enums.py-36-37 (1)

36-37: ⚠️ Potential issue | 🟠 Major

Add a file-processing route for execute_structure_tool.

TaskName.EXECUTE_STRUCTURE_TOOL is introduced here, but workers/shared/infrastructure/config/registry.py still does not route that task to QueueName.FILE_PROCESSING. If this task is queued by name without an explicit queue, it can fall back to the default queue and never reach the intended worker.

Suggested follow-up in workers/shared/infrastructure/config/registry.py

         WorkerType.FILE_PROCESSING: WorkerTaskRouting(
             worker_type=WorkerType.FILE_PROCESSING,
             routes=[
                 TaskRoute("process_file_batch", QueueName.FILE_PROCESSING),
                 TaskRoute("process_file_batch_api", QueueName.FILE_PROCESSING_API),
+                TaskRoute("execute_structure_tool", QueueName.FILE_PROCESSING),
             ],
         ),

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@workers/shared/enums/task_enums.py` around lines 36 - 37,
TaskName.EXECUTE_STRUCTURE_TOOL was added but not routed to the file-processing
queue; update the task routing in the registry so this task maps to
QueueName.FILE_PROCESSING. In workers/shared/infrastructure/config/registry.py
locate the routing map (the dict or function that assigns TaskName values to
queues) and add an entry for TaskName.EXECUTE_STRUCTURE_TOOL ->
QueueName.FILE_PROCESSING (or include it in whatever list/group is used for
file-processing tasks), ensuring any default/fallback logic won’t send it to the
default queue.

workers/tests/conftest.py-13-14 (1)

13-14: ⚠️ Potential issue | 🟠 Major

Force .env.test to override ambient env vars.

load_dotenv() keeps existing variables by default (override=False), so developer or CI values for INTERNAL_API_BASE_URL / INTERNAL_SERVICE_API_KEY will take precedence over .env.test. This makes the test suite environment-dependent and can send tests to the wrong backend.
Suggested fix
 _env_test = Path(__file__).resolve().parent.parent / ".env.test"
-load_dotenv(_env_test)
+load_dotenv(_env_test, override=True)
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@workers/tests/conftest.py` around lines 13 - 14, The call to load_dotenv
currently preserves existing environment variables, causing ambient CI/dev
values to override .env.test; update the call in workers/tests/conftest.py to
force loading .env.test by passing override=True (e.g., change
load_dotenv(_env_test) to load_dotenv(_env_test, override=True)) so _env_test
always replaces existing vars like INTERNAL_API_BASE_URL and
INTERNAL_SERVICE_API_KEY during tests.

docker/sample.compose.override.yaml-323-327 (1)

323-327: ⚠️ Potential issue | 🟠 Major

Add the new executor worker to the dev override too.

This PR adds worker-executor-v2 in docker/docker-compose.yaml, but the sample override only wires up the callback worker. In local dev, Compose will still try to pull unstract/worker-unified:${VERSION} for the executor, and even if that image exists it won’t include local workers/ changes, so the new async flow can’t be exercised reliably.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@docker/sample.compose.override.yaml` around lines 323 - 327, The dev override
is missing the new service definition for worker-executor-v2 so Compose will
pull the remote image instead of building the local worker; add a service block
named worker-executor-v2 to the sample override (matching how
worker-prompt-studio-callback is defined) that points to the local build
(dockerfile: docker/dockerfiles/backend.Dockerfile and context: ..), and include
the same volumes, environment and any depends_on or networks used by other local
worker services so the executor is built from local workers/ sources and
participates in the local async flow.

frontend/src/components/custom-tools/tool-ide/ToolIde.jsx-267-275 (1)

267-275: ⚠️ Potential issue | 🟠 Major

Clear the indexing state on non-async success too.

docId now gets removed from indexDocs only when the POST fails. If async_prompt_execution is off, or the backend returns a successful response without a follow-up socket event, the document stays permanently “indexing” and future retries are blocked.

Suggested change

   pushIndexDoc(docId);
-  return axiosPrivate(requestOptions).catch((err) => {
-    // Only clear spinner on POST network failure (not 2xx).
-    // On success the spinner stays until a socket event arrives.
-    deleteIndexDoc(docId);
-    setAlertDetails(
-      handleException(err, `${doc?.document_name} - Failed to index`),
-    );
-  });
+  return axiosPrivate(requestOptions)
+    .then((res) => {
+      if (res.status !== 202 || !res.data?.task_id) {
+        deleteIndexDoc(docId);
+      }
+      return res;
+    })
+    .catch((err) => {
+      deleteIndexDoc(docId);
+      setAlertDetails(
+        handleException(err, `${doc?.document_name} - Failed to index`),
+      );
+    });

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@frontend/src/components/custom-tools/tool-ide/ToolIde.jsx` around lines 267 -
275, The code only calls deleteIndexDoc(docId) inside the .catch branch so the
doc remains "indexing" when the POST succeeds without a socket event; change the
axiosPrivate call to ensure deleteIndexDoc(docId) runs for successful non-async
responses as well (e.g., call deleteIndexDoc in a .finally or in a .then branch
after inspecting the response), keep setAlertDetails(handleException(...)) in
the .catch branch, and preserve pushIndexDoc(docId) before the request; update
the axiosPrivate(requestOptions) call around pushIndexDoc/deleteIndexDoc to call
deleteIndexDoc(docId) on both success and failure (use the existing
pushIndexDoc, deleteIndexDoc, axiosPrivate, setAlertDetails, and handleException
symbols to locate and modify the code).

workers/executor/executors/retrievers/base_retriever.py-10-35 (1)

10-35: ⚠️ Potential issue | 🟠 Major

Remove @staticmethod and raise NotImplementedError to enforce the base contract.

The current implementation silently returns an empty set, which masks incomplete implementations. All 7 concrete retrievers properly override this method, but the base class should fail fast if called directly. Ideally, promote BaseRetriever to an abstract base class with @abstractmethod to prevent instantiation entirely—this pattern already exists elsewhere in the codebase (BaseTool, FileStorageInterface).
Suggested change
 class BaseRetriever:
     def __init__(
         self,
@@ -30,6 +30,6 @@ class BaseRetriever:
         self.top_k = top_k
         self.llm = llm if llm else None
 
-    `@staticmethod`
-    def retrieve() -> set[str]:
-        return set()
+    def retrieve(self) -> set[str]:
+        raise NotImplementedError
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@workers/executor/executors/retrievers/base_retriever.py` around lines 10 -
35, The BaseRetriever class currently defines retrieve as a `@staticmethod`
returning an empty set; change this to an abstract instance method that enforces
the contract: remove the `@staticmethod` decorator on BaseRetriever.retrieve, make
BaseRetriever inherit from abc.ABC (or otherwise mark it abstract), and
implement retrieve(self, ...) to raise NotImplementedError (or use
`@abstractmethod`) so calling the base method fails fast; reference the
BaseRetriever class and its retrieve method when making this change so concrete
retrievers continue to override the instance method.

workers/executor/worker.py-36-48 (1)

36-48: ⚠️ Potential issue | 🟠 Major

Don't report a worker with zero executors as healthy.

ExecutorRegistry.list_executors() can return an empty list when import-time registration breaks, but both the registered health check and the task-level healthcheck still report success. That makes a worker that cannot execute anything look ready to orchestration and monitoring. Please degrade when no executors are registered, and have the task reuse that computed status/details instead of hardcoding "healthy".

Also applies to: 67-75
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@workers/executor/worker.py` around lines 36 - 48, The health check currently
always returns HealthStatus.HEALTHY and hardcodes message/details even when
ExecutorRegistry.list_executors() returns an empty list; change logic in the
health check (the function that builds the HealthCheckResult using
ExecutorRegistry.list_executors()) to detect an empty executors list and set
status to HealthStatus.UNHEALTHY (or a degraded status), adjust the message and
details appropriately (e.g., note "no executors registered"), and then reuse
that same computed HealthCheckResult/details in the task-level healthcheck
(instead of hardcoding "healthy"/queues) so both the registered health check and
the task-level healthcheck reflect the computed status and details. Ensure
references: ExecutorRegistry.list_executors(), HealthCheckResult, HealthStatus,
and the task-level healthcheck function are updated.

frontend/src/hooks/usePromptStudioSocket.js-45-72 (1)

45-72: ⚠️ Potential issue | 🟠 Major

Completed events need the same cleanup fallback as failed events.

handleCompleted() only clears local state from result. If the callback succeeds with [] or with a payload that omits the IDs you need, the pending prompt/index status never gets removed and the UI spinner stays stuck. Pass extra into the success path and reuse the metadata-based fallback clearing you already have in handleFailed().

Also applies to: 132-135
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@frontend/src/hooks/usePromptStudioSocket.js` around lines 45 - 72,
handleCompleted currently only clears state using result payload and can leave
pending statuses if result is [] or missing IDs; update handleCompleted (and the
similar block around the other occurrence at lines ~132-135) to accept the extra
metadata argument and call the same metadata-based fallback clearing used in
handleFailed: pass extra into handleCompleted, and inside each success branch
(fetch_response, single_pass_extraction, index_document) call
clearResultStatuses(result, extra) or invoke the metadata-based cleanup logic
used by handleFailed (using prompt/index IDs from extra when result lacks them)
so updatePromptOutputState, updateCustomTool, deleteIndexDoc, and
setAlertDetails remain but pending statuses are always cleared via the extra
fallback.

frontend/src/hooks/usePromptRun.js-51-67 (1)

51-67: ⚠️ Potential issue | 🟠 Major

Timeout callback is not cleaned up on unmount - potential memory leak.

The setTimeout created in the .then() callback is not tracked or cleared. If the component unmounts before the 5-minute timeout fires, the callback will still execute, potentially causing:

State updates on unmounted component
Stale closure issues with store references

Since this hook is used in PromptRun.jsx which may unmount when navigating away, this could cause issues.

🐛 Proposed fix - track and clear timeouts

+import { useRef, useEffect } from "react";
+
 const usePromptRun = () => {
+  const timeoutRefs = useRef(new Map());
+
+  // Cleanup timeouts on unmount
+  useEffect(() => {
+    return () => {
+      timeoutRefs.current.forEach((timeoutId) => clearTimeout(timeoutId));
+      timeoutRefs.current.clear();
+    };
+  }, []);
+
   // ... existing code ...

   const runPromptApi = (api) => {
     const [promptId, docId, profileId] = api.split("__");
     const runId = generateUUID();
+    const timeoutKey = `${promptId}__${docId}__${profileId}`;
     // ... body and requestOptions ...

     makeApiRequest(requestOptions)
       .then(() => {
-        setTimeout(() => {
+        const timeoutId = setTimeout(() => {
+          timeoutRefs.current.delete(timeoutKey);
           const statusKey = generateApiRunStatusId(docId, profileId);
           // ... rest of timeout logic
         }, SOCKET_TIMEOUT_MS);
+        timeoutRefs.current.set(timeoutKey, timeoutId);
       })

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@frontend/src/hooks/usePromptRun.js` around lines 51 - 67, The timeout created
after makeApiRequest in the hook is not tracked or cleared, risking state
updates after unmount; modify the hook to store the timeout ID (e.g., in a ref
or local variable tied to the hook) when calling setTimeout (the block that uses
generateApiRunStatusId, usePromptRunStatusStore, removePromptStatus, and
setAlertDetails with SOCKET_TIMEOUT_MS) and ensure you clearTimeout on
cleanup/unmount and/or before scheduling a new timeout (also clear it when the
API request or socket result arrives) so the callback cannot run against an
unmounted component or stale closures.

workers/executor/tasks.py-19-27 (1)

19-27: ⚠️ Potential issue | 🟠 Major

Avoid blanket autoretry for executor operations.

This retries the entire task, not just a transport call. A timeout after an indexing/LLM/vector-store request has already been accepted will rerun the operation and can double-write or double-bill unless every executor path is idempotent end-to-end. Prefer retries inside the specific client calls, or carry an idempotency key/checkpoint through the executor flow before enabling task-level retries.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@workers/executor/tasks.py` around lines 19 - 27, The `@shared_task` decorator
on TaskName.EXECUTE_EXTRACTION currently applies blanket autoretry for the whole
executor task which can cause double-writes/double-billing; change this by
removing or disabling task-level autoretry for the execute_extraction task and
instead implement targeted retries inside the specific client calls (e.g.,
indexing, LLM, vector-store client functions) or propagate an idempotency
key/checkpoint through the executor flow so retries are safe end-to-end; locate
the decorator annotation used on the execute_extraction task and update the
retry strategy accordingly, adding per-client retry logic around transport calls
or adding an idempotency token passed into execute_extraction and checked before
performing side-effecting operations.

workers/executor/executors/retrieval.py-71-87 (1)

71-87: ⚠️ Potential issue | 🟠 Major

Preserve retrieval rank instead of coercing unordered chunks to a list.

The new retrievers currently expose set[str], so return list(context) makes output order hash-dependent rather than relevance-dependent. If callers concatenate chunks in returned order, answer quality becomes nondeterministic. Please move the contract to an ordered sequence and dedupe without dropping rank.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@workers/executor/executors/retrieval.py` around lines 71 - 87, The current
code does list(context) which converts a set to an unordered list and breaks
relevance ranking; instead, treat the retriever result as an ordered sequence
and preserve rank while removing duplicates: iterate over the returned context
from retriever.retrieve() in order (do not call list(context) directly), build a
new list by appending items only if not seen before (use a local seen set keyed
by chunk id or the chunk itself) and return that deduped list; keep the existing
metrics/logging (context_retrieval_metrics, prompt_key, doc_id, retrieval_type,
top_k) unchanged and apply deduping before len(context) and the returned value.

workers/executor/executors/retrievers/router.py-37-59 (1)

37-59: ⚠️ Potential issue | 🟠 Major

keyword_search is still the same semantic engine.

This helper builds the same query-engine path as the base tool and only changes similarity_top_k. The router metadata promises exact-term matching, but the implementation never switches retrieval strategy, so exact-match queries can be routed to a tool that does not actually provide keyword behavior.

🔧 Minimal fallback if a real keyword backend is not ready yet

-                        name="keyword_search",
+                        name="expanded_vector_search",
                         description=(
-                            "Best for finding specific terms, names, numbers, dates, "
-                            "or exact phrases. Use when looking for precise matches."
+                            "Broader semantic search with more candidates."
                         ),

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@workers/executor/executors/retrievers/router.py` around lines 37 - 59, The
current _add_keyword_search_tool creates a semantic engine
(vector_store_index.as_query_engine) and only changes similarity_top_k so it
does not perform true exact-term matching; update _add_keyword_search_tool to
construct a keyword/term-based retriever instead of the default semantic
retriever: try to obtain a keyword retriever from vector_store_index (e.g., a
method like as_retriever(search_type="keyword") or a retriever configured for
exact matching) and pass that retriever into vector_store_index.as_query_engine
(or build a QueryEngine that uses that retriever) before creating the
QueryEngineTool with metadata name="keyword_search"; if the vector store has no
native keyword retriever, fall back to a simple filter/exact-match pass-through
(or a low-level text/term scan) and ensure exceptions are caught and logged from
this new retrieval creation path.

workers/executor/executors/variable_replacement.py-31-35 (1)

31-35: ⚠️ Potential issue | 🟠 Major

Don't treat valid falsey values as “missing.”

Both branches skip replacement on if not output_value, so 0, False, empty strings, and empty collections leave unresolved template tokens in the prompt even when the key exists.

Proposed fix

-        if not output_value:
+        if output_value is None:
             return prompt
@@
-        if not output_value:
+        if output_value is None:
             return prompt

Also applies to: 90-94

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@workers/executor/executors/variable_replacement.py` around lines 31 - 35, The
code treats any falsy output_value as missing, causing valid values like 0,
False, "" or [] to skip replacement; update the checks in
variable_replacement.py (where
VariableReplacementHelper.check_static_variable_run_status is called) to only
treat None as missing (e.g., change "if not output_value" to "if output_value is
None"), and apply the same change to the other identical branch later in the
file so only None (or an explicit sentinel) prevents replacement.

workers/executor/executors/postprocessor.py-34-53 (1)

34-53: ⚠️ Potential issue | 🟠 Major

Validate the webhook response shape before inspecting it.

response.json() can return any JSON value, not just an object. If the webhook answers with a scalar like true or 42, this membership check raises instead of falling back to parsed_data, which turns a bad webhook payload into a hard failure.

Proposed fix

 def _process_successful_response(
-    response_data: dict, parsed_data: dict, highlight_data: list | None
+    response_data: Any, parsed_data: dict, highlight_data: list | None
 ) -> tuple[dict[str, Any], list | None]:
     """Process successful webhook response."""
+    if not isinstance(response_data, dict):
+        logger.warning("Ignoring postprocessing due to invalid webhook response type")
+        return parsed_data, highlight_data
+
     if "structured_output" not in response_data:
         logger.warning("Response missing 'structured_output' key")
         return parsed_data, highlight_data

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@workers/executor/executors/postprocessor.py` around lines 34 - 53, The
function _process_successful_response assumes response_data is a dict and does
"if 'structured_output' not in response_data", which will raise when
response_data is a non-object JSON (e.g., a scalar); guard against that by first
checking isinstance(response_data, dict) (or dict-like) and return parsed_data,
highlight_data if it's not a dict, then proceed with existing logic (use
response_data.get("structured_output")/response_data.get("highlight_data") and
keep calling _validate_structured_output and _validate_highlight_data as
before).

workers/file_processing/structure_tool_task.py-645-651 (1)

645-651: ⚠️ Potential issue | 🟠 Major

Don't reset METADATA.json after a failed read.

If fs.read() or json.loads() fails here, the code falls back to {} and then rewrites the file, which drops prior tool_metadata and total_elapsed_time. Log the read error and abort this update instead of silently overwriting accumulated metadata.

Proposed fix

         if fs.exists(metadata_path):
             try:
                 existing_raw = fs.read(path=metadata_path, mode="r")
                 if existing_raw:
                     existing = json.loads(existing_raw)
-            except Exception:
-                pass
+            except Exception as e:
+                logger.warning("Failed to read existing METADATA.json: %s", e)
+                return

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@workers/file_processing/structure_tool_task.py` around lines 645 - 651, When
reading METADATA.json (metadata_path) you currently swallow exceptions from
fs.read() and json.loads(), then fall back to an empty dict and overwrite
accumulated fields; instead catch the exception, log the error (including the
exception details) and abort this update so you do not reset prior tool_metadata
or total_elapsed_time. Specifically, in the block around
fs.read(path=metadata_path, mode="r") and json.loads(existing_raw) (the
variables existing_raw and existing), replace the bare except: pass with logging
the exception (e.g., logger.exception or process_logger.error with the
exception) and return/raise to skip rewriting METADATA.json.

workers/executor/executors/variable_replacement.py-65-69 (1)

65-69: ⚠️ Potential issue | 🟠 Major

Catch TypeError in the JSON-to-string fallback.

json.dumps() raises TypeError for non-serializable objects. Right now values like datetime, Decimal, or SDK objects will escape this helper and abort prompt rendering instead of falling back to str().

Proposed fix

     def handle_json_and_str_types(value: Any) -> str:
         try:
             formatted_value = json.dumps(value)
-        except ValueError:
+        except (TypeError, ValueError):
             formatted_value = str(value)
         return formatted_value

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@workers/executor/executors/variable_replacement.py` around lines 65 - 69, The
helper handle_json_and_str_types currently only catches ValueError from
json.dumps, but json.dumps raises TypeError for non-serializable objects (e.g.,
datetime, Decimal, SDK objects), causing failures; update the except clause to
catch TypeError as well (or catch both ValueError and TypeError) and then fall
back to str(value) so non-JSON-serializable inputs are safely converted for
prompt rendering.

workers/file_processing/structure_tool_task.py-270-287 (1)

270-287: ⚠️ Potential issue | 🟠 Major

Preserve exported tool defaults when no workflow override is provided.

These get(..., False) calls overwrite the fetched tool’s own settings with False, so summarize/highlight/single-pass/challenge can be disabled just because the instance payload omitted the key. The agentic path already falls back to exported metadata; the regular path should do the same.

Proposed fix

     # ---- Extract settings from tool_metadata ----
     settings = tool_instance_metadata
-    is_challenge_enabled = settings.get(_SK.ENABLE_CHALLENGE, False)
-    is_summarization_enabled = settings.get(_SK.SUMMARIZE_AS_SOURCE, False)
-    is_single_pass_enabled = settings.get(_SK.SINGLE_PASS_EXTRACTION_MODE, False)
-    challenge_llm = settings.get(_SK.CHALLENGE_LLM_ADAPTER_ID, "")
-    is_highlight_enabled = settings.get(_SK.ENABLE_HIGHLIGHT, False)
-    is_word_confidence_enabled = settings.get(_SK.ENABLE_WORD_CONFIDENCE, False)
+    tool_id = tool_metadata[_SK.TOOL_ID]
+    tool_settings = tool_metadata[_SK.TOOL_SETTINGS]
+    outputs = tool_metadata[_SK.OUTPUTS]
+    is_challenge_enabled = settings.get(
+        _SK.ENABLE_CHALLENGE, tool_settings.get(_SK.ENABLE_CHALLENGE, False)
+    )
+    is_summarization_enabled = settings.get(
+        _SK.SUMMARIZE_AS_SOURCE, tool_settings.get(_SK.SUMMARIZE_AS_SOURCE, False)
+    )
+    is_single_pass_enabled = settings.get(
+        _SK.SINGLE_PASS_EXTRACTION_MODE,
+        tool_settings.get(_SK.ENABLE_SINGLE_PASS_EXTRACTION, False),
+    )
+    challenge_llm = settings.get(
+        _SK.CHALLENGE_LLM_ADAPTER_ID, tool_settings.get(_SK.CHALLENGE_LLM, "")
+    )
+    is_highlight_enabled = settings.get(
+        _SK.ENABLE_HIGHLIGHT, tool_settings.get(_SK.ENABLE_HIGHLIGHT, False)
+    )
+    is_word_confidence_enabled = settings.get(
+        _SK.ENABLE_WORD_CONFIDENCE,
+        tool_settings.get(_SK.ENABLE_WORD_CONFIDENCE, False),
+    )
@@
-    tool_id = tool_metadata[_SK.TOOL_ID]
-    tool_settings = tool_metadata[_SK.TOOL_SETTINGS]
-    outputs = tool_metadata[_SK.OUTPUTS]

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@workers/file_processing/structure_tool_task.py` around lines 270 - 287, The
instance-level settings (variable settings = tool_instance_metadata) currently
use settings.get(..., False) which unintentionally overrides exported tool
defaults; update the lookups for is_challenge_enabled, is_summarization_enabled,
is_single_pass_enabled, challenge_llm, is_highlight_enabled, and
is_word_confidence_enabled to fall back to the exported tool defaults from
tool_settings (tool_metadata[_SK.TOOL_SETTINGS]) when the key is missing, e.g.
use settings.get(KEY, tool_settings.get(KEY)) or similar so the tool’s exported
defaults are preserved when the instance payload omits a key; keep using the
same _SK key symbols (e.g., _SK.ENABLE_CHALLENGE, _SK.SUMMARIZE_AS_SOURCE,
_SK.SINGLE_PASS_EXTRACTION_MODE, _SK.CHALLENGE_LLM_ADAPTER_ID,
_SK.ENABLE_HIGHLIGHT, _SK.ENABLE_WORD_CONFIDENCE) to locate and change the
lookups.

frontend/src/components/custom-tools/prompt-card/PromptCard.jsx-75-79 (1)

75-79: ⚠️ Potential issue | 🟠 Major

Don't bind per-card progress to tool-global socket messages.

details?.tool_id is shared by every prompt card in the tool, so this predicate lets any tool-scoped INFO/ERROR message leak into every card's progressMsg. During concurrent runs, one prompt can end up showing another prompt's progress or error state.

Suggested narrowing

         .find(
           (item) =>
             (item?.component?.prompt_id === promptDetailsState?.prompt_id ||
-              item?.component?.prompt_key === promptKey ||
-              item?.component?.tool_id === details?.tool_id) &&
+              item?.component?.prompt_key === promptKey ||
+              (!item?.component?.prompt_id &&
+                !item?.component?.prompt_key &&
+                item?.component?.tool_id === details?.tool_id)) &&
             (item?.level === "INFO" || item?.level === "ERROR"),
         );

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@frontend/src/components/custom-tools/prompt-card/PromptCard.jsx` around lines
75 - 79, Predicate in PromptCard.jsx is too broad because checking
details?.tool_id allows tool-global INFO/ERROR socket messages to apply to every
card; update the filter used for progressMsg so it only accepts messages
explicitly targeted to this prompt instance by requiring a match on a unique
prompt identifier (e.g., item?.component?.prompt_id ===
promptDetailsState?.prompt_id or item?.component?.prompt_key === promptKey) and
not just details?.tool_id, or if messages include a run/instance id use that
(e.g., item?.component?.run_id or instance_id) combined with
prompt_id/prompt_key to correlate; change the predicate that references
item?.component?.prompt_id, promptDetailsState?.prompt_id, promptKey and
details?.tool_id so it narrows scope to the specific prompt instance rather than
any message with the same tool_id.

workers/executor/executors/dto.py-26-31 (1)

26-31: ⚠️ Potential issue | 🟠 Major

Validate the full chunking invariant here.

Only rejecting chunk_size == 0 still allows negative sizes and chunk_overlap >= chunk_size; that produces a zero/negative stride for chunkers and can break or hang indexing downstream.

Suggested fix

     def __post_init__(self) -> None:
-        if self.chunk_size == 0:
-            raise ValueError(
-                "Indexing cannot be done for zero chunks."
-                "Please provide a valid chunk_size."
-            )
+        if self.chunk_size <= 0:
+            raise ValueError("chunk_size must be greater than 0")
+        if self.chunk_overlap < 0:
+            raise ValueError("chunk_overlap cannot be negative")
+        if self.chunk_overlap >= self.chunk_size:
+            raise ValueError(
+                "chunk_overlap must be smaller than chunk_size"
+            )

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@workers/executor/executors/dto.py` around lines 26 - 31, In __post_init__
validate the full chunking invariant: ensure chunk_size is > 0, chunk_overlap is
>= 0, and chunk_overlap < chunk_size (so stride = chunk_size - chunk_overlap is
positive); if any check fails raise a ValueError with a clear message mentioning
chunk_size and chunk_overlap; update the validation logic in the __post_init__
method (which currently only checks chunk_size == 0) to perform these three
checks.

workers/tests/test_legacy_executor_scaffold.py-252-268 (1)

252-268: ⚠️ Potential issue | 🟠 Major

This Flask check is session-global, not module-specific.

Scanning all flask* entries in sys.modules makes the test pass or fail based on unrelated imports from the rest of the test run, not on what executor.executors.exceptions did. Please assert on imports triggered by this module itself instead of inspecting global interpreter state afterward.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@workers/tests/test_legacy_executor_scaffold.py` around lines 252 - 268, The
test_no_flask_import currently inspects global sys.modules, which is
session-global; change it to check only imports caused by importing
executor.executors.exceptions by recording sys.modules before and after the
import inside the test_no_flask_import function and asserting that no new keys
starting with "flask" were added; specifically, capture a snapshot (e.g.,
pre_modules = set(sys.modules)), import or reload the module under test
(executor.executors.exceptions), compute added = set(sys.modules) - pre_modules,
and then assert that no name in added startswith "flask" to ensure the module
itself didn't pull in Flask.

backend/prompt_studio/prompt_studio_core_v2/views.py-389-423 (1)

389-423: ⚠️ Potential issue | 🟠 Major

Keep the async rollout behind async_prompt_execution.

These branches now always dispatch to Celery and return 202 Accepted. That removes the synchronous fallback instead of guarding it behind the rollout flag, so backend behavior changes even when the async feature is supposed to be off.

Also applies to: 464-504, 561-595

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@backend/prompt_studio/prompt_studio_core_v2/views.py` around lines 389 - 423,
The current code in PromptStudio views (around the dispatch block that calls
PromptStudioHelper._get_dispatcher() and dispatch_with_callback, currently
pre-generating executor_task_id and returning HTTP_202_ACCEPTED) always uses
Celery; wrap this async dispatch and the 202 response behind the async rollout
flag (async_prompt_execution) and preserve the synchronous fallback when the
flag is false. Concretely, in the view(s) that build context via
PromptStudioHelper.build_index_payload, fetch the rollout flag
(async_prompt_execution), and only call PromptStudioHelper._get_dispatcher(),
generate executor_task_id, call dispatcher.dispatch_with_callback(...) and
return Response(..., status=HTTP_202_ACCEPTED) when the flag is true; otherwise
keep the existing synchronous execution path (no dispatcher, no task id) and
return the original synchronous result. Apply the same guard to the other
similar dispatch blocks referenced (lines around 464-504 and 561-595) so all
async Celery dispatches are behind async_prompt_execution.

backend/prompt_studio/prompt_studio_core_v2/views.py-452-453 (1)

452-453: ⚠️ Potential issue | 🟠 Major

Scope prompt/document lookups to the current tool.

These raw objects.get(pk=...) calls let callers mix IDs across projects. A user who can access one tool can submit a prompt or document UUID from another tool and have this endpoint operate on it. Resolve both through the current tool and return 404 when they do not belong to it.

Also applies to: 461-462, 536-537
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@backend/prompt_studio/prompt_studio_core_v2/views.py` around lines 452 - 453,
Scope all prompt and document lookups to the current tool: replace raw
ToolStudioPrompt.objects.get(pk=prompt_id) (and the similar lookups at the other
occurrences) with a scoped lookup that includes the current tool, e.g. use
get_object_or_404(ToolStudioPrompt, pk=prompt_id, tool=current_tool) or
ToolStudioPrompt.objects.get(pk=prompt_id, tool=current_tool) wrapped to raise
Http404; do the same for the document model lookups (e.g. ToolDocument or the
document class used at the other locations) so requests for IDs from other tools
return 404.

backend/api_v2/deployment_helper.py-279-284 (1)

279-284: ⚠️ Potential issue | 🟠 Major

include_metrics=True currently keeps full inner metadata.

Both branches only call remove_inner_result_metadata() when both flags are false. After usage enrichment, the include_metrics and not include_metadata path still returns the entire result.metadata object, not just usage/metrics, which broadens the response contract unexpectedly.

Also applies to: 483-488
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@backend/api_v2/deployment_helper.py` around lines 279 - 284, The current
logic enriches usage metadata via cls._enrich_result_with_usage_metadata(result)
but only calls result.remove_inner_result_metadata() when both include_metadata
and include_metrics are false, leaving full inner metadata when
include_metrics=True and include_metadata=False; change the branching so that
after enrichment you unconditionally call result.remove_inner_result_metadata()
whenever include_metadata is False (i.e., if not include_metadata:
result.remove_inner_result_metadata()), and still call
result.remove_result_metrics() when not include_metrics; apply this fix to both
occurrences around the
cls._enrich_result_with_usage_metadata/result.remove_inner_result_metadata
blocks (the block shown and the similar block at lines 483-488) so metrics-only
responses no longer contain full inner metadata.

backend/prompt_studio/prompt_studio_core_v2/test_tasks.py-289-336 (1)

289-336: ⚠️ Potential issue | 🟠 Major

These tests are pinned to deleted implementation details.

views.py in this PR no longer calls run_* .apply_async or reads StateStore directly; it builds an ExecutionContext and dispatches through the new dispatcher. The Phase 8 assertions will fail against the current code, and the Phase 9 cases only grep source, so they do not verify task_status() at runtime.

Also applies to: 361-394

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@backend/prompt_studio/prompt_studio_core_v2/test_tasks.py` around lines 289 -
336, The tests currently assert old implementation details (run_*.apply_async
and direct StateStore.get) which no longer exist; update tests for
PromptStudioCoreView.index_document, PromptStudioCoreView.fetch_response, and
PromptStudioCoreView.single_pass_extraction to (1) assert the view dispatches
via the new dispatcher (e.g., check for dispatcher.dispatch or
dispatcher.async_dispatch usage in the source or mock the dispatcher and assert
it was called), (2) assert an ExecutionContext (or the helper that builds it) is
constructed/passed and includes captures for Common.LOG_EVENTS_ID and
Common.REQUEST_ID, and (3) replace the Phase 9 greps with a runtime assertion
that the dispatcher call results in the expected task status behavior (e.g.,
mock dispatcher.dispatch/async_dispatch and assert task_status()/response code
is HTTP_202_ACCEPTED). Use the unique symbols
PromptStudioCoreView.index_document, PromptStudioCoreView.fetch_response,
PromptStudioCoreView.single_pass_extraction, ExecutionContext,
dispatcher.dispatch/async_dispatch, and Common.LOG_EVENTS_ID/Common.REQUEST_ID
to locate the code to change.

backend/prompt_studio/prompt_studio_core_v2/views.py-597-629 (1)

597-629: ⚠️ Potential issue | 🟠 Major

Validate task ownership before returning Celery results.

This detail action never resolves the CustomTool, so any object-level permission logic on IsOwnerOrSharedUserOrSharedToOrg is skipped, and pk is not tied to task_id at all. A caller who learns another task ID can query it through any tool they can access unless you check that the task belongs to this tool/run first.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@backend/prompt_studio/prompt_studio_core_v2/views.py` around lines 597 - 629,
The task_status action skips resolving the CustomTool and thus bypasses
object-level permission checks; fix it by calling self.get_object() at the start
of task_status to enforce IsOwnerOrSharedUserOrSharedToOrg, then after obtaining
the AsyncResult (using AsyncResult and get_worker_celery_app()) verify that the
Celery task actually belongs to that tool/run (e.g., compare the tool's PK to an
identifier stored on the task result/meta/payload) and return HTTP 403 if it
does not match before returning task status/result.

unstract/sdk1/src/unstract/sdk1/execution/dispatcher.py-133-164 (1)

133-164: ⚠️ Potential issue | 🟠 Major

Keep dispatch() failure-safe around broker send and result decoding.

send_task() and ExecutionResult.from_dict() are outside the try, so a broker outage or malformed worker payload will raise instead of returning ExecutionResult.failure(...). That breaks the public contract of this synchronous API.

Suggested fix

-        async_result = self._app.send_task(
-            _TASK_NAME,
-            args=[context.to_dict()],
-            queue=queue,
-        )
-        logger.info(
-            "Task sent: celery_task_id=%s, waiting for result...",
-            async_result.id,
-        )
-
         try:
+            async_result = self._app.send_task(
+                _TASK_NAME,
+                args=[context.to_dict()],
+                queue=queue,
+            )
+            logger.info(
+                "Task sent: celery_task_id=%s, waiting for result...",
+                async_result.id,
+            )
             # disable_sync_subtasks=False: safe because the executor task
             # runs on a separate worker pool (worker-v2) — no deadlock
             # risk even when dispatch() is called from inside a Django
             # Celery task.
             result_dict = async_result.get(
                 timeout=timeout,
                 disable_sync_subtasks=False,
             )
+            return ExecutionResult.from_dict(result_dict)
         except Exception as exc:
             logger.error(
                 "Dispatch failed: executor=%s operation=%s run_id=%s error=%s",
                 context.executor_name,
                 context.operation,
                 context.run_id,
                 exc,
             )
             return ExecutionResult.failure(
                 error=f"{type(exc).__name__}: {exc}",
             )
-
-        return ExecutionResult.from_dict(result_dict)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@unstract/sdk1/src/unstract/sdk1/execution/dispatcher.py` around lines 133 -
164, The dispatch() method currently only wraps async_result.get(...) in
try/except, leaving self._app.send_task(...) and ExecutionResult.from_dict(...)
able to raise; update dispatch() to make the entire send/get/decode sequence
failure-safe by expanding the try block (or adding an outer try) to include the
call to self._app.send_task(...) and the call to ExecutionResult.from_dict(...),
catch Exception as exc, log the error with context.executor_name,
context.operation, context.run_id and the exception, and return
ExecutionResult.failure(error=f"{type(exc).__name__}: {exc}") so broker send
failures and malformed worker payloads return a failure result instead of
raising.

backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py-390-393 (1)

390-393: ⚠️ Potential issue | 🟠 Major

Keep build_index_payload() side-effect free.

This helper marks the document as indexing before it even returns the ExecutionContext. If anything fails between this return and the eventual task publish, no worker ever sees the job and the document is left stuck in the pending state.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py` around
lines 390 - 393, build_index_payload currently performs a side-effect by calling
DocumentIndexingService.set_document_indexing(org_id=org_id, user_id=user_id,
doc_id_key=doc_id_key) before returning the ExecutionContext; remove that call
from build_index_payload so the function is side-effect free and only
constructs/returns the payload/ExecutionContext, then ensure the caller (the
code that publishes the indexing task) invokes
DocumentIndexingService.set_document_indexing only after the task
publish/queueing succeeds (use the same org_id, user_id, doc_id_key parameters),
so a document is marked indexing only when a worker will actually receive the
job.

backend/prompt_studio/prompt_studio_core_v2/tasks.py-142-144 (1)

142-144: ⚠️ Potential issue | 🟠 Major

Mirror the indexing cleanup in the generic callback exception path.

Only the explicit executor-failure branch clears the in-progress marker. If post-success bookkeeping raises before mark_document_indexed() finishes, this path emits the websocket error and re-raises, but the document can remain stuck as "being indexed".

Also applies to: 174-183
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@backend/prompt_studio/prompt_studio_core_v2/tasks.py` around lines 142 - 144,
The generic callback exception path doesn’t clear the in-progress marker like
the executor-failure branch, so ensure
DocumentIndexingService.remove_document_indexing(org_id=org_id, user_id=user_id,
doc_id_key=doc_id_key) is called on all error paths (including the generic
exception handler that emits the websocket error and re-raises) and when
post-success bookkeeping raises before mark_document_indexed(); move or add the
remove_document_indexing call into a shared cleanup/finally block around
mark_document_indexed()/post-success bookkeeping in the task function so both
the executor-failure branch and the generic callback exception path (and the
post-success error case) always clear the "being indexed" marker.

backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py-186-188 (1)

186-188: ⚠️ Potential issue | 🟠 Major

Don't fail open when created_by is missing.

This early return skips every adapter ownership check. An orphaned or imported ProfileManager can then execute with adapters that are not shared_to_org, which effectively bypasses the permission gate this method is supposed to enforce.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py` around
lines 186 - 188, The early return when profile_manager_owner is None skips
ownership checks; instead remove that early return and enforce ownership
validation: if profile_manager_owner is None, try to derive owner from
profile_manager.created_by, and if created_by is also missing treat the profile
manager as orphaned and require that every adapter in profile_manager.adapters
has shared_to_org == True (otherwise raise an authorization error). Update the
logic in the ownership-checking function that references
profile_manager_owner/profile_manager to perform this fallback and to deny
access when neither owner nor created_by is present unless all adapters are
shared_to_org.

backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py-572-577 (1)

572-577: ⚠️ Potential issue | 🟠 Major

Create the summary artifact before switching the payload path.

Both builders rewrite the payload to .../summarize/<stem>.txt, but neither code path generates that file first. On a first-run document or after cache cleanup, the subsequent hash/read will fail because the summary artifact does not exist yet.

Also applies to: 666-667, 815-821
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py` around
lines 572 - 577, The code changes extract_path to the summarize path before the
summary file is created, causing subsequent reads/hashes to fail; before
assigning extract_path to Path(.../"summarize"/(p.stem+".txt")) ensure the
summary artifact is created/written first (e.g., call the existing
summary-writing routine or write the summarized content to that target path
using the current payload) and only then set profile_manager.chunk_size and
replace extract_path; apply the same fix to the other similar blocks referenced
in the diff (the other summarize branches around the other occurrences).

workers/executor/executors/legacy_executor.py-694-704 (1)

694-704: ⚠️ Potential issue | 🟠 Major

Use each output's adapter tuple when deduplicating and indexing.

This code always reads vector-db, embedding, and x2text_adapter from tool_settings, even though those fields can vary per output. Mixed-output pipelines will currently collapse distinct adapter combinations onto one param_key and may build the child index_ctx with the wrong backend IDs.

Suggested fix

-            vector_db = tool_settings.get("vector-db", "")
-            embedding = tool_settings.get("embedding", "")
-            x2text = tool_settings.get("x2text_adapter", "")
+            vector_db = output.get("vector-db", tool_settings.get("vector-db", ""))
+            embedding = output.get("embedding", tool_settings.get("embedding", ""))
+            x2text = output.get(
+                "x2text_adapter", tool_settings.get("x2text_adapter", "")
+            )

Also applies to: 725-738

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@workers/executor/executors/legacy_executor.py` around lines 694 - 704, The
deduplication/indexing param_key is built using the global tool_settings instead
of each output's own adapter settings, causing different outputs to be
collapsed; update the code that constructs param_key (and the similar block
later around the creation of index_ctx) to read vector-db, embedding, and
x2text_adapter from the specific output's settings (e.g.
output.get("tool_settings", {}) or the per-output metadata) rather than from the
outer tool_settings, so param_key reflects chunk_size, chunk_overlap and the
output-specific adapter tuple and the child index_ctx is created with the
correct backend IDs.

workers/tests/test_sanity_phase4.py-168-351 (1)

168-351: ⚠️ Potential issue | 🟠 Major

These helpers mirror the payload contract instead of exercising it.

_make_ide_prompt() and the _ide_*_ctx() factories hardcode the same shape that prompt_studio_helper.py is supposed to produce, and TestIDEPayloadKeyCompatibility only re-asserts those local literals. If the real helper drifts, this module can still go green because both sides of the “contract” live here. Build the contexts through the production helper path or a shared factory/DTO so these tests actually fail on contract regressions.

Also applies to: 835-899
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@workers/tests/test_sanity_phase4.py` around lines 168 - 351, The tests
currently replicate the exact payload shape instead of using the real builder,
so update the factories (_make_ide_prompt, _ide_extract_ctx, _ide_index_ctx,
_ide_answer_prompt_ctx, _ide_single_pass_ctx) to build their payloads via the
production prompt studio helper (or a shared DTO/factory) rather than hardcoding
literals; replace the direct dict constructions with calls to the real helper
functions in prompt_studio_helper.py (or a new shared factory used by both prod
and tests) and pass overrides through that helper so tests fail if the real
contract changes.

workers/tests/test_sanity_phase4.py-810-829 (1)

810-829: ⚠️ Potential issue | 🟠 Major

This test never proves the IDE-specific variable-replacement path ran.

result.success only shows the request completed. Also, if is_variables_present() gates replacement, returning False here bypasses the very branch the docstring says this test covers. Drive the IDE path and assert on the mocked variable-replacement call/kwargs that consume the IDE flag.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@workers/tests/test_sanity_phase4.py` around lines 810 - 829, The test
currently only checks result.success and sets
var_service.is_variables_present.return_value = False which prevents the
variable-replacement branch; change the mock so the VariableReplacementService
will run the replacement (set is_variables_present.return_value = True or
otherwise allow the replacement to be invoked), then after calling
_run_task(eager_app, ctx.to_dict()) assert that the variable-replacement method
on var_service (e.g., replace_variables / perform_replacement) was called and
that its call/kwargs include is_ide=True, using the existing mocks (var_service,
_ide_answer_prompt_ctx, _run_task, ExecutionResult) to prove the IDE-specific
path executed.

workers/tests/test_sanity_phase4.py-797-805 (1)

797-805: ⚠️ Potential issue | 🟠 Major

Assert the no-write behavior you describe.

This test says IDE mode skips metadata writes, but there is no assertion on mock_fs. As written, it still passes if _update_exec_metadata calls dump_json() in IDE mode.

✅ Add the missing assertion

         result_dict = _run_task(eager_app, ctx.to_dict())
         result = ExecutionResult.from_dict(result_dict)
         assert result.success is True
-
-        # For IDE source, _update_exec_metadata should NOT write
-        # (it only writes for execution_source="tool")
-        # This is verified by the fact that no dump_json was called
-        # on the fs mock. In IDE mode, whisper_hash metadata is skipped.
+        mock_fs.dump_json.assert_not_called()

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@workers/tests/test_sanity_phase4.py` around lines 797 - 805, The test lacks
an assertion that mock_fs.dump_json was not called for IDE execution_source;
update the test_sanity_phase4.py test (the block invoking _run_task(eager_app,
ctx.to_dict()) and creating result via ExecutionResult.from_dict) to assert that
the filesystem mock's dump_json (mock_fs.dump_json or the specific mock variable
used in the test) was not called—i.e., add an assertion like assert
mock_fs.dump_json.call_count == 0 or assert not mock_fs.dump_json.called after
verifying result.success, so _update_exec_metadata's no-write behavior in IDE
mode is actually enforced.

coderabbitai · 2026-03-12T08:38:18Z

unstract/sdk1/src/unstract/sdk1/utils/indexing.py

 from unstract.sdk1.file_storage import FileStorage, FileStorageProvider
 from unstract.sdk1.platform import PlatformHelper
-from unstract.sdk1.tool.base import BaseTool
+from unstract.sdk1.tool.stream import StreamMixin


⚠️ Potential issue | 🔴 Critical

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash # Description: Verify StreamMixin interface and callers of generate_index_key echo "=== 1. Check StreamMixin class definition ===" ast-grep --pattern $'class StreamMixin$_: $$$ ' echo "" echo "=== 2. Check PlatformHelper.get_adapter_config signature and usage of first param ===" ast-grep --pattern $'def get_adapter_config($_, $_): $$$ ' echo "" echo "=== 3. Find all callers of generate_index_key ===" rg -n -B2 -A5 'generate_index_key\(' --type py echo "" echo "=== 4. Check if BaseTool inherits from or includes StreamMixin ===" rg -n 'class BaseTool' --type py -A10

Repository: Zipstack/unstract

Length of output: 15172

🏁 Script executed:

head -25 unstract/sdk1/src/unstract/sdk1/utils/indexing.py

Repository: Zipstack/unstract

Length of output: 846

🏁 Script executed:

cat unstract/sdk1/src/unstract/sdk1/utils/indexing.py

Repository: Zipstack/unstract

Length of output: 2355

Critical breaking change: tool parameter is now required but no callers pass it.

The tool: StreamMixin parameter (line 18) is required without a default value, and the function uses it in three places (lines 50-52). However, all ~15 existing callers in the codebase do not pass this parameter, which will cause TypeError: missing 1 required positional argument: 'tool' at runtime.

Either:

All callers must be updated to pass a StreamMixin-compatible object (e.g., a tool instance), or

The parameter should have a default value if it's optional.

Note: The type narrowing from BaseTool to StreamMixin itself is not an issue since BaseTool already extends StreamMixin.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@unstract/sdk1/src/unstract/sdk1/utils/indexing.py` at line 5, The new required parameter tool: StreamMixin causes callers to break; change the function signature to make tool optional (tool: Optional[StreamMixin] = None) and update all uses where tool is invoked (the three places that reference tool around lines 50–52) to guard against None (e.g., if tool is not None: tool.method(...) or use a local no-op fallback) so existing callers work without passing a tool; ensure imports include Optional and keep the StreamMixin type for clarity.

coderabbitai · 2026-03-12T08:38:18Z

workers/executor/executors/legacy_executor.py

+            index_metrics = self._run_pipeline_index(
+                context=context,
+                index_template=index_template,
+                answer_params=answer_params,
+                extracted_text=extracted_text,
+            )


⚠️ Potential issue | 🔴 Critical

Abort structure_pipeline when the index step fails.

_run_pipeline_index() currently degrades a failed _handle_index() to a warning, and _handle_structure_pipeline() only gets a metrics dict back. The pipeline then continues into _handle_answer_prompt() against an index that was never created, which hides the real failure behind secondary prompt errors or empty output.

Also applies to: 740-746

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@workers/executor/executors/legacy_executor.py` around lines 493 - 498, The index step failure is being downgraded to a warning in _run_pipeline_index/_handle_index, causing _handle_structure_pipeline to proceed with only a metrics dict and then call _handle_answer_prompt against a non-existent index; update _run_pipeline_index (and underlying _handle_index) to return a clear failure signal (raise a specific exception or return None/False) and modify _handle_structure_pipeline to check that signal and abort/raise immediately instead of continuing to _handle_answer_prompt; make the same change for the other pipeline call site (the block that mirrors this logic around the second call) so any index creation failure short-circuits the pipeline and surfaces the original error.

coderabbitai · 2026-03-12T08:38:18Z

workers/executor/executors/retrievers/keyword_table.py

+            all_retriever = vector_store_index.as_retriever(
+                similarity_top_k=1000,  # Get all nodes
+                filters=MetadataFilters(
+                    filters=[
+                        ExactMatchFilter(key="doc_id", value=self.doc_id),
+                    ],
+                ),
+            )
+
+            # Retrieve all nodes to build keyword index
+            all_nodes = all_retriever.retrieve(" ")


⚠️ Potential issue | 🔴 Critical

🧩 Analysis chain

🏁 Script executed:

# First, let's see the full context of the file wc -l workers/executor/executors/retrievers/keyword_table.py

Repository: Zipstack/unstract

Length of output: 1885

🏁 Script executed:

# Read the relevant section with context cat -n workers/executor/executors/retrievers/keyword_table.py | head -60

Repository: Zipstack/unstract

Length of output: 1885

🏁 Script executed:

# Search for other patterns of node retrieval or docstore usage in the codebase rg "docstore|doc_store" --type py -A 3 -B 3

Repository: Zipstack/unstract

Length of output: 1885

🏁 Script executed:

# Check for imports and understand what vector_store_index is rg "vector_store_index|VectorStoreIndex" workers/executor/executors/retrievers/ -B 2 -A 2

Repository: Zipstack/unstract

Length of output: 1885

🌐 Web query:

LlamaIndex VectorStoreIndex as_retriever retrieve with MetadataFilters top-k behavior

💡 Result:

In VectorStoreIndex.as_retriever(...), the “top‑k” you set (via similarity_top_k=) is passed into the retriever and then into the underlying VectorStoreQuery together with your MetadataFilters (filters=). The vector store is asked to return up to similarity_top_k results subject to those filters. [1], [2]

What that means in practice

Filters constrain the candidate set; top‑k is then taken within that constrained set.
For example, SimpleVectorStore explicitly prefilters node IDs by metadata (build_metadata_filter_fn(...)) and only then computes similarities and selects get_top_k_* with similarity_top_k=query.similarity_top_k. So top‑k is effectively “top‑k among the filtered nodes.” If fewer than k nodes match the filter, you’ll get fewer than k results. [3]

Some vector stores over-fetch internally (especially for MMR / ANN) but still target returning k final results.
E.g., the Chroma integration calculates a prefetch_k (at least similarity_top_k, often larger) to get enough candidates for MMR-style selection. This affects internal candidate retrieval, not the meaning of similarity_top_k as the requested result count. [4]

Why you might see “less than k” even if your index is large

Your metadata filter matches < k nodes (common / expected). [3]

Your backing vector DB’s filtered-ANN implementation may need overfetch to reliably find k filtered neighbors; in those cases, look for a store-specific “prefetch/overfetch/num_candidates” knob (like Chroma’s prefetch_k) or increase the relevant vector-store kwargs. [4]

Sources:
[1] Vector retriever API / retriever construction showing filters + similarity_top_k wiring (LlamaIndex docs) (docs.llamaindex.ai)
[2] VectorIndexRetriever._build_vector_store_query(...) includes both similarity_top_k and filters (LlamaIndex docs) (docs.llamaindex.ai)
[3] SimpleVectorStore.query(...) prefilters by metadata, then applies similarity_top_k in get_top_k_* (LlamaIndex docs) (docs.llamaindex.ai)
[4] Chroma vector store uses prefetch_k computed from similarity_top_k (LlamaIndex docs) (docs.llamaindex.ai)

Citations:

1: https://docs.llamaindex.ai/en/stable/api_reference/retrievers/vector/

2: https://docs.llamaindex.ai/en/stable/api_reference/retrievers/vector/

3: https://docs.llamaindex.ai/en/stable/api_reference/storage/vector_store/simple/

4: https://docs.llamaindex.ai/en/stable/api_reference/storage/vector_store/chroma/

🌐 Web query:

LlamaIndex load all nodes by doc_id docstore API

💡 Result:

To fetch all nodes for a given document id in LlamaIndex, use the docstore’s ref doc tracking:

# doc_id == ref_doc_id ref_doc_id = "doc_id_0" ref_info = index.docstore.get_ref_doc_info(ref_doc_id) if ref_info is None: nodes = [] else: nodes = index.docstore.get_nodes(ref_info.node_ids)

get_ref_doc_info(ref_doc_id) returns a RefDocInfo (which includes the node_ids created from that document). [1]

get_nodes(node_ids) loads the full BaseNode objects from the docstore. [1]

You can also inspect all tracked documents via index.ref_doc_info (maps doc_id -> RefDocInfo(node_ids=...)). [2]

Or from the docstore directly: get_all_ref_doc_info() returns ref_doc_id -> RefDocInfo for all ingested documents. [1]

Sources:
[1] LlamaIndex docstore API reference (BaseDocumentStore: get_ref_doc_info, get_nodes, get_all_ref_doc_info) (docs.llamaindex.ai)
[2] LlamaIndex “Document Tracking” showing index.ref_doc_info contains node_ids per doc_id (llamaindexxx.readthedocs.io)

Citations:

1: https://docs.llamaindex.ai/en/stable/api_reference/storage/docstore/

2: https://llamaindexxx.readthedocs.io/en/latest/module_guides/indexing/document_management.html

Load all document nodes via docstore instead of vector retrieval.

as_retriever(...).retrieve(" ") performs top-k similarity selection on a whitespace query, which does not guarantee all nodes in the document are returned. Since the keyword index must cover all document nodes to be effective, use the docstore API instead:

ref_info = index.docstore.get_ref_doc_info(self.doc_id) if ref_info: all_nodes = index.docstore.get_nodes(ref_info.node_ids)

This ensures the keyword index is built from the complete set of document nodes regardless of similarity scoring.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@workers/executor/executors/retrievers/keyword_table.py` around lines 30 - 40, Replace the current vector-based retrieval (vector_store_index.as_retriever(...) and all_retriever.retrieve(" ")) with docstore-based loading so you get every node for the document; call index.docstore.get_ref_doc_info(self.doc_id) and, if it returns ref_info, load the complete node list via index.docstore.get_nodes(ref_info.node_ids) and assign that to all_nodes so the keyword index covers all document nodes.

coderabbitai · 2026-03-12T08:38:18Z

workers/executor/executors/variable_replacement.py

+    def fetch_dynamic_variable_value(url: str, data: str) -> Any:
+        """Fetch dynamic variable value from an external URL.
+
+        Ported from prompt-service make_http_request — simplified to direct
+        requests.post since we don't need Flask error classes.
+        """
+        headers = {"Content-Type": "text/plain"}
+        try:
+            response = pyrequests.post(url, data=data, headers=headers, timeout=30)
+            response.raise_for_status()
+            if response.headers.get("content-type") == "application/json":
+                return response.json()
+            return response.text
+        except RequestException as e:
+            logger.error("HTTP request error fetching dynamic variable: %s", e)
+            status_code = None
+            if getattr(e, "response", None) is not None:
+                status_code = getattr(e.response, "status_code", None)
+            raise LegacyExecutorError(
+                message=f"HTTP POST to {url} failed: {e!s}",
+                code=status_code or 500,
+            ) from e


⚠️ Potential issue | 🔴 Critical

Harden dynamic-variable HTTP calls against SSRF.

The target URL comes straight from the prompt template and is POSTed without any allowlist/public-host validation or redirect blocking. That lets a crafted template drive the worker toward internal network targets. Reuse the webhook URL safety check here via a shared helper and send the request with redirects disabled.

🧰 Tools

🪛 Ruff (0.15.5)

[warning] 174-174: Consider moving this statement to an else block

(TRY300)

[warning] 176-176: Use logging.exception instead of logging.error

Replace with exception

(TRY400)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@workers/executor/executors/variable_replacement.py` around lines 162 - 183, The fetch_dynamic_variable_value function must validate the target URL against the shared webhook/webhook-safety helper before calling pyrequests.post and must disable redirects; call the existing URL-safety helper (e.g., is_safe_webhook_url or validate_webhook_url) at the top of fetch_dynamic_variable_value and raise LegacyExecutorError if the URL is disallowed, then invoke pyrequests.post(..., allow_redirects=False) so redirects are blocked; add any necessary import for the shared helper and keep the existing error handling around pyrequests.post and LegacyExecutorError intact.

Auto-fixed 48 lint errors across 56 files: import ordering, block statements, unused variable prefixing, and formatting issues. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

coderabbitai

Actionable comments posted: 4

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

frontend/src/components/custom-tools/prompt-card/PromptOutput.jsx (1)
433-446: ⚠️ Potential issue | 🟡 Minor

Scope progress messages per profile before rendering them here.

progressMsg is computed once per prompt card in frontend/src/components/custom-tools/prompt-card/PromptCard.jsx:71-93 without any profile_id filter, but this branch passes that same object into every profile-specific DisplayPromptResult. If two LLM profiles are running at the same time, multiple loading panes can show the latest message from the wrong profile. Please key the progress state by {promptId, profileId} or avoid rendering the shared message in the multi-profile path.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@frontend/src/components/custom-tools/prompt-card/PromptOutput.jsx` around
lines 433 - 446, The shared progressMsg is computed per prompt card and passed
into every DisplayPromptResult, causing cross-profile progress updates; update
the progress state to be keyed by both promptId and profileId (or derive a
profile-scoped message) and pass the profile-scoped message into
DisplayPromptResult instead of the global progressMsg; locate the progress
computation in PromptCard.jsx (lines computing progressMsg) and change its state
shape to use a compound key like {promptId, profileId} or maintain a per-profile
map, then update the prop passed from PromptOutput.jsx (where
DisplayPromptResult is rendered) to use the profile-scoped value so each
profile's loading pane shows only its own messages.
frontend/src/components/custom-tools/prompt-card/PromptCard.jsx (1)
71-93: ⚠️ Potential issue | 🟠 Major

Missing dependencies in useEffect may cause stale closure bugs.

The effect references promptDetailsState, promptKey, and details but only [messages] is in the dependency array. When any of these values change, the effect won't re-run and will filter messages using outdated values, potentially displaying progress for the wrong prompt or tool.
🐛 Proposed fix to add missing dependencies
-  }, [messages]);
+  }, [messages, promptDetailsState?.prompt_id, promptKey, details?.tool_id]);
Alternatively, if you need the full objects for other reasons:
-  }, [messages]);
+  }, [messages, promptDetailsState, promptKey, details]);
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@frontend/src/components/custom-tools/prompt-card/PromptCard.jsx` around lines
71 - 93, The useEffect that computes msg from messages captures
promptDetailsState, promptKey, and details but only lists [messages] as
dependencies, which can cause stale closures; update the dependency array for
the effect that calls setProgressMsg to include promptDetailsState, promptKey,
and details (so it becomes [messages, promptDetailsState, promptKey, details])
or refactor the filter logic into a memoized callback (e.g., using
useCallback/useMemo) referenced by the effect to ensure it re-runs whenever
those values change.

🧹 Nitpick comments (3)

frontend/src/components/input-output/configure-ds/ConfigureDs.jsx (1)
210-212: Keep analytics non-blocking, but don’t make failures invisible.

These empty catch blocks preserve the user flow, but they also discard the only signal that PostHog instrumentation is broken. A tiny shared helper that logs at debug level or reports to monitoring would keep this non-fatal without swallowing it completely.

Also applies to: 293-295, 313-315
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@frontend/src/components/input-output/configure-ds/ConfigureDs.jsx` around
lines 210 - 212, Replace the empty catch blocks around PostHog calls by invoking
a small shared helper (e.g., logNonBlockingError(error, context)) that logs the
error at debug level and optionally reports it to monitoring; implement
logNonBlockingError in this module (or a shared utils file) to use the app
logger if available or console.debug and to attach a minimal context (source:
"PostHog", component: "ConfigureDs", and the operation name), then call
logNonBlockingError(err, {operation: "setCustomEvent"}) inside the catch blocks
that currently swallow errors for posthog/analytics calls (referencing the
posthog.* calls in ConfigureDs.jsx).
frontend/src/components/deployments/create-api-deployment-from-prompt-studio/CreateApiDeploymentFromPromptStudio.jsx (1)
323-325: Remove redundant try-catch; PostHog error handling should occur at the hook level.

The hook setPostHogCustomEvent already wraps posthog.capture() in a try-catch that silently ignores failures (see usePostHogEvents.js:83-88). The try-catch here is redundant—errors are already swallowed upstream with no logging. If improved observability into PostHog failures is needed, fix it at the source: either add logging to the hook itself or use an existing frontend error-reporting path, then remove this duplicate catch block.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In
`@frontend/src/components/deployments/create-api-deployment-from-prompt-studio/CreateApiDeploymentFromPromptStudio.jsx`
around lines 323 - 325, The catch block that swallows errors around the call to
setPostHogCustomEvent is redundant because setPostHogCustomEvent already handles
and ignores PostHog failures; remove the surrounding try-catch in
CreateApiDeploymentFromPromptStudio (the block catching _err) and rely on the
hook's internal handling, or if you want observability instead, add
logging/error-reporting inside usePostHogEvents (around posthog.capture in
setPostHogCustomEvent) and then remove this outer catch to avoid duplicate
suppression of errors.
frontend/src/components/custom-tools/prompt-card/PromptCard.jsx (1)
262-264: Consider logging silently swallowed exceptions for debuggability.

The intent to not interrupt the main flow for telemetry failures is valid, and the _err convention correctly signals the unused parameter. However, completely swallowing exceptions can make debugging difficult if PostHog integration starts failing silently.
💡 Optional: Add debug-level logging
     } catch (_err) {
-      // If an error occurs while setting custom posthog event, ignore it and continue
+      // PostHog event failures are non-critical; log for debugging but don't interrupt flow
+      console.debug("PostHog event failed:", _err);
     }
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@frontend/src/components/custom-tools/prompt-card/PromptCard.jsx` around lines
262 - 264, The catch block in the PromptCard component is silently swallowing
PostHog errors (catch (_err) { ... }), which makes failures hard to debug;
update that catch to log the error at a debug/trace level (e.g., console.debug
or the app logger) including context like "PromptCard: posthog event failed" and
the caught error (_err) so telemetry failures are non-fatal but visible during
debugging; keep the handler non-throwing so main flow continues.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In
`@frontend/src/components/deployments/create-api-deployment-from-prompt-studio/CreateApiDeploymentFromPromptStudio.jsx`:
- Around line 98-100: When the early return for missing toolDetails?.tool_id is
hit, clear the tool-specific state so stale values don't persist: call the state
setters (e.g. setToolFunctionName('') and setToolSchema(null) — and any other
related setters like setToolInputs([]) or setSelectedTool(null) if present)
immediately before returning from the component logic in
CreateApiDeploymentFromPromptStudio.jsx so step 2 and the tool-creation path use
a fresh, empty state.

In `@frontend/src/components/input-output/configure-ds/ConfigureDs.jsx`:
- Around line 120-125: The effect watching selectedSourceId currently returns
early when metadata is falsy, leaving formData populated with the previous
source's values; update the useEffect (the effect that references
selectedSourceId, metadata, and setFormData) so that when metadata is
null/undefined you explicitly reset the form state (e.g., call setFormData with
an empty/default object) instead of returning early, otherwise keep the existing
behavior of setting form data from metadata when present.

In `@frontend/src/hooks/usePromptRun.js`:
- Line 18: The hook usePromptOutput currently only exposes
generatePromptOutputKey but the socket consumer in usePromptStudioSocket still
imports and calls updatePromptOutputState for fetch_response and
single_pass_extraction completions; restore and export updatePromptOutputState
from usePromptOutput (preserving its existing signature and behavior) so
usePromptStudioSocket's handlers continue to work, or alternatively update
usePromptStudioSocket to stop calling updatePromptOutputState in the same
PR—make sure updatePromptOutputState (the function name) is present and returned
by usePromptOutput so socket handlers do not receive undefined.
- Around line 54-67: The timeout cleanup can clear a newer run's status because
it only keys by promptId+statusKey; modify the logic so the timeout is tied to
the specific execution (runId) or is cancelable when the socket response
arrives: when scheduling the setTimeout (using SOCKET_TIMEOUT_MS) include/runId
in the generated key (generateApiRunStatusId) or store the returned timer id in
a map keyed by runId in usePromptRunStatusStore, and on the socket event handler
clearTimeout for that runId and remove the stored timer before updating status;
update calls that currently call removePromptStatus(promptId, statusKey) and
setAlertDetails to instead operate only for the matching runId to avoid removing
newer runs.

---

Outside diff comments:
In `@frontend/src/components/custom-tools/prompt-card/PromptCard.jsx`:
- Around line 71-93: The useEffect that computes msg from messages captures
promptDetailsState, promptKey, and details but only lists [messages] as
dependencies, which can cause stale closures; update the dependency array for
the effect that calls setProgressMsg to include promptDetailsState, promptKey,
and details (so it becomes [messages, promptDetailsState, promptKey, details])
or refactor the filter logic into a memoized callback (e.g., using
useCallback/useMemo) referenced by the effect to ensure it re-runs whenever
those values change.

In `@frontend/src/components/custom-tools/prompt-card/PromptOutput.jsx`:
- Around line 433-446: The shared progressMsg is computed per prompt card and
passed into every DisplayPromptResult, causing cross-profile progress updates;
update the progress state to be keyed by both promptId and profileId (or derive
a profile-scoped message) and pass the profile-scoped message into
DisplayPromptResult instead of the global progressMsg; locate the progress
computation in PromptCard.jsx (lines computing progressMsg) and change its state
shape to use a compound key like {promptId, profileId} or maintain a per-profile
map, then update the prop passed from PromptOutput.jsx (where
DisplayPromptResult is rendered) to use the profile-scoped value so each
profile's loading pane shows only its own messages.

---

Nitpick comments:
In `@frontend/src/components/custom-tools/prompt-card/PromptCard.jsx`:
- Around line 262-264: The catch block in the PromptCard component is silently
swallowing PostHog errors (catch (_err) { ... }), which makes failures hard to
debug; update that catch to log the error at a debug/trace level (e.g.,
console.debug or the app logger) including context like "PromptCard: posthog
event failed" and the caught error (_err) so telemetry failures are non-fatal
but visible during debugging; keep the handler non-throwing so main flow
continues.

In
`@frontend/src/components/deployments/create-api-deployment-from-prompt-studio/CreateApiDeploymentFromPromptStudio.jsx`:
- Around line 323-325: The catch block that swallows errors around the call to
setPostHogCustomEvent is redundant because setPostHogCustomEvent already handles
and ignores PostHog failures; remove the surrounding try-catch in
CreateApiDeploymentFromPromptStudio (the block catching _err) and rely on the
hook's internal handling, or if you want observability instead, add
logging/error-reporting inside usePostHogEvents (around posthog.capture in
setPostHogCustomEvent) and then remove this outer catch to avoid duplicate
suppression of errors.

In `@frontend/src/components/input-output/configure-ds/ConfigureDs.jsx`:
- Around line 210-212: Replace the empty catch blocks around PostHog calls by
invoking a small shared helper (e.g., logNonBlockingError(error, context)) that
logs the error at debug level and optionally reports it to monitoring; implement
logNonBlockingError in this module (or a shared utils file) to use the app
logger if available or console.debug and to attach a minimal context (source:
"PostHog", component: "ConfigureDs", and the operation name), then call
logNonBlockingError(err, {operation: "setCustomEvent"}) inside the catch blocks
that currently swallow errors for posthog/analytics calls (referencing the
posthog.* calls in ConfigureDs.jsx).

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: a2662be5-cfa0-4cca-8f86-68063a8d6d60

📥 Commits

Reviewing files that changed from the base of the PR and between 3e5ce31 and e3ca0c6.

📒 Files selected for processing (56)

frontend/src/App.jsx
frontend/src/components/agency/agency/Agency.jsx
frontend/src/components/agency/configure-connector-modal/ConfigureConnectorModal.jsx
frontend/src/components/agency/markdown-renderer/MarkdownRenderer.jsx
frontend/src/components/common/PromptStudioModal.jsx
frontend/src/components/custom-tools/add-llm-profile/AddLlmProfile.jsx
frontend/src/components/custom-tools/combined-output/CombinedOutput.jsx
frontend/src/components/custom-tools/custom-data-settings/CustomDataSettings.jsx
frontend/src/components/custom-tools/document-parser/DocumentParser.jsx
frontend/src/components/custom-tools/header/Header.jsx
frontend/src/components/custom-tools/import-tool/ImportTool.jsx
frontend/src/components/custom-tools/list-of-tools/ListOfTools.jsx
frontend/src/components/custom-tools/manage-llm-profiles/ManageLlmProfiles.jsx
frontend/src/components/custom-tools/notes-card/NotesCard.jsx
frontend/src/components/custom-tools/output-analyzer/OutputAnalyzer.jsx
frontend/src/components/custom-tools/output-analyzer/OutputAnalyzerCard.jsx
frontend/src/components/custom-tools/prompt-card/DisplayPromptResult.jsx
frontend/src/components/custom-tools/prompt-card/OutputForIndex.jsx
frontend/src/components/custom-tools/prompt-card/PromptCard.jsx
frontend/src/components/custom-tools/prompt-card/PromptCardItems.jsx
frontend/src/components/custom-tools/prompt-card/PromptOutput.jsx
frontend/src/components/custom-tools/prompt-card/PromptRun.jsx
frontend/src/components/custom-tools/prompts-reorder/DraggablePrompt.jsx
frontend/src/components/custom-tools/prompts-reorder/PromptsReorder.jsx
frontend/src/components/custom-tools/retrieval-strategy-modal/RetrievalStrategyModal.jsx
frontend/src/components/custom-tools/tool-ide/ToolIde.jsx
frontend/src/components/custom-tools/tools-main/ToolsMain.jsx
frontend/src/components/custom-tools/tools-main/ToolsMainActionBtns.jsx
frontend/src/components/deployments/create-api-deployment-from-prompt-studio/CreateApiDeploymentFromPromptStudio.jsx
frontend/src/components/helpers/auth/RequireAuth.js
frontend/src/components/helpers/auth/RequireGuest.js
frontend/src/components/helpers/socket-messages/SocketMessages.js
frontend/src/components/input-output/add-source/AddSource.jsx
frontend/src/components/input-output/configure-ds/ConfigureDs.jsx
frontend/src/components/input-output/data-source-card/DataSourceCard.jsx
frontend/src/components/input-output/list-of-sources/ListOfSources.jsx
frontend/src/components/input-output/manage-files/ManageFiles.jsx
frontend/src/components/logs-and-notifications/DisplayLogsAndNotifications.jsx
frontend/src/components/metrics-dashboard/RecentActivity.jsx
frontend/src/components/pipelines-or-deployments/file-history-modal/FileHistoryModal.jsx
frontend/src/components/set-org/SetOrg.jsx
frontend/src/components/settings/default-triad/DefaultTriad.jsx
frontend/src/components/settings/invite/InviteEditUser.jsx
frontend/src/components/settings/platform/PlatformSettings.jsx
frontend/src/components/settings/users/Users.jsx
frontend/src/components/tool-settings/tool-settings/ToolSettings.jsx
frontend/src/hooks/usePromptOutput.js
frontend/src/hooks/usePromptRun.js
frontend/src/hooks/usePromptStudioSocket.js
frontend/src/hooks/useRequestUrl.js
frontend/src/layouts/rjsf-form-layout/CustomObjectFieldTemplate.jsx
frontend/src/store/alert-store.js
frontend/src/store/prompt-run-queue-store.js
frontend/src/store/prompt-studio-store.js
frontend/src/store/retrieval-strategies-store.js
frontend/src/store/workflow-store.js

✅ Files skipped from review due to trivial changes (23)

frontend/src/components/common/PromptStudioModal.jsx
frontend/src/hooks/usePromptOutput.js
frontend/src/components/logs-and-notifications/DisplayLogsAndNotifications.jsx
frontend/src/components/input-output/list-of-sources/ListOfSources.jsx
frontend/src/components/custom-tools/output-analyzer/OutputAnalyzerCard.jsx
frontend/src/store/prompt-run-queue-store.js
frontend/src/store/alert-store.js
frontend/src/components/custom-tools/prompt-card/OutputForIndex.jsx
frontend/src/components/custom-tools/retrieval-strategy-modal/RetrievalStrategyModal.jsx
frontend/src/components/input-output/manage-files/ManageFiles.jsx
frontend/src/components/custom-tools/custom-data-settings/CustomDataSettings.jsx
frontend/src/components/input-output/add-source/AddSource.jsx
frontend/src/components/agency/configure-connector-modal/ConfigureConnectorModal.jsx
frontend/src/components/metrics-dashboard/RecentActivity.jsx
frontend/src/components/custom-tools/document-parser/DocumentParser.jsx
frontend/src/components/custom-tools/prompts-reorder/PromptsReorder.jsx
frontend/src/components/helpers/auth/RequireGuest.js
frontend/src/components/custom-tools/output-analyzer/OutputAnalyzer.jsx
frontend/src/layouts/rjsf-form-layout/CustomObjectFieldTemplate.jsx
frontend/src/components/agency/markdown-renderer/MarkdownRenderer.jsx
frontend/src/components/custom-tools/notes-card/NotesCard.jsx
frontend/src/store/prompt-studio-store.js
frontend/src/store/workflow-store.js

🚧 Files skipped from review as they are similar to previous changes (3)

frontend/src/components/helpers/socket-messages/SocketMessages.js
frontend/src/components/custom-tools/tool-ide/ToolIde.jsx
frontend/src/hooks/usePromptStudioSocket.js

coderabbitai · 2026-03-12T09:03:18Z

...deployments/create-api-deployment-from-prompt-studio/CreateApiDeploymentFromPromptStudio.jsx

+    if (!toolDetails?.tool_id) {
+      return;
+    }


⚠️ Potential issue | 🟠 Major

Reset tool-specific state before returning here.

If the modal is reopened for a tool without tool_id, this exits with the previous toolFunctionName and toolSchema still in state. That can show stale settings in step 2 and let Line 379 create the new tool instance with the last tool's function name instead of the current selection.

Proposed fix

if (!toolDetails?.tool_id) { + setToolFunctionName(null); + setToolSchema(null); + setIsSchemaLoading(false); return; }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@frontend/src/components/deployments/create-api-deployment-from-prompt-studio/CreateApiDeploymentFromPromptStudio.jsx` around lines 98 - 100, When the early return for missing toolDetails?.tool_id is hit, clear the tool-specific state so stale values don't persist: call the state setters (e.g. setToolFunctionName('') and setToolSchema(null) — and any other related setters like setToolInputs([]) or setSelectedTool(null) if present) immediately before returning from the component logic in CreateApiDeploymentFromPromptStudio.jsx so step 2 and the tool-creation path use a fresh, empty state.

coderabbitai · 2026-03-12T09:03:18Z

frontend/src/components/input-output/configure-ds/ConfigureDs.jsx

  useEffect(() => {
-    if (!metadata) return;
+    if (!metadata) {
+      return;
+    }
    setFormData(metadata);
  }, [selectedSourceId, metadata, setFormData]);


⚠️ Potential issue | 🟠 Major

Reset formData when the new source has no metadata.

The early return on Line 121 means a selectedSourceId change with no metadata keeps the previous source's values in the form. That lets stale connector/adapter settings be submitted against the newly selected source.

Proposed fix

useEffect(() => { if (!metadata) { + setFormData({}); return; } setFormData(metadata); }, [selectedSourceId, metadata, setFormData]);

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@frontend/src/components/input-output/configure-ds/ConfigureDs.jsx` around lines 120 - 125, The effect watching selectedSourceId currently returns early when metadata is falsy, leaving formData populated with the previous source's values; update the useEffect (the effect that references selectedSourceId, metadata, and setFormData) so that when metadata is null/undefined you explicitly reset the form state (e.g., call setFormData with an empty/default object) instead of returning early, otherwise keep the existing behavior of setting form data from metadata when present.

coderabbitai · 2026-03-12T09:03:18Z

frontend/src/hooks/usePromptRun.js

  const { pushPromptRunApi, freeActiveApi } = usePromptRunQueueStore();
-  const { generatePromptOutputKey, updatePromptOutputState } =
-    usePromptOutput();
+  const { generatePromptOutputKey } = usePromptOutput();


⚠️ Potential issue | 🔴 Critical

Restore updatePromptOutputState until the socket consumer is migrated.

frontend/src/hooks/usePromptStudioSocket.js:26-58 still does const { updatePromptOutputState } = usePromptOutput(); and calls it for fetch_response / single_pass_extraction completions. If this PR makes usePromptOutput() expose only generatePromptOutputKey, that handler will get undefined and throw before results are written or RUNNING status is cleared. Either keep updatePromptOutputState in the hook API or update the socket hook in the same change.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@frontend/src/hooks/usePromptRun.js` at line 18, The hook usePromptOutput currently only exposes generatePromptOutputKey but the socket consumer in usePromptStudioSocket still imports and calls updatePromptOutputState for fetch_response and single_pass_extraction completions; restore and export updatePromptOutputState from usePromptOutput (preserving its existing signature and behavior) so usePromptStudioSocket's handlers continue to work, or alternatively update usePromptStudioSocket to stop calling updatePromptOutputState in the same PR—make sure updatePromptOutputState (the function name) is present and returned by usePromptOutput so socket handlers do not receive undefined.

coderabbitai · 2026-03-12T09:03:18Z

frontend/src/hooks/usePromptRun.js

+        // Timeout safety net: clear stale status if socket event never arrives.
+        setTimeout(() => {
+          const statusKey = generateApiRunStatusId(docId, profileId);
+          const current = usePromptRunStatusStore.getState().promptRunStatus;
+          if (
+            current?.[promptId]?.[statusKey] === PROMPT_RUN_API_STATUSES.RUNNING
+          ) {
+            removePromptStatus(promptId, statusKey);
+            setAlertDetails({
+              type: "warning",
+              content: "Prompt execution timed out. Please try again.",
+            });
+          }
+        }, SOCKET_TIMEOUT_MS);


⚠️ Potential issue | 🟠 Major

Key the timeout cleanup to the specific execution.

Line 55 schedules a timer that only checks promptId + statusKey. If the user reruns the same prompt/doc/profile before that 5-minute timer fires, the stale timer can remove the newer run's RUNNING state and show a false timeout. Please tie the timeout to runId (or cancel the timer when the socket result arrives) before clearing status.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@frontend/src/hooks/usePromptRun.js` around lines 54 - 67, The timeout cleanup can clear a newer run's status because it only keys by promptId+statusKey; modify the logic so the timeout is tied to the specific execution (runId) or is cancelable when the socket response arrives: when scheduling the setTimeout (using SOCKET_TIMEOUT_MS) include/runId in the generated key (generateApiRunStatusId) or store the returned timer id in a map keyed by runId in usePromptRunStatusStore, and on the socket event handler clearTimeout for that runId and remove the stored timer before updating status; update calls that currently call removePromptStatus(promptId, statusKey) and setAlertDetails to instead operate only for the matching runId to avoid removing newer runs.

…Workflows Resolves vite build warning about SharePermission.jsx being both dynamically and statically imported across the codebase. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Signed-off-by: harini-venkataraman <115449948+harini-venkataraman@users.noreply.github.com>

- Remove unnecessary try-catch around PostHog event calls - Flip negated condition in PromptOutput.handleTable for clarity Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

coderabbitai

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

frontend/src/components/custom-tools/prompt-card/PromptCard.jsx (1)
71-93: ⚠️ Potential issue | 🟡 Minor

Missing useEffect dependencies and overly broad message matching.

Two concerns here:

The tool_id match (line 79) is very broad—any message for the entire tool will match, potentially causing progress messages from unrelated prompts to appear in this prompt card.

The useEffect uses promptDetailsState?.prompt_id, promptKey, and details?.tool_id but the dependency array only includes [messages]. This can cause stale closure issues where the effect captures outdated values.
Suggested fix
-  }, [messages]);
+  }, [messages, promptDetailsState?.prompt_id, promptKey, details?.tool_id]);
Please also verify that the tool_id matching logic is intentional—consider if it should only match when no prompt_id or prompt_key is present in the message.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@frontend/src/components/custom-tools/prompt-card/PromptCard.jsx` around lines
71 - 93, The effect in useEffect that finds a matching message uses overly broad
tool-level matching and has an incomplete dependency array; update the matching
logic in the effect (the .find call) so tool-level matching
(item?.component?.tool_id === details?.tool_id) is only considered when the
message does not include a prompt_id or prompt_key (i.e., prefer prompt_id or
prompt_key matches first, fallback to tool_id only when both prompt identifiers
are absent), and expand the dependency array to include messages,
promptDetailsState?.prompt_id, promptKey, and details?.tool_id so the effect
re-runs when any of those values change; keep the setProgressMsg usage unchanged
but ensure you reference the current values inside the effect.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@frontend/src/components/custom-tools/prompt-card/PromptCard.jsx`:
- Around line 257-261: In handleRun, make the PostHog call fire-and-forget: wrap
the setPostHogCustomEvent(...) call in a try/catch (or call it in a non-blocking
promise) so any thrown errors are swallowed/logged locally and do not propagate;
after guarding the analytics call, continue to call validateInputs(...) and
handlePromptRunRequest(...) normally. Ensure you reference the
setPostHogCustomEvent() invocation inside handleRun and do not change the
subsequent validateInputs and handlePromptRunRequest call flow.

---

Outside diff comments:
In `@frontend/src/components/custom-tools/prompt-card/PromptCard.jsx`:
- Around line 71-93: The effect in useEffect that finds a matching message uses
overly broad tool-level matching and has an incomplete dependency array; update
the matching logic in the effect (the .find call) so tool-level matching
(item?.component?.tool_id === details?.tool_id) is only considered when the
message does not include a prompt_id or prompt_key (i.e., prefer prompt_id or
prompt_key matches first, fallback to tool_id only when both prompt identifiers
are absent), and expand the dependency array to include messages,
promptDetailsState?.prompt_id, promptKey, and details?.tool_id so the effect
re-runs when any of those values change; keep the setProgressMsg usage unchanged
but ensure you reference the current values inside the effect.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 29a872fa-c10a-4838-a8bd-6c253b1c4ae3

📥 Commits

Reviewing files that changed from the base of the PR and between a62a9fd and b3a90af.

📒 Files selected for processing (4)

frontend/src/components/custom-tools/prompt-card/PromptCard.jsx
frontend/src/components/custom-tools/prompt-card/PromptOutput.jsx
frontend/src/components/deployments/create-api-deployment-from-prompt-studio/CreateApiDeploymentFromPromptStudio.jsx
frontend/src/components/input-output/configure-ds/ConfigureDs.jsx

🚧 Files skipped from review as they are similar to previous changes (1)

frontend/src/components/input-output/configure-ds/ConfigureDs.jsx

coderabbitai · 2026-03-12T12:40:49Z

frontend/src/components/custom-tools/prompt-card/PromptCard.jsx

    const handleRun = (promptRunType, promptId, profileId, documentId) => {
-      try {
-        setPostHogCustomEvent("ps_prompt_run", {
-          info: "Click on 'Run Prompt' button (Multi Pass)",
-        });
-      } catch (err) {
-        // If an error occurs while setting custom posthog event, ignore it and continue
-      }
+      setPostHogCustomEvent("ps_prompt_run", {
+        info: "Click on 'Run Prompt' button (Multi Pass)",
+      });



⚠️ Potential issue | 🟡 Minor

Analytics error could block prompt execution.

The try/catch was removed from the PostHog event call. If setPostHogCustomEvent throws (network failure, library error, etc.), the exception will bubble up and prevent validateInputs and handlePromptRunRequest from executing. Analytics should be fire-and-forget.

Suggested fix

const handleRun = (promptRunType, promptId, profileId, documentId) => { + try { setPostHogCustomEvent("ps_prompt_run", { info: "Click on 'Run Prompt' button (Multi Pass)", }); + } catch { + // Analytics should not block core functionality + } const validateInputs = () => {

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

const handleRun = (promptRunType, promptId, profileId, documentId) => {

try {

setPostHogCustomEvent("ps_prompt_run", {

info: "Click on 'Run Prompt' button (Multi Pass)",

});

} catch (err) {

// If an error occurs while setting custom posthog event, ignore it and continue

}

setPostHogCustomEvent("ps_prompt_run", {

info: "Click on 'Run Prompt' button (Multi Pass)",

});

const handleRun = (promptRunType, promptId, profileId, documentId) => {

try {

setPostHogCustomEvent("ps_prompt_run", {

info: "Click on 'Run Prompt' button (Multi Pass)",

});

} catch {

// Analytics should not block core functionality

}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@frontend/src/components/custom-tools/prompt-card/PromptCard.jsx` around lines 257 - 261, In handleRun, make the PostHog call fire-and-forget: wrap the setPostHogCustomEvent(...) call in a try/catch (or call it in a non-blocking promise) so any thrown errors are swallowed/logged locally and do not propagate; after guarding the analytics call, continue to call validateInputs(...) and handlePromptRunRequest(...) normally. Ensure you reference the setPostHogCustomEvent() invocation inside handleRun and do not change the subsequent validateInputs and handlePromptRunRequest call flow.

github-actions · 2026-03-12T15:50:43Z

Frontend Lint Report (Biome)

✅ All checks passed! No linting or formatting issues found.

github-actions · 2026-03-12T15:50:55Z

Test Results

Summary

✅ Runner Tests: 11 passed, 0 failed (11 total)

Runner Tests - Full Report

filepath	function	$$\textcolor{#23d18b}{\tt{passed}}$$	SUBTOTAL
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_logs}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_cleanup}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_cleanup\_skip}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_client\_init}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image\_exists}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_container\_run\_config}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_container\_run\_config\_without\_mount}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_run\_container}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image\_for\_sidecar}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_sidecar\_container}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{TOTAL}}$$		$$\textcolor{#23d18b}{\tt{11}}$$	$$\textcolor{#23d18b}{\tt{11}}$$

sonarqubecloud · 2026-03-12T15:53:08Z

Quality Gate failed

Failed conditions
6.9% Duplication on New Code (required ≤ 3%)

See analysis details on SonarQube Cloud

greptile-apps · 2026-03-12T15:57:06Z

Greptile Summary

This PR introduces a major architectural shift for Prompt Studio IDE execution: replacing synchronous, blocking Django HTTP calls with a fire-and-forget Celery dispatch model backed by a new pluggable ExecutorFramework (SDK1), a dedicated LegacyExecutor worker, and Socket.IO for result delivery. The change is well-motivated — Django workers were being tied up waiting for LLM calls that could take minutes — and the implementation is generally thorough and well-tested.

Key findings from the review:

Missing feature flag gating (views.py): The PR description states the async path is gated behind the async_prompt_execution feature flag with the original sync path preserved as a fallback. The actual code contains no feature flag check in any of the three view methods. All endpoints unconditionally use HTTP 202 and Celery dispatch. If executor workers aren't running, every prompt execution silently hangs forever.
Dead-code null guards in payload builders: if not tool: raise ToolNotValid() in build_index_payload and if not profile_manager: raise DefaultProfileError() in build_fetch_response_payload (and build_single_pass_payload) are both placed after the ORM calls and validation methods that would already raise, making them unreachable.
DB_USER not URL-encoded in worker_celery.py result backend connection string — DB_PASSWORD uses quote_plus but DB_USER is interpolated raw.
Redundant import uuid as _uuid appears inside three view-method bodies despite uuid being imported at module level.
The SDK1 execution framework (context.py, dispatcher.py, registry.py, result.py, orchestrator.py) is cleanly designed with proper serialization contracts and good separation of concerns.
The callback tasks (ide_index_complete, ide_prompt_complete) correctly handle both executor-level failures (checking result_dict["success"]) and unhandled task-level failures (via link_error callbacks).
The frontend usePromptStudioSocket hook is well-structured with stable useCallback-wrapped handlers and proper socket cleanup.

Confidence Score: 2/5

Not safe to merge as-is — the feature flag gating described in the PR is absent, making this a hard breaking change for all users without the required worker infrastructure.
The most critical issue is that all three Prompt Studio endpoints unconditionally use the new async (HTTP 202) path with no feature flag check, directly contradicting the PR's own safety contract. Any deployment without the executor and callback workers running will cause every prompt execution to silently hang. Two additional dead-code null guards in the payload builders indicate incomplete refactoring. The core SDK1 framework, executor worker, and frontend socket hook are all well-implemented, but the missing flag gate is a deployment-blocking issue.
backend/prompt_studio/prompt_studio_core_v2/views.py requires the async_prompt_execution feature flag gate to be added before merge. backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py needs the dead-code null guards fixed in build_index_payload and build_fetch_response_payload.

Important Files Changed

Filename	Overview
unstract/sdk1/src/unstract/sdk1/execution/context.py	New ExecutionContext dataclass — well-structured, JSON-serializable DTO with proper round-trip to/from dict; enum normalization in post_init prevents serialization issues.
unstract/sdk1/src/unstract/sdk1/execution/dispatcher.py	ExecutionDispatcher with three dispatch modes (sync, async, with-callback); well-documented, handles timeout from env var, and properly logs without exposing credentials.
unstract/sdk1/src/unstract/sdk1/execution/registry.py	Clean class-decorator self-registration pattern; correctly prevents duplicate name registration and creates a fresh executor instance per get() call.
unstract/sdk1/src/unstract/sdk1/execution/result.py	Standardized ExecutionResult with success/failure factory, proper validation (error required when success=False), and clean round-trip serialization.
backend/prompt_studio/prompt_studio_core_v2/views.py	Critical: all three async endpoints (index_document, fetch_response, single_pass_extraction) lack the async_prompt_execution feature flag gating described in the PR — the old sync path is entirely removed, making this a hard breaking change for all users; also contains redundant inline uuid imports in each view method.
backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py	New build_*_payload methods have dead-code null guards (if not tool: and if not profile_manager:) placed after the ORM calls and validation methods that would already raise first; the payload construction logic itself is thorough and correct.
backend/prompt_studio/prompt_studio_core_v2/tasks.py	New Celery callback tasks (ide_index_complete, ide_prompt_complete, ide_index_error, ide_prompt_error) follow consistent structure with proper state store setup/teardown; _json_safe helper correctly handles UUID and datetime serialization.
backend/backend/worker_celery.py	Creative Celery subclass workaround for env-var broker priority; singleton pattern without thread-lock (harmless race but worth noting); DB_USER is not URL-encoded in the result backend connection string unlike DB_PASSWORD.
workers/executor/tasks.py	Single entry-point execute_extraction task with retry policy, log correlation setup, and always-returns-a-result pattern; note that link_error callbacks are only reachable for infrastructure-level failures since the task never raises in normal operation.
workers/executor/executors/legacy_executor.py	Comprehensive strategy-pattern routing across 7 operations; handles LegacyExecutorError with real-time frontend streaming; 1,799 lines is large but well-organized by operation type.
frontend/src/hooks/usePromptRun.js	Rewritten to fire-and-forget model with 5-minute timeout safety net; freeActiveApi() correctly moved to finally; removePromptStatus now handled by socket hook on success, or immediately on error.
frontend/src/hooks/usePromptStudioSocket.js	New Socket.IO listener for prompt_studio_result events; correctly handles completed/failed status routing, clears spinners via removePromptStatus, and uses useCallback for stable event handler identity.

Sequence Diagram

sequenceDiagram
    participant FE as Frontend (PromptRun.jsx)
    participant WS as Socket.IO
    participant DJ as Django View (views.py)
    participant CB as Callback Worker (tasks.py)
    participant EX as Executor Worker (tasks.py)
    participant LLM as LLM / VectorDB

    FE->>DJ: POST /fetch_response/{tool_id}<br/>(fire-and-forget)
    DJ->>DJ: build_fetch_response_payload()<br/>(ORM loads, extract, index)
    DJ->>EX: dispatch_with_callback()<br/>celery_executor_legacy queue
    DJ-->>FE: HTTP 202 {task_id, run_id, status:"accepted"}
    Note over FE: Sets 5-min timeout safety net<br/>Spinner stays on

    EX->>LLM: LegacyExecutor._handle_answer_prompt()<br/>RetrievalService → LLM call
    LLM-->>EX: Extracted answer
    EX-->>CB: Celery link callback<br/>ide_prompt_complete(result_dict, cb_kwargs)<br/>prompt_studio_callback queue

    CB->>CB: OutputManagerHelper.handle_prompt_output_update()<br/>(ORM write)
    CB->>WS: _emit_websocket_event("prompt_studio_result")
    WS-->>FE: Socket.IO event {status:"completed", operation, result}
    FE->>FE: usePromptStudioSocket.onResult()<br/>updatePromptOutputState() → clears spinner

    Note over EX,CB: On infrastructure failure (ConnectionError etc.)<br/>link_error fires ide_prompt_error instead
    EX--xCB: ide_prompt_error(failed_task_id, cb_kwargs)
    CB->>WS: _emit_websocket_event("prompt_studio_result")<br/>{status:"failed", error}
    WS-->>FE: Socket.IO error event
    FE->>FE: handleFailed() → clears spinner + shows error alert

Prompt To Fix All With AI

This is a comment left during a code review.
Path: backend/prompt_studio/prompt_studio_core_v2/views.py
Line: 364-422

Comment:
**Async path is unconditional — feature flag gating is absent**

The PR description explicitly states that all three endpoints (`index_document`, `fetch_response`, `single_pass_extraction`) are gated behind the `async_prompt_execution` feature flag with the old sync path preserved as a fallback. However, the actual implementation contains no feature flag check anywhere in these view methods. All three endpoints now **always** return HTTP 202 and dispatch to Celery workers, making this a hard breaking change for all users regardless of the flag.

The PR description also says:
> "When flag is OFF (default), all 3 endpoints use the old sync path returning HTTP 200. No behavior change for existing users."

This behavior is not implemented. Any deployment of this PR will immediately switch all Prompt Studio execution to the async path. If the `worker-executor-v2` or `worker-prompt-studio-callback` services are not running, every prompt execution will silently hang forever waiting for socket events that never arrive.

The conditional dispatch pattern would look like:

```python
from utils.feature_flags import check_feature_flag  # or however flags are checked

if check_feature_flag("async_prompt_execution"):
    context, cb_kwargs = PromptStudioHelper.build_index_payload(...)
    # ... dispatch with callback, return 202
else:
    # legacy sync path
    unique_id = PromptStudioHelper.index_document(...)
    return Response({"message": "Document indexed successfully."}, status=status.HTTP_200_OK)
```

The same conditional must be applied to `fetch_response` and `single_pass_extraction`. Also note that `usePromptStudioSocket.js` on the frontend is unconditionally mounted without any feature flag guard either, which reinforces that the gating mechanism described in the PR is incomplete.

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py
Line: 322-324

Comment:
**Dead code — `if not tool:` guard is unreachable**

`CustomTool.objects.get(pk=tool_id)` is called two lines earlier (line 312). Django's `.get()` raises `CustomTool.DoesNotExist` if no matching row exists, and returns a truthy model instance on success. The `if not tool:` check therefore can never be true — either the call raises before reaching this line, or `tool` is a valid model object that is always truthy. Additionally, `ProfileManager.get_default_llm_profile(tool)` is called on line 321 *before* the guard, so even if `.get()` somehow returned a falsy value, the guard would be too late.

This check should be moved to *before* the ORM call, or (more idiomatically) handled by catching `CustomTool.DoesNotExist` in the caller.

```suggestion
        default_profile = ProfileManager.get_default_llm_profile(tool)
        PromptStudioHelper.validate_adapter_status(default_profile)
```

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py
Line: 527-532

Comment:
**Dead code — `if not profile_manager:` guard fires after it is already dereferenced**

`validate_adapter_status(profile_manager)` (line 527) and `validate_profile_manager_owner_access(profile_manager)` (line 528) are both called *before* the `if not profile_manager:` guard on line 531. If `profile_manager` is `None`, both validation helpers will dereference it and raise an `AttributeError` before the guard is reached — making the explicit `raise DefaultProfileError()` dead code.

The same pattern is repeated in `build_single_pass_payload` at line 1428. Move the null check to immediately after `profile_manager` is resolved:

```python
profile_manager = prompt.profile_manager
if profile_manager_id:
    profile_manager = ProfileManagerHelper.get_profile_manager(
        profile_manager_id=profile_manager_id
    )

if not profile_manager:          # ← guard BEFORE any dereference
    raise DefaultProfileError()

monitor_llm, challenge_llm = PromptStudioHelper._resolve_llm_ids(tool)
PromptStudioHelper.validate_adapter_status(profile_manager)
PromptStudioHelper.validate_profile_manager_owner_access(profile_manager)
```

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: backend/prompt_studio/prompt_studio_core_v2/views.py
Line: 401-402

Comment:
**Redundant inline imports of `uuid`**

`uuid` is already imported at module level (line 3 of this file). The inline `import uuid as _uuid` inside the function body is unnecessary and appears in all three view methods (`index_document` at line 401, `fetch_response` at line 482, `single_pass_extraction` at line 573). Use the existing top-level import directly.

```suggestion
        executor_task_id = str(uuid.uuid4())
```

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: backend/backend/worker_celery.py
Line: 70-74

Comment:
**`settings.DB_USER` is not URL-encoded in the result backend URL**

`quote_plus` is correctly applied to `DB_PASSWORD`, but `DB_USER` is interpolated raw. If the database username contains any URL-special characters (e.g. `@`, `:`, `/`), the resulting connection string would be malformed and the Celery result backend would fail to connect. Apply the same `quote_plus` encoding to `settings.DB_USER` for consistency and correctness, just as is done for `settings.DB_PASSWORD`.

How can I resolve this? If you propose a fix, please make it concise.

_{Last reviewed commit: 4200ac1}

greptile-apps · 2026-03-12T15:57:10Z

backend/prompt_studio/prompt_studio_core_v2/views.py

@@ -380,104 +384,249 @@ def index_document(self, request: HttpRequest, pk: Any = None) -> Response:
        document_id: str = serializer.validated_data.get(ToolStudioPromptKeys.DOCUMENT_ID)
        document: DocumentManager = DocumentManager.objects.get(pk=document_id)
        file_name: str = document.document_name
-        # Generate a run_id
        run_id = CommonUtils.generate_uuid()

-        unique_id = PromptStudioHelper.index_document(
+        context, cb_kwargs = PromptStudioHelper.build_index_payload(
            tool_id=str(tool.tool_id),
            file_name=file_name,
            org_id=UserSessionUtils.get_organization_id(request),
            user_id=tool.created_by.user_id,
            document_id=document_id,
            run_id=run_id,
        )
-        if unique_id:
-            return Response(
-                {"message": "Document indexed successfully."},
-                status=status.HTTP_200_OK,
-            )
-        else:
-            logger.error("Error occured while indexing. Unique ID is not valid.")
-            raise IndexingAPIError()
+
+        dispatcher = PromptStudioHelper._get_dispatcher()
+
+        # Pre-generate task ID so callbacks can reference it
+        import uuid as _uuid
+
+        executor_task_id = str(_uuid.uuid4())
+        cb_kwargs["executor_task_id"] = executor_task_id
+
+        task = dispatcher.dispatch_with_callback(
+            context,
+            on_success=signature(
+                "ide_index_complete",
+                kwargs={"callback_kwargs": cb_kwargs},
+                queue="prompt_studio_callback",
+            ),
+            on_error=signature(
+                "ide_index_error",
+                kwargs={"callback_kwargs": cb_kwargs},
+                queue="prompt_studio_callback",
+            ),
+            task_id=executor_task_id,
+        )
+        return Response(
+            {"task_id": task.id, "run_id": run_id, "status": "accepted"},
+            status=status.HTTP_202_ACCEPTED,


Async path is unconditional — feature flag gating is absent

The PR description explicitly states that all three endpoints (index_document, fetch_response, single_pass_extraction) are gated behind the async_prompt_execution feature flag with the old sync path preserved as a fallback. However, the actual implementation contains no feature flag check anywhere in these view methods. All three endpoints now always return HTTP 202 and dispatch to Celery workers, making this a hard breaking change for all users regardless of the flag.

The PR description also says:

"When flag is OFF (default), all 3 endpoints use the old sync path returning HTTP 200. No behavior change for existing users."

This behavior is not implemented. Any deployment of this PR will immediately switch all Prompt Studio execution to the async path. If the worker-executor-v2 or worker-prompt-studio-callback services are not running, every prompt execution will silently hang forever waiting for socket events that never arrive.

The conditional dispatch pattern would look like:

from utils.feature_flags import check_feature_flag # or however flags are checked if check_feature_flag("async_prompt_execution"): context, cb_kwargs = PromptStudioHelper.build_index_payload(...) # ... dispatch with callback, return 202 else: # legacy sync path unique_id = PromptStudioHelper.index_document(...) return Response({"message": "Document indexed successfully."}, status=status.HTTP_200_OK)

The same conditional must be applied to fetch_response and single_pass_extraction. Also note that usePromptStudioSocket.js on the frontend is unconditionally mounted without any feature flag guard either, which reinforces that the gating mechanism described in the PR is incomplete.

Prompt To Fix With AI

This is a comment left during a code review. Path: backend/prompt_studio/prompt_studio_core_v2/views.py Line: 364-422 Comment: **Async path is unconditional — feature flag gating is absent** The PR description explicitly states that all three endpoints (`index_document`, `fetch_response`, `single_pass_extraction`) are gated behind the `async_prompt_execution` feature flag with the old sync path preserved as a fallback. However, the actual implementation contains no feature flag check anywhere in these view methods. All three endpoints now **always** return HTTP 202 and dispatch to Celery workers, making this a hard breaking change for all users regardless of the flag. The PR description also says: > "When flag is OFF (default), all 3 endpoints use the old sync path returning HTTP 200. No behavior change for existing users." This behavior is not implemented. Any deployment of this PR will immediately switch all Prompt Studio execution to the async path. If the `worker-executor-v2` or `worker-prompt-studio-callback` services are not running, every prompt execution will silently hang forever waiting for socket events that never arrive. The conditional dispatch pattern would look like: ```python from utils.feature_flags import check_feature_flag # or however flags are checked if check_feature_flag("async_prompt_execution"): context, cb_kwargs = PromptStudioHelper.build_index_payload(...) # ... dispatch with callback, return 202 else: # legacy sync path unique_id = PromptStudioHelper.index_document(...) return Response({"message": "Document indexed successfully."}, status=status.HTTP_200_OK) ``` The same conditional must be applied to `fetch_response` and `single_pass_extraction`. Also note that `usePromptStudioSocket.js` on the frontend is unconditionally mounted without any feature flag guard either, which reinforces that the gating mechanism described in the PR is incomplete. How can I resolve this? If you propose a fix, please make it concise.

greptile-apps · 2026-03-12T15:57:11Z

backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py

+        default_profile = ProfileManager.get_default_llm_profile(tool)
+        if not tool:
+            raise ToolNotValid()


Dead code — if not tool: guard is unreachable

CustomTool.objects.get(pk=tool_id) is called two lines earlier (line 312). Django's .get() raises CustomTool.DoesNotExist if no matching row exists, and returns a truthy model instance on success. The if not tool: check therefore can never be true — either the call raises before reaching this line, or tool is a valid model object that is always truthy. Additionally, ProfileManager.get_default_llm_profile(tool) is called on line 321 before the guard, so even if .get() somehow returned a falsy value, the guard would be too late.

This check should be moved to before the ORM call, or (more idiomatically) handled by catching CustomTool.DoesNotExist in the caller.

Suggested change

default_profile = ProfileManager.get_default_llm_profile(tool)

if not tool:

raise ToolNotValid()

default_profile = ProfileManager.get_default_llm_profile(tool)

PromptStudioHelper.validate_adapter_status(default_profile)

Prompt To Fix With AI

This is a comment left during a code review. Path: backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py Line: 322-324 Comment: **Dead code — `if not tool:` guard is unreachable** `CustomTool.objects.get(pk=tool_id)` is called two lines earlier (line 312). Django's `.get()` raises `CustomTool.DoesNotExist` if no matching row exists, and returns a truthy model instance on success. The `if not tool:` check therefore can never be true — either the call raises before reaching this line, or `tool` is a valid model object that is always truthy. Additionally, `ProfileManager.get_default_llm_profile(tool)` is called on line 321 *before* the guard, so even if `.get()` somehow returned a falsy value, the guard would be too late. This check should be moved to *before* the ORM call, or (more idiomatically) handled by catching `CustomTool.DoesNotExist` in the caller. ```suggestion default_profile = ProfileManager.get_default_llm_profile(tool) PromptStudioHelper.validate_adapter_status(default_profile) ``` How can I resolve this? If you propose a fix, please make it concise.

greptile-apps · 2026-03-12T15:57:12Z

backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py

+
+        PromptStudioHelper.validate_adapter_status(profile_manager)
+        PromptStudioHelper.validate_profile_manager_owner_access(profile_manager)
+
+        if not profile_manager:
+            raise DefaultProfileError()


Dead code — if not profile_manager: guard fires after it is already dereferenced

validate_adapter_status(profile_manager) (line 527) and validate_profile_manager_owner_access(profile_manager) (line 528) are both called before the if not profile_manager: guard on line 531. If profile_manager is None, both validation helpers will dereference it and raise an AttributeError before the guard is reached — making the explicit raise DefaultProfileError() dead code.

The same pattern is repeated in build_single_pass_payload at line 1428. Move the null check to immediately after profile_manager is resolved:

profile_manager = prompt.profile_manager if profile_manager_id: profile_manager = ProfileManagerHelper.get_profile_manager( profile_manager_id=profile_manager_id ) if not profile_manager: # ← guard BEFORE any dereference raise DefaultProfileError() monitor_llm, challenge_llm = PromptStudioHelper._resolve_llm_ids(tool) PromptStudioHelper.validate_adapter_status(profile_manager) PromptStudioHelper.validate_profile_manager_owner_access(profile_manager)

Prompt To Fix With AI

This is a comment left during a code review. Path: backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py Line: 527-532 Comment: **Dead code — `if not profile_manager:` guard fires after it is already dereferenced** `validate_adapter_status(profile_manager)` (line 527) and `validate_profile_manager_owner_access(profile_manager)` (line 528) are both called *before* the `if not profile_manager:` guard on line 531. If `profile_manager` is `None`, both validation helpers will dereference it and raise an `AttributeError` before the guard is reached — making the explicit `raise DefaultProfileError()` dead code. The same pattern is repeated in `build_single_pass_payload` at line 1428. Move the null check to immediately after `profile_manager` is resolved: ```python profile_manager = prompt.profile_manager if profile_manager_id: profile_manager = ProfileManagerHelper.get_profile_manager( profile_manager_id=profile_manager_id ) if not profile_manager: # ← guard BEFORE any dereference raise DefaultProfileError() monitor_llm, challenge_llm = PromptStudioHelper._resolve_llm_ids(tool) PromptStudioHelper.validate_adapter_status(profile_manager) PromptStudioHelper.validate_profile_manager_owner_access(profile_manager) ``` How can I resolve this? If you propose a fix, please make it concise.

greptile-apps · 2026-03-12T15:57:13Z

backend/prompt_studio/prompt_studio_core_v2/views.py

+        import uuid as _uuid
+


Redundant inline imports of uuid

uuid is already imported at module level (line 3 of this file). The inline import uuid as _uuid inside the function body is unnecessary and appears in all three view methods (index_document at line 401, fetch_response at line 482, single_pass_extraction at line 573). Use the existing top-level import directly.

Suggested change

import uuid as _uuid

executor_task_id = str(uuid.uuid4())

Prompt To Fix With AI

This is a comment left during a code review. Path: backend/prompt_studio/prompt_studio_core_v2/views.py Line: 401-402 Comment: **Redundant inline imports of `uuid`** `uuid` is already imported at module level (line 3 of this file). The inline `import uuid as _uuid` inside the function body is unnecessary and appears in all three view methods (`index_document` at line 401, `fetch_response` at line 482, `single_pass_extraction` at line 573). Use the existing top-level import directly. ```suggestion executor_task_id = str(uuid.uuid4()) ``` How can I resolve this? If you propose a fix, please make it concise.

greptile-apps · 2026-03-12T15:57:14Z

backend/backend/worker_celery.py

+    result_backend = (
+        f"db+postgresql://{settings.DB_USER}:"
+        f"{quote_plus(settings.DB_PASSWORD)}"
+        f"@{settings.DB_HOST}:{settings.DB_PORT}/"
+        f"{settings.CELERY_BACKEND_DB_NAME}"


settings.DB_USER is not URL-encoded in the result backend URL

quote_plus is correctly applied to DB_PASSWORD, but DB_USER is interpolated raw. If the database username contains any URL-special characters (e.g. @, :, /), the resulting connection string would be malformed and the Celery result backend would fail to connect. Apply the same quote_plus encoding to settings.DB_USER for consistency and correctness, just as is done for settings.DB_PASSWORD.

Prompt To Fix With AI

This is a comment left during a code review. Path: backend/backend/worker_celery.py Line: 70-74 Comment: **`settings.DB_USER` is not URL-encoded in the result backend URL** `quote_plus` is correctly applied to `DB_PASSWORD`, but `DB_USER` is interpolated raw. If the database username contains any URL-special characters (e.g. `@`, `:`, `/`), the resulting connection string would be malformed and the Celery result backend would fail to connect. Apply the same `quote_plus` encoding to `settings.DB_USER` for consistency and correctness, just as is done for `settings.DB_PASSWORD`. How can I resolve this? If you propose a fix, please make it concise.

greptile-apps · 2026-03-12T16:00:30Z

Greptile Summary

This PR introduces a pluggable async executor framework that replaces blocking Django HTTP calls to the prompt-service with a fire-and-forget Celery dispatch chain (Django → RabbitMQ → ExecutorWorker → CallbackWorker → Socket.IO). The architecture is well-designed: the SDK1 ExecutionContext/ExecutionResult DTOs are clean and fully serialisable, the ExecutorRegistry self-registration pattern is solid, and the callback task structure correctly separates ORM writes from result delivery. The frontend migration to a Socket.IO listener (usePromptStudioSocket) with a 5-minute timeout safety net is a sensible approach.

Key issues found during review:

Feature flag gate is missing: The PR description explicitly states that all three IDE endpoints (index_document, fetch_response, single_pass_extraction) are gated behind the async_prompt_execution Flipt flag with a sync fallback. The actual views.py contains no feature flag check — all users receive HTTP 202 unconditionally. This directly breaks the stated safe-rollout strategy.
IDOR risk in task_status: The endpoint looks up a Celery task by task_id from the URL without verifying that the task was produced by an operation on the tool identified by pk. A user with access to any tool can query results belonging to a different user's execution.
Null guards after use in prompt_studio_helper.py: Both build_fetch_response_payload and build_single_pass_payload call validators on profile_manager/default_profile and then dereference them before the if not …: raise DefaultProfileError() guard, making those guards dead code for the None case.
Queue name mismatch in worker_celery.py: task_queues=[Queue("executor")] is configured, but ExecutionDispatcher dispatches to "celery_executor_legacy". The declared queue is never consumed by the dispatched tasks.

Confidence Score: 2/5

Not safe to merge without the feature flag gate — the async path is enabled for all users despite the PR's claim of a gated rollout, and there is an IDOR risk in the new task_status endpoint.
The SDK1 execution framework, executor worker, and callback task infrastructure are well-implemented. However, the missing feature flag check is a critical gap between the stated design (safe, flag-gated rollout) and the actual implementation (always-async). Every Prompt Studio user is immediately affected. The task_status IDOR allows cross-user data access. The null-guard ordering issues are latent bugs that could surface with incomplete profile configurations. These issues need to be resolved before merging.
backend/prompt_studio/prompt_studio_core_v2/views.py (missing feature flag gate + IDOR), backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py (null guard ordering in two build methods), backend/backend/worker_celery.py (queue name mismatch)

Important Files Changed

Filename	Overview
backend/prompt_studio/prompt_studio_core_v2/views.py	Three new async endpoints (index_document, fetch_response, single_pass_extraction) are missing the advertised `async_prompt_execution` feature flag gate, unconditionally breaking the sync fallback; task_status endpoint has an IDOR risk; redundant inner `import uuid` in three methods.
backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py	New `build_*_payload` methods correctly pre-load ORM data and construct ExecutionContext objects; two methods have misordered null guards (DefaultProfileError check placed after the variable is already dereferenced).
backend/backend/worker_celery.py	New second Celery app for dispatching to executor workers. Queue name in task_queues config ("executor") does not match the actual queue used by ExecutionDispatcher ("celery_executor_legacy"), making the queue config a no-op.
backend/prompt_studio/prompt_studio_core_v2/tasks.py	New Celery callback tasks (ide_index_complete, ide_prompt_complete, ide_index_error, ide_prompt_error) cleanly handle ORM writes and Socket.IO emission; _json_safe serialisation guard is present; legacy tasks are correctly retained for rollback safety.
unstract/sdk1/src/unstract/sdk1/execution/context.py	Clean, well-documented ExecutionContext dataclass with correct to_dict/from_dict round-trips, enum normalisation in post_init, and auto-generated request_id; no issues found.
unstract/sdk1/src/unstract/sdk1/execution/dispatcher.py	ExecutionDispatcher provides three dispatch modes; queue naming convention is clear; disable_sync_subtasks=False use is justified; timeout resolution logic is sound.
unstract/sdk1/src/unstract/sdk1/execution/registry.py	ExecutorRegistry class-decorator self-registration pattern is well-implemented with duplicate-name protection and a clear get/list/clear API for testing.
workers/executor/tasks.py	Single execute_extraction entry point with sensible retry policy (ConnectionError, TimeoutError, OSError), proper context deserialization error handling, and lightweight log-correlation setup.
frontend/src/hooks/usePromptStudioSocket.js	New Socket.IO listener correctly maps prompt_studio_result events to UI state updates (clearResultStatuses, updatePromptOutputState); handles fetch_response, single_pass_extraction, and index_document operations cleanly.
frontend/src/hooks/usePromptRun.js	Correctly rewritten to fire-and-forget POST; 5-minute timeout safety net clears stale spinners; freeActiveApi() in finally is intentional (frees HTTP slot, not Celery task slot).
unstract/sdk1/src/unstract/sdk1/execution/orchestrator.py	ExecutionOrchestrator correctly wraps all executor exceptions into failed ExecutionResult, measures elapsed time, and logs lifecycle events.

Sequence Diagram

sequenceDiagram
    participant FE as Frontend (PromptRun.jsx)
    participant DJV as Django View (views.py)
    participant PSH as PromptStudioHelper
    participant DISP as ExecutionDispatcher
    participant RMQ as RabbitMQ
    participant EW as Executor Worker (execute_extraction)
    participant LE as LegacyExecutor
    participant CBW as Callback Worker (prompt_studio_callback)
    participant SIO as Socket.IO
    participant FEH as Frontend (usePromptStudioSocket)

    FE->>DJV: POST /fetch_response/{tool_id}
    DJV->>PSH: build_fetch_response_payload() [ORM + extract + index - blocking]
    PSH-->>DJV: (ExecutionContext, cb_kwargs)
    DJV->>DISP: dispatch_with_callback(context, on_success=ide_prompt_complete, on_error=ide_prompt_error)
    DISP->>RMQ: send_task("execute_extraction", queue="celery_executor_legacy")
    DJV-->>FE: HTTP 202 {task_id, run_id, status:"accepted"}

    RMQ->>EW: execute_extraction(context_dict)
    EW->>LE: LegacyExecutor.execute(context)
    LE-->>EW: ExecutionResult
    EW-->>RMQ: result.to_dict() [Celery link callback triggered]

    RMQ->>CBW: ide_prompt_complete(result_dict, callback_kwargs)
    CBW->>CBW: OutputManagerHelper.handle_prompt_output_update() [ORM write]
    CBW->>SIO: _emit_websocket_event(room=log_events_id, event="prompt_studio_result")
    SIO-->>FEH: "prompt_studio_result" {status:"completed", operation, result}
    FEH->>FEH: handleCompleted() → updatePromptOutputState() + clearResultStatuses()

Prompt To Fix All With AI

This is a comment left during a code review.
Path: backend/prompt_studio/prompt_studio_core_v2/views.py
Line: 364-595

Comment:
**Missing feature flag gate on async endpoints**

The PR description states that all three IDE endpoints (`index_document`, `fetch_response`, `single_pass_extraction`) are gated behind the `async_prompt_execution` Flipt feature flag, with the old synchronous path preserved as a fallback when the flag is `OFF`. However, none of the three view methods contain any feature flag check — they unconditionally invoke the async/Celery path and return HTTP 202.

This means the breaking architectural change (fire-and-forget + Socket.IO result delivery) is deployed for **all users** regardless of the feature flag, directly contradicting the safe-rollout strategy described in the PR. When `async_prompt_execution=false`, users would still receive HTTP 202 with no result, because the old synchronous code path is never reached.

The sync fallback (e.g. delegating to the old `run_index_document` / `run_fetch_response` / `run_single_pass_extraction` Celery tasks or the direct helper methods) should be invoked when the flag is off.

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: backend/prompt_studio/prompt_studio_core_v2/views.py
Line: 597-629

Comment:
**`task_status` lacks task-ownership verification (IDOR risk)**

The endpoint looks up `task_id` directly in the Celery result backend without verifying that the task belongs to the tool identified by `pk`. A user who has legitimate access to any Prompt Studio tool can supply an arbitrary `task_id` from a different tool/user's execution and retrieve that execution's `result` (the full `ExecutionResult` dict, which may contain extracted document data).

For example:
```
GET /prompt-studio/<my_tool_pk>/task-status/<other_users_task_id>
```
The permission check only validates access to `pk` (via `IsOwnerOrSharedUserOrSharedToOrg`), not whether `task_id` was produced by operations on that tool.

Consider either (a) storing a `(tool_id, task_id)` mapping server-side and validating the lookup, or (b) returning only the task's `status` from this endpoint (omitting the full `result` payload, since the real result is already delivered via Socket.IO).

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: backend/prompt_studio/prompt_studio_core_v2/views.py
Line: 401-403

Comment:
**Redundant `import uuid as _uuid` inside method bodies**

`uuid` is already imported at the module level (line 2). The three identical inner imports (`import uuid as _uuid` in `index_document`, `fetch_response`, and `single_pass_extraction`) are redundant. Simply use the already-imported `uuid.uuid4()`.

```suggestion
        executor_task_id = str(uuid.uuid4())
```

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py
Line: 520-532

Comment:
**Null guard after the variable is already dereferenced**

`validate_adapter_status(profile_manager)` and `validate_profile_manager_owner_access(profile_manager)` are both called **before** the `if not profile_manager` guard. If `profile_manager` is `None` (e.g. when `prompt.profile_manager` is unset and no `profile_manager_id` is passed), those helper calls will raise an `AttributeError` inside them, not the intended `DefaultProfileError`. The guard at line 531–532 is effectively dead code for the `None` case.

The null check should be moved to immediately after `profile_manager` is resolved:

```python
profile_manager = prompt.profile_manager
if profile_manager_id:
    profile_manager = ProfileManagerHelper.get_profile_manager(
        profile_manager_id=profile_manager_id
    )

if not profile_manager:
    raise DefaultProfileError()

# Only then call validators
PromptStudioHelper.validate_adapter_status(profile_manager)
PromptStudioHelper.validate_profile_manager_owner_access(profile_manager)
```

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py
Line: 733-747

Comment:
**Null guard on `default_profile` comes after it is already used**

`default_profile.chunk_size = 0` mutates the object **before** the `if not default_profile: raise DefaultProfileError()` check. If `ProfileManager.get_default_llm_profile(tool)` returns `None`, the assignment at line 744 would raise `AttributeError` rather than the intended `DefaultProfileError`. The guard is dead code for the `None` case.

Move the null check to immediately after `default_profile` is assigned (before the validators and the `chunk_size` assignment):

```python
default_profile = ProfileManager.get_default_llm_profile(tool)
if not default_profile:
    raise DefaultProfileError()

PromptStudioHelper.validate_adapter_status(default_profile)
PromptStudioHelper.validate_profile_manager_owner_access(default_profile)
default_profile.chunk_size = 0
```

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: backend/backend/worker_celery.py
Line: 85-92

Comment:
**Configured queue name `"executor"` doesn't match the actual dispatch queue**

`get_worker_celery_app()` registers `task_queues=[Queue("executor")]`, but `ExecutionDispatcher._get_queue()` (in `sdk1/execution/dispatcher.py`) constructs the actual queue name as `celery_executor_{executor_name}` — for the legacy executor this becomes `"celery_executor_legacy"`.

The queue declared on the app (`"executor"`) never matches the queue used by `send_task`, so this `task_queues` setting has no practical effect. While `send_task` with an explicit `queue` parameter bypasses queue routing and the task is delivered correctly, the misconfigured `task_queues` setting means any queue-routing policies (e.g. prefetch limits, fair scheduling) configured on `"executor"` will not apply.

Either align the queue name to `"celery_executor_legacy"` (or the appropriate prefix), or remove the stale `task_queues` declaration from this app's config if it is intentionally unused.

How can I resolve this? If you propose a fix, please make it concise.

_{Last reviewed commit: 4200ac1}

greptile-apps · 2026-03-12T16:00:34Z

backend/prompt_studio/prompt_studio_core_v2/views.py

@@ -380,104 +384,249 @@ def index_document(self, request: HttpRequest, pk: Any = None) -> Response:
        document_id: str = serializer.validated_data.get(ToolStudioPromptKeys.DOCUMENT_ID)
        document: DocumentManager = DocumentManager.objects.get(pk=document_id)
        file_name: str = document.document_name
-        # Generate a run_id
        run_id = CommonUtils.generate_uuid()

-        unique_id = PromptStudioHelper.index_document(
+        context, cb_kwargs = PromptStudioHelper.build_index_payload(
            tool_id=str(tool.tool_id),
            file_name=file_name,
            org_id=UserSessionUtils.get_organization_id(request),
            user_id=tool.created_by.user_id,
            document_id=document_id,
            run_id=run_id,
        )
-        if unique_id:
-            return Response(
-                {"message": "Document indexed successfully."},
-                status=status.HTTP_200_OK,
-            )
-        else:
-            logger.error("Error occured while indexing. Unique ID is not valid.")
-            raise IndexingAPIError()
+
+        dispatcher = PromptStudioHelper._get_dispatcher()
+
+        # Pre-generate task ID so callbacks can reference it
+        import uuid as _uuid
+
+        executor_task_id = str(_uuid.uuid4())
+        cb_kwargs["executor_task_id"] = executor_task_id
+
+        task = dispatcher.dispatch_with_callback(
+            context,
+            on_success=signature(
+                "ide_index_complete",
+                kwargs={"callback_kwargs": cb_kwargs},
+                queue="prompt_studio_callback",
+            ),
+            on_error=signature(
+                "ide_index_error",
+                kwargs={"callback_kwargs": cb_kwargs},
+                queue="prompt_studio_callback",
+            ),
+            task_id=executor_task_id,
+        )
+        return Response(
+            {"task_id": task.id, "run_id": run_id, "status": "accepted"},
+            status=status.HTTP_202_ACCEPTED,
+        )

    @action(detail=True, methods=["post"])
    def fetch_response(self, request: HttpRequest, pk: Any = None) -> Response:
        """API Entry point method to fetch response to prompt.

-        Args:
-            request (HttpRequest): _description_
+        Builds the full execution payload (ORM work), then fires a
+        single executor task with Celery link/link_error callbacks.

-        Raises:
-            FilenameMissingError: _description_
+        Args:
+            request (HttpRequest)

        Returns:
            Response
        """
        custom_tool = self.get_object()
-        tool_id: str = str(custom_tool.tool_id)
        document_id: str = request.data.get(ToolStudioPromptKeys.DOCUMENT_ID)
-        id: str = request.data.get(ToolStudioPromptKeys.ID)
+        prompt_id: str = request.data.get(ToolStudioPromptKeys.ID)
        run_id: str = request.data.get(ToolStudioPromptKeys.RUN_ID)
-        profile_manager: str = request.data.get(ToolStudioPromptKeys.PROFILE_MANAGER_ID)
+        profile_manager_id: str = request.data.get(
+            ToolStudioPromptKeys.PROFILE_MANAGER_ID
+        )
        if not run_id:
-            # Generate a run_id
            run_id = CommonUtils.generate_uuid()

-        # Check output count before prompt run for HubSpot notification
-        # Filter through tool FK to scope by organization (PromptStudioOutputManager
-        # lacks DefaultOrganizationManagerMixin)
-        output_count_before = PromptStudioOutputManager.objects.filter(
-            tool_id__in=CustomTool.objects.values_list("tool_id", flat=True)
-        ).count()
+        org_id = UserSessionUtils.get_organization_id(request)
+        user_id = custom_tool.created_by.user_id

-        response: dict[str, Any] = PromptStudioHelper.prompt_responder(
-            id=id,
-            tool_id=tool_id,
-            org_id=UserSessionUtils.get_organization_id(request),
-            user_id=custom_tool.created_by.user_id,
+        # Resolve prompt
+        prompt = ToolStudioPrompt.objects.get(pk=prompt_id)
+
+        # Build file path
+        doc_path = PromptStudioFileHelper.get_or_create_prompt_studio_subdirectory(
+            org_id,
+            is_create=False,
+            user_id=user_id,
+            tool_id=str(custom_tool.tool_id),
+        )
+        document: DocumentManager = DocumentManager.objects.get(pk=document_id)
+        doc_path = str(Path(doc_path) / document.document_name)
+
+        context, cb_kwargs = PromptStudioHelper.build_fetch_response_payload(
+            tool=custom_tool,
+            doc_path=doc_path,
+            doc_name=document.document_name,
+            prompt=prompt,
+            org_id=org_id,
+            user_id=user_id,
            document_id=document_id,
            run_id=run_id,
-            profile_manager_id=profile_manager,
+            profile_manager_id=profile_manager_id,
        )

-        # Notify HubSpot about first prompt run
-        notify_hubspot_event(
-            user=request.user,
-            event_name="PROMPT_RUN",
-            is_first_for_org=output_count_before == 0,
-            action_label="prompt run",
+        # If document is being indexed, return pending status
+        if context is None:
+            return Response(cb_kwargs, status=status.HTTP_200_OK)
+
+        dispatcher = PromptStudioHelper._get_dispatcher()
+
+        import uuid as _uuid
+
+        executor_task_id = str(_uuid.uuid4())
+        cb_kwargs["executor_task_id"] = executor_task_id
+
+        task = dispatcher.dispatch_with_callback(
+            context,
+            on_success=signature(
+                "ide_prompt_complete",
+                kwargs={"callback_kwargs": cb_kwargs},
+                queue="prompt_studio_callback",
+            ),
+            on_error=signature(
+                "ide_prompt_error",
+                kwargs={"callback_kwargs": cb_kwargs},
+                queue="prompt_studio_callback",
+            ),
+            task_id=executor_task_id,
+        )
+        return Response(
+            {"task_id": task.id, "run_id": run_id, "status": "accepted"},
+            status=status.HTTP_202_ACCEPTED,
        )
-
-        return Response(response, status=status.HTTP_200_OK)

    @action(detail=True, methods=["post"])
    def single_pass_extraction(self, request: HttpRequest, pk: uuid) -> Response:
-        """API Entry point method to fetch response to prompt.
+        """API Entry point method for single pass extraction.
+
+        Builds the full execution payload (ORM work), then fires a
+        single executor task with Celery link/link_error callbacks.

        Args:
-            request (HttpRequest): _description_
-            pk (Any): Primary key of the CustomTool
+            request (HttpRequest)
+            pk: Primary key of the CustomTool

        Returns:
            Response
        """
-        # TODO: Handle fetch_response and single_pass_
-        # extraction using common function
        custom_tool = self.get_object()
-        tool_id: str = str(custom_tool.tool_id)
        document_id: str = request.data.get(ToolStudioPromptKeys.DOCUMENT_ID)
        run_id: str = request.data.get(ToolStudioPromptKeys.RUN_ID)
        if not run_id:
-            # Generate a run_id
            run_id = CommonUtils.generate_uuid()
-        response: dict[str, Any] = PromptStudioHelper.prompt_responder(
-            tool_id=tool_id,
-            org_id=UserSessionUtils.get_organization_id(request),
-            user_id=custom_tool.created_by.user_id,
+
+        org_id = UserSessionUtils.get_organization_id(request)
+        user_id = custom_tool.created_by.user_id
+
+        # Build file path
+        doc_path = PromptStudioFileHelper.get_or_create_prompt_studio_subdirectory(
+            org_id,
+            is_create=False,
+            user_id=user_id,
+            tool_id=str(custom_tool.tool_id),
+        )
+        document: DocumentManager = DocumentManager.objects.get(pk=document_id)
+        doc_path = str(Path(doc_path) / document.document_name)
+
+        # Fetch prompts eligible for single-pass extraction.
+        # Mirrors the filtering in _execute_prompts_in_single_pass:
+        # only active, non-NOTES, non-TABLE/RECORD prompts.
+        prompts = list(
+            ToolStudioPrompt.objects.filter(tool_id=custom_tool.tool_id).order_by(
+                "sequence_number"
+            )
+        )
+        prompts = [
+            p
+            for p in prompts
+            if p.prompt_type != ToolStudioPromptKeys.NOTES
+            and p.active
+            and p.enforce_type != ToolStudioPromptKeys.TABLE
+            and p.enforce_type != ToolStudioPromptKeys.RECORD
+        ]
+        if not prompts:
+            return Response(
+                {"error": "No active prompts found for single pass extraction."},
+                status=status.HTTP_400_BAD_REQUEST,
+            )
+
+        context, cb_kwargs = PromptStudioHelper.build_single_pass_payload(
+            tool=custom_tool,
+            doc_path=doc_path,
+            doc_name=document.document_name,
+            prompts=prompts,
+            org_id=org_id,
            document_id=document_id,
            run_id=run_id,
        )
-        return Response(response, status=status.HTTP_200_OK)
+
+        dispatcher = PromptStudioHelper._get_dispatcher()
+
+        import uuid as _uuid
+
+        executor_task_id = str(_uuid.uuid4())
+        cb_kwargs["executor_task_id"] = executor_task_id
+
+        task = dispatcher.dispatch_with_callback(
+            context,
+            on_success=signature(
+                "ide_prompt_complete",
+                kwargs={"callback_kwargs": cb_kwargs},
+                queue="prompt_studio_callback",
+            ),
+            on_error=signature(
+                "ide_prompt_error",
+                kwargs={"callback_kwargs": cb_kwargs},
+                queue="prompt_studio_callback",
+            ),
+            task_id=executor_task_id,
+        )
+        return Response(
+            {"task_id": task.id, "run_id": run_id, "status": "accepted"},
+            status=status.HTTP_202_ACCEPTED,
+        )


Missing feature flag gate on async endpoints

The PR description states that all three IDE endpoints (index_document, fetch_response, single_pass_extraction) are gated behind the async_prompt_execution Flipt feature flag, with the old synchronous path preserved as a fallback when the flag is OFF. However, none of the three view methods contain any feature flag check — they unconditionally invoke the async/Celery path and return HTTP 202.

This means the breaking architectural change (fire-and-forget + Socket.IO result delivery) is deployed for all users regardless of the feature flag, directly contradicting the safe-rollout strategy described in the PR. When async_prompt_execution=false, users would still receive HTTP 202 with no result, because the old synchronous code path is never reached.

The sync fallback (e.g. delegating to the old run_index_document / run_fetch_response / run_single_pass_extraction Celery tasks or the direct helper methods) should be invoked when the flag is off.

Prompt To Fix With AI

This is a comment left during a code review. Path: backend/prompt_studio/prompt_studio_core_v2/views.py Line: 364-595 Comment: **Missing feature flag gate on async endpoints** The PR description states that all three IDE endpoints (`index_document`, `fetch_response`, `single_pass_extraction`) are gated behind the `async_prompt_execution` Flipt feature flag, with the old synchronous path preserved as a fallback when the flag is `OFF`. However, none of the three view methods contain any feature flag check — they unconditionally invoke the async/Celery path and return HTTP 202. This means the breaking architectural change (fire-and-forget + Socket.IO result delivery) is deployed for **all users** regardless of the feature flag, directly contradicting the safe-rollout strategy described in the PR. When `async_prompt_execution=false`, users would still receive HTTP 202 with no result, because the old synchronous code path is never reached. The sync fallback (e.g. delegating to the old `run_index_document` / `run_fetch_response` / `run_single_pass_extraction` Celery tasks or the direct helper methods) should be invoked when the flag is off. How can I resolve this? If you propose a fix, please make it concise.

greptile-apps · 2026-03-12T16:00:35Z

backend/prompt_studio/prompt_studio_core_v2/views.py

+    @action(detail=True, methods=["get"])
+    def task_status(
+        self, request: HttpRequest, pk: Any = None, task_id: str = None
+    ) -> Response:
+        """Poll the status of an async Prompt Studio task.
+
+        Task IDs now point to executor worker tasks dispatched via the
+        worker-v2 Celery app.  Both apps share the same PostgreSQL
+        result backend, so we use the worker app to look up results.
+
+        Args:
+            request (HttpRequest)
+            pk: Primary key of the CustomTool (for permission check)
+            task_id: Celery task ID returned by the 202 response
+
+        Returns:
+            Response with {task_id, status} and optionally result or error
+        """
+        from celery.result import AsyncResult
+
+        from backend.worker_celery import get_worker_celery_app
+
+        result = AsyncResult(task_id, app=get_worker_celery_app())
+        if not result.ready():
+            return Response({"task_id": task_id, "status": "processing"})
+        if result.successful():
+            return Response(
+                {"task_id": task_id, "status": "completed", "result": result.result}
+            )
+        return Response(
+            {"task_id": task_id, "status": "failed", "error": str(result.result)},
+            status=status.HTTP_500_INTERNAL_SERVER_ERROR,
+        )


task_status lacks task-ownership verification (IDOR risk)

The endpoint looks up task_id directly in the Celery result backend without verifying that the task belongs to the tool identified by pk. A user who has legitimate access to any Prompt Studio tool can supply an arbitrary task_id from a different tool/user's execution and retrieve that execution's result (the full ExecutionResult dict, which may contain extracted document data).

For example:

GET /prompt-studio/<my_tool_pk>/task-status/<other_users_task_id>

The permission check only validates access to pk (via IsOwnerOrSharedUserOrSharedToOrg), not whether task_id was produced by operations on that tool.

Consider either (a) storing a (tool_id, task_id) mapping server-side and validating the lookup, or (b) returning only the task's status from this endpoint (omitting the full result payload, since the real result is already delivered via Socket.IO).

Prompt To Fix With AI

This is a comment left during a code review. Path: backend/prompt_studio/prompt_studio_core_v2/views.py Line: 597-629 Comment: **`task_status` lacks task-ownership verification (IDOR risk)** The endpoint looks up `task_id` directly in the Celery result backend without verifying that the task belongs to the tool identified by `pk`. A user who has legitimate access to any Prompt Studio tool can supply an arbitrary `task_id` from a different tool/user's execution and retrieve that execution's `result` (the full `ExecutionResult` dict, which may contain extracted document data). For example: ``` GET /prompt-studio/<my_tool_pk>/task-status/<other_users_task_id> ``` The permission check only validates access to `pk` (via `IsOwnerOrSharedUserOrSharedToOrg`), not whether `task_id` was produced by operations on that tool. Consider either (a) storing a `(tool_id, task_id)` mapping server-side and validating the lookup, or (b) returning only the task's `status` from this endpoint (omitting the full `result` payload, since the real result is already delivered via Socket.IO). How can I resolve this? If you propose a fix, please make it concise.

greptile-apps · 2026-03-12T16:00:36Z

backend/prompt_studio/prompt_studio_core_v2/views.py

+        import uuid as _uuid
+
+        executor_task_id = str(_uuid.uuid4())


Redundant import uuid as _uuid inside method bodies

uuid is already imported at the module level (line 2). The three identical inner imports (import uuid as _uuid in index_document, fetch_response, and single_pass_extraction) are redundant. Simply use the already-imported uuid.uuid4().

Suggested change

import uuid as _uuid

executor_task_id = str(_uuid.uuid4())

executor_task_id = str(uuid.uuid4())

Prompt To Fix With AI

This is a comment left during a code review. Path: backend/prompt_studio/prompt_studio_core_v2/views.py Line: 401-403 Comment: **Redundant `import uuid as _uuid` inside method bodies** `uuid` is already imported at the module level (line 2). The three identical inner imports (`import uuid as _uuid` in `index_document`, `fetch_response`, and `single_pass_extraction`) are redundant. Simply use the already-imported `uuid.uuid4()`. ```suggestion executor_task_id = str(uuid.uuid4()) ``` How can I resolve this? If you propose a fix, please make it concise.

greptile-apps · 2026-03-12T16:00:37Z

backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py

+        profile_manager = prompt.profile_manager
+        if profile_manager_id:
+            profile_manager = ProfileManagerHelper.get_profile_manager(
+                profile_manager_id=profile_manager_id
+            )
+
+        monitor_llm, challenge_llm = PromptStudioHelper._resolve_llm_ids(tool)
+
+        PromptStudioHelper.validate_adapter_status(profile_manager)
+        PromptStudioHelper.validate_profile_manager_owner_access(profile_manager)
+
+        if not profile_manager:
+            raise DefaultProfileError()


Null guard after the variable is already dereferenced

validate_adapter_status(profile_manager) and validate_profile_manager_owner_access(profile_manager) are both called before the if not profile_manager guard. If profile_manager is None (e.g. when prompt.profile_manager is unset and no profile_manager_id is passed), those helper calls will raise an AttributeError inside them, not the intended DefaultProfileError. The guard at line 531–532 is effectively dead code for the None case.

The null check should be moved to immediately after profile_manager is resolved:

profile_manager = prompt.profile_manager if profile_manager_id: profile_manager = ProfileManagerHelper.get_profile_manager( profile_manager_id=profile_manager_id ) if not profile_manager: raise DefaultProfileError() # Only then call validators PromptStudioHelper.validate_adapter_status(profile_manager) PromptStudioHelper.validate_profile_manager_owner_access(profile_manager)

Prompt To Fix With AI

This is a comment left during a code review. Path: backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py Line: 520-532 Comment: **Null guard after the variable is already dereferenced** `validate_adapter_status(profile_manager)` and `validate_profile_manager_owner_access(profile_manager)` are both called **before** the `if not profile_manager` guard. If `profile_manager` is `None` (e.g. when `prompt.profile_manager` is unset and no `profile_manager_id` is passed), those helper calls will raise an `AttributeError` inside them, not the intended `DefaultProfileError`. The guard at line 531–532 is effectively dead code for the `None` case. The null check should be moved to immediately after `profile_manager` is resolved: ```python profile_manager = prompt.profile_manager if profile_manager_id: profile_manager = ProfileManagerHelper.get_profile_manager( profile_manager_id=profile_manager_id ) if not profile_manager: raise DefaultProfileError() # Only then call validators PromptStudioHelper.validate_adapter_status(profile_manager) PromptStudioHelper.validate_profile_manager_owner_access(profile_manager) ``` How can I resolve this? If you propose a fix, please make it concise.

greptile-apps · 2026-03-12T16:00:38Z

backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py

+        default_profile = ProfileManager.get_default_llm_profile(tool)
+
+        challenge_llm_instance: AdapterInstance | None = tool.challenge_llm
+        challenge_llm: str | None = None
+        if challenge_llm_instance:
+            challenge_llm = str(challenge_llm_instance.id)
+        else:
+            challenge_llm = str(default_profile.llm.id)
+
+        PromptStudioHelper.validate_adapter_status(default_profile)
+        PromptStudioHelper.validate_profile_manager_owner_access(default_profile)
+        default_profile.chunk_size = 0
+
+        if not default_profile:
+            raise DefaultProfileError()


Null guard on default_profile comes after it is already used

default_profile.chunk_size = 0 mutates the object before the if not default_profile: raise DefaultProfileError() check. If ProfileManager.get_default_llm_profile(tool) returns None, the assignment at line 744 would raise AttributeError rather than the intended DefaultProfileError. The guard is dead code for the None case.

Move the null check to immediately after default_profile is assigned (before the validators and the chunk_size assignment):

default_profile = ProfileManager.get_default_llm_profile(tool) if not default_profile: raise DefaultProfileError() PromptStudioHelper.validate_adapter_status(default_profile) PromptStudioHelper.validate_profile_manager_owner_access(default_profile) default_profile.chunk_size = 0

Prompt To Fix With AI

This is a comment left during a code review. Path: backend/prompt_studio/prompt_studio_core_v2/prompt_studio_helper.py Line: 733-747 Comment: **Null guard on `default_profile` comes after it is already used** `default_profile.chunk_size = 0` mutates the object **before** the `if not default_profile: raise DefaultProfileError()` check. If `ProfileManager.get_default_llm_profile(tool)` returns `None`, the assignment at line 744 would raise `AttributeError` rather than the intended `DefaultProfileError`. The guard is dead code for the `None` case. Move the null check to immediately after `default_profile` is assigned (before the validators and the `chunk_size` assignment): ```python default_profile = ProfileManager.get_default_llm_profile(tool) if not default_profile: raise DefaultProfileError() PromptStudioHelper.validate_adapter_status(default_profile) PromptStudioHelper.validate_profile_manager_owner_access(default_profile) default_profile.chunk_size = 0 ``` How can I resolve this? If you propose a fix, please make it concise.

greptile-apps · 2026-03-12T16:00:39Z

backend/backend/worker_celery.py

+    app.conf.update(
+        result_backend=result_backend,
+        task_queues=[Queue("executor")],
+        task_serializer="json",
+        accept_content=["json"],
+        result_serializer="json",
+        result_extended=True,
+    )


Configured queue name "executor" doesn't match the actual dispatch queue

get_worker_celery_app() registers task_queues=[Queue("executor")], but ExecutionDispatcher._get_queue() (in sdk1/execution/dispatcher.py) constructs the actual queue name as celery_executor_{executor_name} — for the legacy executor this becomes "celery_executor_legacy".

The queue declared on the app ("executor") never matches the queue used by send_task, so this task_queues setting has no practical effect. While send_task with an explicit queue parameter bypasses queue routing and the task is delivered correctly, the misconfigured task_queues setting means any queue-routing policies (e.g. prefetch limits, fair scheduling) configured on "executor" will not apply.

Either align the queue name to "celery_executor_legacy" (or the appropriate prefix), or remove the stale task_queues declaration from this app's config if it is intentionally unused.

Prompt To Fix With AI

This is a comment left during a code review. Path: backend/backend/worker_celery.py Line: 85-92 Comment: **Configured queue name `"executor"` doesn't match the actual dispatch queue** `get_worker_celery_app()` registers `task_queues=[Queue("executor")]`, but `ExecutionDispatcher._get_queue()` (in `sdk1/execution/dispatcher.py`) constructs the actual queue name as `celery_executor_{executor_name}` — for the legacy executor this becomes `"celery_executor_legacy"`. The queue declared on the app (`"executor"`) never matches the queue used by `send_task`, so this `task_queues` setting has no practical effect. While `send_task` with an explicit `queue` parameter bypasses queue routing and the task is delivered correctly, the misconfigured `task_queues` setting means any queue-routing policies (e.g. prefetch limits, fair scheduling) configured on `"executor"` will not apply. Either align the queue name to `"celery_executor_legacy"` (or the appropriate prefix), or remove the stale `task_queues` declaration from this app's config if it is intentionally unused. How can I resolve this? If you propose a fix, please make it concise.

harini-venkataraman and others added 30 commits February 19, 2026 20:39

Execution backend - revamp

2da4907

async flow

41eeef8

Streaming progress to FE

f66dfb2

Removing multi hop in Prompt studio ide and structure tool

95c6592

Merge remote-tracking branch 'origin/main' into feat/execution-backend

44a2b3f

UN-3234 [FIX] Add beta tag to agentic prompt studio navigation item

2f4f2dc

Added executors for agentic prompt studio

d041201

Merge branch 'main' of github.com:Zipstack/unstract into feat/executi…

0a0cfb1

…on-backend

Merge branch 'main' of github.com:Zipstack/unstract into feat/executi…

a4e1fd7

…on-backend

Added executors for agentic prompt studio

ae77d6a

Added executors for agentic prompt studio

5c22956

Removed redundant envs

3cc3213

Removed redundant envs

d0532f8

Removed redundant envs

6173df5

[pre-commit.ci] auto fixes from pre-commit.com hooks

bbe6f58

for more information, see https://pre-commit.ci

Removed redundant envs

a3dc912

Merge branch 'main' of github.com:Zipstack/unstract into feat/executi…

98c8071

…on-backend

Merge branch 'feat/execution-backend' of github.com:Zipstack/unstract…

21157ac

… into feat/execution-backend

Removed redundant envs

0216b59

Removed redundant envs

db81b9d

Removed redundant envs

e1da202

Removed redundant envs

d119797

Removed redundant envs

fbadbf8

Removed redundant envs

882296e

Removed redundant envs

6d3bbbf

[pre-commit.ci] auto fixes from pre-commit.com hooks

292460b

for more information, see https://pre-commit.ci

Removed redundant envs

f35c0e6

Merge branch 'feat/execution-backend' of github.com:Zipstack/unstract…

9bcb458

… into feat/execution-backend

adding worker for callbacks

0cbd10a

pre-commit-ci bot and others added 13 commits March 11, 2026 17:09

[pre-commit.ci] auto fixes from pre-commit.com hooks

f59755a

for more information, see https://pre-commit.ci

UN-3266 fix: wrap long log message in dispatcher.py to fix E501

4bf9736

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

a2edb23

for more information, see https://pre-commit.ci

UN-3266 fix: remove unused RetrievalStrategy import from _handle_answ…

6391c6c

…er_prompt Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

0af0484

for more information, see https://pre-commit.ci

UN-3266 fix: rename UsageHelper params to lowercase (N803)

807e405

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

18eafe9

for more information, see https://pre-commit.ci

UN-3266 fix: remove unused locals in _handle_answer_prompt (F841)

7a01a35

execution_id, file_hash, log_events_id, custom_data are now extracted inside _execute_single_prompt from context.executor_params. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Merge branch 'main' into feat/async-prompt-service-v2

3e5ce31

Signed-off-by: harini-venkataraman <115449948+harini-venkataraman@users.noreply.github.com>

harini-venkataraman changed the title ~~Feat/async prompt service v2~~ UN-3266 [FEAT] Async Executor Backend for Prompt Studio Mar 12, 2026

harini-venkataraman marked this pull request as ready for review March 12, 2026 08:17

coderabbitai bot reviewed Mar 12, 2026

View reviewed changes

fix: resolve Biome linting errors in frontend source files

e3ca0c6

Auto-fixed 48 lint errors across 56 files: import ordering, block statements, unused variable prefixing, and formatting issues. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

coderabbitai bot reviewed Mar 12, 2026

View reviewed changes

harini-venkataraman and others added 3 commits March 12, 2026 17:39

fix: replace dynamic import of SharePermission with static import in …

db3d8c2

…Workflows Resolves vite build warning about SharePermission.jsx being both dynamically and statically imported across the codebase. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Merge branch 'main' into feat/async-prompt-service-v2

a62a9fd

Signed-off-by: harini-venkataraman <115449948+harini-venkataraman@users.noreply.github.com>

fix: resolve SonarCloud warnings in frontend components

b3a90af

- Remove unnecessary try-catch around PostHog event calls - Flip negated condition in PromptOutput.handleTable for clarity Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

coderabbitai bot reviewed Mar 12, 2026

View reviewed changes

Merge branch 'main' into feat/async-prompt-service-v2

4200ac1

greptile-apps bot reviewed Mar 12, 2026

View reviewed changes

Conversation

harini-venkataraman commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Why

How

Backend (65 files)

Workers (70 files, ~19,500 new lines)

SDK1 (22 files)

Frontend (275 files)

Docker / Infrastructure

Architecture Change

Can this PR break any existing features? If yes, please list possible items. If no, please explain why.

Review Guidelines

Code Structure Overview

Recommended Review Order

Data Flow (End-to-End)

Known Code Duplication

Files Safe to Skim

Relevant Docs

Related Issues or PRs

Dependencies Versions / Env Variables

Notes on Testing

Screenshots

Checklist

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 12, 2026

Choose a reason for hiding this comment

What that means in practice

Why you might see “less than k” even if your index is large

Uh oh!

coderabbitai bot Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Mar 12, 2026

Frontend Lint Report (Biome)

Uh oh!

github-actions bot commented Mar 12, 2026

Test Results

Uh oh!

sonarqubecloud bot commented Mar 12, 2026

Quality Gate failed

Uh oh!

greptile-apps bot commented Mar 12, 2026

Greptile Summary

Confidence Score: 2/5

Important Files Changed

Sequence Diagram

Uh oh!

greptile-apps bot Mar 12, 2026

harini-venkataraman commented Mar 11, 2026 •

edited

Loading