[POC] feat: implement skill management in tools by prd-hoang-doan · Pull Request #6362 · FlowiseAI/Flowise

prd-hoang-doan · 2026-05-07T14:16:52Z

Proof-of-Concept: Skill management in tools

Ticket:

Overview:

A Skill is a new authoring primitive in Flowise — a self-contained app composed of markdown prompts, code, data, and binary assets that can be published once and invoked from any Agentflow as one or more LangChain tools. This PR implements the full Skill lifecycle across three packages: server, components, and UI.

Database

Column	Type	Purpose
id	uuid (PK)	Primary key
workspaceId	text
name	varchar(255)	Display name; unique per workspace
description	text (nullable)	Optional description of the skill
iconSrc	varchar(255) (nullable)	UI icon metadata
color	varchar(16) (nullable)	UI color metadata
fileTree	text	Entire file/folder tree as a JSON string (SkillFileTree)
contentDigest	varchar(64)	sha256(canonicalJson({fileTree, sortedNodeDigests})) — invalidates caches on any change
publishedBundleId	varchar(64) (nullable)	Pointer to the latest published SkillBundle artefact in storage. null until first publish
createdDate	datetime	Auto-managed creation timestamp
updatedDate	datetime	Auto-managed update timestamp

Execution Modes

The runtime node collapses into one of two modes depending on environment configuration:

Mode	Condition	What the LLM sees
Sandbox Shell (bash)	E2B_APIKEY is set, SKILL_ALLOW_EXEC ≠ false, and enableBash is on	Per-file skill tools enriched with execution recipes + a single bash_ tool that materialises assets under `/home/user/skills/` in a sandboxed E2B VM and lets the model run shell commands
Fallback (read-only markdown)	E2B not configured, kill-switch flipped, or enableBash=false	Per-file skill tools only — compiled markdown is returned verbatim; referenced code/data becomes documentation the model reasons about without execution

Markdown execution flow: Each selected .md file becomes a SkillFileTool. When the LLM calls it, the tool returns the pre-compiled markdown content (with all {{skill.*}} and {{tool.*}} placeholders resolved at publish time) plus an optional tool hint block. Runtime placeholders ({{question}}, {{$vars.*}}) are resolved by Flowise's agent layer after the tool returns.

Bash execution flow: A single SandboxBashTool is registered per skill. On first invocation it lazily boots an E2B VM, materialises all reachable assets (markdown, code, data, binaries) under /home/user/skills/, and executes the command. Results are returned as a JSON envelope { status, stdout, stderr, exitCode, durationMs }. Built-in helper scripts handle document extraction (PDF, DOCX, PPTX, XLSX, HTML, TXT).

File Kind Classification:
Files in the skill tree are classified into four kinds that determine how they are compiled and surfaced at runtime:

Kind	Extensions	Runtime Behavior
skill	.md, .markdown	Invocable entry points — placeholders resolved at compile time
code	.py, .js, .ts, .sh, .rb, .go, …	Passed verbatim to sandbox VM
data	.txt, .json, .csv, .yaml, .html, …	Text data — readable inside sandbox
binary	everything else (.pdf, .png, .xlsx, …)	Opaque bytes — surfaced as raw files

How LLM detect the bash command

There is compiler to build the bash description which inject the tool hint in the markdown skills
For example:

flowise:dev: [buildBashToolDescription] Generated bash tool description:
flowise:dev: Run a shell command inside the skill sandbox VM (engine: E2B (Bash session)). Working directory is /home/user; all reachable skill files live under /home/user/skills/ and any artefacts you want to hand back to the user should go into /home/user/output/. Returns a JSON envelope { status, stdout, stderr, exitCode, error?, durationMs, engine }; stdout/stderr are clipped, so pipe large outputs through head/tail or write them to /home/user/output/ and inspect with cat.
flowise:dev:
flowise:dev: Productivity rules — DO NOT default to cat for data files:
flowise:dev: - Always peek first: the per-file commands below are deliberately head/tail probes, not full reads. Run those before anything else.
flowise:dev: - To find specific content, use grep -nE '<pattern>' <path> (or pdfgrep for PDFs, jq for JSON, yq for YAML, xmllint --xpath for XML) — never re-read the whole file.
flowise:dev: - Need the entire file? Confirm size first with wc -c <path> and only then escalate to the explicit cat <path> alternative listed under "Productive commands per family" below.
flowise:dev: - For markdown skill files, you usually already have the content from the per-skill tool response — re-reading them with cat is wasted tokens.
flowise:dev: - Pipe noisy outputs through head -n 200 or write to /home/user/output/ and re-read selectively to stay under the stdout clamp.
flowise:dev:
flowise:dev: Starter commands per file (productive peeks/probes — escalate via "Productive commands per family" below for full reads, search, or query):
flowise:dev: - Execute with Node.js:
flowise:dev: • scoring_algorithm.js → node /home/user/skills/scoring_algorithm.js [args...]
flowise:dev: - Markdown skill files:
flowise:dev: • email-drafter.md → head -n 80 /home/user/skills/email-drafter.md
flowise:dev: • interview-questions.md → head -n 80 /home/user/skills/interview-questions.md
flowise:dev: • resume-screener.md → head -n 80 /home/user/skills/resume-screener.md
flowise:dev: - Plain text data:
flowise:dev: • job-description.txt → head -n 50 /home/user/skills/job-description.txt
flowise:dev:
flowise:dev: Built-in helpers (always available under /home/user/helpers):
flowise:dev: - pdf_extract.py python3 /home/user/helpers/pdf_extract.py # Extract text from PDF (stdlib only — single-page FlateDecode PDFs).
flowise:dev: - docx_extract.py python3 /home/user/helpers/docx_extract.py # Extract paragraph text from a .docx (stdlib only — body paragraphs).
flowise:dev: - xlsx_extract.py python3 /home/user/helpers/xlsx_extract.py # Extract rows from a .xlsx as TSV; multi-sheet workbooks get headers (stdlib only).
flowise:dev: - pptx_extract.py python3 /home/user/helpers/pptx_extract.py # Extract slide text from a .pptx with === Slide N === separators (stdlib only).
flowise:dev: - html_to_text.py python3 /home/user/helpers/html_to_text.py # Strip HTML to plain text; <script>/<style> are dropped, whitespace collapsed (stdlib only).
flowise:dev:
flowise:dev: Productive commands per family (template-only — substitute / / ):
flowise:dev: - Markdown skill files:
flowise:dev: • grep -nE '' — Locate a regex pattern with line numbers; replace <pattern> before issuing.
flowise:dev: • cat — Streams the entire file to stdout — escalate here only after a peek/search proves you need the whole content.
flowise:dev: - Plain text data:
flowise:dev: • grep -nE '' — Locate a regex pattern with line numbers; replace <pattern> before issuing.
flowise:dev: • cat — Streams the entire file to stdout — escalate here only after a peek/search proves you need the whole content.
flowise:dev: • wc -l — Line count — use to size a file before reading it whole.

Testing

Storage: local, s3, gcs
Database: postgres, mysql, mariadb, sqlite

Demo Recording

Youtube: https://youtu.be/QDcPRBVZXCg
Recording uses the local storage for fast editing files (s3 and gcs is not optimized for latency)
Example data is used for testing in the recording
example-testing.zip

- Components: SkillPublishBar, SkillCreateNodeDialog, and reference pickers for files and tools. - Hooks & Utils: Added useNodeBlobUrl for resource handling, along with tree management, file validation, and extension utilities. - Feature Integration: Enhanced the Tools view to support full skill lifecycle management.

gemini-code-assist

Code Review

This pull request introduces a comprehensive "Skills V2" feature, enabling the creation, management, and execution of self-contained skill bundles within an E2B sandbox environment. It includes a full-screen UI editor with file tree and graph visualizations, a server-side compiler for transitive dependency resolution, and enhanced storage providers with raw blob primitives. Feedback focuses on improving code robustness and data integrity, specifically by simplifying boolean logic, ensuring file cleanup in error scenarios, and addressing potential race conditions and state inconsistencies during database and storage operations. Additionally, a suggestion was made to use a more portable UUID generation function in PostgreSQL migrations.

gemini-code-assist · 2026-05-07T14:22:53Z

+        // Capability + manifest. The manifest is only needed for the bash
+        // path, so building it when the user opted out keeps init() cheap.
+        // ------------------------------------------------------------------
+        const bashEnabledByUser = nodeData.inputs?.enableBash === false ? false : true


The ternary operator can be simplified for better readability, prioritizing code understandability.

const bashEnabledByUser = nodeData.inputs?.enableBash !== false

References

Prioritize code readability and understandability over conciseness.

gemini-code-assist · 2026-05-07T14:22:53Z

+                try {
+                    fs.unlinkSync(file.path)
+                } catch {
+                    /* best-effort cleanup */
+                }


The file cleanup logic (fs.unlinkSync) should be placed in a finally block to ensure the temporary file is deleted even if fs.readFileSync fails, preventing file leaks.

try { buffer = fs.readFileSync(file.path) } finally { try { fs.unlinkSync(file.path) } catch { /* best-effort cleanup */ } }

gemini-code-assist · 2026-05-07T14:22:53Z

+        const existing = await repo().findOneBy({ workspaceId, name: dto.name.trim() })
+        if (existing) {
+            throw new InternalFlowiseError(StatusCodes.CONFLICT, `A skill named "${dto.name}" already exists`)
+        }


The check for an existing skill name followed by a save is prone to race conditions. Consider using a database-level unique constraint to ensure data integrity.

gemini-code-assist · 2026-05-07T14:22:53Z

+    await Promise.all([
+        writeBuffer(JSON_MIME, buf, `${nodeId}.json`, ...prefix(workspaceId, skillId), 'nodes'),
+        putNodeMeta(workspaceId, skillId, nodeId, meta)
+    ])


Using Promise.all for writing the payload and metadata can lead to an inconsistent state if one operation fails. Consider sequential execution or a transaction-like approach to ensure both succeed.

gemini-code-assist · 2026-05-07T14:22:53Z

+        for (const id of toDelete) {
+            await SkillStorage.deleteNodeAssets(workspaceId, skillId, id)
+        }
+
+        await saveFileTree(workspaceId, skillId, tree)


If saveFileTree fails, the assets are already deleted, leaving the tree in an inconsistent state. Consider wrapping these operations in a transaction or ensuring the tree is updated before deleting assets.

prd-hoang-doan · 2026-05-07T14:24:22Z

Hi @HenryHengZJ , @harshit-flowise, @jchui-wd. Could you review the general approach? If you’re happy with this solution, I’m ready to dive back in and handle any further requirements to get this production-ready.

gemini-code-assist Bot reviewed May 7, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[POC] feat: implement skill management in tools#6362

[POC] feat: implement skill management in tools#6362
prd-hoang-doan wants to merge 1 commit intoFlowiseAI:mainfrom
prd-hoang-doan:feat/skill-tool-poc

prd-hoang-doan commented May 7, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 7, 2026

Uh oh!

gemini-code-assist Bot May 7, 2026

Uh oh!

Uh oh!

gemini-code-assist Bot May 7, 2026

Uh oh!

gemini-code-assist Bot May 7, 2026

Uh oh!

gemini-code-assist Bot May 7, 2026

Uh oh!

prd-hoang-doan commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

prd-hoang-doan commented May 7, 2026

Proof-of-Concept: Skill management in tools

Ticket:

Overview:

Database

Execution Modes

How LLM detect the bash command

Testing

Demo Recording

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 7, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 7, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gemini-code-assist Bot May 7, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 7, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 7, 2026

Choose a reason for hiding this comment

Uh oh!

prd-hoang-doan commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant