Detect invisible characters in LLM input and output

Similarly to the Minder rule type that detects invisible characters, the purpose of this feature would be to detect invisible unicode characters. In the context of LLMs, these can be used as a venue for [prompt injection attacks](https://www.trendmicro.com/en_us/research/25/a/invisible-prompt-injection-secure-ai.html).

Invisible characters can also be present in the code that the LLM generates, poisoning the generated application.

We should add a pipeline step that detects and blocks these.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Detect invisible characters in LLM input and output #776

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Detect invisible characters in LLM input and output #776

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions