[New feature] LLM Weaviate Code Generation Benchmark by g-despot · Pull Request #379 · weaviate/docs

g-despot · 2026-03-21T07:09:32Z

What's being changed:

This benchmark evaluates how well large language models (LLMs) generate working Weaviate v4 Python client code when given natural language task descriptions. It measures whether an LLM can produce code that actually connects to a Weaviate cluster and performs the requested operation without errors.

Type of change:

Documentation content updates (non-breaking change to fix/update documentation )
Feature or enhancements (non-breaking change to add functionality)

How has this been tested?

Local build - the site works as expected when running yarn start

orca-security-eu

Orca Security Scan Summary

Status	Check	Issues by priority
Passed	Infrastructure as Code	0 0 0 0	View in Orca
Passed	SAST	0 0 0 0	View in Orca
Passed	Secrets	0 0 0 0	View in Orca
Passed	Vulnerabilities	0 0 0 0	View in Orca

Add vibe code benchmarking

3b296d4

orca-security-eu bot reviewed Mar 21, 2026

View reviewed changes

Retrigger build

711feba

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[New feature] LLM Weaviate Code Generation Benchmark#379

[New feature] LLM Weaviate Code Generation Benchmark#379
g-despot wants to merge 2 commits intomainfrom
vibe-code-evaluation

g-despot commented Mar 21, 2026

Uh oh!

orca-security-eu bot left a comment •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

g-despot commented Mar 21, 2026

What's being changed:

Type of change:

How has this been tested?

Uh oh!

orca-security-eu bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Orca Security Scan Summary

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

orca-security-eu bot left a comment •

edited

Loading