For rebuttal, add GLiNER2 eval results by Ki-Seki · Pull Request #102 · SculptAI/GIMBench

Ki-Seki · 2026-04-10T18:09:45Z

No description provided.

for more information, see https://pre-commit.ci

Copilot

Pull request overview

Adds evaluation artifacts for the GLiNER2 CV-parsing run, capturing both the exact invocation used and the resulting per-item/aggregate metrics for later comparison and reproducibility.

Changes:

Added a full CV parsing evaluation result JSON for fastino/gliner2-large-v1 on Sculpt-AI/GIMBench-cv-parse.
Added a small eval.sh helper script to rerun the same evaluation configuration.

Reviewed changes

Copilot reviewed 1 out of 2 changed files in this pull request and generated 2 comments.

File	Description
results/260411-kdd-rebuttal-cv-gliner2-model/fastino_gliner2-large-v1_Sculpt-AI_GIMBench-cv-parse_260411-010729.json	Stores the recorded environment/args plus detailed per-item extraction outcomes and aggregate accuracy.
results/260411-kdd-rebuttal-cv-gliner2-model/eval.sh	Provides a reproducible command to rerun the GLiNER2 CV parsing evaluation.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+                "phone_number": {
+                    "prediction": "",
+                    "expected": "+49 (0) 621 181 2098",
+                    "verbatim_correct": false,
+                    "judge_model_correct": false,
+                    "correct": false
+                },
+                "email": {
+                    "prediction": "B n.zhang@uni-mannheim.de",
+                    "expected": "n.zhang@uni-mannheim.de",
+                    "verbatim_correct": false,
+                    "judge_model_correct": false,
+                    "correct": false


+python -m gimbench.cv.cv_parse \
+    --use_gliner2 \
+    --model_name "fastino/gliner2-large-v1" \
+    --model_type "openai" \
+    --judge_model_name "google/gemini-2.5-flash" \
+    --api_key $API_KEY --base_url $API_BASE


Add GLiNER2 eval results

6392111

Ki-Seki added the do not merge label Apr 10, 2026

Copilot AI review requested due to automatic review settings April 10, 2026 18:09

[pre-commit.ci] auto fixes from pre-commit.com hooks

d267019

for more information, see https://pre-commit.ci

Ki-Seki changed the title ~~Add GLiNER2 eval results~~ For rebuttal, add GLiNER2 eval results Apr 10, 2026

Copilot started reviewing on behalf of Ki-Seki April 10, 2026 18:10 View session

Copilot AI reviewed Apr 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

For rebuttal, add GLiNER2 eval results#102

For rebuttal, add GLiNER2 eval results#102
Ki-Seki wants to merge 2 commits into
rebuttal/gliner2from
rebuttal/gliner2-results

Ki-Seki commented Apr 10, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Ki-Seki commented Apr 10, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants