Skip to content

Add show-cases tool for per-document regression/improvement analysis#615

Merged
de-code merged 1 commit into
mainfrom
show-examples
May 27, 2026
Merged

Add show-cases tool for per-document regression/improvement analysis#615
de-code merged 1 commit into
mainfrom
show-examples

Conversation

@de-code
Copy link
Copy Markdown
Collaborator

@de-code de-code commented May 27, 2026

Introduces benchmarks/show_cases.py with word-level diffs and file export, and Makefile targets dev-show-regressions / dev-show-improvements.

Per-case exports include the field text files alongside the source PDF, gold JATS XML, and both run TEI XMLs so regressions can be inspected end-to-end without leaving the examples directory.

SHOW_RUN_B defaults to the first baseline from eval.yml so the common comparison requires no extra arguments beyond SHOW_FIELD.

Introduces benchmarks/show_cases.py with word-level diffs and file
export, and Makefile targets dev-show-regressions / dev-show-improvements.

Per-case exports include the field text files alongside the source PDF,
gold JATS XML, and both run TEI XMLs so regressions can be inspected
end-to-end without leaving the examples directory.

SHOW_RUN_B defaults to the first baseline from eval.yml so the common
comparison requires no extra arguments beyond SHOW_FIELD.
@de-code de-code marked this pull request as ready for review May 27, 2026 20:11
@de-code de-code enabled auto-merge (squash) May 27, 2026 20:11
@de-code de-code merged commit 0f960be into main May 27, 2026
6 checks passed
@de-code de-code deleted the show-examples branch May 27, 2026 20:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant