WIP: Add Capture Dataset by bodsul · Pull Request #974 · open-compass/VLMEvalKit

bodsul · 2025-05-02T19:51:06Z

Add evaluation for newly released CAPTURE dataset. Arxiv preprint: https://arxiv.org/abs/2504.15485, repo: https://github.com/atinpothiraj/CAPTURe.

Opened issue regarding prompt choice and using Llama-3.1-8B-Instruct to extract answers here. Will update PR from WIP once this issues are clarified.

kennymckormick · 2025-05-26T03:00:45Z

Hi, @bodsul ,

Maybe you can try to re-implement and use some API models (like gpt-4o / gpt-4.1-mini). If you don't have the credit for GPT API, you can just implement in this PR and I'll help check the results.

atinpothiraj · 2025-05-30T03:23:37Z

Added my comments to the issue in the original repo, lmk if any other clarification is needed.

bodsul added 3 commits May 1, 2025 00:11

impl tsv from capture data logic

923cda2

WIP: adding CAPTURE dataset

496aec6

fix flake8 errors introduced by me

e9df441

bodsul mentioned this pull request May 2, 2025

Prompting atinpothiraj/CAPTURe#1

Open

kennymckormick added the WIP label May 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Add Capture Dataset#974

WIP: Add Capture Dataset#974
bodsul wants to merge 3 commits intoopen-compass:mainfrom
bodsul:capture

bodsul commented May 2, 2025 •

edited

Loading

Uh oh!

kennymckormick commented May 26, 2025

Uh oh!

atinpothiraj commented May 30, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

bodsul commented May 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kennymckormick commented May 26, 2025

Uh oh!

atinpothiraj commented May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bodsul commented May 2, 2025 •

edited

Loading

atinpothiraj commented May 30, 2025 •

edited

Loading