Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP][Benchmark] Add MEGA-Bench core and core_single_image support #724

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

TianhaoLiang2000
Copy link

@TianhaoLiang2000 TianhaoLiang2000 commented Jan 15, 2025

Hi, VLMEvalKit team,

This PR incorporates our recent work, MEGA-Bench, a multimodal evaluation suite with over 500 real-world tasks and 45 metrics.

The evaluation process involves two steps: 1) run VLMEvalKit to produce the response/submission file; 2) run our evaluator with 45 diverse metrics to get the scores.

Example usage:

python3 run.py
--data MEGABench_core_single_image_16frame
--model Qwen2-VL-7B-Instruct
--verbose
--work-dir ~/LMUData

This PR implements the MEGA-Bench core subset and the core_single_image subset, and the core_single_image subset has been successfully tested using the Qwen2VL-7B model, confirming its functionality and compatibility. The open-ended subset will be committed soon in the same PR.

@TianhaoLiang2000 TianhaoLiang2000 changed the title [Benchmark] Add MEGA-Bench core and core_single_image support [WIP][Benchmark] Add MEGA-Bench core and core_single_image support Jan 15, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant