-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
1,319 changed files
with
87,318 additions
and
16 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,56 @@ | ||
# For the main pipeline structure-related code, we maintain the original license provided with lm-evaluation-harness, which is the MIT License. | ||
|
||
MIT License | ||
|
||
Copyright (c) 2024 LMMs-Lab | ||
|
||
Permission is hereby granted, free of charge, to any person obtaining a copy | ||
of this software and associated documentation files (the "Software"), to deal | ||
in the Software without restriction, including without limitation the rights | ||
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell | ||
copies of the Software, and to permit persons to whom the Software is | ||
furnished to do so, subject to the following conditions: | ||
|
||
The above copyright notice and this permission notice shall be included in all | ||
copies or substantial portions of the Software. | ||
|
||
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR | ||
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, | ||
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE | ||
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER | ||
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, | ||
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE | ||
SOFTWARE. | ||
|
||
# For the multimodal models and datasets that we have added (defined as code in the lmms_eval/tasks and lmms_eval/models folders), we apply the Apache License. | ||
|
||
Apache 2.0 License | ||
|
||
Copyright (c) 2024 LMMs-Lab | ||
|
||
Licensed under the Apache License, Version 2.0 (the "License"); | ||
you may not use this file except in compliance with the License. | ||
You may obtain a copy of the License at | ||
|
||
http://www.apache.org/licenses/LICENSE-2.0 | ||
|
||
Unless required by applicable law or agreed to in writing, software | ||
distributed under the License is distributed on an "AS IS" BASIS, | ||
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | ||
See the License for the specific language governing permissions and | ||
limitations under the License. | ||
|
||
When modifying the code, please include the following information about the original lmms-eval source: | ||
# Adopted from lmms-eval from https://github.com/EvolvingLMMs-Lab/lmms-eval. Below is the original copyright: | ||
# | ||
# Licensed under the Apache License, Version 2.0 (the "License"); | ||
# you may not use this file except in compliance with the License. | ||
# You may obtain a copy of the License at | ||
# | ||
# http://www.apache.org/licenses/LICENSE-2.0 | ||
# | ||
# Unless required by applicable law or agreed to in writing, software | ||
# distributed under the License is distributed on an "AS IS" BASIS, | ||
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | ||
# See the License for the specific language governing permissions and | ||
# limitations under the License. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,16 +1,84 @@ | ||
## Hi there 👋 | ||
|
||
<!-- | ||
**videommmu/VideoMMMU** is a ✨ _special_ ✨ repository because its `README.md` (this file) appears on your GitHub profile. | ||
Here are some ideas to get you started: | ||
- 🔭 I’m currently working on ... | ||
- 🌱 I’m currently learning ... | ||
- 👯 I’m looking to collaborate on ... | ||
- 🤔 I’m looking for help with ... | ||
- 💬 Ask me about ... | ||
- 📫 How to reach me: ... | ||
- 😄 Pronouns: ... | ||
- ⚡ Fun fact: ... | ||
--> | ||
<p align="center" width="80%"> | ||
<img src="https://i.postimg.cc/g0QRgMVv/WX20240228-113337-2x.png" width="100%" height="70%"> | ||
</p> | ||
|
||
# The Evaluation Suite of Large Multimodal Models | ||
|
||
[](https://pypi.org/project/lmms-eval) | ||
 | ||
 | ||
[](https://github.com/EvolvingLMMs-Lab/lmms-eval/issues) | ||
[](https://github.com/EvolvingLMMs-Lab/lmms-eval/issues) | ||
|
||
> Accelerating the development of large multimodal models (LMMs) with `lmms-eval` | ||
🏠 [LMMs-Lab Homepage](https://lmms-lab.framer.ai) | 🤗 [Huggingface Datasets](https://huggingface.co/lmms-lab) | <a href="https://emoji.gg/emoji/1684-discord-thread"><img src="https://cdn3.emoji.gg/emojis/1684-discord-thread.png" width="14px" height="14px" alt="Discord_Thread"></a> [discord/lmms-eval](https://discord.gg/zdkwKUqrPy) | ||
|
||
📖 [Supported Tasks (90+)](https://github.com/EvolvingLMMs-Lab/lmms-eval/blob/main/docs/current_tasks.md) | 🌟 [Supported Models (30+)](https://github.com/EvolvingLMMs-Lab/lmms-eval/tree/main/lmms_eval/models) | 📚 [Documentation](docs/README.md) | ||
|
||
--- | ||
|
||
## Annoucement | ||
- [2025-1] 🎉🎉 We introduce [VideoMMMU](https://videommmu.github.io/), a massive, multi-modal, multi-disciplinary video benchmark that evaluates the knowledge acquisition capability from educational videos. | ||
|
||
## Installation | ||
|
||
For formal usage, you can install the package from PyPI by running the following command: | ||
```bash | ||
pip install lmms-eval | ||
``` | ||
|
||
For development, you can install the package by cloning the repository and running the following command: | ||
```bash | ||
git clone https://github.com/EvolvingLMMs-Lab/lmms-eval | ||
cd lmms-eval | ||
pip install -e . | ||
``` | ||
|
||
If you want to test LLaVA, you will have to clone their repo from [LLaVA](https://github.com/haotian-liu/LLaVA) and | ||
```bash | ||
git clone https://github.com/LLaVA-VL/LLaVA-NeXT | ||
cd LLaVA-NeXT | ||
pip install -e . | ||
``` | ||
|
||
## Evaluation | ||
|
||
**Evaluation of LLaVA-OneVision on VideoMMMU** | ||
|
||
```bash | ||
accelerate launch --num_processes=1 --main_process_port 12345 -m lmms_eval \ | ||
--model llava_onevision \ | ||
--model_args pretrained=lmms-lab/llava-onevision-qwen2-7b-ov,conv_template=qwen_1_5,model_name=llava_qwen,max_frames_num=32,torch_dype=bfloat16 \ | ||
--tasks video_mmmu \ | ||
--batch_size 1 \ | ||
--log_samples \ | ||
--log_samples_suffix debug \ | ||
--output_path ./logs/ | ||
``` | ||
|
||
**Evaluate a single track of VideoMMMU** | ||
|
||
```bash | ||
accelerate launch --num_processes=1 --main_process_port 12345 -m lmms_eval \ | ||
--model llava_onevision \ | ||
--model_args pretrained=lmms-lab/llava-onevision-qwen2-7b-ov,conv_template=qwen_1_5,model_name=llava_qwen,max_frames_num=32,torch_dype=bfloat16 \ | ||
--tasks video_mmmu_perception \ | ||
--batch_size 1 \ | ||
--log_samples \ | ||
--log_samples_suffix debug \ | ||
--output_path ./logs/ | ||
``` | ||
|
||
**Evaluate the question_only track of VideoMMMU (Knowledge Acquisition Experiment)** | ||
|
||
```bash | ||
accelerate launch --num_processes=1 --main_process_port 12345 -m lmms_eval \ | ||
--model llava_onevision \ | ||
--model_args pretrained=lmms-lab/llava-onevision-qwen2-7b-ov,conv_template=qwen_1_5,model_name=llava_qwen,max_frames_num=1,torch_dype=bfloat16 \ | ||
--tasks video_mmmu_adaptation_question_only \ | ||
--batch_size 1 \ | ||
--log_samples \ | ||
--log_samples_suffix debug \ | ||
--output_path ./logs/ | ||
``` |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
# LMMs Eval Documentation | ||
|
||
Welcome to the docs for `lmms-eval`! | ||
|
||
Majority of this documentation is adapted from [lm-eval-harness](https://github.com/EleutherAI/lm-evaluation-harness/) | ||
|
||
## Table of Contents | ||
|
||
* To learn about the command line flags, see the [commands](commands.md) | ||
* To learn how to add a new moddel, see the [Model Guide](model_guide.md). | ||
* For a crash course on adding new tasks to the library, see our [Task Guide](task_guide.md). | ||
* If you need to upload your datasets into correct HF format with viewer supported, please refer to [tools](https://github.com/EvolvingLMMs-Lab/lmms-eval/tree/pufanyi/hf_dataset_docs/tools) |
Oops, something went wrong.