Skip to content

Commit

Permalink
Update packages/tasks/src/tasks/image-text-to-text/about.md
Browse files Browse the repository at this point in the history
Co-authored-by: Omar Sanseviero <[email protected]>
  • Loading branch information
merveenoyan and osanseviero authored Jun 19, 2024
1 parent d1339ee commit b2737d1
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion packages/tasks/src/tasks/image-text-to-text/about.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@
Vision language models come in three types:

- **Base:** Pre-trained models that can be fine-tuned. A good example is for base models is [PaliGemma models](https://huggingface.co/models?sort=trending&search=google%2Fpaligemma-3b-pt) by Google.
- **Instruction:** Base models fine-tuned on instruction datasets. A good example for instruction fine-tuned models is [idefics2-8b](https://huggingface.co/HuggingFaceM4/idefics2-8b).
- **Instruction:** Base models fine-tuned on instruction datasets. A good example of instruction fine-tuned models is [idefics2-8b](https://huggingface.co/HuggingFaceM4/idefics2-8b).
- **Chatty/Conversation:** Base models fine-tuned on conversation datasets. A good example for chatty models is [deepseek-vl-7b-chat](https://huggingface.co/deepseek-ai/deepseek-vl-7b-chat).

![VLM uses](https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/blog/vlm/visual.jpg)
Expand Down

0 comments on commit b2737d1

Please sign in to comment.