Try OV-DINO, a more powerful open-vocabulary detector. #452

wanghao9610 · 2024-07-30T04:39:44Z

Thanks for the awesome YOLO-World, I share our recent work 🦖OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion.

OV-DINO is a novel unified open vocabulary detection approach that offers superior performance and effectiveness for practical real-world application.
OV-DINO entails a Unified Data Integration pipeline that integrates diverse data sources for end-to-end pre-training, and a Language-Aware Selective Fusion module to improve the vision-language understanding of the model.
OV-DINO shows significant performance improvement on COCO and LVIS benchmarks compared to previous methods, achieving relative improvements of +5.5% AP on COCO and +4.7% AP on LVIS compared to YOLO-World in zero-shot evaluation.

We have released the evaluation, fine-tuning, demo code in our project, feel free to try our model for your application.

Project: https://wanghao9610.github.io/OV-DINO

Paper: https://arxiv.org/abs/2407.07844

Code: https://github.com/wanghao9610/OV-DINO

Demo: http://47.115.200.157:7860

Welcome everyone to try our model and feel free to raise issue if you encounter any problem.

ForestWang · 2024-08-05T06:14:47Z

how about the inference time compared with yolo-world?

wanghao9610 · 2024-08-05T07:06:43Z

@ForestWang We haven't tested the inference time systematically, the inference time is already fast according to the demo without any deployment optimization.

wanghao9610 changed the title ~~Try OV-DINO, a more strong open-vocabulary detector.~~ Try OV-DINO, a more powerful open-vocabulary detector. Jul 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Try OV-DINO, a more powerful open-vocabulary detector. #452

Try OV-DINO, a more powerful open-vocabulary detector. #452

wanghao9610 commented Jul 30, 2024 •

edited

Loading

ForestWang commented Aug 5, 2024

wanghao9610 commented Aug 5, 2024

Try OV-DINO, a more powerful open-vocabulary detector. #452

Try OV-DINO, a more powerful open-vocabulary detector. #452

Comments

wanghao9610 commented Jul 30, 2024 • edited Loading

ForestWang commented Aug 5, 2024

wanghao9610 commented Aug 5, 2024

wanghao9610 commented Jul 30, 2024 •

edited

Loading