Stars
Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen
PyTorch Implementation of introducing diffusion approach to 3D depth perception ECCV 2024
InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion (CVPR 2024)
Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Drag & drop UI to build your customized LLM flow
[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution
A PyTorch library and evaluation platform for end-to-end compression research
Faster Whisper transcription with CTranslate2
VRT: A Video Restoration Transformer (official repository)
A high-throughput and memory-efficient inference and serving engine for LLMs
Implementation of RSGC-BD (Blur Detection)
Intel® NPU Acceleration Library
This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-e…
Build real-time multimodal AI applications 🤖🎙️📹
Third party firmware for Asus routers (newer codebase)
Official Pytorch Implementation of the paper: Wavelet Diffusion Models are fast and scalable Image Generators (CVPR'23)
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
A Deep Learning based project for creating line art portraits.
A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone