- United Kingdom
-
15:35
- same time - @PTudosiu
Lists (10)
Sort Name ascending (A-Z)
Stars
AeroSpace is an i3-like tiling window manager for macOS
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
SIGGRAPH 2024 Conference Paper: Deep Fourier-based Arbitrary-scale Super-resolution for Real-time Rendering
FastVideo is a lightweight framework for accelerating large video diffusion models.
[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
This is a PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Framework for Cross-Modality Evolution'
Python bindings to the Zstandard (zstd) compression library
[CVPR 2025] Official implementation of the paper "Generative Inbetweening through Frame-wise Conditions-Driven Video Generation"
Enhance-A-Video: Better Generated Video for Free
[CVPR2025] PAR: Parallelized Autoregressive Visual Generation. https://epiphqny.github.io/PAR-project/
A generative world for general-purpose robotics & embodied AI learning.
[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)
Kickstart your LLMOps initiative with a flexible, robust, and productive Python package.
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
Port of OpenAI's Whisper model in C/C++
[CVPR 2025🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition
A natural language interface for computers
Create a Conda environment file from a Python project using uv.
[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding
Simple, unified interface to multiple Generative AI providers
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Efficient Triton Kernels for LLM Training
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥