Stars
A collection of easy-to-use image/video filter.
osama-ata / Wan2.2-mlx
Forked from Wan-Video/Wan2.2Wan: Open and Advanced Large-Scale Video Generative Models
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation
Offical Implementation of SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations
ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation
Weighs the soul of incoming HTTP requests to stop AI crawlers
MLX native implementations of state-of-the-art generative image models
General purpose 3D and 2D game engine using Go (golang) and Vulkan with built in editor
True P2P Email on top of Yggdrasil Network for Android
LiveLinkFace ARKit Receiver is a Blender add-on that receives facial tracking data sent from the Live Link Face app on iPhone and automatically applies it to Shape Keys in Blender.
Track personal Bluetooth devices via Apple's "Find My" network using OpenHaystack and Macless-Haystack with tools written in Go/TinyGo. No Apple hardware required!
Cross-platform Bluetooth API for Go and TinyGo. Supports Linux, macOS, Windows, and bare metal using Nordic SoftDevice or HCI
DeepSeek-OCR Inspired Optical Encoder for Qwen3-VLM
Unofficial Galaxy Buds Manager for Windows, macOS, Linux, and Android
Unofficial Galaxy Buds Manager for Windows, macOS, Linux, and Android
Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mobile (Android & iOS), and Linux/IoT (Arm64 & x86 Docker). Su…
LL3M writes Python code that generates 3D assets in Blender.
DeepFuze is a state-of-the-art deep learning tool that seamlessly integrates with ComfyUI to revolutionize facial transformations, lipsyncing, Face Swapping, Lipsync Translation, video generation, …
vurte / Wan2.1-MAC
Forked from Wan-Video/Wan2.1Wan: Open and Advanced Large-Scale Video Generative Models
A simple web-based tool for Spriting and Pixel art.
Portable file server with accelerated resumable uploads, dedup, WebDAV, SFTP, FTP, TFTP, zeroconf, media indexer, thumbnails++ all in one file
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

