Lists (1)
Sort Name ascending (A-Z)
Stars
💎1MB lightweight face detection model (1MB轻量级人脸检测模型)
Noise supression using deep filtering
A video processing framework with simplicity in mind
Official implementation of AnimateDiff.
DiffuEraser is a diffusion model for video inpainting, which performs great content completeness and temporal consistency while maintaining acceptable efficiency.
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
[IJCV] FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
Use OCR technology to extract video hard subtitles, automatically merge multi-line subtitles, add large model auto-calibration. Ultra-high accuracy with no omissions.使用OCR技术提取视频字幕,用大模型自动校准,超高准确率。
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
andupotorac / Glyph-ByT5
Forked from AIGText/Glyph-ByT5This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering"
visual-text-ai / Glyph-ByT5
Forked from AIGText/Glyph-ByT5This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering"
[NeurIPS2023] This is the official code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"
[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Mu…
This AI tool cuts editing time from 1 hour to 3 minutes (95% faster!). It automatically extracts dialogue, identifies characters, generates commentary based on the plot, separates background noise,…
Large World Model -- Modeling Text and Video with Millions Context
AI-Video-Cropper is a Python-based tool that leverages the power of GPT-4 (OpenAI's language model) to automatically analyze videos, extract the most interesting sections, and crop them for improve…
auto video translation-video translator can auto translate video hard subtitles, auto video translation and dubbing, remove any video text, auto remove video subtitles/text. 自动视频翻译配音,自动翻译视频字幕和回填样式,…
视频二创剪辑,用AI对视频去字幕、改写文案并重新配音,一种快速的视频二创方法。Remake the video by rephrase the captions and AI dubbing. Remove all the original subtitles by auto OCR and video Inpainting.
Remove any video text by auto video OCR and video inpainting. Auto erase video watermarks、texts 、subtitles...自动视频去字幕、自动去除视频文本和台词、去除视频水印和台标、视频擦除和修复等
AI Image Translation Tool-An excellent translator for photos, pictures, posters, covers, banners and product images.AI图片翻译-很棒的批量跨境电商|海报|商品图片翻译,擦除干净,排版整齐。
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Easily train a good VC model with voice data <= 10 mins!
Source code for the X Recommendation Algorithm
