Stars
A modern web interface for managing and interacting with vLLM servers (www.github.com/vllm-project/vllm). Supports both GPU and CPU modes, with special optimizations for macOS Apple Silicon and ent…
📑 PageIndex: Document Index for Reasoning-based RAG
The open-source CapCut alternative
PersonaLive! : Expressive Portrait Image Animation for Live Streaming
An executable to convert SOCKS5 proxy into HTTP proxy
此仓库存储我在YouTube频道分享的N8N工作流配置文件,用户可直接下载JSON文件导入N8N使用
JJYB_AI 智剪 - 智能视频自动剪辑与AI解说工具(离线TTS、原创解说、混剪、AI配音)
CLI tool to extract (meta)data from PDF and manipulate PDF files
AIdea 是一款支持 GPT 以及国产大语言模型通义千问、文心一言等,支持 Stable Diffusion 文生图、图生图、 SDXL1.0、超分辨率、图片上色的全能型 APP。
Nexent is a zero-code platform for auto-generating agents — no orchestration, no complex drag-and-drop required. Nexent also offers powerful capabilities for agent running control, data processing …
A framework for efficient model inference with omni-modality models
Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.
SoFlow: Solution Flow Models for One-Step Generative Modeling
A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of differen…
Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mobile (Android & iOS), and Linux/IoT (Arm64 & x86 Docker). Su…
Next generation Cosmic desktop environment
Backup automation for self-hosters. Built on top of restic
LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.
基于PaddleOCR重构,并且脱离PaddlePaddle深度学习训练框架的轻量级OCR,推理速度超快 —— A lightweight OCR system based on PaddleOCR, decoupled from the PaddlePaddle deep learning training framework, with ultra-fast inference speed.
Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.
阅你所想,绘你所梦,从一个想法到一本完整的精彩小说。ReaDreamAI为你包办写作、插图与视频。
Translate EPUB books using Large Language Models while preserving the original text. The translated content is displayed side-by-side with the original, creating bilingual books perfect for languag…
PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.
E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2…
The successful integration of Qwen3-VL-Instruct series into the ComfyUI platform has enabled a smooth operation, supporting (but not limited to) text-based queries, video queries, single-image quer…
