Stars
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
A latent text-to-image diffusion model
High-speed downloader for multiple platforms
DeepSeek-VL: Towards Real-World Vision-Language Understanding
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Reference PyTorch implementation and models for DINOv3
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Wan: Open and Advanced Large-Scale Video Generative Models
Wan: Open and Advanced Large-Scale Video Generative Models
Port of OpenAI's Whisper model in C/C++
Robust Speech Recognition via Large-Scale Weak Supervision
经典编程书籍大全,涵盖:计算机系统与网络、系统架构、算法与数据结构、前端开发、后端开发、移动开发、数据库、测试、项目与团队、程序员职业修炼、求职面试等
The fantastic ORM library for Golang, aims to be developer friendly
A high-performance and stable proxy for MySQL, it is developed by Qihoo's DBA and infrastructure team
A curated list of awesome Go frameworks, libraries and software
Scalable datastore for metrics, events, and real-time analytics
A high productivity, full-stack web framework for the Go language.
Vitess is a database clustering system for horizontal scaling of MySQL.