Skip to content
View wangxihao's full-sized avatar
🎯
AIGC
🎯
AIGC

Block or report wangxihao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The Best Agent Harness. Meet Sisyphus: The Batteries-Included Agent that codes like you.

TypeScript 12,336 827 Updated Jan 9, 2026

"DeepTutor: AI-Powered Personalized Learning Assistant"

Python 7,194 865 Updated Jan 9, 2026

本人的科研经验

9,713 522 Updated Dec 12, 2025

The open source coding agent.

TypeScript 56,142 4,792 Updated Jan 9, 2026

A set of ready to use scientific skills for Claude

Python 4,996 598 Updated Jan 8, 2026

Official code for StoryMem: Multi-shot Long Video Storytelling with Memory

Python 589 58 Updated Dec 26, 2025

DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Python 535 18 Updated Jan 6, 2026

StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars

HTML 8 1 Updated Dec 29, 2025

The official repo for "Vidi: Large Multimodal Models for Video Understanding and Editing"

Python 556 36 Updated Dec 11, 2025

A high-performance, 100% client-side tool for removing Gemini AI watermarks. Built with pure JavaScript, it leverages a mathematically precise Reverse Alpha Blending algorithm rather than unpredict…

JavaScript 1,934 217 Updated Jan 4, 2026

HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 924 65 Updated Jan 6, 2026

Qwen-Image-Layered: Layered Decomposition for Inherent Editablity

Python 1,374 108 Updated Dec 31, 2025

Offical Implementation of SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations

Python 663 37 Updated Jan 6, 2026

A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using autoregressive diffusion.

Python 273 38 Updated Dec 15, 2025

HunyuanVideo-1.5: A leading lightweight video generation model

Python 2,957 113 Updated Jan 2, 2026

We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while achieving up to 6$\times$ acceleration in inference speed.

Python 406 28 Updated Dec 24, 2025
TypeScript 1,967 351 Updated Jan 5, 2026
Jupyter Notebook 193 2 Updated Dec 19, 2025

A refactored codebase for Gaussian Splatting. Training 3DGS in 50 seconds!

Cuda 320 28 Updated Dec 18, 2025

我的开发经验+提示词库=vibecoding工作站;My development experience + prompt dictionary = Vibecoding workstation;ניסיון הפיתוח שלי + מילון פרומפטים = תחנת עבודה Vibecoding;私の開発経験 + プロンプト辞書 = Vibecoding ワークステーション;나…

Python 6,354 708 Updated Jan 4, 2026

Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"

Python 1,359 130 Updated Dec 30, 2025

One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer

Python 423 42 Updated Dec 21, 2025

Performs Bazel Target Diffing between two revisions in Git, allowing for Test Target Selection and Selective Building

Kotlin 485 72 Updated Nov 4, 2025

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,128 211 Updated Jan 9, 2026

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 14,386 1,500 Updated Jan 7, 2026

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 18,862 2,097 Updated Jan 7, 2026

🚀 AI 全自动短视频引擎 | AI Fully Automated Short Video Engine

Python 1,337 217 Updated Jan 8, 2026

[NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Python 518 22 Updated Jan 5, 2026

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)

Python 362 48 Updated Oct 29, 2025
Next