CuiRobert

Follow

Rob Cui CuiRobert

Follow

I am a graduate student in ECNU,interested in CV NLP and ML

2 followers · 9 following

Achievements

Achievements

Stars

QwenLM / Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 6,961 404 Updated Dec 31, 2025

yfzhang114 / Awesome-Multimodal-Large-Language-Models

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

863 36 Updated Dec 4, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 13,536 1,605 Updated Dec 17, 2025

KlingTeam / LivePortrait

Bring portraits to life!

Python 17,596 1,828 Updated Nov 16, 2025

BytedTsinghua-SIA / MemAgent

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 852 58 Updated Jul 31, 2025

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 3,824 310 Updated Aug 14, 2025

VectorSpaceLab / OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 4,299 369 Updated Dec 4, 2025

instantX-research / InstantID

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,900 878 Updated Jul 18, 2024

datajuicer / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,728 313 Updated Jan 12, 2026

wenet-e2e / WeTextProcessing

Text Normalization & Inverse Text Normalization

Python 718 95 Updated Dec 1, 2025

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 18,922 2,107 Updated Jan 12, 2026

sooftware / attentions

PyTorch implementation of some attentions for Deep Learning Researchers.

Python 547 73 Updated Mar 4, 2022

donahowe / AutoStudio

AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation

Jupyter Notebook 447 31 Updated Apr 13, 2025

lllyasviel / IC-Light

More relighting!

Python 8,340 525 Updated Feb 20, 2025

YaoFANGUK / video-subtitle-remover

基于AI的图片/视频硬字幕去除、文本水印去除，无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API，本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

Python 9,197 1,152 Updated Dec 3, 2025

Kiteretsu77 / APISR

APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)

Python 1,068 68 Updated Oct 16, 2025

lllyasviel / sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Python 4,109 351 Updated Aug 30, 2024

lllyasviel / LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

2,186 36 Updated Jun 16, 2024

ZHO-ZHO-ZHO / ComfyUI-I2VGenXL

Unofficial implementation of I2VGenXL for ComfyUI

Python 114 10 Updated May 22, 2024

YangLing0818 / RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)

Jupyter Notebook 1,841 102 Updated Feb 1, 2025

TencentARC / PhotoMaker

PhotoMaker [CVPR 2024]

Jupyter Notebook 10,108 823 Updated Oct 31, 2024

Zulko / moviepy

Video editing with Python

Python 14,227 2,016 Updated Sep 25, 2025

zyddnys / manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)

Python 9,203 897 Updated Dec 17, 2025

zai-org / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,715 449 Updated May 29, 2024

guoyww / AnimateDiff

Official implementation of AnimateDiff.

Python 11,981 1,037 Updated Jul 31, 2024

eastmountyxz / ImageProcessing-Python

该资源为作者在CSDN的撰写Python图像处理文章的支撑，主要是Python实现图像处理、图像识别、图像分类等算法代码实现，希望该资源对您有所帮助，一起加油。

Jupyter Notebook 2,088 474 Updated Apr 12, 2024

tencent-ailab / IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 6,405 412 Updated Jun 28, 2024

yerfor / GeneFace

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Python 2,659 300 Updated Oct 18, 2024

lllyasviel / Fooocus

Focus on prompting and generating

Python 47,497 7,746 Updated Dec 1, 2025

wuyangecit / prompting_strategy_for_diffusers

Python 5 2 Updated Aug 4, 2023