Researcher at Shanghai AI Lab & Shanghai Innovation Institute.
Previous PhD. from CUHK, Postdoc at NUS.
- Shanghai
- https://qiaoshengzhang.github.io/
Pinned Loading
-
-
-
COPO
COPO PublicForked from Baichenjia/COPO
Online Preference Alignment for Language Models via Count-based Exploration
Python
-
MM-EUREKA
MM-EUREKA PublicForked from ModalMinds/MM-EUREKA
MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning
Python
-
MM-Eureka-V0
MM-Eureka-V0 PublicForked from FanqingM/MM-Eureka-V0
MM-Eureka V0 also called R1-Multimodal-Journey, Latest version is in MM-Eureka
Python
-
MM-PRM
MM-PRM PublicForked from ModalMinds/MM-PRM
MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.