Skip to content
View dingtine's full-sized avatar

Block or report dingtine

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

2,697 118 Updated Mar 12, 2026

Enforce the output format (JSON Schema, Regex etc) of a language model

Python 1,993 84 Updated Aug 24, 2025

A series of technical report on Slow Thinking with LLM

Python 761 41 Updated Aug 13, 2025

MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning

Python 772 31 Updated Sep 7, 2025

A fork to add multimodal model training to open-r1

Python 1,499 70 Updated Feb 8, 2025

Witness the aha moment of VLM with less than $3.

Python 4,036 287 Updated May 19, 2025

✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,494 183 Updated Mar 28, 2025

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Python 74,826 8,351 Updated Mar 12, 2026

LLaVA-UHD v3: Progressive Visual Compression for Efficient Native-Resolution Encoding in MLLMs

Python 418 21 Updated Dec 20, 2025

LLM inference in C/C++

C++ 97,698 15,423 Updated Mar 12, 2026

[EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models

Python 66 7 Updated Sep 22, 2024

Fast Multimodal LLM on Mobile Devices

C++ 1,429 174 Updated Mar 7, 2026

A family of lightweight multimodal models.

Python 1,052 77 Updated Nov 18, 2024

World's Smallest Vision-Language Model

Python 33 4 Updated Apr 7, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,333 275 Updated May 4, 2024

Egocentric Video Understanding Dataset (EVUD)

Python 33 3 Updated Jul 4, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,550 2,741 Updated Aug 12, 2024
Python 45 4 Updated Mar 19, 2024

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 79,344 15,152 Updated May 10, 2024
Java 3 Updated Oct 7, 2016

WebChatGPT: A browser extension that augments your ChatGPT prompts with web results.

TypeScript 6,483 832 Updated Aug 13, 2024

This is an open-source toolkit for Heterogeneous Graph Neural Network(OpenHGNN) based on DGL.

Python 959 164 Updated Sep 22, 2025

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Python 1,040 84 Updated Sep 19, 2024

This is the source code of PFRec

Python 14 Updated Dec 16, 2022

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Python 2,802 375 Updated Mar 1, 2026
Python 31 12 Updated Jul 14, 2021

Share Some Recommender System Paper I read.

70 11 Updated Jun 26, 2021

Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习

Python 14,292 3,843 Updated Feb 18, 2025

推荐、广告工业界经典以及最前沿的论文、资料集合/ Must-read Papers on Recommendation System and CTR Prediction

1,022 219 Updated Jan 20, 2024
Next