Skip to content
View caichun's full-sized avatar

Block or report caichun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

22,030 2,096 Updated May 19, 2025

Awesome tooling and resources in the Chrome DevTools & DevTools Protocol ecosystem

6,752 406 Updated Sep 29, 2025

科技爱好者周刊,每周五发布

82,109 3,822 Updated Jan 9, 2026

Code to accompany "A Method for Animating Children's Drawings of the Human Figure"

Python 12,740 1,142 Updated Sep 3, 2025

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 19,205 1,951 Updated Nov 19, 2025

微信HOOK、微信机器人 wxhook,数据库解密 微信公众号采集 微信公众号爬虫,企业微信HOOK

C++ 7,024 2,367 Updated Dec 17, 2025

CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTor…

Python 3,713 533 Updated Sep 21, 2025

Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESM…

Python 4,074 653 Updated Apr 2, 2025

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 96,950 10,746 Updated Jan 5, 2026

GuwenBERT: 古文预训练语言模型(古文BERT) A Pre-trained Language Model for Classical Chinese (Literary Chinese)

546 40 Updated Aug 31, 2021

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Python 4,929 426 Updated Nov 30, 2025

基于Pytorch的Bert应用,包括命名实体识别、情感分析、文本分类以及文本相似度等

Python 814 116 Updated Jun 18, 2021

all kinds of text classification models and more with deep learning

Python 7,954 2,554 Updated Sep 28, 2023

中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。

Python 5,693 1,266 Updated Sep 23, 2020

总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力

Python 7,443 1,208 Updated Aug 24, 2022

🌿 基于springboot的快速学习示例,整合自己遇到的开源框架,如:rabbitmq(延迟队列)、Kafka、jpa、redies、oauth2、swagger、jsp、docker、k3s、k3d、k8s、mybatis加解密插件、异常处理、日志输出、多模块开发、多环境打包、缓存cache、爬虫、jwt、GraphQL、dubbo、zookeeper和Async等等📌

Java 2,745 930 Updated Mar 7, 2025

v2ray/xray多用户管理部署程序

Python 7,015 2,393 Updated Feb 24, 2024

金融风控系统(springboot+drools)、flink流计算、mongodb

Java 168 108 Updated Jun 17, 2022

基于Flink流处理的动态实时亿级全端用户画像系统

Java 486 201 Updated Dec 14, 2022

一个轻量且功能全面的中文分词器,帮助学生了解分词器的工作原理。MicroTokenizer: A lightweight Chinese tokenizer designed for educational and research purposes. Provides a practical, hands-on approach to understanding NLP concepts,…

Python 159 22 Updated Oct 18, 2024

📚 专门为自然语言处理(NLP)面试准备的学习笔记与资料

Jupyter Notebook 352 47 Updated Nov 21, 2020

学习验证码识别的相关技术,包括opencv、tesseract、机器学习算法(kNN和SVM)等,将原作者的算法改为python

Python 135 52 Updated Oct 18, 2016

机器学习、自然语言处理、深度学习部分算法实现

Python 40 24 Updated Jan 6, 2019

The most cited deep learning papers

TeX 26,092 4,462 Updated Jan 18, 2024

中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc

2,522 395 Updated Jan 17, 2024

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 154,736 31,659 Updated Jan 8, 2026

A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型

Python 3,987 750 Updated Nov 21, 2022

AiLearning:数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2

Python 41,920 11,616 Updated Nov 12, 2024

Java后端知识图谱🔥 帮助Java初学者成长

19,041 3,821 Updated May 28, 2024
Next