Stars
Let your Claude able to think
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
An open-source tool-augmented conversational language model from Fudan University
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡
🦜🔗 The platform for reliable agents.
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Running large language models on a single GPU for throughput-oriented scenarios.
Serve, optimize and scale PyTorch models in production
PLATO dialog model with pre-trained parameters in pytorch version
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
An annotated implementation of the Transformer paper.
TensorFlow code and pre-trained models for BERT
A very simple BiLSTM-CRF model for Chinese Named Entity Recognition 中文命名实体识别 (TensorFlow)
Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.