transformers

Star

Here are 3,311 public repositories matching this topic...

hiyouga / LlamaFactory

Star

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Updated Jan 13, 2026
Python

labmlai / annotated_deep_learning_paper_implementations

Star

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

machine-learning reinforcement-learning deep-learning transformers pytorch transformer gan neural-networks literate-programming attention lora deep-learning-tutorial optimizers

Updated Nov 11, 2025
Python

lucidrains / vit-pytorch

Star

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

computer-vision transformers artificial-intelligence image-classification attention-mechanism

Updated Jan 15, 2026
Python

huggingface / peft

Star

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

python adapter transformers pytorch lora diffusion fine-tuning peft parameter-efficient-learning llm

Updated Jan 15, 2026
Python

arc53 / DocsGPT

Sponsor

Star

Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.

react python search machine-learning natural-language-processing information-retrieval ai transformers pytorch language-model agents semantic-search hacktoberfest rag llm hacktoberfest2025 chatgpt agent-builder docsgpt

Updated Jan 15, 2026
Python

stas00 / ml-engineering

Star

Machine Learning Engineering Open Book

training debugging machine-learning ai storage network scalability transformers slurm inference pytorch machine-learning-engineering gpus mlops large-language-models llm

Updated Jan 11, 2026
Python

NVIDIA / Megatron-LM

Star

Ongoing research training transformer models at scale

transformers model-para large-language-models

Updated Jan 15, 2026
Python

BlinkDL / RWKV-LM

Star

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

deep-learning transformers pytorch transformer lstm rnn gpt language-model attention-mechanism gpt-2 gpt-3 linear-attention rwkv chatgpt

Updated Jan 12, 2026
Python

PaddlePaddle / PaddleNLP

Star

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

nlp search-engine compression sentiment-analysis transformers information-extraction question-answering llama pretrained-models embedding bert semantic-analysis distributed-training ernie neural-search uie document-intelligence paddlenlp llm

Updated Dec 17, 2025
Python

neuml / txtai

Star

💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

Updated Jan 15, 2026
Python

qubvel-org / segmentation_models.pytorch

Star

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Updated Dec 23, 2025
Python

speechbrain / speechbrain

Star

A PyTorch-based Speech Toolkit

Updated Jan 5, 2026
Python

OpenRLHF / OpenRLHF

Star

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

reinforcement-learning raylib transformers proximal-policy-optimization large-language-models reinforcement-learning-from-human-feedback vllm openai-o1

Updated Jan 8, 2026
Python

intel / ipex-llm

Star

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.

gpu transformers pytorch llm

Updated Oct 14, 2025
Python

EleutherAI / gpt-neo

Star

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

transformers gpt language-model gpt-2 gpt-3

Updated Feb 25, 2022
Python

lucidrains / PaLM-rlhf-pytorch

Star

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

reinforcement-learning deep-learning transformers artificial-intelligence attention-mechanisms human-feedback

Updated Oct 11, 2025
Python

jessevig / bertviz

Star

BertViz: Visualize Attention in Transformer Models

visualization nlp machine-learning natural-language-processing neural-network transformers pytorch transformer bert roberta gpt2

Updated Jan 8, 2026
Python

EleutherAI / gpt-neox

Star

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

transformers language-model gpt-3 deepspeed-library

Updated Dec 10, 2025
Python

MaartenGr / BERTopic

Star

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

nlp machine-learning topic transformers topic-modeling bert topic-models sentence-embeddings topic-modelling ldavis

Updated Jan 5, 2026
Python

microsoft / presidio

Star

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

Updated Jan 15, 2026
Python

Improve this page

Add a description, image, and links to the transformers topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the transformers topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

transformers

Here are 3,311 public repositories matching this topic...

hiyouga / LlamaFactory

labmlai / annotated_deep_learning_paper_implementations

lucidrains / vit-pytorch

huggingface / peft

arc53 / DocsGPT

stas00 / ml-engineering

NVIDIA / Megatron-LM

BlinkDL / RWKV-LM

PaddlePaddle / PaddleNLP

neuml / txtai

qubvel-org / segmentation_models.pytorch

speechbrain / speechbrain

OpenRLHF / OpenRLHF

intel / ipex-llm

EleutherAI / gpt-neo

lucidrains / PaLM-rlhf-pytorch

jessevig / bertviz

EleutherAI / gpt-neox

MaartenGr / BERTopic

microsoft / presidio

Improve this page

Add this topic to your repo