dungdx34

BlueSky dungdx34

NLP

8 followers · 86 following

Hanoi, Vietnam

Stars

ant-research / long-context-modeling

Research work aimed at addressing the problem of modeling infinite-length context

Python 40 6 Updated Dec 18, 2025

SafeAILab / EAGLE

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 2,105 238 Updated Dec 18, 2025

bytedance / DiffLM

Findings of ACL'25 DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models

Python 4 1 Updated Jun 9, 2025

ise-uiuc / xft

XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts

Python 35 2 Updated Jul 2, 2024

SpaceHunterInf / Segment_Level_Diffusion

Python 1 Updated Jul 20, 2025

louaaron / Score-Entropy-Discrete-Diffusion

[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)

Python 684 91 Updated Feb 29, 2024

LiQiiiii / DLLM-Survey

[Arxiv] Discrete Diffusion in Large Language and Multimodal Models: A Survey

Python 348 3 Updated Nov 1, 2025

Oswald1997 / SumSurvey

Python 2 Updated May 31, 2024

NirDiamant / GenAI_Agents

This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…

Jupyter Notebook 19,131 3,176 Updated Oct 30, 2025

icip-cas / PPTAgent

An Autonomous Agentic Framework for Reflective PowerPoint Generation

Python 3,057 365 Updated Dec 28, 2025

RUC-NLPIR / WebThinker

[NeurIPS 2025] 🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability

Python 1,381 135 Updated Dec 8, 2025

okoge-kaz / llm-recipes

Ongoing Research Project for continaual pre-training LLM(dense mode)

Python 44 4 Updated Mar 3, 2025

rioyokotalab / moe-recipes

Ongoing Research Project for Mixture of Expert models

Python 6 1 Updated Oct 2, 2024

ML-GSAI / SMDM

Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"

Python 353 25 Updated Dec 22, 2024

allenai / ai2-scholarqa-lib

Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library

Python 242 45 Updated Nov 17, 2025

hamishivi / tess-2

Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"

Python 53 4 Updated Feb 20, 2025

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,461 233 Updated Nov 12, 2025

allenai / olmocr

Toolkit for linearizing PDFs for LLM datasets/training

Python 16,591 1,306 Updated Jan 5, 2026

taewonpark / D3

Source code for "Discrete Dictionary-based Decomposition Layer for Structured Representation Learning"

Python 2 Updated Nov 4, 2024

CodeCreator / WebOrganizer

Organize the Web: Constructing Domains Enhances Pre-Training Data Curation

Jupyter Notebook 73 6 Updated May 2, 2025

HKUNLP / DiffuLLaMA

[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Python 358 25 Updated May 31, 2025

allenai / open-instruct

AllenAI's post-training codebase

Python 3,506 480 Updated Jan 6, 2026

allenai / OLMoE

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 943 89 Updated Sep 23, 2025

Tencent-Hunyuan / Tencent-Hunyuan-Large

Python 1,590 119 Updated Dec 6, 2024

deepseek-ai / DeepSeek-V3

Python 100,974 16,448 Updated Aug 28, 2025

multimodal-art-projection / MAP-NEO

Python 973 89 Updated Feb 7, 2025

AI-Hypercomputer / maxtext

A simple, performant and scalable Jax LLM!

Python 2,071 444 Updated Jan 5, 2026

phandat128 / TPR-Pascal-Transformer

Python 1 Updated Apr 14, 2024

rwitten / HighPerfLLMs2024

Python 551 56 Updated Jul 11, 2024

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 6,271 693 Updated Nov 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly