Skip to content
View dungdx34's full-sized avatar
  • Hanoi, Vietnam

Block or report dungdx34

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Research work aimed at addressing the problem of modeling infinite-length context

Python 40 6 Updated Dec 18, 2025

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 2,105 238 Updated Dec 18, 2025

Findings of ACL'25 DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models

Python 4 1 Updated Jun 9, 2025

XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts

Python 35 2 Updated Jul 2, 2024

[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)

Python 684 91 Updated Feb 29, 2024

[Arxiv] Discrete Diffusion in Large Language and Multimodal Models: A Survey

Python 348 3 Updated Nov 1, 2025
Python 2 Updated May 31, 2024

This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…

Jupyter Notebook 19,131 3,176 Updated Oct 30, 2025

An Autonomous Agentic Framework for Reflective PowerPoint Generation

Python 3,057 365 Updated Dec 28, 2025

[NeurIPS 2025] 🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability

Python 1,381 135 Updated Dec 8, 2025

Ongoing Research Project for continaual pre-training LLM(dense mode)

Python 44 4 Updated Mar 3, 2025

Ongoing Research Project for Mixture of Expert models

Python 6 1 Updated Oct 2, 2024

Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"

Python 353 25 Updated Dec 22, 2024

Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library

Python 242 45 Updated Nov 17, 2025

Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"

Python 53 4 Updated Feb 20, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,461 233 Updated Nov 12, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 16,591 1,306 Updated Jan 5, 2026

Source code for "Discrete Dictionary-based Decomposition Layer for Structured Representation Learning"

Python 2 Updated Nov 4, 2024

Organize the Web: Constructing Domains Enhances Pre-Training Data Curation

Jupyter Notebook 73 6 Updated May 2, 2025

[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Python 358 25 Updated May 31, 2025

AllenAI's post-training codebase

Python 3,506 480 Updated Jan 6, 2026

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 943 89 Updated Sep 23, 2025

A simple, performant and scalable Jax LLM!

Python 2,071 444 Updated Jan 5, 2026
Python 1 Updated Apr 14, 2024
Python 551 56 Updated Jul 11, 2024

Modeling, training, eval, and inference code for OLMo

Python 6,271 693 Updated Nov 24, 2025
Next