-
[LLaDA] Large Language Diffusion ModelsLLMs/Diffusion 2026. 5. 7. 13:57
(NeurIPS 2025)
https://ml-gsai.github.io/LLaDA-demo/
LLaDA - Large Language Diffusion Models
LLaDA is a diffusion model with an unprecedented 8B scale, trained entirely from scratch, rivaling LLaMA3 8B in performance.
ml-gsai.github.io










































'LLMs > Diffusion' 카테고리의 다른 글
[D-CFG] Simple Guidance Mechanism for Discrete Diffusion Models (0) 2026.05.08 [d1] Scaling Reasoning in dLLMs via RL (0) 2026.05.07 [MDLMs] Simple and Effective Masked Diffusion Language Models (0) 2026.05.05 Likelihood-Based Diffusion Language Models (0) 2026.01.01 Diffusion-LM Improves Controllable Text Generation (0) 2025.12.31