(On-going) Mixture-of-Experts

LLMs/Reasoning 2025. 12. 31. 11:30

공부 中

* Jan 2024 DeepSeekMoE

(이하 Experiments & Ablation study 읽었지만 생략.. (Ablation study 잘되어있음!))

다른 referece 추가될 예정...

[COCONUT] Training LLMs to Reason in a Continuous Latent Space (0)	2026.01.04
s1: Simple test-time scaling (0)	2026.01.04
[Dr.GRPO] Understanding R1-Zero-Like Training: A Critical Perspective (0)	2026.01.02
[DeepSeek-R1] Incentivizing Reasoning Capability in LLMs via Reinforcement Learning (0)	2026.01.02
[DeepSeekMath] Pushing the Limits of Mathematical Reasoning in Open LMs (0)	2026.01.02

밤에 쓰는 편지 밤에 쓰는 편지