-
(On-going) Mixture-of-ExpertsLLMs/Reasoning 2025. 12. 31. 11:30
공부 中
* Jan 2024 DeepSeekMoE










(이하 Experiments & Ablation study 읽었지만 생략.. (Ablation study 잘되어있음!))
다른 referece 추가될 예정...

'LLMs > Reasoning' 카테고리의 다른 글
[COCONUT] Training LLMs to Reason in a Continuous Latent Space (0) 2026.01.04 s1: Simple test-time scaling (0) 2026.01.04 [Dr.GRPO] Understanding R1-Zero-Like Training: A Critical Perspective (0) 2026.01.02 [DeepSeek-R1] Incentivizing Reasoning Capability in LLMs via Reinforcement Learning (0) 2026.01.02 [DeepSeekMath] Pushing the Limits of Mathematical Reasoning in Open LMs (0) 2026.01.02