-
f-DPGResearch/... 2024. 12. 23. 14:11
* f-divergence
* f-divergence examples (KL-divergence, Total Variation Distance)
* Aligning LMs with Preferences through f-divergence Minimization
* Algorithm
'Research > ...' 카테고리의 다른 글
Aligning Language Models with Preferences through f-divergence Minimization (0) 2024.12.23 [cdpg] Controlling Conditional Language Models without Catastrophic Forgetting (0) 2024.12.22 (3/3) GAN, F-Divergence, IPM (0) 2024.12.22 (2/3) GAN, F-Divergence, IPM (0) 2024.12.21 (1/3) GAN, F-Divergence, IPM (0) 2024.12.20