GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models Paper • 2605.29398 • Published 9 days ago • 7
GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models Paper • 2605.29398 • Published 9 days ago • 7
diffusion-reasoning/sudoku_llfree_dream_batchll_tlc_mu8_cl256_gc4_lr1e-5_kl5e-4_psi1.0_mc2 Text Generation • Updated Apr 21
diffusion-reasoning/sudoku_llfree_dream_batchll_tlc_mu8_cl256_gc4_lr1e-5_kl5e-4_psi1.0_mc2 Text Generation • Updated Apr 21