arxiv:2506.02294
Niclas P
NPBP26
AI & ML interests
None yet
Recent Activity
upvoted a paper 5 days ago
Less is More: Early Stopping Rollout for On-Policy Distillation upvoted a paper 14 days ago
MixSD: Mixed Contextual Self-Distillation for Knowledge Injection upvoted a paper 21 days ago
SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-trainingOrganizations
None yet