arxiv:2605.07865
Minjae Oh
Riasok
·
AI & ML interests
None yet
Recent Activity
upvoted a paper about 12 hours ago
Human Psychometric Questionnaires Mischaracterize LLM Behavior authored a paper 6 days ago
ThinkBrake: Efficient Reasoning via Log-Probability Margin Guided Decoding authored a paper 6 days ago
KL for a KL: On-Policy Distillation with Control Variate Baseline