arxiv:2603.08091
Hongli Zhou
Joe-Hall-Lee
AI & ML interests
Large Language Models
Recent Activity
upvoted a paper about 14 hours ago
Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation upvoted a paper 2 months ago
Think-J: Learning to Think for Generative LLM-as-a-Judge upvoted a paper 2 months ago
Mitigating the Bias of Large Language Model EvaluationOrganizations
None yet