Why Far Looks Up: Probing Spatial Representation in Vision-Language Models Paper • 2605.30161 • Published 4 days ago • 38 • 3
When Cloud Agents Meet Device Agents: Lessons from Hybrid Multi-Agent Systems Paper • 2605.30102 • Published 4 days ago • 11 • 3
REPOT: Recoverable Program-of-Thought via Checkpoint Repair Paper • 2605.30052 • Published 4 days ago • 6 • 3
Thinking Before Constraining: A Unified Decoding Framework for Large Language Models Paper • 2601.07525 • Published 4 days ago • 6 • 3
Alignment Tampering: How Reinforcement Learning from Human Feedback Is Exploited to Optimize Misaligned Biases Paper • 2605.27355 • Published 6 days ago • 2 • 3
Convex Low-resource Accent-Robust Language Detection in Speech Recognition Paper • 2605.23235 • Published 10 days ago • 3 • 3
LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory Paper • 2603.03269 • Published Mar 3 • 63 • 7
Got a Secret? LLM Agents Can't Keep It: Evaluating Privacy in Multi-Agent Systems Paper • 2605.27766 • Published 6 days ago • 1 • 3
EarlyTom: Early Token Compression Completes Fast Video Understanding Paper • 2605.30010 • Published 4 days ago • 27 • 3
SAHOO: Safeguarded Alignment for High-Order Optimization Objectives in Recursive Self-Improvement Paper • 2603.06333 • Published Mar 6 • 1 • 3
Small Vision-Language Models are Smart Compressors for Long Video Understanding Paper • 2604.08120 • Published Apr 9 • 20 • 3
Meta-learning In-Context Enables Training-Free Cross Subject Brain Decoding Paper • 2604.08537 • Published Apr 9 • 9 • 3
Time is Not a Label: Continuous Phase Rotation for Temporal Knowledge Graphs and Agentic Memory Paper • 2604.11544 • Published Apr 13 • 4 • 3
Models That Know How Evaluations Are Designed Score Safer Paper • 2605.28591 • Published 5 days ago • 6 • 5
SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context Paper • 2604.11716 • Published Apr 13 • 5 • 3
ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation Paper • 2605.28293 • Published 5 days ago • 81 • 3
Models That Know How Evaluations Are Designed Score Safer Paper • 2605.28591 • Published 5 days ago • 6 • 5
Self-Improving Language Models with Bidirectional Evolutionary Search Paper • 2605.28814 • Published 5 days ago • 54 • 3
CollectionLoRA: Collecting 50 Effects in 1 LoRA via Multi-Teacher On-Policy Distillation Paper • 2605.25378 • Published 7 days ago • 53 • 3