PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion Paper • 2605.23902 • Published 10 days ago • 44
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published 6 days ago • 128
SkillOpt: Executive Strategy for Self-Evolving Agent Skills Paper • 2605.23904 • Published 10 days ago • 208
view article Article OlmoEarth v1.1: A more efficient family of Earth observation models allenai • 12 days ago • 20
view article Article Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality ibm-granite • 17 days ago • 32
MinT: Managed Infrastructure for Training and Serving Millions of LLMs Paper • 2605.13779 • Published 19 days ago • 219
SAGA: Workflow-Atomic Scheduling for AI Agent Inference on GPU Clusters Paper • 2605.00528 • Published about 1 month ago • 1
Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction Paper • 2605.05242 • Published 29 days ago • 117
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 29 days ago • 166
view article Article vLLM V0 to V1: Correctness Before Corrections in RL ServiceNow-AI • 25 days ago • 11
CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation Paper • 2604.19636 • Published Apr 21 • 87
view article Article Ecom-RLVE: Adaptive Verifiable Environments for E-Commerce Conversational Agents +2 thebajajra, ai-queen, pmonad, burtenshaw • Apr 16 • 20
Seedance 2.0: Advancing Video Generation for World Complexity Paper • 2604.14148 • Published Apr 15 • 163
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents Paper • 2604.11784 • Published Apr 13 • 143