HRM-Text: Efficient Pretraining Beyond Scaling Paper • 2605.20613 • Published 20 days ago • 312
VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation Paper • 2605.16079 • Published 25 days ago • 28
Running on Zero MCP Featured 1.43k FireRed Image Edit 1.0 Fast 🌖 1.43k FireRed-Image-Edit × Qwen-Image-Edit-Rapid (Transformers)
Running on Zero MCP 155 Qwen Image Edit 2509 LoRAs Fast âš¡ 155 Demo of the Collection of Qwen Image Editing LoRAs
FashionChameleon: Towards Real-Time and Interactive Human-Garment Video Customization Paper • 2605.15824 • Published 25 days ago • 65
alibaba-multimodal-industrial-ai/IndustryBench Viewer • Updated 27 days ago • 2.05k • 385 • 29
deepseek-ai/DeepSeek-V4-Flash Text Generation • 158B • Updated about 19 hours ago • 3.26M • • 1.45k
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published May 4 • 348
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents Paper • 2604.11784 • Published Apr 13 • 143