HRM-Text: Efficient Pretraining Beyond Scaling Paper • 2605.20613 • Published 17 days ago • 310
VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation Paper • 2605.16079 • Published 22 days ago • 28
Running on Zero MCP Featured 1.41k FireRed Image Edit 1.0 Fast 🌖 1.41k FireRed-Image-Edit × Qwen-Image-Edit-Rapid (Transformers)
Running on Zero MCP 145 Qwen Image Edit 2509 LoRAs Fast âš¡ 145 Demo of the Collection of Qwen Image Editing LoRAs
FashionChameleon: Towards Real-Time and Interactive Human-Garment Video Customization Paper • 2605.15824 • Published 22 days ago • 64
alibaba-multimodal-industrial-ai/IndustryBench Viewer • Updated 23 days ago • 2.05k • 377 • 29
deepseek-ai/DeepSeek-V4-Flash Text Generation • 158B • Updated about 1 month ago • 3.47M • • 1.41k
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published May 4 • 348
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents Paper • 2604.11784 • Published Apr 13 • 143