AutoRubric-T2I: Robust Rule-Based Reward Model for Text-to-Image Alignment Paper • 2605.17602 • Published 9 days ago • 19
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published 22 days ago • 230
Maestro: Reinforcement Learning to Orchestrate Hierarchical Model-Skill Ensembles Paper • 2605.22177 • Published 8 days ago • 20
Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining Paper • 2605.14747 • Published 15 days ago • 144
DiffRetriever: Parallel Representative Tokens for Retrieval with Diffusion Language Models Paper • 2605.07210 • Published 21 days ago • 4
CGM-JEPA: Learning Consistent Continuous Glucose Monitor Representations via Predictive Self-Supervised Pretraining Paper • 2605.00933 • Published 28 days ago • 2
Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling Paper • 2604.27039 • Published 30 days ago • 25
Leveraging Verifier-Based Reinforcement Learning in Image Editing Paper • 2604.27505 • Published 29 days ago • 57
Heterogeneous Scientific Foundation Model Collaboration Paper • 2604.27351 • Published 29 days ago • 217
Modeling Sparse and Bursty Vulnerability Sightings: Forecasting Under Data Constraints Paper • 2604.16038 • Published Apr 17 • 4