RiT: Vanilla Diffusion Transformers Suffice in Representation Space Paper • 2605.21981 • Published 7 days ago • 10
Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization Paper • 2605.13641 • Published 15 days ago • 49
A Single Neuron Is Sufficient to Bypass Safety Alignment in Large Language Models Paper • 2605.08513 • Published 20 days ago • 15
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published Apr 22 • 242
MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping Paper • 2604.08364 • Published Apr 9 • 101
OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks Paper • 2604.08539 • Published Apr 9 • 50
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated Apr 6 • 184k • • 2.85k
CREval: An Automated Interpretable Evaluation for Creative Image Manipulation under Complex Instructions Paper • 2603.26174 • Published Mar 27 • 5