-
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
Paper • 2508.21113 • Published • 110 -
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
Paper • 2508.16949 • Published • 24 -
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control
Paper • 2508.21112 • Published • 78 -
UItron: Foundational GUI Agent with Advanced Perception and Planning
Paper • 2508.21767 • Published • 12
Jeff Nyzio
TheOneTrueNiz
AI & ML interests
None yet
Recent Activity
liked a model about 4 hours ago
Ex0bit/Gemma4-PRISM-PRO-DQ liked a model about 13 hours ago
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 liked a model 1 day ago
LilaRest/gemma-4-31B-it-NVFP4-turboOrganizations
None yet