view article Article Hugging Face and Cerebras bring Gemma 4 to real-time voice AI +2 A-Mahla, andito, lvwerra, vyassaurabh • 4 days ago • 60
ELDR: Expert-Locality-Aware Decode Routing for PD-Disaggregated MoE Serving Paper • 2607.00466 • Published 4 days ago • 24
MOPD: Multi-Teacher On-Policy Distillation for Capability Integration in LLM Post-Training Paper • 2606.30406 • Published 6 days ago • 13
Multi-Resolution Flow Matching: Training-Free Diffusion Acceleration via Staged Sampling Paper • 2607.01642 • Published 3 days ago • 26
Optimizing Visual Generative Models via Distribution-wise Rewards Paper • 2607.02291 • Published 3 days ago • 14
PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception Paper • 2606.28322 • Published 9 days ago • 38
CausalMix: Data Mixture as Causal Inference for Language Model Training Paper • 2607.01104 • Published 4 days ago • 17
SkillHone: A Harness for Continual Agent Skill Evolution Through Persistent Decision History Paper • 2606.08671 • Published 12 days ago • 27
Nemotron-Labs-Diffusion-Image: Advancing Masked Discrete Diffusion for High-Resolution Image Synthesis Paper • 2606.29814 • Published 6 days ago • 12
Monte Carlo Energy Aggregation for Mobile 3D Gaussian Splatting Paper • 2606.30017 • Published 6 days ago • 19
Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent Paper • 2606.30616 • Published 6 days ago • 86
Formalizing Latent Thoughts: Four Axioms of Thought Representation in LLMs Paper • 2606.27378 • Published May 7 • 58
Reinforcement Learning with Metacognitive Feedback Elicits Faithful Uncertainty Expression in LLMs Paper • 2606.32032 • Published 5 days ago • 22
BrainJanus: A Unified Model for Understanding and Generation across Brain, Vision, and Language Paper • 2606.30319 • Published 6 days ago • 8
LiteResearcher: A Scalable Agentic RL Training Framework for Deep Research Agent Paper • 2604.17931 • Published Apr 20 • 3
MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction Paper • 2604.27393 • Published Apr 30 • 81
Holistic Data Scheduler for LLM Pre-training via Multi-Objective Reinforcement Learning Paper • 2606.24133 • Published 12 days ago • 11
IV-CoT: Implicit Visual Chain-of-Thought for Structure-Aware Text-to-Image Generation Paper • 2606.24849 • Published 12 days ago • 17