view article Article Hugging Face and Cerebras bring Gemma 4 to real-time voice AI +2 A-Mahla, andito, lvwerra, vyassaurabh • 5 days ago • 62
ELDR: Expert-Locality-Aware Decode Routing for PD-Disaggregated MoE Serving Paper • 2607.00466 • Published 5 days ago • 24
MOPD: Multi-Teacher On-Policy Distillation for Capability Integration in LLM Post-Training Paper • 2606.30406 • Published 7 days ago • 13
Multi-Resolution Flow Matching: Training-Free Diffusion Acceleration via Staged Sampling Paper • 2607.01642 • Published 4 days ago • 28
Optimizing Visual Generative Models via Distribution-wise Rewards Paper • 2607.02291 • Published 4 days ago • 14
AEON-7/Ornith-1.0-35B-AEON-Ultimate-Uncensored-BF16 Text Generation • 35B • Updated 6 days ago • 1.44k • 10
PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception Paper • 2606.28322 • Published 10 days ago • 38
AEON-7/Nemotron-3-Nano-Omni-AEON-Ultimate-Uncensored-NVFP4 Any-to-Any • 20B • Updated 14 days ago • 878 • 8