Basit mustafa

BasitMustafa

195 217

AI & ML interests

None yet

Recent Activity

upvoted an article about 13 hours ago

Hugging Face and Cerebras bring Gemma 4 to real-time voice AI

upvoted a paper about 13 hours ago

ELDR: Expert-Locality-Aware Decode Routing for PD-Disaggregated MoE Serving

upvoted a paper about 13 hours ago

Morphing into Hybrid Attention Models

View all activity

Organizations

upvoted an article about 13 hours ago

Article

Hugging Face and Cerebras bring Gemma 4 to real-time voice AI

A-Mahla, andito, lvwerra, vyassaurabh

•

4 days ago

• 60

upvoted 6 papers about 13 hours ago

ELDR: Expert-Locality-Aware Decode Routing for PD-Disaggregated MoE Serving

Paper • 2607.00466 • Published 4 days ago • 24

Morphing into Hybrid Attention Models

Paper • 2606.30562 • Published 6 days ago • 37

MOPD: Multi-Teacher On-Policy Distillation for Capability Integration in LLM Post-Training

Paper • 2606.30406 • Published 6 days ago • 13

Multi-Resolution Flow Matching: Training-Free Diffusion Acceleration via Staged Sampling

Paper • 2607.01642 • Published 3 days ago • 26

Optimizing Visual Generative Models via Distribution-wise Rewards

Paper • 2607.02291 • Published 3 days ago • 14

PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception

Paper • 2606.28322 • Published 9 days ago • 38

upvoted 9 papers 3 days ago

CausalMix: Data Mixture as Causal Inference for Language Model Training

Paper • 2607.01104 • Published 4 days ago • 17

SkillHone: A Harness for Continual Agent Skill Evolution Through Persistent Decision History

Paper • 2606.08671 • Published 12 days ago • 27

Nemotron-Labs-Diffusion-Image: Advancing Masked Discrete Diffusion for High-Resolution Image Synthesis

Paper • 2606.29814 • Published 6 days ago • 12

Monte Carlo Energy Aggregation for Mobile 3D Gaussian Splatting

Paper • 2606.30017 • Published 6 days ago • 19

Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent

Paper • 2606.30616 • Published 6 days ago • 86

Formalizing Latent Thoughts: Four Axioms of Thought Representation in LLMs

Paper • 2606.27378 • Published May 7 • 58

Reinforcement Learning with Metacognitive Feedback Elicits Faithful Uncertainty Expression in LLMs

Paper • 2606.32032 • Published 5 days ago • 22

BrainJanus: A Unified Model for Understanding and Generation across Brain, Vision, and Language

Paper • 2606.30319 • Published 6 days ago • 8

LiteResearcher: A Scalable Agentic RL Training Framework for Deep Research Agent

Paper • 2604.17931 • Published Apr 20 • 3

upvoted a paper 4 days ago

MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction

Paper • 2604.27393 • Published Apr 30 • 81

upvoted 3 papers 7 days ago

Holistic Data Scheduler for LLM Pre-training via Multi-Objective Reinforcement Learning

Paper • 2606.24133 • Published 12 days ago • 11

IV-CoT: Implicit Visual Chain-of-Thought for Structure-Aware Text-to-Image Generation

Paper • 2606.24849 • Published 12 days ago • 17

Discretizing Reward Models

Paper • 2606.21795 • Published 16 days ago • 17

Basit mustafa

AI & ML interests

Recent Activity

Organizations

BasitMustafa's activity

Hugging Face and Cerebras bring Gemma 4 to real-time voice AI