4 1535

Shaobai Jiang

shaobaij

AI & ML interests

None yet

Recent Activity

upvoted a paper about 14 hours ago

MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

upvoted a paper 1 day ago

Efficient Universal Perception Encoder

upvoted a paper 1 day ago

How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings

View all activity

Organizations

None yet

upvoted a paper about 14 hours ago

MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

Paper • 2603.23516 • Published Mar 6 • 46

upvoted 6 papers 1 day ago

Efficient Universal Perception Encoder

Paper • 2603.22387 • Published 17 days ago • 6

How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings

Paper • 2604.04323 • Published 3 days ago • 28

upvoted 5 papers 2 days ago

Agentic Critical Training

Paper • 2603.08706 • Published about 1 month ago • 14

Meta-Reinforcement Learning with Self-Reflection for Agentic Search

Paper • 2603.11327 • Published 28 days ago • 9

ActionParty: Multi-Subject Action Binding in Generative Video Games

Paper • 2604.02330 • Published 7 days ago • 6

Therefore I am. I Think

Paper • 2604.01202 • Published 7 days ago • 30

Meta-Harness: End-to-End Optimization of Model Harnesses

Paper • 2603.28052 • Published 10 days ago • 16

upvoted 8 papers 3 days ago

VOID: Video Object and Interaction Deletion

Paper • 2604.02296 • Published 7 days ago • 47

FlashSampling: Fast and Memory-Efficient Exact Sampling

Paper • 2603.15854 • Published 24 days ago • 9

LookaheadKV: Fast and Accurate KV Cache Eviction by Glimpsing into the Future without Generation

Paper • 2603.10899 • Published 29 days ago • 7

EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models

Paper • 2603.12252 • Published 28 days ago • 12

Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion

Paper • 2603.06577 • Published Mar 6 • 48

Lost in Backpropagation: The LM Head is a Gradient Bottleneck

Paper • 2603.10145 • Published 30 days ago • 12

Exclusive Self Attention

Paper • 2603.09078 • Published about 1 month ago • 3

Adaptive Loops and Memory in Transformers: Think Harder or Know More?

Paper • 2603.08391 • Published 29 days ago • 1

Shaobai Jiang

AI & ML interests

Recent Activity

Organizations

shaobaij's activity