arxiv:2602.05711
Loser Cheems
JingzeShi
AI & ML interests
I like training small languge models.
Recent Activity
updated a model about 2 hours ago
JingzeShi/flash-sparse-attention liked a model 30 days ago
BAAI/OpenSeek-Mid-v1 published a model about 1 month ago
JingzeShi/flash-sparse-attention