view article Article Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains JetBrains • about 4 hours ago • 8
Foundation Text-Generation Models Below 360M Parameters Collection Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. • 43 items • Updated 10 days ago • 46
tongrow/MLX-Qwopus3.5-9B-Coder-oQ4-fp16-mtp Image-Text-to-Text • 2B • Updated 14 days ago • 1.88k • 1
stamsam/Qwen3.6-35B-A3B-Claude-4.7-Opus-Reasoning-Distilled-MLX-oQ4-MTP Text Generation • 35B • Updated 20 days ago • 7.38k • 11
mudler/Qwen3.6-35B-A3B-Claude-4.7-Opus-Reasoning-Distilled-APEX-MTP-GGUF 36B • Updated 11 days ago • 37.9k • 37