APEX Quants (GGUF) Collection MoE models quantized with the APEX Quantization technique ( https://github.com/mudler/apex-quant ) • 23 items • Updated about 22 hours ago • 48
Nemotron-Cascade 2 Collection Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated about 8 hours ago • 48
Running Featured 78 Cohere Transcribe WebGPU ⚡ 78 Run Cohere Transcribe locally in your browser on WebGPU.
Running Featured 75 Nemotron 3 Nano WebGPU ⚛ 75 A compact reasoning-capable model running in your browser.