Please add support for GGUF‑quantized models

by makisekurisu-jp - opened 3 days ago

Yes, I need the Q4K_M quantized model. Its size is roughly half that of an FP8 model. @makisekurisu-jp

comfyui won't support gguf...but I've converted it to 4-bit safetensors, try it, so it's 3.5x smaller and almost no degradation https://huggingface.co/dummy9996/seedvr2_comfyui_bf16_mxfp8_nvfp4/blob/main/seedvr2_3b_nvfp4.safetensors

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment