Please add support for GGUF‑quantized models
#3
by makisekurisu-jp - opened
Yes, I need the Q4K_M quantized model. Its size is roughly half that of an FP8 model. @makisekurisu-jp
comfyui won't support gguf...but I've converted it to 4-bit safetensors, try it, so it's 3.5x smaller and almost no degradation https://huggingface.co/dummy9996/seedvr2_comfyui_bf16_mxfp8_nvfp4/blob/main/seedvr2_3b_nvfp4.safetensors