numind/NuExtract3-mlx-nvfp4

MLX quantized version of numind/NuExtract3.

Quantization

  • mode: nvfp4
  • group_size: 16
  • bits: 4
  • dtype: source/default
  • quant_predicate: none

Usage

pip install -U mlx-vlm
mlx_vlm.generate --model numind/NuExtract3-mlx-nvfp4 --prompt "your prompt"
Downloads last month
19
Safetensors
Model size
1B params
Tensor type
U8
·
U32
·
BF16
·
MLX
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for numind/NuExtract3-mlx-nvfp4

Finetuned
Qwen/Qwen3.5-4B
Quantized
(9)
this model

Collection including numind/NuExtract3-mlx-nvfp4