numind/NuExtract3-mlx-8bits

MLX quantized version of numind/NuExtract3.

Quantization

  • mode: affine
  • group_size: 64
  • bits: 8
  • dtype: source/default
  • quant_predicate: none

Usage

pip install -U mlx-vlm
mlx_vlm.generate --model numind/NuExtract3-mlx-8bits --prompt "your prompt"
Downloads last month
-
Safetensors
Model size
2B params
Tensor type
BF16
·
U32
·
MLX
Hardware compatibility
Log In to add your hardware

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for numind/NuExtract3-mlx-8bits

Finetuned
Qwen/Qwen3.5-4B
Quantized
(9)
this model

Collection including numind/NuExtract3-mlx-8bits