Instructions to use moondream/moondream3-preview with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use moondream/moondream3-preview with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-text-to-text", model="moondream/moondream3-preview", trust_remote_code=True)# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("moondream/moondream3-preview", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use moondream/moondream3-preview with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "moondream/moondream3-preview" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "moondream/moondream3-preview", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/moondream/moondream3-preview
- SGLang
How to use moondream/moondream3-preview with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "moondream/moondream3-preview" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "moondream/moondream3-preview", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "moondream/moondream3-preview" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "moondream/moondream3-preview", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use moondream/moondream3-preview with Docker Model Runner:
docker model run hf.co/moondream/moondream3-preview
Add PBench average evaluation result
#38 opened 6 days ago
by
merve
Always output inaccurate results
2
#34 opened 2 months ago
by
Gureumi
Segmentation not supported in moondream3-preview
👀 5
#31 opened 3 months ago
by
theblackcat102
AttributeError: 'HfMoondream' object has no attribute 'all_tied_weights_keys'. Did you mean: '_tied_weights_keys'?
👍 1
8
#30 opened 3 months ago
by
Clausss
How to perform multi-image inference with Moondream3?
1
#29 opened 5 months ago
by
StrideH
Having issues trying to run moondream3-preview in vllm?
1
#28 opened 6 months ago
by
mmonti
Use nn.Linear instead of a custom linear function
#27 opened 6 months ago
by
Disty0
fix: preserve spatial_toks when reasoning=true
2
#26 opened 7 months ago
by
jasonmoo
CPU is forbidden ?
1
#24 opened 7 months ago
by
edmond
Does this model only detect one type of target at a time?
🔥 1
1
#23 opened 7 months ago
by
zgh1997
PDF support
1
#22 opened 7 months ago
by
chiragshahcompass
Dataset ?
1
#21 opened 7 months ago
by
Ink
What are the metrics/eval framework?
1
#20 opened 7 months ago
by
wamreyaz
Perceptual Ambiguity and Visual-Verbal Coherence in Multimodal Models
#19 opened 7 months ago
by
elly99
Request: DOI
1
#17 opened 8 months ago
by
HasanXLS
Any plan to release other size model
4
#16 opened 8 months ago
by
Beomsun
Quantization
8
#14 opened 8 months ago
by
SirCodesAlot
Inconsistency between reasoning and answer
#13 opened 8 months ago
by
putazon
Local Installation Video and Testing - Step by Step
#12 opened 8 months ago
by
fahdmirzac
Inference failed: Torch not compiled with CUDA enabled
5
#11 opened 8 months ago
by
maz-net-au
how to run ?
#10 opened 8 months ago
by
houxiaowei
could support vllm or sglang?
#9 opened 8 months ago
by
willy1212009
training/ft guide
👍 5
#7 opened 8 months ago
by
nanatang
Degreaded point performance on some examples compared to the playground
11
#6 opened 8 months ago
by
AndyB12
New model not detecting the tap to type text
#5 opened 8 months ago
by
Camper3649
Amazing work !
2
#4 opened 8 months ago
by
Antix5
Is there any official approach for fine-tuning the model for custom tasks ?
1
#3 opened 8 months ago
by
Vimax97
Struggles with images of chastity cages
🤯🔥 11
2
#1 opened 8 months ago
by
NyxKrage