Text Classification
Scikit-learn
Joblib
English
llm-routing
model-selection
budget-optimization
nearest-neighbor
Instructions to use JiaqiXue/R2-Router-RouterArena with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Scikit-learn
How to use JiaqiXue/R2-Router-RouterArena with Scikit-learn:
from huggingface_hub import hf_hub_download import joblib model = joblib.load( hf_hub_download("JiaqiXue/R2-Router-RouterArena", "sklearn_model.joblib") ) # only load pickle files from sources you trust # read more about it here https://skops.readthedocs.io/en/stable/persistence.html - Notebooks
- Google Colab
- Kaggle
fix: update vllm serve comment to --runner pooling
Browse files
router.py
CHANGED
|
@@ -12,7 +12,7 @@ Usage:
|
|
| 12 |
result = router.route_text("What is the capital of France?")
|
| 13 |
|
| 14 |
# Option B: Remote vLLM server (no local GPU needed for embedding)
|
| 15 |
-
# Start server: vllm serve Qwen/Qwen3-0.6B --
|
| 16 |
router = R2Router.from_pretrained(path, embed_url="http://localhost:8000")
|
| 17 |
result = router.route_text("What is the capital of France?")
|
| 18 |
|
|
|
|
| 12 |
result = router.route_text("What is the capital of France?")
|
| 13 |
|
| 14 |
# Option B: Remote vLLM server (no local GPU needed for embedding)
|
| 15 |
+
# Start server: vllm serve Qwen/Qwen3-0.6B --runner pooling
|
| 16 |
router = R2Router.from_pretrained(path, embed_url="http://localhost:8000")
|
| 17 |
result = router.route_text("What is the capital of France?")
|
| 18 |
|