Text Classification
Scikit-learn
Joblib
English
llm-routing
model-selection
budget-optimization
nearest-neighbor
Instructions to use JiaqiXue/R2-Router-RouterArena with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Scikit-learn
How to use JiaqiXue/R2-Router-RouterArena with Scikit-learn:
from huggingface_hub import hf_hub_download import joblib model = joblib.load( hf_hub_download("JiaqiXue/R2-Router-RouterArena", "sklearn_model.joblib") ) # only load pickle files from sources you trust # read more about it here https://skops.readthedocs.io/en/stable/persistence.html - Notebooks
- Google Colab
- Kaggle
docs: note sub_10 only training following chayan
Browse files
README.md
CHANGED
|
@@ -185,7 +185,9 @@ checkpoints/
|
|
| 185 |
|
| 186 |
## Training Details
|
| 187 |
|
| 188 |
-
-
|
|
|
|
|
|
|
| 189 |
- **Method**: Nearest-neighbor regression with cosine distance, distance-weighted
|
| 190 |
- **Evaluation**: Full 8,400 RouterArena queries (no data leakage)
|
| 191 |
- **Training Time**: < 1 second
|
|
|
|
| 185 |
|
| 186 |
## Training Details
|
| 187 |
|
| 188 |
+
Following [chayan](https://huggingface.co/adaptive-classifier/chayan), we only use the official **sub_10 split** (809 queries, 10% of the full 8,400) for training. No full-set data is used during training or hyperparameter tuning.
|
| 189 |
+
|
| 190 |
+
- **Training Data**: RouterArena sub_10 split (809 queries)
|
| 191 |
- **Method**: Nearest-neighbor regression with cosine distance, distance-weighted
|
| 192 |
- **Evaluation**: Full 8,400 RouterArena queries (no data leakage)
|
| 193 |
- **Training Time**: < 1 second
|