Instructions to use nsadeq/ReDis-Mistral with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use nsadeq/ReDis-Mistral with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="nsadeq/ReDis-Mistral")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("nsadeq/ReDis-Mistral", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use nsadeq/ReDis-Mistral with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "nsadeq/ReDis-Mistral" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "nsadeq/ReDis-Mistral", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/nsadeq/ReDis-Mistral
- SGLang
How to use nsadeq/ReDis-Mistral with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "nsadeq/ReDis-Mistral" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "nsadeq/ReDis-Mistral", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "nsadeq/ReDis-Mistral" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "nsadeq/ReDis-Mistral", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use nsadeq/ReDis-Mistral with Docker Model Runner:
docker model run hf.co/nsadeq/ReDis-Mistral
Model Card for Model ID
ReDis-Llama is trained for improved inductive reasoning performance.
Model Description
- Developed by: Nafis Sadeq
- Language(s) (NLP): English
- Finetuned from model: mistralai/Mistral-7B-Instruct-v0.3
Model Sources [optional]
- Repository: https://github.com/NafisSadeq/reasoning-distillation
- Paper: https://arxiv.org/abs/2504.10647
How to Get Started with the Model
Follow the instructions here: https://github.com/NafisSadeq/reasoning-distillation
Training Details
Training details can be found in the paper: https://arxiv.org/abs/2504.10647
Environmental Impact
- Hardware Type: 2 × 48 GB Nvidia RTX A6000 GPUs
- Hours used: 72 hours
Model Architecture and Objective
This model has the same architecture as mistralai/Mistral-7B-Instruct-v0.3
Compute Infrastructure
2 × 48 GB Nvidia RTX A6000 GPUs
Citation
If you use this model, please cite the following paper.
@misc{sadeq2025improvingincontextlearningreasoning, title={Improving In-Context Learning with Reasoning Distillation}, author={Nafis Sadeq and Xin Xu and Zhouhang Xie and Julian McAuley and Byungkyu Kang and Prarit Lamba and Xiang Gao}, year={2025}, eprint={2504.10647}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2504.10647}, }
Model tree for nsadeq/ReDis-Mistral
Base model
mistralai/Mistral-7B-v0.3