Model Card for Model ID

ReDis-Llama is trained for improved inductive reasoning performance.

Model Description

Developed by: Nafis Sadeq
Language(s) (NLP): English
Finetuned from model: mistralai/Mistral-7B-Instruct-v0.3

Model Sources [optional]

Repository: https://github.com/NafisSadeq/reasoning-distillation
Paper: https://arxiv.org/abs/2504.10647

How to Get Started with the Model

Follow the instructions here: https://github.com/NafisSadeq/reasoning-distillation

Training Details

Training details can be found in the paper: https://arxiv.org/abs/2504.10647

Environmental Impact

Hardware Type: 2 × 48 GB Nvidia RTX A6000 GPUs
Hours used: 72 hours

Model Architecture and Objective

This model has the same architecture as mistralai/Mistral-7B-Instruct-v0.3

Compute Infrastructure

2 × 48 GB Nvidia RTX A6000 GPUs

Citation

If you use this model, please cite the following paper.

@misc{sadeq2025improvingincontextlearningreasoning, title={Improving In-Context Learning with Reasoning Distillation}, author={Nafis Sadeq and Xin Xu and Zhouhang Xie and Julian McAuley and Byungkyu Kang and Prarit Lamba and Xiang Gao}, year={2025}, eprint={2504.10647}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2504.10647}, }

Downloads last month: -; Downloads are not tracked for this model. How to track

Model tree for nsadeq/ReDis-Mistral

Base model

mistralai/Mistral-7B-v0.3

Finetuned

mistralai/Mistral-7B-Instruct-v0.3

Finetuned

(479)

this model

Datasets used to train nsadeq/ReDis-Mistral

Paper for nsadeq/ReDis-Mistral

Improving In-Context Learning with Reasoning Distillation

Paper • 2504.10647 • Published Apr 14, 2025