MiniCPM4-8B-PaperProf

Fine-tuned from openbmb/MiniCPM4-8B for exam-question generation in PaperProf, an AI study buddy that turns course PDFs into interactive quiz sessions.

Training

  • Method: QLoRA (4-bit NF4, r=16, alpha=32, all-linear targets), merged to bf16
  • Data: ~3500 multi-task pairs in PaperProf's three production formats: open question generation (SQuAD), MCQ with distractors and per-option explanations (SciQ), and structured answer evaluation (SQuAD-derived), so the model is optimized for the exact tasks it serves.
  • Epochs: 1, lr 2e-4 cosine, bf16 compute

Usage

Drop-in replacement for the base model:

from transformers import AutoTokenizer, AutoModelForCausalLM
tok = AutoTokenizer.from_pretrained("build-small-hackathon/MiniCPM4-8B-PaperProf", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("build-small-hackathon/MiniCPM4-8B-PaperProf", trust_remote_code=True, torch_dtype="bfloat16")

Built for the Build Small Hackathon, June 2026, by Team PaperProf (EPITA).

Downloads last month
-
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for build-small-hackathon/MiniCPM4-8B-PaperProf

Adapter
(1)
this model
Quantizations
1 model

Datasets used to train build-small-hackathon/MiniCPM4-8B-PaperProf

Space using build-small-hackathon/MiniCPM4-8B-PaperProf 1