MiniCPM4-8B-PaperProf

Fine-tuned from openbmb/MiniCPM4-8B for exam-question generation in PaperProf, an AI study buddy that turns course PDFs into interactive quiz sessions.

Training

Method: QLoRA (4-bit NF4, r=16, alpha=32, all-linear targets), merged to bf16
Data: ~3500 multi-task pairs in PaperProf's three production formats: open question generation (SQuAD), MCQ with distractors and per-option explanations (SciQ), and structured answer evaluation (SQuAD-derived), so the model is optimized for the exact tasks it serves.
Epochs: 1, lr 2e-4 cosine, bf16 compute

Usage

Drop-in replacement for the base model:

from transformers import AutoTokenizer, AutoModelForCausalLM
tok = AutoTokenizer.from_pretrained("build-small-hackathon/MiniCPM4-8B-PaperProf", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("build-small-hackathon/MiniCPM4-8B-PaperProf", trust_remote_code=True, torch_dtype="bfloat16")

Built for the Build Small Hackathon, June 2026, by Team PaperProf (EPITA).

Downloads last month: -

Safetensors

Model size

8B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for build-small-hackathon/MiniCPM4-8B-PaperProf

Base model

openbmb/MiniCPM4-8B

Adapter

(1)

this model

Quantizations

1 model

build-small-hackathon
/

MiniCPM4-8B-PaperProf

MiniCPM4-8B-PaperProf

Training

Usage

Model tree for build-small-hackathon/MiniCPM4-8B-PaperProf

Datasets used to train build-small-hackathon/MiniCPM4-8B-PaperProf

Space using build-small-hackathon/MiniCPM4-8B-PaperProf 1