synapse/Qwen-3-4B-recipes
Text Generation • 4B • Updated • 157
I appreciate your time, a couple of quick more questions:
Oh, actually I found it. I was searching for this https://huggingface.co/datasets/HuggingFaceTB/smollm3-configs
In SmalLM2 you guys reported the weights for each dataset, for example: FineWeb-Edu (60%), DCLM (40%). Where can we find those specifics, for instance, what were the weights for each Web dataset?:
Web: 85% (12% multilingual) - FineWeb-Edu, DCLM, FineWeb2 and FineWeb2-HQ