Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
43.9
TFLOPS
13
10
239
Aunali
Cossale
Follow
21world's profile picture
PhysiQuanty's profile picture
alexandreteles's profile picture
8 followers
Β·
42 following
https://auna.li?q=hf
XCossale
Aunali321
AI & ML interests
Text2Image and Text2Text generation.
Recent Activity
reacted
to
qgallouedec
's
post
with π₯
1 day ago
TRL v1.2 introduces the SSDTrainer π Simple Self-Distillation (SSD) from Apple's paper "Embarrassingly Simple Self-Distillation Improves Code Generation" is now available as an experimental trainer in TRL. The recipe is as minimal as the name suggests: sample completions from the model itself at a training-time temperature, then fine-tune on those raw, unverified samples with plain cross-entropy. No reward model. No verifier. No teacher model. No reinforcement learning. Just prompts and the model. ```python from trl.experimental.ssd import SSDConfig, SSDTrainer trainer = SSDTrainer( model="Qwen/Qwen3-4B-Instruct", args=SSDConfig(temperature=0.6, top_k=20, top_p=0.95), train_dataset=dataset, ) trainer.train() ``` v1.2 also ships expanded tool-calling support (LLaMA 3.1 / 3.2, DeepSeek-V3), another round of KTO β DPO alignment getting us closer to promoting KTO to stable, a big GRPO simplification for overlong tool results, deprecation of `use_transformers_paged`, and key fixes for VLM response parsing. Full release notes: https://github.com/huggingface/trl/releases/tag/v1.2.0
liked
a model
11 days ago
openbmb/VoxCPM2
updated
a dataset
12 days ago
Cossale/memory-traces
View all activity
Organizations
Cossale
's models
16
Sort:Β Recently updated
Cossale/poetry-gemma3-4B
Text Generation
β’
Updated
Mar 15, 2025
β’
4
Cossale/poetry-gemma3-4B-LoRA
Updated
Mar 15, 2025
Cossale/Frames2-Flex.1
Text-to-Image
β’
Updated
Jan 22, 2025
β’
81
β’
5
Cossale/Frames-Flex.1
Text-to-Image
β’
Updated
Jan 19, 2025
β’
28
β’
4
Cossale/frames
Text-to-Image
β’
Updated
Nov 10, 2024
β’
42
β’
Cossale/aya-expanse-8b-formal
8B
β’
Updated
Oct 24, 2024
β’
2
Cossale/phi3.5-ft
4B
β’
Updated
Sep 22, 2024
β’
3
Cossale/mms-tts-guj-train
Text-to-Audio
β’
83M
β’
Updated
Aug 19, 2024
β’
2
Cossale/Solar-Claude
Updated
Mar 29, 2024
Cossale/tinyllama-claude_q5_k_m
1B
β’
Updated
Feb 20, 2024
β’
37
Cossale/tinyllama-claude_q4_k_m
1B
β’
Updated
Feb 20, 2024
β’
79
β’
1
Cossale/tinyllama-claude_16bit_GGUF
1B
β’
Updated
Feb 20, 2024
β’
56
Cossale/tinyllama-claude_8bit_GGUF
1B
β’
Updated
Feb 20, 2024
β’
30
Cossale/tinyllama-claude_16bit
Text Generation
β’
Updated
Feb 20, 2024
β’
7
Cossale/tinyllama-claude
Updated
Feb 20, 2024
Cossale/tinyllama-claude-lora
Updated
Feb 20, 2024