Ernani Britto
Ernani
AI & ML interests
AI Agents and LLMs
Recent Activity
liked a Space 3 days ago
build-small-hackathon/registration liked a model about 1 month ago
pyannote/segmentation-3.0 reacted to burtenshaw's post with ๐ about 1 year ago
Qwen 3 Fine tuning >> MoE. Update the experiment thread to include config and script for fine-tuning the Qwen3-30B-A3B model.
The goal is to make a low latency non-thinking model for a daily driver coding, so 3 billion parameters active should be perfect.
โ๏ธ training running
โ๏ธ evals running
โญ๏ธ improve dataset
The moe isn't going to fit into colab's A100 even with quantization (๐ @UnslothAI ). So I've been working on HF spaces' H100s for this. Everything is available in the tread and I'll share more tomorrow.
https://huggingface.co/burtenshaw/Qwen3-Code-Lite/discussions/1