veg
ciprianv
AI & ML interests
None yet
Recent Activity
new activity 3 days ago
INC4AI/MiMo-V2.5-Pro-int4-mixed:working vllm or sglang command new activity 3 days ago
lukealonso/MiMo-V2.5-NVFP4:Looping in OpenCode upvoted a collection 2 months ago
Cerebras REAPOrganizations
None yet
working vllm or sglang command
#1 opened 3 days ago
by
ciprianv
Looping in OpenCode
👀 1
5
#4 opened 17 days ago
by
Jon-Nielsen
GPTQ vs Q4 GGUF
👀 3
1
#2 opened 3 months ago
by
ciprianv
Cant get it to work on 8x RTX3090
14
#1 opened 3 months ago
by
maglat
"w1_weight_scale_2 must match w3_weight_scale_2. Accuracy may be affected."
👍 1
21
#2 opened 3 months ago
by
zenmagnets
accuracy
26
#4 opened 3 months ago
by
ktsaou
Fastest for my 3090x8
2
#1 opened 3 months ago
by
ciprianv
Please create also Minimax 2.1 REAP versions
2
#1 opened 4 months ago
by
ciprianv
Report: getting 20 t/s with UD-Q4_K_XL and 72 VRAM
🔥 2
10
#2 opened 5 months ago
by
SlavikF
Hot Damn This Model Cooks!
👍 6
12
#5 opened 5 months ago
by
aaron-newsome
Please make 4 bit dwq mlx quant
2
#1 opened 5 months ago
by
Narutoouz
Please update llama.cpp to see improved performance!
🚀 4
4
#7 opened 5 months ago
by
danielhanchen
Updated Title: UDQ4_K_XL - Great Rust coder
👍 3
5
#11 opened 10 months ago
by
wonderfuldestruction
download link creates Q5_K_M instead of UD-Q5_K_XL named files
1
#2 opened 10 months ago
by
ciprianv
Confused about the eval score
❤️ 2
3
#15 opened 10 months ago
by
Denisssy
IQ3_KS metrics on mixed CUDA + CPU, pretty good model!
🔥 2
34
#2 opened 11 months ago
by
Panchovix
What are the recommended settings?
1
#7 opened 11 months ago
by
ciprianv
Thanks for your work! Any chance for something between Q2_K_R and Q3_K_R?
🚀👀 5
19
#7 opened 12 months ago
by
Panchovix
Update - Tool Calling + Chat Template bug fixes
9
#20 opened 11 months ago
by
danielhanchen