veg's picture

veg

ciprianv

·

AI & ML interests

None yet

Recent Activity

new activity 3 days ago

INC4AI/MiMo-V2.5-Pro-int4-mixed:working vllm or sglang command

new activity 3 days ago

lukealonso/MiMo-V2.5-NVFP4:Looping in OpenCode

upvoted a collection 2 months ago

View all activity

Organizations

None yet

New activity in INC4AI/MiMo-V2.5-Pro-int4-mixed 3 days ago

working vllm or sglang command

#1 opened 3 days ago by

New activity in lukealonso/MiMo-V2.5-NVFP4 3 days ago

Looping in OpenCode

#4 opened 17 days ago by

New activity in Qwen/Qwen3.5-397B-A17B-GPTQ-Int4 3 months ago

GPTQ vs Q4 GGUF

#2 opened 3 months ago by

New activity in mratsim/MiniMax-M2.5-BF16-INT4-AWQ 3 months ago

Cant get it to work on 8x RTX3090

#1 opened 3 months ago by

New activity in lukealonso/MiniMax-M2.5-NVFP4 3 months ago

"w1_weight_scale_2 must match w3_weight_scale_2. Accuracy may be affected."

#2 opened 3 months ago by

New activity in mratsim/MiniMax-M2.5-BF16-INT4-AWQ 3 months ago

accuracy

#4 opened 3 months ago by

New activity in mratsim/MiniMax-M2.1-BF16-INT4-AWQ 3 months ago

Fastest for my 3090x8

#1 opened 3 months ago by

New activity in 0xSero/MiniMax-M2.1-REAP-40 4 months ago

Hey i like the model could you maybe make a NVFP4 version or a version optimised for the dgx spark?

#1 opened 4 months ago by

New activity in cerebras/GLM-4.7-REAP-218B-A32B 4 months ago

Please create also Minimax 2.1 REAP versions

#1 opened 4 months ago by

New activity in unsloth/MiniMax-M2.1-GGUF 5 months ago

Report: getting 20 t/s with UD-Q4_K_XL and 72 VRAM

#2 opened 5 months ago by

Hot Damn This Model Cooks!

#5 opened 5 months ago by

New activity in MiniMaxAI/MiniMax-M2.1 5 months ago

Please make 4 bit dwq mlx quant

#1 opened 5 months ago by

New activity in unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF 5 months ago

Please update llama.cpp to see improved performance!

#7 opened 5 months ago by

New activity in unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF 10 months ago

Updated Title: UDQ4_K_XL - Great Rust coder

#11 opened 10 months ago by

wonderfuldestruction

New activity in unsloth/Qwen3-235B-A22B-Thinking-2507-GGUF 10 months ago

download link creates Q5_K_M instead of UD-Q5_K_XL named files

#2 opened 10 months ago by

New activity in Qwen/Qwen3-Coder-480B-A35B-Instruct 10 months ago

Confused about the eval score

#15 opened 10 months ago by

New activity in ubergarm/DeepSeek-TNG-R1T2-Chimera-GGUF 11 months ago

IQ3_KS metrics on mixed CUDA + CPU, pretty good model!

#2 opened 11 months ago by

New activity in tngtech/DeepSeek-TNG-R1T2-Chimera 11 months ago

What are the recommended settings?

#7 opened 11 months ago by

New activity in ubergarm/DeepSeek-R1-0528-GGUF 11 months ago

Thanks for your work! Any chance for something between Q2_K_R and Q3_K_R?

#7 opened 12 months ago by

New activity in unsloth/DeepSeek-R1-0528-GGUF 11 months ago

Update - Tool Calling + Chat Template bug fixes

#20 opened 11 months ago by