docs: deprecate `llama-gemma3-cli` and update usage to `llama-mtmd-cli`
#11 opened about 10 hours ago
by
danchev

GGUF seems to be broken
4
#10 opened 19 days ago
by
aportnoy
VLLM support
15
1
#8 opened 23 days ago
by
potanin-marat
request for benchmarks to compare between original model and quantized models
2
#7 opened 24 days ago
by
riversnow
KeyError: 'general.name'
3
7
#4 opened about 1 month ago
by
vilyuha
Ollama run returns authentication error on in Windows.
6
#3 opened about 1 month ago
by
theforgehermit
Run on Ollama problem
5
#2 opened about 1 month ago
by
Sagicc

why is the size bigger than regular Q4_0 quants ?
3
6
#1 opened about 1 month ago
by
lefromage