File size: 239 Bytes
428220d
8e2e559
31b59bf
1
2
3
static quants of https://huggingface.co/nvidia/Llama-3_1-Nemotron-Ultra-253B-v1

To run these quants before https://github.com/ggml-org/llama.cpp/pull/12843 is merged you will need to build llama.cpp from https://github.com/ymcki/llama.cpp