bnb-4bit-experimental
Collection
experimental 4-bit dynamic quant using unsloth, these model will be used for inference and RoPE fine-tuning.
•
16 items
•
Updated
This is a converted weight from DeepHermes-3-Llama-3-8B-Preview model in unsloth 4-bit dynamic quant using this collab notebook.
This conversion uses Unsloth to load the model in 4-bit format and force-save it in the same 4-bit format.
This allows for reduced memory usage and faster inference while keeping the model compact.
Base model
meta-llama/Llama-3.1-8B