Model Card for Model ID

prepared with mergekit

quantized 4 bits with bitsandbytes

base_model: meta-llama/Llama-2-7b-chat-hf
gate_mode: cheap_embed
experts:
  - source_model: meta-llama/Llama-2-7b-chat-hf
    positive_prompts: ["You are an helpful assistant."]
  - source_model: TheTravellingEngineer/llama2-7b-hf-guanaco
    positive_prompts: ["You are an helpful general-pupose assistant"]
Downloads last month
8
Safetensors
Model size
5.83B params
Tensor type
F32
·
U8
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including uisikdag/Mixllama-2x7b-4bit-bitsnbytes