Model Card for Model ID

prepared with mergekit

quantized 4 bits with bitsandbytes

base_model: meta-llama/Llama-2-7b-chat-hf
gate_mode: cheap_embed
experts:
  - source_model: meta-llama/Llama-2-7b-chat-hf
    positive_prompts: ["You are an helpful assistant."]
  - source_model: TheTravellingEngineer/llama2-7b-hf-guanaco
    positive_prompts: ["You are an helpful general-pupose assistant"]

Downloads last month: 8

Safetensors

Model size

5.83B params

Tensor type

F32

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including uisikdag/Mixllama-2x7b-4bit-bitsnbytes

LLM / VLM Quantization

Collection

3 items • Updated Mar 9