---
license: apache-2.0
inference: false 
base_model: berkeley-nest/Starling-LM-7B-alpha
base_model_relation: quantized 
tags: [green, p7, llmware-chat, gguf]
---

# starling-lm-7b-alpha-gguf

**starling-lm-7b-alpha-gguf** is a GGUF Q4_K_M int4 quantized version of Berkeley Nest's popular finetune of mistral, providing a very fast, very small inference implementation.    

[**starling-lm-7b-alpha-gguf**](https://huggingface.co/berkeley-nest/Starling-LM-7B-alpha) is a leading chat finetuned version of mistral 7b.  


### Model Description

- **Developed by:** berkeley-nest
- **Quantized by:** llmware   
- **Model type:** mistral-7b  
- **Parameters:** 7 billion
- **Model Parent:** berkeley-nest/Starling-LM-7B-alpha
- **Language(s) (NLP):** English  
- **License:** Apache 2.0  
- **Uses:** General purpose chat
- **RAG Benchmark Accuracy Score:** NA  
- **Quantization:** int4  
  

## Model Card Contact

[llmware on github](https://www.github.com/llmware-ai/llmware)  

[llmware on hf](https://www.huggingface.co/llmware)  

[llmware website](https://www.llmware.ai)