doberst commited on
Commit
02d26d7
·
verified ·
1 Parent(s): 029f8ff

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +33 -3
README.md CHANGED
@@ -1,3 +1,33 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ inference: false
4
+ base_model: microsoft/Phi-3-mini-4k-instruct
5
+ base_model_relation: quantized
6
+ tags: [green, llmware-chat, p3, ov]
7
+ ---
8
+
9
+ # phi-3-npu-ov
10
+
11
+ <!-- Provide a quick summary of what the model is/does. -->
12
+
13
+ **phi-3-npu-ov** is an OpenVino int4 quantized version of [Microsoft Phi-3-mini-4k-instruct](https://www.huggingface.co/microsoft/Phi-3-mini-4k-instruct), providing a very fast, very small inference implementation, optimized for AI PCs using Intel NPU.
14
+
15
+
16
+ ### Model Description
17
+
18
+ - **Developed by:** microsoft
19
+ - **Quantized by:** llmware
20
+ - **Model type:** phi3
21
+ - **Parameters:** 3.8 billion
22
+ - **Model Parent:** microsoft/Phi-3-mini-4k-instruct
23
+ - **Language(s) (NLP):** English
24
+ - **License:** Apache 2.0
25
+ - **Uses:** Chat, general-purpose LLM
26
+ - **Quantization:** int4
27
+
28
+
29
+ ## Model Card Contact
30
+
31
+ [llmware on hf](https://www.huggingface.co/llmware)
32
+
33
+ [llmware website](https://www.llmware.ai)