onnxruntime
/

DeepSeek-R1-Distill-ONNX

Text Generation

Inference Endpoints

Model card Files Files and versions Community

kvaishnavi commited on Feb 16

Commit

5f0a440

·

verified ·

1 Parent(s): f192828

Update README.md

Files changed (1) hide show

README.md +15 -2

README.md CHANGED Viewed

@@ -32,16 +32,29 @@ For CUDA:
 ```bash
 # Download the model directly using the Hugging Face CLI
-huggingface-cli download onnxruntime/DeepSeek-R1-Distill-ONNX --include 'deepseek-r1-distill-qwen-1.5B/cuda/*' --local-dir .
 # Install the CUDA package of ONNX Runtime GenAI
 pip install onnxruntime-genai-cuda
 # Please adjust the model directory (-m) accordingly
 curl -o https://raw.githubusercontent.com/microsoft/onnxruntime-genai/refs/heads/main/examples/python/model-chat.py
-python model-chat.py -m /path/to/cuda-int4-rtn-block-32/ -e cuda --chat_template "<|begin▁of▁sentence|><|User|>{input}<|Assistant|>"
 ```
 ## ONNX Models

 ```bash
 # Download the model directly using the Hugging Face CLI
+huggingface-cli download onnxruntime/DeepSeek-R1-Distill-ONNX --include 'deepseek-r1-distill-qwen-1.5B/gpu/*' --local-dir .
 # Install the CUDA package of ONNX Runtime GenAI
 pip install onnxruntime-genai-cuda
 # Please adjust the model directory (-m) accordingly
 curl -o https://raw.githubusercontent.com/microsoft/onnxruntime-genai/refs/heads/main/examples/python/model-chat.py
+python model-chat.py -m /path/to/gpu-int4-rtn-block-32/ -e cuda --chat_template "<|begin▁of▁sentence|><|User|>{input}<|Assistant|>"
 ```
+For DirectML:
+```bash
+# Download the model directly using the Hugging Face CLI
+huggingface-cli download onnxruntime/DeepSeek-R1-Distill-ONNX --include 'deepseek-r1-distill-qwen-1.5B/gpu/*' --local-dir .
+# Install the DirectML package of ONNX Runtime GenAI
+pip install onnxruntime-genai-directml
+# Please adjust the model directory (-m) accordingly
+curl -o https://raw.githubusercontent.com/microsoft/onnxruntime-genai/refs/heads/main/examples/python/model-chat.py
+python model-chat.py -m /path/to/gpu-int4-rtn-block-32/ -e dml --chat_template "<|begin▁of▁sentence|><|User|>{input}<|Assistant|>"
+```
 ## ONNX Models