DeepSeek testing - a gaunernst Collection

gaunernst 's Collections

DeepSeek testing

Gemma 3 QAT INT4 (from GGUF)

Gemma 3 QAT INT4 (from Flax)

Mini BERT models

Face Recognition Models

Smallish LLM pre-training datasets

Llama2-compatible

Llama3-compatible

DeepSeek testing

updated 30 days ago

A collection of MoE+MLA models, serving as testing proxies for DeepSeek-V3/R1