q4_k_m: Recommended. Uses Q6_K for half of the attention.wv and feed_forward.w2 tensors, else Q4_K 0f58395 verified datatab commited on Mar 6, 2024