MTP weights?
#45
by
SzymonOzog
- opened
Does this contain multi token prediciton weights? I know that they are marked as layer 61 in the huggingface implementation but I can see this goes up to blk_60. Are they named differently or are they just prunned?