MTP weights?

#45
by SzymonOzog - opened

Does this contain multi token prediciton weights? I know that they are marked as layer 61 in the huggingface implementation but I can see this goes up to blk_60. Are they named differently or are they just prunned?

Sign up or log in to comment