128k version of YARN
#6
by
sovetboga
- opened
Hello, does this model have the same situation as in Qwen3. Is it possible to get the 128k version?
https://github.com/THUDM/GLM-4#model-and-prompt-implementation
Hi there good idea we'll see waht we can do :)
It would be awesome, looking for it.